Sample records for high-performance parallel coupler

  1. Refractive index engineering of high performance coupler for compact photonic integrated circuits

    NASA Astrophysics Data System (ADS)

    Liu, Lu; Zhou, Zhiping

    2017-04-01

    High performance couplers are highly desired in many applications, but the design is limited by nearly unchangeable material refractive index. To tackle this issue, refractive index engineering method is investigated, which can be realized by subwavelength grating. Subwavelength gratings are periodical structures with pitches small enough to locally synthesize the refractive index of photonic waveguides, which allows direct control of optical profile as well as easier fabrication process. This review provides an introduction to the basics of subwavelength structures and pay special attention to the design strategies of some representative examples of subwavelength grating devices, including: edge couplers, fiber-chip grating couplers, directional couplers and multimode interference couplers. Benefited from the subwavelength grating which can engineer the refractive index as well as birefringence and dispersion, these devices show better performance when compared to their conventional counterparts.

  2. Highly efficient coupler for dielectric slot waveguides and hybrid plasmonic waveguides

    NASA Astrophysics Data System (ADS)

    Yu, Jiyao; Ohtera, Yasuo; Yamada, Hirohito

    2018-05-01

    A compact, highly efficient optical coupler for dielectric slot waveguides and hybrid plasmonic waveguides based on transition layers (air slot grooves) was investigated. The power-coupling efficiency of 75% for the direct coupling case increased to 90% following the insertion of an intermediate section. By performing time-averaged Poynting vector analysis, we successfully separated the factors of transmission, reflection, and radiation at the coupler interface. We found that the insertion of optimal air grooves into the coupler structure contributed to the improvement of coupling performance. The proposed compact structure is characterized by a high transmission efficiency, low reflection, small length, and broad-band spectrum response.

  3. Widely tunable long-period waveguide grating couplers

    NASA Astrophysics Data System (ADS)

    Bai, Y.; Liu, Q.; Lor, K. P.; Chiang, K. S.

    2006-12-01

    We demonstrate experimentally two widely tunable optical couplers formed with parallel long-period polymer waveguide gratings. One of the couplers consists of two parallel gratings and shows a peak coupling efficiency of ~34%. The resonance wavelength of the coupler can be tuned thermally with a sensitivity of 4.7 nm/°C. The experimental results agree well with the coupled-mode analysis. The other coupler consists of an array of ten widely separated gratings. A peak coupling efficiency of ~11% is obtained between the two best matched gratings in the array and the resonance wavelength can be tuned thermally with a sensitivity of -3.8 nm/°C. These couplers have the potential to be further developed into practical broadband add/drop multiplexers and signal dividers.

  4. A Proposal for a High-Voltage Transmission Line Directional Coupler

    DOE PAGES

    Olsen, R. G.; Li, Zhi

    2017-02-01

    Directional couplers are devices generally used in high frequency transmission lines and waveguides that respond to forward and reverse traveling waves separately. Hence they can be used to either measure standing wave ratio in the steady state or to determine the direction of a propagating transient wave. Here, a design is proposed for a directional coupler to be used on multimode high voltage transmission lines. Its performance is analyzed and several suggestions are made for improving its design.

  5. Role of reinforcement couplers in serviceability performance of concrete members

    NASA Astrophysics Data System (ADS)

    Ng, P. L.; Guan, G. X.; Kwan, A. K. H.

    2017-10-01

    Connection of reinforcing bars by couplers is a common form of reinforcement splicing. However, the variation of stiffness at the location of couplers and the potentially excessive residual slips are suspected to cause adverse impact on the serviceability, especially for structural members subjected to repeated loading. This paper studies the role of couplers in the serviceability performance of concrete members. Relevant provisions in design codes are reviewed and compared. Laboratory tests are conducted to investigate the slip behaviour of couplers. A section analysis approach based on equivalent stiffness model is proposed to account for the effects of couplers, and formulations of crack width calculation are explored for use in structural design.

  6. Influence of load by high power on the optical coupler

    NASA Astrophysics Data System (ADS)

    Bednarek, Lukas; Poboril, Radek; Vanderka, Ales; Hajek, Lukas; Nedoma, Jan; Vasinek, Vladimir

    2016-12-01

    Nowadays, aging of the optical components is a very current topic. Therefore, some investigations are focused on this area, so that the aging of the optical components is accelerated by thermal, high power and gamma load. This paper deals by findings of the influence of the load by laser with high optical power on the transmission parameters of the optical coupler. The investigated coupler has one input and eight outputs (1x8). Load by laser with high optical power is realized using a fiber laser with a cascade configuration EDFA amplifiers. The output power of the amplifier is approximately 250 mW. Duration of the load is moving from 104 hours to 139 hours. After each load, input power and output powers of all branches are measured. Following parameters of the optical coupler are calculated using formulas: the insertion losses of the individual branches, split ratio, total losses, homogeneity of the losses and cross-talk between different branches. All measurements are performed at wavelengths 1310 nm and 1550 nm. Individual optical powers are measured 20 times, due to the exclusion of statistical error of the measurement. After measuring, the coupler is connected to the amplifier for next cycle of the load. The paper contains an evaluation of the results of the coupler before and after four cycles of the burden.

  7. Folded waveguide coupler

    DOEpatents

    Owens, Thomas L.

    1988-03-01

    A resonant cavity waveguide coupler for ICRH of a magnetically confined plasma. The coupler consists of a series of inter-leaved metallic vanes disposed withn an enclosure analogous to a very wide, simple rectangular waveguide that has been "folded" several times. At the mouth of the coupler, a polarizing plate is provided which has coupling apertures aligned with selected folds of the waveguide through which rf waves are launched with magnetic fields of the waves aligned in parallel with the magnetic fields confining the plasma being heated to provide coupling to the fast magnetosonic wave within the plasma in the frequency usage of from about 50-200 mHz. A shorting plate terminates the back of the cavity at a distance approximately equal to one-half the guide wavelength from the mouth of the coupler to ensure that the electric field of the waves launched through the polarizing plate apertures are small while the magnetic field is near a maximum. Power is fed into the coupler folded cavity by means of an input coaxial line feed arrangement at a point which provides an impedance match between the cavity and the coaxial input line.

  8. Single Fiber Star Couplers. [optical waveguides for spacecraft communication

    NASA Technical Reports Server (NTRS)

    Asawa, C. K.

    1979-01-01

    An ion exchange process was developed and used in the fabrication of state-of-the-art planar star couplers for distribution of optical radiation between optical fibers. An 8 x 8 planar transmission star coupler was packaged for evaluation purposes with sixteen fiber connectors and sixteen pigtails. Likewise a transmission star coupler and an eight-port reflection star coupler with eight-fiber ribbons rigidly attached to these couplers, and a planar coupler with silicon guides and a parallel channel guide with pigtails were also fabricated. Optical measurements of the transmission star couplers are included with a description of the manufacturing process.

  9. Optimization of nonbinary slanted surface-relief gratings as high-efficiency broadband couplers for light guides.

    PubMed

    Bai, Benfeng; Laukkanen, Janne; Kuittinen, Markku; Siitonen, Samuli

    2010-10-01

    We propose and investigate the use of slanted surface-relief gratings with nonbinary profiles as high-efficiency broadband couplers for light guides. First, a Chandezon-method-based rigorous numerical formulation is presented for modeling the slanted gratings with overhanging profiles. Then, two typical types of slanted grating couplers--a sinusoidal one and a trapezoidal one--are studied and optimized numerically, both exhibiting a high coupling efficiency of over 50% over the full band of white LED under the normal illumination of unpolarized light. Reasonable structural parameters with nice tolerance have been obtained for the optimized designs. It is found that the performance of the couplers depends little on the grating profile shape, but primarily on the grating period and the slant angle of the ridge. The underlying mechanism is analyzed by the equivalence rules of gratings, which provide useful guidelines for the design and fabrication of the couplers. Preliminary investigation has been performed on the fabrication and replication of the slanted overhanging grating couplers, which shows the feasibility of fabrication with mature microfabrication techniques and the perspective for mass production.

  10. K-Band Substrate Integrated Waveguide (SIW) Coupler

    NASA Astrophysics Data System (ADS)

    Khalid, N.; Ibrahim, S. Z.; Hoon, W. F.

    2018-03-01

    This paper presents a designed coupler by using substrate Roger RO4003. The four port network coupler operates at (18-26 GHz) and designed by using substrate integrated waveguide (SIW) method. Substrate Integrated Waveguide (SIW) are high performance broadband interconnects with excellent immunity to electromagnetic interference and suitable in microwave and millimetre-wave electronics applications, as well as wideband systems. The designs of the coupler are investigated using CST Microwave Studio simulation tool. These proposed couplers are capable of covering the frequency range and provide better performance of scattering parameter (S-parameter). This technology is successfully approached for millimetre-wave and microwave applications. Designs and results are presented and discussed in this paper. The overall simulated percentage bandwidth of the proposed coupler is covered from 18 to 26 GHz with percentage bandwidth of 36.36%.

  11. RF Couplers for Normal-Conducting Photoinjector of High-Power CW FEL

    NASA Astrophysics Data System (ADS)

    Kurennoy, Sergey; Schrage, Dale; Wood, Richard; Schultheiss, Tom; Rathke, John; Young, Lloyd

    2004-05-01

    A high-current emittance-compensated RF photoinjector is a key enabling technology for a high-power CW FEL. A preliminary design of a normal-conducting, 2.5-cell pi-mode, 700-MHz CW RF photoinjector that will be build for demonstration purposes, is completed. This photoinjector will be capable of accelerating a 100-mA electron beam (3 nC per bunch at 35 MHz bunch repetition rate) to 2.7 MeV while providing an emittance below 7 mm-mrad at the wiggler. More than 1 MW of RF power will be fed into the photoinjector cavity through two ridge-loaded tapered waveguides. The waveguides are coupled to the cavity by "dog-bone" irises cut in a thick wall. Due to CW operation of the photoinjector, the cooling of the coupler irises is a rather challenging thermal management project. This paper presents results of a detailed electromagnetic modeling of the coupler-cavity system, which has been performed to select the coupler design that minimizes the iris heating due to RF power loss in its walls.

  12. Low-crosstalk orbital angular momentum fiber coupler design.

    PubMed

    Zhang, Zhishen; Gan, Jiulin; Heng, Xiaobo; Li, Muqiao; Li, Jiong; Xu, Shanhui; Yang, Zhongmin

    2017-05-15

    A fiber coupler for low-crosstalk orbital angular momentum mode beam splitter is proposed with the structure of two separate and parallel microfibers. By properly setting the center-to-center distance between microfibers, the crosstalk is less than -20 dB, which means that the purity of the needed OAM mode in output port is higher than 99%. For a fixed overlapping length, high coupling efficiency (>97%) is achieved in 1545-1560 nm. The operating wavelength is tuned to the whole C-band by using the thermosensitive liquid. So the designed coupler can achieve the tunable coupling ratio over the whole C-band, which is a prospective component for the further OAM fiber system.

  13. Total internal reflection-evanescent coupler for fiber-to-waveguide integration of planar optoelectric devices.

    PubMed

    Lu, Zhaolin; Prather, Dennis W

    2004-08-01

    We present a method for parallel coupling from a single-mode fiber, or fiber ribbon, into a silicon-on-insulator waveguide for integration with silicon optoelectronic circuits. The coupler incorporates the advantages of the vertically tapered waveguides and prism couplers, yet offers the flexibility of planar integration. The coupler can be fabricated by use of either wafer polishing technology or gray-scale photolithography. When optimal coupling is achieved in our experimental setup, the coupler can be packaged by epoxy bonding to form a fiber-waveguide parallel coupler or connector. Two-dimensional electromagnetic calculation predicts a coupling efficiency of 77% (- 1.14-dB insertion loss) for a silicon-to-silicon coupler with a uniform tunnel layer. The coupling efficiency is experimentally achieved to be 46% (-3.4-dB insertion loss), excluding the loss in silicon and the reflections from the input surface and the output facet.

  14. Linearized electrooptic polymeric directional coupler modulator

    NASA Astrophysics Data System (ADS)

    Hung, Yu-Chueh

    External linearized modulators are required in high-performance analog optical communication systems since the performance of conventional modulators, such as Mach-Zehnder modulators, are degraded by distortions by the nonlinearity of their transfer functions. Various linearization schemes have been proposed to increase the dynamic range of an analog optical link. Most of the optical schemes involve multiple Mach-Zehnder modulators, either in parallel or series configuration, incorporated with strict balance of RF and bias control. This is a significant challenge when it comes to practical implementation. In this dissertation, a linearized two-section directional coupler modulator made from electrooptic polymer is presented. The coupling coefficient of each section is tailored by properly tuning the refractive index contrast, which can be easily employed using the photobleaching technique in polymer technology. A two-tone test was performed to evaluate the linearity of the modulator and the spur-free dynamic range shows a 7.5 dB improvement compared to a conventional Mach-Zehnder modulator. This scheme avoids multiple modulators or complicated modulation synchronization and demonstrates a compact design in real implementation. Most of the linearization schemes up to date consider only the direct detection mode of operation. However, the RF output characteristics at the detection side are determined differently by various system parameters if a coherent link is implemented instead. Therefore, different considerations of linearization have to be examined for this kind of application. In the second part of this dissertation, the impact of various modulation scenarios on the system performance of an analog coherent optical link will be addressed. It will be shown that a directional coupler modulator is better suited at increasing the dynamic range in coherent optical links. Specific designs of a directional coupler modulator shows an SFDR improvement of 20 dB compared

  15. Inexpensive 3dB coupler for POF communication by injection-molding production

    NASA Astrophysics Data System (ADS)

    Haupt, M.; Fischer, U. H. P.

    2011-01-01

    POFs (polymer optical fibers) gradually replace traditional communication media such as copper and glass within short distance communication systems. Primarily, this is due to their cost-effectiveness and easy handling. POFs are used in various fields of optical communication, e.g. the automotive sector or in-house communication. So far, however, only a few key components for a POF communication network are available. Even basic components, such as splices and couplers, are fabricated manually. Therefore, these circumstances result in high costs and fluctuations in components' performance. Available couplers have high insertion losses due to their manufacturing method. This can only be compensated by higher power budgets. In order to produce couplers with higher performances new fabrication methods are indispensable. A cheap and effective way to produce couplers for POF communication systems is injection molding. The paper gives an overview of couplers available on market, compares their performances, and shows a way to produce couplers by means of injection molding.

  16. The comparison of two methods to manufacture fused biconical tapered optical fiber coupler

    NASA Astrophysics Data System (ADS)

    Wang, Yue; Liu, Hairong

    2009-08-01

    Optical fiber coupler is a directional coupler which is crucial component for optical fiber communication systems. The fused biconical taper is the most important method in facture of optical fiber coupler, with many advantages of low excess loss, precise coupling ratio, good consistency and stability. In this paper we have introduced a new method to manufacture optical fiber coupler. And more over the new manufacture process has been compared with the traditional manufacture method. In the traditional crafts, two optical fibers are parallel placed, and then use the method of tie a knot of the two optical fibers. In the new process, a new program of fiber placement is introduced. Two optical fibers are parallel placed in the middle of the fixture, and then in order to make the bare part of the optical fiber close as much as possible, the new plan using high temperature resistant material bind the both end of the fiber which are not removing the cladding. After many contrast tests, we can see that adopt the improved method of fiber placement, during the process of fiber pulling, the variation of optical power in the directional arm and the coupler arm are more smooth and steady. But the excess loss (EL) generated in the process of pulling is a bit higher than the traditional method of tie a knot. The tests show that the new method of optical fiber placement is feasible in the actual projects for the manufacture of coupler with low coupling ratio, but for the control of the EL still need further studying.

  17. Directional coupler based on an elliptic cylindrical nanowire hybrid plasmonic waveguide.

    PubMed

    Zeng, Dezheng; Zhang, Li; Xiong, Qiulin; Ma, Junxian

    2018-06-01

    We present what we believe is a novel directional coupler based on an elliptic cylindrical nanowire hybrid plasmonic waveguide. Using the finite element method, the electric field distributions of y-polarized symmetric and antisymmetric modes of the coupler are compared, and the coupling and transmission characteristics are analyzed; then the optimized separation distance between the two parallel waveguides, 100 nm, is obtained. This optimized architecture fits in the weak coupling regime. Furthermore, the energy transfer is studied, and the performances of the directional coupler are evaluated, including excess loss, coupling degree, and directionality. The results show that when the separation distance is set to 100 nm, the coupling length reaches the shorter value of 1.646 μm, and the propagation loss is as low as 0.076 dB/μm, and the maximum energy transfer can reach 80%. The proposed directional coupler features good energy confinement, ultracompact and low propagation loss, which has potential application in dense photonic-integrated circuits and other photonic devices.

  18. High Performance Parallel Computational Nanotechnology

    NASA Technical Reports Server (NTRS)

    Saini, Subhash; Craw, James M. (Technical Monitor)

    1995-01-01

    At a recent press conference, NASA Administrator Dan Goldin encouraged NASA Ames Research Center to take a lead role in promoting research and development of advanced, high-performance computer technology, including nanotechnology. Manufacturers of leading-edge microprocessors currently perform large-scale simulations in the design and verification of semiconductor devices and microprocessors. Recently, the need for this intensive simulation and modeling analysis has greatly increased, due in part to the ever-increasing complexity of these devices, as well as the lessons of experiences such as the Pentium fiasco. Simulation, modeling, testing, and validation will be even more important for designing molecular computers because of the complex specification of millions of atoms, thousands of assembly steps, as well as the simulation and modeling needed to ensure reliable, robust and efficient fabrication of the molecular devices. The software for this capacity does not exist today, but it can be extrapolated from the software currently used in molecular modeling for other applications: semi-empirical methods, ab initio methods, self-consistent field methods, Hartree-Fock methods, molecular mechanics; and simulation methods for diamondoid structures. In as much as it seems clear that the application of such methods in nanotechnology will require powerful, highly powerful systems, this talk will discuss techniques and issues for performing these types of computations on parallel systems. We will describe system design issues (memory, I/O, mass storage, operating system requirements, special user interface issues, interconnects, bandwidths, and programming languages) involved in parallel methods for scalable classical, semiclassical, quantum, molecular mechanics, and continuum models; molecular nanotechnology computer-aided designs (NanoCAD) techniques; visualization using virtual reality techniques of structural models and assembly sequences; software required to

  19. 49 CFR 179.14 - Coupler vertical restraint system.

    Code of Federal Regulations, 2010 CFR

    2010-10-01

    ... mating coupler (or simulated coupler) having only frictional vertical force resistance at the mating interface; or a mating coupler (or simulated coupler) having the capabilities described in paragraph (a) of this section; (2) The testing apparatus shall simulate the vertical coupler performance at the mating...

  20. Silicon-based highly-efficient fiber-to-waveguide coupler for high index contrast systems

    NASA Astrophysics Data System (ADS)

    Nguyen, Victor; Montalbo, Trisha; Manolatou, Christina; Agarwal, Anu; Hong, Ching-yin; Yasaitis, John; Kimerling, L. C.; Michel, Jurgen

    2006-02-01

    A coupler to efficiently transfer broadband light from a single-mode optical fiber to a single-mode high-index contrast waveguide has been fabricated on a silicon substrate. We utilized a novel coupling scheme, with a vertically asymmetric design consisting of a stepwise parabolic graded index profile combined with a horizontal taper, to simultaneously confine light in both directions. Coupling efficiency has been measured as a function of the device dimensions. The optimal coupling efficiency is achieved for structures whose length equals the focal distance of the graded index and whose input width is close to the mode field diameter of the fiber. The fabricated structure is compact, robust and highly efficient, with an insertion loss of 2.2dB at 1550nm. The coupler exhibits less than 1dB variation in coupling efficiency in the measured spectral range from 1520nmto1620nm. The lowest insertion loss of 1.9dB is measured at 1540nm. The coupler design offers highly efficient coupling for single mode waveguides of core indices up to 2.2.

  1. Full scale tank car coupler impact tests

    DOT National Transportation Integrated Search

    2003-11-15

    Full scale tests were performed to investigate various : aspects of tank car behavior during coupler impacts. A tank car : was equipped with 37 accelerometers and an instrumented : coupler. Two series of full scale coupler impact tests, : comprising ...

  2. Global Magnetohydrodynamic Simulation Using High Performance FORTRAN on Parallel Computers

    NASA Astrophysics Data System (ADS)

    Ogino, T.

    High Performance Fortran (HPF) is one of modern and common techniques to achieve high performance parallel computation. We have translated a 3-dimensional magnetohydrodynamic (MHD) simulation code of the Earth's magnetosphere from VPP Fortran to HPF/JA on the Fujitsu VPP5000/56 vector-parallel supercomputer and the MHD code was fully vectorized and fully parallelized in VPP Fortran. The entire performance and capability of the HPF MHD code could be shown to be almost comparable to that of VPP Fortran. A 3-dimensional global MHD simulation of the earth's magnetosphere was performed at a speed of over 400 Gflops with an efficiency of 76.5 VPP5000/56 in vector and parallel computation that permitted comparison with catalog values. We have concluded that fluid and MHD codes that are fully vectorized and fully parallelized in VPP Fortran can be translated with relative ease to HPF/JA, and a code in HPF/JA may be expected to perform comparably to the same code written in VPP Fortran.

  3. Substrate integrated waveguide (SIW) 3 dB coupler for K-Band applications

    NASA Astrophysics Data System (ADS)

    Khalid, Nurehansafwanah; Zuraidah Ibrahim, Siti; Wee, Fwen Hoon; Shazuani Mahmud, Farah

    2017-11-01

    This paper presented a designed coupler by using Rogers RO4003C with thickness (h) 0.508 mm and relative permittivity (ɛr) 3.55. The four port network coupler operates in K-band (18-27 GHz) and design by using substrate integrated waveguide (SIW) method. The reflection coefficient and isolation coefficient of propose Substrate Integrated Waveguide (SIW) coupler is below than -10 dB. Meanwhile the coupler requirements are phase shift 90° between coupled port and output. SIW are high performance broadband interconnects with excellent immunity to electromagnetic interference and suitable for use in microwave and communication electronics, as well as increase bandwidth systems. The designs of coupler are investigated using CST Microwave Studio simulation tool. This proposed couplers are varied from parameters that cover the frequency range (21 -24 GHz) and better performance of scattering (S-parameter).

  4. Development of fundamental power coupler for C-ADS superconducting elliptical cavities

    NASA Astrophysics Data System (ADS)

    Gu, Kui-Xiang; Bing, Feng; Pan, Wei-Min; Huang, Tong-Ming; Ma, Qiang; Meng, Fan-Bo

    2017-06-01

    5-cell elliptical cavities have been selected for the main linac of the China Accelerator Driven sub-critical System (C-ADS) in the medium energy section. According to the design, each cavity should be driven with radio frequency (RF) energy up to 150 kW by a fundamental power coupler (FPC). As the cavities work with high quality factor and high accelerating gradient, the coupler should keep the cavity from contamination in the assembly procedure. To fulfil the requirements, a single-window coaxial type coupler was designed with the capabilities of handling high RF power, class 10 clean room assembly, and heat load control. This paper presents the coupler design and gives details of RF design, heat load optimization and thermal analysis as well as multipacting simulations. In addition, a primary high power test has been performed and is described in this paper. Supported by China ADS Project (XDA03020000) and National Natural Science Foundation of China (11475203)

  5. Development and performance of a new version of the OASIS coupler, OASIS3-MCT_3.0

    NASA Astrophysics Data System (ADS)

    Craig, Anthony; Valcke, Sophie; Coquart, Laure

    2017-09-01

    OASIS is coupling software developed primarily for use in the climate community. It provides the ability to couple different models with low implementation and performance overhead. OASIS3-MCT is the latest version of OASIS. It includes several improvements compared to OASIS3, including elimination of a separate hub coupler process, parallelization of the coupling communication and run-time grid interpolation, and the ability to easily reuse mapping weight files. OASIS3-MCT_3.0 is the latest release and includes the ability to couple between components running sequentially on the same set of tasks as well as to couple within a single component between different grids or decompositions such as physics, dynamics, and I/O. OASIS3-MCT has been tested with different configurations on up to 32 000 processes, with components running on high-resolution grids with up to 1.5 million grid cells, and with over 10 000 2-D coupling fields. Several new features will be available in OASIS3-MCT_4.0, and some of those are also described.

  6. A high performance parallel algorithm for 1-D FFT

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Agarwal, R.C.; Gustavson, F.G.; Zubair, M.

    1994-12-31

    In this paper the authors propose a parallel high performance FFT algorithm based on a multi-dimensional formulation. They use this to solve a commonly encountered FFT based kernel on a distributed memory parallel machine, the IBM scalable parallel system, SP1. The kernel requires a forward FFT computation of an input sequence, multiplication of the transformed data by a coefficient array, and finally an inverse FFT computation of the resultant data. They show that the multi-dimensional formulation helps in reducing the communication costs and also improves the single node performance by effectively utilizing the memory system of the node. They implementedmore » this kernel on the IBM SP1 and observed a performance of 1.25 GFLOPS on a 64-node machine.« less

  7. Twist-induced tuning in tapered fiber couplers.

    PubMed

    Birks, T A

    1989-10-01

    The power-splitting ratio of fused tapered single-mode fiber couplers can be reversibly tuned by axial twisting without affecting loss. The twist-tuning behavior of a range of different tapered couplers is described. A simple expression for twist-tuning can be derived by representing the effects of twist by a change in the refractive index profile. Good agreement between this expression and experimental results is demonstrated. Repeated tuning over tens of thousands of cycles is found not to degrade coupler performance, and a number of practical applications, including a freely tunable tapered coupler, are described.

  8. RF Conditioning and Testing of Fundamental Power Couplers for SNS Superconducting Cavity Production

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    M. Stirbet; G.K. Davis; M. A. Drury

    The Spallation Neutron Source (SNS) makes use of 33 medium beta (0.61) and 48 high beta (0.81) superconducting cavities. Each cavity is equipped with a fundamental power coupler, which should withstand the full klystron power of 550 kW in full reflection for the duration of an RF pulse of 1.3 msec at 60 Hz repetition rate. Before assembly to a superconducting cavity, the vacuum components of the coupler are submitted to acceptance procedures consisting of preliminary quality assessments, cleaning and clean room assembly, vacuum leak checks and baking under vacuum, followed by conditioning and RF high power testing. Similar acceptancemore » procedures (except clean room assembly and baking) were applied for the airside components of the coupler. All 81 fundamental power couplers for SNS superconducting cavity production have been RF power tested at JLAB Newport News and, beginning in April 2004 at SNS Oak Ridge. This paper gives details of coupler processing and RF high power-assessed performances.« less

  9. Analysis of the car body stability performance after coupler jack-knifing during braking

    NASA Astrophysics Data System (ADS)

    Guo, Lirong; Wang, Kaiyun; Chen, Zaigang; Shi, Zhiyong; Lv, Kaikai; Ji, Tiancheng

    2018-06-01

    This paper aims to improve car body stability performance by optimising locomotive parameters when coupler jack-knifing occurs during braking. In order to prevent car body instability behaviour caused by coupler jack-knifing, a multi-locomotive simulation model and a series of field braking tests are developed to analyse the influence of the secondary suspension and the secondary lateral stopper on the car body stability performance during braking. According to simulation and test results, increasing secondary lateral stiffness contributes to limit car body yaw angle during braking. However, it seriously affects the dynamic performance of the locomotive. For the secondary lateral stopper, its lateral stiffness and free clearance have a significant influence on improving the car body stability capacity, and have less effect on the dynamic performance of the locomotive. An optimised measure was proposed and adopted on the test locomotive. For the optimised locomotive, the lateral stiffness of secondary lateral stopper is increased to 7875 kN/m, while its free clearance is decreased to 10 mm. The optimised locomotive has excellent dynamic and safety performance. Comparing with the original locomotive, the maximum car body yaw angle and coupler rotation angle of the optimised locomotive were reduced by 59.25% and 53.19%, respectively, according to the practical application. The maximum derailment coefficient was 0.32, and the maximum wheelset lateral force was 39.5 kN. Hence, reasonable parameters of secondary lateral stopper can improve the car body stability capacity and the running safety of the heavy haul locomotive.

  10. Mid-IR fused fiber couplers

    NASA Astrophysics Data System (ADS)

    Stevens, G.; Woodbridge, T.

    2016-03-01

    We present results from our recent efforts on developing single-mode fused couplers in ZBLAN fibre. We have developed a custom fusion workstation for working with lower melting temperature fibres, such as ZBLAN and chalcogenide fibres. Our workstation uses a precisely controlled electrical heater designed to operate at temperatures between 100 - 250°C as our heat source. The heated region of the fibers was also placed in an inert atmosphere to avoid the formation of microcrystal inclusions during fusion. We firstly developed a process for pulling adiabatic tapers in 6/125 μm ZBLAN fibre. The tapers were measured actively during manufacture using a 2000 nm source. The process was automated so that the heater temperature and motor speed automatically adjusted to pull the taper at constant tension. This process was then further developed so that we could fuse and draw two parallel 6/125 μm ZBLAN fibres, forming a single-mode coupler. Low ratio couplers (1-10%) that could be used as power monitors were manufactured that had an excess loss of 0.76 dB. We have also manufactured 50/50 splitters and wavelength division multiplexers (WDMs). However, the excess loss of these devices was typically 2 - 3 dB. The increased losses were due to localised necking and surface defects forming as the tapers were pulled further to achieve a greater coupling ratio. Initial experiments with chalcogenide fibre have shown that our process can be readily adapted for chalcogenide fibres. A 5% coupler with 1.5 dB insertion loss was manufactured using commercial of the shelf (COTS) fibres.

  11. Waveguide silicon nitride grating coupler

    NASA Astrophysics Data System (ADS)

    Litvik, Jan; Dolnak, Ivan; Dado, Milan

    2016-12-01

    Grating couplers are one of the most used elements for coupling of light between optical fibers and photonic integrated components. Silicon-on-insulator platform provides strong confinement of light and allows high integration. In this work, using simulations we have designed a broadband silicon nitride surface grating coupler. The Fourier-eigenmode expansion and finite difference time domain methods are utilized in design optimization of grating coupler structure. The fully, single etch step grating coupler is based on a standard silicon-on-insulator wafer with 0.55 μm waveguide Si3N4 layer. The optimized structure at 1550 nm wavelength yields a peak coupling efficiency -2.6635 dB (54.16%) with a 1-dB bandwidth up to 80 nm. It is promising way for low-cost fabrication using complementary metal-oxide- semiconductor fabrication process.

  12. Dynamic performance of high speed solenoid valve with parallel coils

    NASA Astrophysics Data System (ADS)

    Kong, Xiaowu; Li, Shizhen

    2014-07-01

    The methods of improving the dynamic performance of high speed on/off solenoid valve include increasing the magnetic force of armature and the slew rate of coil current, decreasing the mass and stroke of moving parts. The increase of magnetic force usually leads to the decrease of current slew rate, which could increase the delay time of the dynamic response of solenoid valve. Using a high voltage to drive coil can solve this contradiction, but a high driving voltage can also lead to more cost and a decrease of safety and reliability. In this paper, a new scheme of parallel coils is investigated, in which the single coil of solenoid is replaced by parallel coils with same ampere turns. Based on the mathematic model of high speed solenoid valve, the theoretical formula for the delay time of solenoid valve is deduced. Both the theoretical analysis and the dynamic simulation show that the effect of dividing a single coil into N parallel sub-coils is close to that of driving the single coil with N times of the original driving voltage as far as the delay time of solenoid valve is concerned. A specific test bench is designed to measure the dynamic performance of high speed on/off solenoid valve. The experimental results also prove that both the delay time and switching time of the solenoid valves can be decreased greatly by adopting the parallel coil scheme. This research presents a simple and practical method to improve the dynamic performance of high speed on/off solenoid valve.

  13. Performance of the round window soft coupler for the backward stimulation of the cochlea in a temporal bone model.

    PubMed

    Gostian, Antoniu-Oreste; Schwarz, David; Mandt, Philipp; Anagiotos, Andreas; Ortmann, Magdalene; Pazen, David; Beutner, Dirk; Hüttenbrink, Karl-Bernd

    2016-11-01

    The round window vibroplasty is a feasible option for the treatment of conductive, sensorineural and mixed hearing loss. Although clinical data suggest a satisfying clinical outcome with various coupling methods, the most efficient coupling technique of the floating mass transducer to the round window is still a matter of debate. For this, a soft silicone-made coupler has been developed recently that aims to ease and optimize the stimulation of the round window membrane of this middle ear implant. We performed a temporal bone study evaluating the performance of the soft coupler compared to the coupling with individually shaped cartilage, perichondrium and the titanium round window coupler with loads up to 20 mN at the unaltered and fully exposed round window niche. The stimulation of the cochlea was measured by the volume velocities of the stapes footplate detected by a laser Doppler vibrometer. The coupling method was computed as significant factor with cartilage and perichondrium allowing for the highest volume velocities followed by the soft and titanium coupler. Exposure of the round window niche allowed for higher volume velocities while the applied load did not significantly affect the results. The soft coupler allows for a good contact to the round window membrane and an effective backward stimulation of the cochlea. Clinical data are mandatory to evaluate performance of this novel coupling method in vivo.

  14. High Performance Parallel Architectures

    NASA Technical Reports Server (NTRS)

    El-Ghazawi, Tarek; Kaewpijit, Sinthop

    1998-01-01

    Traditional remote sensing instruments are multispectral, where observations are collected at a few different spectral bands. Recently, many hyperspectral instruments, that can collect observations at hundreds of bands, have been operational. Furthermore, there have been ongoing research efforts on ultraspectral instruments that can produce observations at thousands of spectral bands. While these remote sensing technology developments hold great promise for new findings in the area of Earth and space science, they present many challenges. These include the need for faster processing of such increased data volumes, and methods for data reduction. Dimension Reduction is a spectral transformation, aimed at concentrating the vital information and discarding redundant data. One such transformation, which is widely used in remote sensing, is the Principal Components Analysis (PCA). This report summarizes our progress on the development of a parallel PCA and its implementation on two Beowulf cluster configuration; one with fast Ethernet switch and the other with a Myrinet interconnection. Details of the implementation and performance results, for typical sets of multispectral and hyperspectral NASA remote sensing data, are presented and analyzed based on the algorithm requirements and the underlying machine configuration. It will be shown that the PCA application is quite challenging and hard to scale on Ethernet-based clusters. However, the measurements also show that a high- performance interconnection network, such as Myrinet, better matches the high communication demand of PCA and can lead to a more efficient PCA execution.

  15. Design and experiment of a directional coupler for X-band long pulse high power microwaves.

    PubMed

    Bai, Zhen; Li, Guolin; Zhang, Jun; Jin, Zhenxing

    2013-03-01

    Higher power and longer pulse are the trend of the development of high power microwave (HPM), and then some problems emerge in measuring the power of HPM because rf breakdown is easier to occur under the circumstance of high power (the level of gigawatt) and long pulse (about 100 ns). In order to measure the power of the dominant TM₀₁ mode of an X-band long pulse overmoded HPM source, a directional coupler with stable coupling coefficient, high directivity, and high power handling capacity in wide band is investigated numerically and experimentally. At the central frequency 9.4 GHz, the simulation results show that the coupling coefficient is -59.6 dB with the directivity of 35 dB and the power handling capacity of 2 GW. The coupling coefficient is calibrated to be accordant with the simulation results. The high power tests are performed on an X-band long pulse HPM source, whose output mode is mainly TM₀₁ mode, and the results show that the measured power and waveform of the directional coupler have a good consistency with the far-field measuring results.

  16. High-directionality fiber-chip grating coupler with interleaved trenches and subwavelength index-matching structure.

    PubMed

    Benedikovic, Daniel; Alonso-Ramos, Carlos; Cheben, Pavel; Schmid, Jens H; Wang, Shurui; Xu, Dan-Xia; Lapointe, Jean; Janz, Siegfried; Halir, Robert; Ortega-Moñux, Alejandro; Wangüemert-Pérez, J Gonzalo; Molina-Fernández, Iñigo; Fédéli, Jean-Marc; Vivien, Laurent; Dado, Milan

    2015-09-15

    We present the first experimental demonstration of a new fiber-chip grating coupler concept that exploits the blazing effect by interleaving the standard full (220 nm) and shallow etch (70 nm) trenches in a 220 nm thick silicon layer. The high directionality is obtained by controlling the separation between the deep and shallow trenches to achieve constructive interference in the upward direction and destructive interference toward the silicon substrate. Utilizing this concept, the grating directionality can be maximized independent of the bottom oxide thickness. The coupler also includes a subwavelength-engineered index-matching region, designed to reduce the reflectivity at the interface between the injection waveguide and the grating. We report a measured fiber-chip coupling efficiency of -1.3  dB, the highest coupling efficiency achieved to date for a surface grating coupler in a 220 nm silicon-on-insulator platform fabricated in a conventional dual-etch process without high-index overlays or bottom mirrors.

  17. Performance improvement of optical fiber coupler with electric heating versus gas heating.

    PubMed

    Shuai, Cijun; Gao, Chengde; Nie, Yi; Peng, Shuping

    2010-08-20

    Gas heating has been widely used in the process of fused biconical tapering. However, as the instability and asymmetric flame temperature of gas heating exist, the performance of the optical devices fabricated by this method was affected. To overcome the problems resulting from gas combustion, an electric heater is designed and manufactured using a metal-ceramic (MoSi(2)) as a heating material. Our experimental data show that the fused-taper machine with an electric heater has improved the performance of optical devices by increasing the consistency of the extinction ratio, excess loss, and the splitting ratio over that of the previous gas heating mode. Microcrystallizations and microcracks were observed at the fused region of the polarization-maintaining (PM) fiber coupler and at the taper region with scanning electron microscopy and atomic force microscopy respectively. The distribution of the microcrystallizations and microcracks are nonuniform along the fiber with gas heating, while their distribution is rather uniform with electric heating. These findings show that the novel optical fiber coupler with an electric heater has improved the performance of optical fiber devices by affecting the consistency of the optical parameters and micromorphology of the surface of PM fiber.

  18. Effect of external index of refraction on multimode fiber couplers.

    PubMed

    Wang, G Z; Murphy, K A; Claus, R O

    1995-12-20

    The dependence of the performance of fused-taper multimode fiber couplers on the refractive index of the material surrounding the taper region has been investigated both theoretically and experimentally. It has been identified that for a 2 × 2 multimode fiber coupler there is a range of output-power-coupling ratios for which the effect of the external refractive index is negligible. When the coupler is tapered beyond this region, the performance becomes dependent on the external index of refraction and lossy. To analyze the multimode coupler-loss mechanism, we develop a two-dimensional ray-optics model that incorporates trapped cladding-mode loss and core-mode loss through frustrated total internal reflection.

    Computer-simulation results support the experimental observations. Related issues such as coupler fabrication and packaging are also discussed.

  19. Nonuniform transmission line codirectional couplers for hybrid MIMIC and superconductive applications

    NASA Astrophysics Data System (ADS)

    Uysal, Sener; Turner, Charles W.; Watkins, John

    1994-03-01

    A new design approach for thin-film codirectional quadrature couplers and their applications is described. An in-depth analysis and semi-empirical design curves are presented for these couplers. Forward-wave coupling is achieved by making use of the difference between even- and odd-mode phase velocities. Modified nonuniform codirectional couplers with a dummy channel for continuously decreasing or increasing taper and employing wiggly, serpentined and smooth coupled edges have been designed and tested. It is found that a wiggly coupler can achieve a 50% length reduction compared to a smooth-edge coupler. A further 60% length reduction compared to a wiggly coupler is achieved by a serpentine coupler. Coupler performance for wiggly and serpentined configurations is computed by choosing a realizable phase velocity function for a given coupler length. Either constant 90deg or - 90deg phase shift is possible with these couplers giving significant design flexibility in some applications. The results for a K(sub u)-band Sigma-Delta Magic-T circuit employing a 0 dB wiggly coupler and a - 3 dB smooth-edge coupler are also presented.

  20. 49 CFR 215.123 - Defective couplers.

    Code of Federal Regulations, 2012 CFR

    2012-10-01

    ..., DEPARTMENT OF TRANSPORTATION RAILROAD FREIGHT CAR SAFETY STANDARDS Freight Car Components Draft System § 215.123 Defective couplers. A railroad may not place or continue in service a car, if— (a) The car is... automatically with the adjacent car; (b) The car has a coupler that has a crack in the highly stressed junction...

  1. 49 CFR 215.123 - Defective couplers.

    Code of Federal Regulations, 2013 CFR

    2013-10-01

    ..., DEPARTMENT OF TRANSPORTATION RAILROAD FREIGHT CAR SAFETY STANDARDS Freight Car Components Draft System § 215.123 Defective couplers. A railroad may not place or continue in service a car, if— (a) The car is... automatically with the adjacent car; (b) The car has a coupler that has a crack in the highly stressed junction...

  2. 49 CFR 215.123 - Defective couplers.

    Code of Federal Regulations, 2014 CFR

    2014-10-01

    ..., DEPARTMENT OF TRANSPORTATION RAILROAD FREIGHT CAR SAFETY STANDARDS Freight Car Components Draft System § 215.123 Defective couplers. A railroad may not place or continue in service a car, if— (a) The car is... automatically with the adjacent car; (b) The car has a coupler that has a crack in the highly stressed junction...

  3. An Interdigitated Coupler with Defect Ground Structure

    DTIC Science & Technology

    2015-07-01

    branch-line coupler. In [8], DGS is used to microstrip forward-wave coupler for size–reduction. In fact, DGS have been widely used from the concept put...substantially. REFERENCE [1] Bialkowski M E, Seman N, Leong M S. Design of a compact ultra wideband 3 dB microstrip -slot coupler with high return losses and...Pozar D M. Microwave engineering. John Wiley & Sons, 2009. [4] You S J, Liao W. A multi-layer coupled-line power divider. Antennas , Propagation and EM

  4. Design of compact surface optical coupler based on vertically curved silicon waveguide for high-numerical-aperture single-mode optical fiber

    NASA Astrophysics Data System (ADS)

    Atsumi, Yuki; Yoshida, Tomoya; Omoda, Emiko; Sakakibara, Youichi

    2017-09-01

    A surface optical coupler based on a vertically curved Si waveguide was designed for coupling with high-numerical aperture single-mode optical fibers with a mode-field diameter of 5 µm. This coupler has a quite small device size, with a height of approximately 12 µm, achieved by introducing an effective spot-size converter configured with the combination of an extremely short Si exponential-inverse taper and a dome-structured SiO2 lens formed on the coupler top. The designed coupler shows high-efficiency optical coupling, with a loss of 0.8 dB for TE polarized light, as well as broad-band coupling with a 0.5-dB-loss band of 420 nm.

  5. Seismic Performance of Columns with Grouted Couplers in Idaho Accelerated Bridge Construction Applications

    DOT National Transportation Integrated Search

    2016-10-16

    n Accelerated Bridge Construction (ABC) methods, one way to connect prefabricated columns is by using grouted steel bar couplers. As of October 2016, in the U.S., only Utah DOT allows the use of grouted couplers in plastic hinge locations in seismic ...

  6. Laser-To-Fibre Couplers In Optical Recording Applications

    NASA Astrophysics Data System (ADS)

    Ophey, W. G.; Benschop, J. P. H.

    1988-02-01

    In optical recording, the use of single-mode fibres can considerably increase the coupling efficiency of the laser light into the light path. Important here is the performance of the laser-to-fibre coupler used. A mathematical treatment of different kinds of laser-to-fibre couplers is presented using scalar diffraction theory in order to obtain the field incident on the front end of the fibre. In this case the coupling efficiency of a laser-to-fibre coupler, using an aberrated light source (astigmatism) with an asymmetric far-field pattern, can easily be calculated.

  7. SOI ring resonators with controllable MMI coupler sections

    NASA Astrophysics Data System (ADS)

    Hu, Youfang; Gardes, Frédéric Y.; Mashanovich, Goran Z.; Reed, Graham T.

    2011-01-01

    A ring resonator using a single 2×2 MMI as the coupler section has the distinct advantages of low sensitivity to fabrication error, temperature, wavelength and polarisation. However, the coupling coefficient of the 2×2 MMI coupler is fixed; hence, the performance of this type of device is limited, e.g. transmission spectrum with high extinction ratio is difficult to achieve. We have designed and simulated ring resonators with coupler sections consisting of two 2×2 MMIs and phase shifters, so that the coupling efficiency can be varied from 0% to 100% with relative ease. For a single ring resonator, the transmission spectrum can be controlled to achieve an extinction ratio of >20dB and a spectral bandwidth of <1nm. For a multiple ring filter, the transmission spectrum can be controlled to achieve an extinction ratio of >30dB and a bandwidth of <1nm in addition, a flat-top transmission spectrum is also achievable. The whole device has a footprint of approximately 200μm by 100μm.

  8. Mitigation of multipacting, enhanced by gas condensation on the high power input coupler of a superconducting RF module, by comprehensive warm aging

    NASA Astrophysics Data System (ADS)

    Wang, Chaoen; Chang, Lung-Hai; Chang, Mei-Hsia; Chen, Ling-Jhen; Chung, Fu-Tsai; Lin, Ming-Chyuan; Liu, Zong-Kai; Lo, Chih-Hung; Tsai, Chi-Lin; Yeh, Meng-Shu; Yu, Tsung-Chi

    2017-11-01

    Excitation of multipacting, enhanced by gas condensation on cold surfaces of the high power input coupler in a SRF module poses the highest challenge for reliable SRF operation under high average RF power. This could prevent the light source SRF module from being operated with a desired high beam current. Off-line long-term reliability tests have been conducted for the newly constructed 500-MHz SRF KEKB type modules at an accelerating RF voltage of 1.6-MV to enable prediction of their operational reliability in the 3-GeV Taiwan Photon Source (TPS), since prediction from mere production performance by conventional horizontal test is presently unreliable. As expected, operational difficulties resulting from multipacting, enhanced by gas condensation, have been identified in the course of long-term reliability test. Our present hypothesis is that gas condensation can be slowed down by preserving the vacuum pressure at the power coupler close to that reached just after its cool down to liquid helium temperatures. This is achievable by reduction of the power coupler out-gassing rate through comprehensive warm aging. Its feasibility and effectiveness has been experimentally verified in a second long term reliability test. Our success opens the possibility to operate the SRF module free of multipacting trouble and opens a new direction to improve the operational performance of next generation SRF modules in light sources with high beam currents.

  9. Inverse design of near unity efficiency perfectly vertical grating couplers.

    PubMed

    Michaels, Andrew; Yablonovitch, Eli

    2018-02-19

    Efficient coupling between integrated optical waveguides and optical fibers is essential to the success of silicon photonics. While many solutions exist, perfectly vertical grating couplers that scatter light out of a waveguide in the direction normal to the waveguide's top surface are an ideal candidate due to their potential to reduce packaging complexity. Designing such couplers with high efficiencies, however, has proven difficult. In this paper, we use inverse electromagnetic design techniques to optimize a high efficiency two-layer perfectly vertical silicon grating coupler. Our base design achieves a chip-to-fiber coupling efficiency of 99.2% (-0.035 dB) at 1550 nm. Using this base design as a starting point, we run subsequent constrained optimizations to realize vertical couplers with coupling efficiencies over 96% and back reflections of less than -40 dB which can be fabricated using 65 nm-resolution lithography. These results demonstrate a new path forward for designing fabrication-tolerant ultra high efficiency grating couplers.

  10. Very short intracavity directional coupler for high-speed communication

    NASA Astrophysics Data System (ADS)

    Griffel, Giora

    1993-07-01

    We propose a novel intracavity modulator/switch that consists of a directional-coupler located inside a Fabry-Perot cavity. The back mirror of the cavity has a unit reflectivity so that both input and output signals are at the same side. In this way we obtain a two-port, single side element, with coupling length of 83.5 μm, which is the shortest modulation coupler proposed so far. The upper frequency limit due to photon lifetime is 275 GHz, which is well over the bandwidth constraints of microwave lumped structures. A unified approach for the analysis of this device and other similar structures is presented and discussed.

  11. Thin-Ribbon Tapered Couplers For Dielectric Waveguides

    NASA Technical Reports Server (NTRS)

    Otoshi, Tom Y.; Shimabukuro, Fred I.; Yeh, Cavour

    1996-01-01

    Thin-ribbon tapered couplers proposed for launching electro-magnetic waves into dielectric waveguides, which include optical fibers. Intended for use with ribbon dielectric waveguides designed for operation at millimeter or submillimeter wavelengths, made of high-relative-permittivity, low-loss materials and thicknesses comparable to or less than free-space design wavelengths. Coupling efficiencies exceeds those of older tapered couplers.

  12. Directional multimode coupler for planar magnonics: Side-coupled magnetic stripes

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Sadovnikov, A. V., E-mail: sadovnikovav@gmail.com; Nikitov, S. A.; Kotel'nikov Institute of Radioengineering and Electronics, Russian Academy of Sciences, Moscow 125009

    We experimentally demonstrate spin waves coupling in two laterally adjacent magnetic stripes. By the means of Brillouin light scattering spectroscopy, we show that the coupling efficiency depends both on the magnonic waveguides' geometry and the characteristics of spin-wave modes. In particular, the lateral confinement of coupled yttrium-iron-garnet stripes enables the possibility of control over the spin-wave propagation characteristics. Numerical simulations (in time domain and frequency domain) reveal the nature of intermodal coupling between two magnonic stripes. The proposed topology of multimode magnonic coupler can be utilized as a building block for fabrication of integrated parallel functional and logic devices suchmore » as the frequency selective directional coupler or tunable splitter, enabling a number of potential applications for planar magnonics.« less

  13. Multimode Directional Coupler

    NASA Technical Reports Server (NTRS)

    Simons, Rainee N. (Inventor); Wintucky, Edwin G. (Inventor)

    2016-01-01

    A multimode directional coupler is provided. In some embodiments, the multimode directional coupler is configured to receive a primary signal and a secondary signal at a first port of a primary waveguide. The primary signal is configured to propagate through the primary waveguide and be outputted at a second port of the primary waveguide. The multimode directional coupler also includes a secondary waveguide configured to couple the secondary signal from the primary waveguide with no coupling of the primary signal into the secondary waveguide. The secondary signal is configured to propagate through the secondary waveguide and be outputted from a port of the secondary waveguide.

  14. 30 CFR 77.805 - Cable couplers and connection boxes; minimum design requirements.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... 30 Mineral Resources 1 2013-07-01 2013-07-01 false Cable couplers and connection boxes; minimum... WORK AREAS OF UNDERGROUND COAL MINES Surface High-Voltage Distribution § 77.805 Cable couplers and connection boxes; minimum design requirements. (a)(1) Couplers that are used in medium- or high-voltage power...

  15. 30 CFR 77.805 - Cable couplers and connection boxes; minimum design requirements.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... 30 Mineral Resources 1 2014-07-01 2014-07-01 false Cable couplers and connection boxes; minimum... WORK AREAS OF UNDERGROUND COAL MINES Surface High-Voltage Distribution § 77.805 Cable couplers and connection boxes; minimum design requirements. (a)(1) Couplers that are used in medium- or high-voltage power...

  16. 30 CFR 77.805 - Cable couplers and connection boxes; minimum design requirements.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... 30 Mineral Resources 1 2012-07-01 2012-07-01 false Cable couplers and connection boxes; minimum... WORK AREAS OF UNDERGROUND COAL MINES Surface High-Voltage Distribution § 77.805 Cable couplers and connection boxes; minimum design requirements. (a)(1) Couplers that are used in medium- or high-voltage power...

  17. A high performance linear equation solver on the VPP500 parallel supercomputer

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Nakanishi, Makoto; Ina, Hiroshi; Miura, Kenichi

    1994-12-31

    This paper describes the implementation of two high performance linear equation solvers developed for the Fujitsu VPP500, a distributed memory parallel supercomputer system. The solvers take advantage of the key architectural features of VPP500--(1) scalability for an arbitrary number of processors up to 222 processors, (2) flexible data transfer among processors provided by a crossbar interconnection network, (3) vector processing capability on each processor, and (4) overlapped computation and transfer. The general linear equation solver based on the blocked LU decomposition method achieves 120.0 GFLOPS performance with 100 processors in the LIN-PACK Highly Parallel Computing benchmark.

  18. Bidirectional optical coupler for plastic optical fibers.

    PubMed

    Sugita, Tatsuya; Abe, Tomiya; Hirano, Kouki; Itoh, Yuzo

    2005-05-20

    We have developed a low-loss bidirectional optical coupler for high-speed optical communication with plastic optical fibers (POFs). The coupler, which is fabricated by an injection molding method that uses poly (methyl methacrylate), has an antisymmetric tapered shape. We show that the coupler has low insertion and branching losses. The tapered shape of the receiving branch reduces beam diameter and increases detection efficiency coupling to a photodetector, whose area is smaller than that of the plastic optical fiber. The possibility of more than 15-m bidirectional transmission with a signaling bit rate up to 500 Mbits/s for simplex step-index POFs is demonstrated.

  19. A coupler for parasitic mode diagnosis in an X-band triaxial klystron amplifier

    NASA Astrophysics Data System (ADS)

    Zhang, Wei; Ju, Jin-chuan; Zhang, Jun; Qi, Zu-min; Zhong, Hui-huang

    2017-10-01

    The traditional methods of parasitic mode excitation diagnosis in an X-band triaxial klystron amplifier (TKA) meet two difficulties: limited installation space and vacuum sealing. In order to solve these issues, a simple and compact coupler with good sealing performance, which can prevent air flow between the main and the auxiliary waveguides, is proposed and investigated experimentally. The coupler is designed with the aperture diffraction theory and the finite-different time-domain (FDTD) method. The designed coupler consists of a main coaxial waveguide (for microwave transmission) and a rectangular auxiliary waveguide (for parasitic mode diagnosis). The entire coupler structure has been fabricated by macromolecule polymer which is transparent to microwave signal in frequency range of X-band. The metal coating of about 200 microns has been performed through electroplating technique to ensure that the device operates well at high power. A small aperture is made in the metal coating. Hence, microwave can couple through the hole and the wave-transparent medium, whereas air flow is blocked by the wave-transparent medium. The coupling coefficient is analyzed and simulated with CST software. The coupler model is also included in particle-in-cell (PIC) simulation with CHIPIC software and the associated parasitic mode excitation is studied. A frequency component of 11.46 GHz is observed in the FFT of the electric field of the drift tube and its corresponding competition mode appears as TE61 mode according to the electric field distribution. Besides, a frequency component of 10.8 GHz is also observed in the FFT of the electric field. After optimization of TE61 mode suppression, an experiment of the TKA with the designed coupler is carried out and the parasitic mode excitation at 10.8 GHz is observed through the designed coupler.

  20. The aging process of optical couplers by gamma irradiation

    NASA Astrophysics Data System (ADS)

    Bednarek, Lukas; Marcinka, Ondrej; Perecar, Frantisek; Papes, Martin; Hajek, Lukas; Nedoma, Jan; Vasinek, Vladimir

    2015-08-01

    Scientists have recently discovered that the ageing process of optical elements is faster than it was originally anticipated. It is mostly due to the multiple increases of the optical power in optical components, the introduction of wavelength division multiplexers and, overall, the increased flow of traffic in optical communications. This article examines the ageing process of optical couplers and it focuses on their performance parameters. It describes the measurement procedure followed by the evaluation of the measurement results. To accelerate the ageing process, gamma irradiation from 60Co was used. The results of the measurements of the optical coupler with one input and eight outputs (1:8) were summarized. The results gained by measuring of the optical coupler with one input and four outputs (1:4) as well as of the optical couplers with one input and two outputs (1:2) with different split ratios were also processed. The optical powers were measured on the input and the outputs of each branch of each optical coupler at the wavelengths of 1310 nm and 1550 nm. The parameters of the optical couplers were subsequently calculated according to the appropriate formulas. These parameters were the insertion loss of the individual branches, split ratio, total losses, homogeneity of the losses and directionalities alias cross-talk between the individual output branches. The gathered data were summarized before and after the first irradiation when the configuration of the couplers was 1:8 and 1:4. The data were summarized after the third irradiation when the configuration of the couplers was 1:2.

  1. Coaxial cable Bragg grating assisted microwave coupler.

    PubMed

    Huang, Jie; Wei, Tao; Fan, Jun; Xiao, Hai

    2014-01-01

    This paper reports a microwave coupler based on two parallel coaxial cable Bragg gratings fabricated by drilling U-grooves across the cables at periodic distance along the cable direction. Electromagnetic field couplings between two cables were observed at discrete frequencies through both near and far ends detections. The coupling frequency and strength can be precisely controlled by varying the grating period and length. The coupling bandwidth may also be controlled through specific grating design. The device physics was also described through transfer matrix which matched well with the experimental results.

  2. Inverse design of near unity efficiency perfectly vertical grating couplers

    NASA Astrophysics Data System (ADS)

    Michaels, Andrew; Yablonovitch, Eli

    2018-02-01

    Efficient coupling between integrated optical waveguides and optical fibers is essential to the success of integrated photonics. While many solutions exist, perfectly vertical grating couplers which scatter light out of a waveguide in the direction normal to the waveguide's top surface are an ideal candidate due to their potential to reduce packaging complexity. Designing such couplers with high efficiency, however, has proven difficult. In this paper, we use electromagnetic inverse design techniques to optimize a high efficiency two-layer perfectly vertical silicon grating coupler. Our base design achieves a chip-to-fiber coupling efficiency of over 99% (-0.04 dB) at 1550 nm. Using this base design, we apply subsequent constrained optimizations to achieve vertical couplers with over 96% efficiency which are fabricable using a 65 nm process.

  3. Demonstration of a High-Order Mode Input Coupler for a 220-GHz Confocal Gyrotron Traveling Wave Tube

    NASA Astrophysics Data System (ADS)

    Guan, Xiaotong; Fu, Wenjie; Yan, Yang

    2018-02-01

    A design of high-order mode input coupler for 220-GHz confocal gyrotron travelling wave tube is proposed, simulated, and demonstrated by experimental tests. This input coupler is designed to excite confocal TE 06 mode from rectangle waveguide TE 10 mode over a broadband frequency range. Simulation results predict that the optimized conversion loss is about 2.72 dB with a mode purity excess of 99%. Considering of the gyrotron interaction theory, an effective bandwidth of 5 GHz is obtained, in which the beam-wave coupling efficiency is higher than half of maximum. The field pattern under low power demonstrates that TE 06 mode is successfully excited in confocal waveguide at 220 GHz. Cold test results from the vector network analyzer perform good agreements with simulation results. Both simulation and experimental results illustrate that the reflection at input port S11 is sensitive to the perpendicular separation of two mirrors. It provides an engineering possibility for estimating the assembly precision.

  4. 49 CFR 179.14 - Coupler vertical restraint system.

    Code of Federal Regulations, 2011 CFR

    2011-10-01

    ... system shall be tested under the following conditions: (1) The test coupler shall be tested with a mating coupler (or simulated coupler) having only frictional vertical force resistance at the mating interface; or a mating coupler (or simulated coupler) having the capabilities described in paragraph (a) of this...

  5. Fabrication of 8×8 MMI optical coupler in BK7 by ion-exchange

    NASA Astrophysics Data System (ADS)

    Li, Xia; Li, Xi-Hua; Zhou, Qiang; Jiang, Xiao-Qing; Yang, Jian-Yi; Wang, Ming-Hua

    2005-01-01

    The planar waveguide optical couplers are of prime importance in optical communication and optical signal processing system. Comparing with the optical fiber coupler (OFC) which fabricated by fused biconical taper technology, the planar waveguide couplers are more compact size, lower loss, better uniformity, easier manufacture and integration. Multimode interference (MMI) couplers have many advantages, such as compact size, wavelength and polarization insensitivity, fabrication tolerances and low loss, etc., which concentrate more and more attention. Conventional MMI devices are based on the uniform index waveguides. When the number of input/output waveguides becomes larger, the intrinsic propagation constant error, which will cause bad uniformity of output power, can"t be neglected. In fact, most waveguide devices are graded-index. With the enhanced compatibility of MMI coupler, the performance can be improved at the same time. Prior study shows that graded-index MMI couplers reach the best performance under certain index contrast. Among many available materials, glass is chosen to be the substrate of the coupler, because of its good features, such as low loss, ease fabrication, cheap cost, and so on. In this paper, an 8×8 MMI optical coupler is designed based on the principle of graded-index MMI. The coupler is composed of a waveguide, which is designed to support a large number of modes, and several access (usually single-mode) waveguides, which are used to launch light into and recover light from that multimode waveguide. The total length of the device is less than 3.5 centimeter, including S-bends which lead the multiple images to the output of the device with the spacing D=250μm to make the device fiber compatible. In this paper, we describe an experimental realization of the 8×8 graded-index MMI optical coupler and the measurement of its performance with the testing laser of the wavelength of 1.55μm. The device is fabricated by ion-exchange on BK7 glass

  6. 30 CFR 77.805 - Cable couplers and connection boxes; minimum design requirements.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... 30 Mineral Resources 1 2010-07-01 2010-07-01 false Cable couplers and connection boxes; minimum... connection boxes; minimum design requirements. (a)(1) Couplers that are used in medium- or high-voltage power... materials other than metal. (2) Cable couplers shall be adequate for the intended current and voltage. (3...

  7. 30 CFR 77.805 - Cable couplers and connection boxes; minimum design requirements.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... 30 Mineral Resources 1 2011-07-01 2011-07-01 false Cable couplers and connection boxes; minimum... connection boxes; minimum design requirements. (a)(1) Couplers that are used in medium- or high-voltage power... materials other than metal. (2) Cable couplers shall be adequate for the intended current and voltage. (3...

  8. Angle selective fiber coupler.

    PubMed

    Barnoski, M K; Morrison, R J

    1976-01-01

    Angle selective input coupling through the side of a slightly tapered section of Corning highly multimode fiber has been experimentally demonstrated for the first time. This coupling technique allows the possibility of fabricating bidirectional (duplex) couplers for systems employing single strands of multimode, low loss fiber.

  9. High-performance parallel analysis of coupled problems for aircraft propulsion

    NASA Technical Reports Server (NTRS)

    Felippa, C. A.; Farhat, C.; Lanteri, S.; Gumaste, U.; Ronaghi, M.

    1994-01-01

    Applications are described of high-performance parallel, computation for the analysis of complete jet engines, considering its multi-discipline coupled problem. The coupled problem involves interaction of structures with gas dynamics, heat conduction and heat transfer in aircraft engines. The methodology issues addressed include: consistent discrete formulation of coupled problems with emphasis on coupling phenomena; effect of partitioning strategies, augmentation and temporal solution procedures; sensitivity of response to problem parameters; and methods for interfacing multiscale discretizations in different single fields. The computer implementation issues addressed include: parallel treatment of coupled systems; domain decomposition and mesh partitioning strategies; data representation in object-oriented form and mapping to hardware driven representation, and tradeoff studies between partitioning schemes and fully coupled treatment.

  10. Scalable Unix commands for parallel processors : a high-performance implementation.

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ong, E.; Lusk, E.; Gropp, W.

    2001-06-22

    We describe a family of MPI applications we call the Parallel Unix Commands. These commands are natural parallel versions of common Unix user commands such as ls, ps, and find, together with a few similar commands particular to the parallel environment. We describe the design and implementation of these programs and present some performance results on a 256-node Linux cluster. The Parallel Unix Commands are open source and freely available.

  11. Mode-converting coupler for silicon-on-sapphire devices

    NASA Astrophysics Data System (ADS)

    Zlatanovic, S.; Offord, B. W.; Owen, M.; Shimabukuro, R.; Jacobs, E. W.

    2015-02-01

    Silicon-on-sapphire devices are attractive for the mid-infrared optical applications up to 5 microns due to the low loss of both silicon and sapphire in this wavelength band. Designing efficient couplers for silicon-on-sapphire devices presents a challenge due to a highly confined mode in silicon and large values of refractive index of both silicon and sapphire. Here, we present design, fabrication, and measurements of a mode-converting coupler for silicon-on-sapphire waveguides. We utilize a mode converter layout that consists of a large waveguide that is overlays a silicon inverse tapered waveguide. While this geometry was previously utilized for silicon-on-oxide devices, the novelty is in using materials that are compatible with the silicon-on-sapphire platform. In the current coupler the overlaying waveguide is made of silicon nitride. Silicon nitride is the material of choice because of the large index of refraction and low absorption from near-infrared to mid-infrared. The couplers were fabricated using a 0.25 micron silicon-on-sapphire process. The measured coupling loss from tapered lensed silica fibers to the silicon was 4.8dB/coupler. We will describe some challenges in fabrication process and discuss ways to overcome them.

  12. Scalable High Performance Computing: Direct and Large-Eddy Turbulent Flow Simulations Using Massively Parallel Computers

    NASA Technical Reports Server (NTRS)

    Morgan, Philip E.

    2004-01-01

    This final report contains reports of research related to the tasks "Scalable High Performance Computing: Direct and Lark-Eddy Turbulent FLow Simulations Using Massively Parallel Computers" and "Devleop High-Performance Time-Domain Computational Electromagnetics Capability for RCS Prediction, Wave Propagation in Dispersive Media, and Dual-Use Applications. The discussion of Scalable High Performance Computing reports on three objectives: validate, access scalability, and apply two parallel flow solvers for three-dimensional Navier-Stokes flows; develop and validate a high-order parallel solver for Direct Numerical Simulations (DNS) and Large Eddy Simulation (LES) problems; and Investigate and develop a high-order Reynolds averaged Navier-Stokes turbulence model. The discussion of High-Performance Time-Domain Computational Electromagnetics reports on five objectives: enhancement of an electromagnetics code (CHARGE) to be able to effectively model antenna problems; utilize lessons learned in high-order/spectral solution of swirling 3D jets to apply to solving electromagnetics project; transition a high-order fluids code, FDL3DI, to be able to solve Maxwell's Equations using compact-differencing; develop and demonstrate improved radiation absorbing boundary conditions for high-order CEM; and extend high-order CEM solver to address variable material properties. The report also contains a review of work done by the systems engineer.

  13. Internal Mirror Optical Fiber Couplers

    NASA Astrophysics Data System (ADS)

    Shin, Jong-Dug

    A fusion splicing technique has been used to produce angled dielectric mirrors in multimode and single-mode silica fibers. These mirrored fiber couplers serve as compact directional couplers with low excess optical loss (~0.2 dB for multimode and 0.5 dB for single mode at 1.3 μm) and excellent mechanical properties. The reflectance is found to be wavelength dependent and strongly polarization dependent, as expected. Far-field scans of the reflected output power measured with a white-light source show a pattern which is almost circularly symmetric. The splitting ratio in a multimode coupler measured with a laser source is much less dependent on input coupling conditions than in conventional fused biconical-taper couplers. Spectral properties of multilayer fiber mirrors have been investigated experimentally, and a matrix analysis has been used to explain the results.

  14. High Performance Input/Output for Parallel Computer Systems

    NASA Technical Reports Server (NTRS)

    Ligon, W. B.

    1996-01-01

    The goal of our project is to study the I/O characteristics of parallel applications used in Earth Science data processing systems such as Regional Data Centers (RDCs) or EOSDIS. Our approach is to study the runtime behavior of typical programs and the effect of key parameters of the I/O subsystem both under simulation and with direct experimentation on parallel systems. Our three year activity has focused on two items: developing a test bed that facilitates experimentation with parallel I/O, and studying representative programs from the Earth science data processing application domain. The Parallel Virtual File System (PVFS) has been developed for use on a number of platforms including the Tiger Parallel Architecture Workbench (TPAW) simulator, The Intel Paragon, a cluster of DEC Alpha workstations, and the Beowulf system (at CESDIS). PVFS provides considerable flexibility in configuring I/O in a UNIX- like environment. Access to key performance parameters facilitates experimentation. We have studied several key applications fiom levels 1,2 and 3 of the typical RDC processing scenario including instrument calibration and navigation, image classification, and numerical modeling codes. We have also considered large-scale scientific database codes used to organize image data.

  15. Modeling of a Single-Notch Microfiber Coupler for High-Sensitivity and Low Detection-Limit Refractive Index Sensing.

    PubMed

    Zhang, Jiali; Shi, Lei; Zhu, Song; Xu, Xinbiao; Zhang, Xinliang

    2016-05-11

    A highly sensitive refractive index sensor with low detection limit based on an asymmetric optical microfiber coupler is proposed. It is composed of a silica optical microfiber and an As₂Se₃ optical microfiber. Due to the asymmetry of the microfiber materials, a single-notch transmission spectrum is demonstrated by the large refractive index difference between the two optical microfibers. Compared with the symmetric coupler, the bandwidth of the asymmetric structure is over one order of magnitude narrower than that of the former. Therefore, the asymmetric optical microfiber coupler based sensor can reach over one order of magnitude smaller detection limit, which is defined as the minimal detectable refractive index change caused by the surrounding analyte. With the advantage of large evanescent field, the results also show that a sensitivity of up to 3212 nm per refractive index unit with a bandwidth of 12 nm is achieved with the asymmetric optical microfiber coupler. Furthermore, a maximum sensitivity of 4549 nm per refractive index unit can be reached while the radii of the silica optical microfiber and As₂Se₃ optical microfiber are 0.5 μm and a 0.128 μm, respectively. This sensor component may have important potential for low detection-limit physical and biochemical sensing applications.

  16. CW all optical self switching in nonlinear chalcogenide nano plasmonic directional coupler

    NASA Astrophysics Data System (ADS)

    Motamed-Jahromi, Leila; Hatami, Mohsen

    2018-04-01

    In this paper we obtain the coupling coefficient of plasmonic directional coupler (PDC) made up of two parallel monolayer waveguides filled with high nonlinear chalcogenide material for TM mode in continues wave (CW) regime. In addition, we assume each waveguides acts as a perturbation to other waveguide. Four nonlinear-coupled equations are derived. Transfer distances are numerically calculated and used for deriving length of all optical switch. The length of designed switch is in the range of 10-1000 μm, and the switching power is in the range of 1-100 W/m. Obtained values are suitable for designing all optical elements in the integrated optical circuits.

  17. Fiber-chip edge coupler with large mode size for silicon photonic wire waveguides.

    PubMed

    Papes, Martin; Cheben, Pavel; Benedikovic, Daniel; Schmid, Jens H; Pond, James; Halir, Robert; Ortega-Moñux, Alejandro; Wangüemert-Pérez, Gonzalo; Ye, Winnie N; Xu, Dan-Xia; Janz, Siegfried; Dado, Milan; Vašinek, Vladimír

    2016-03-07

    Fiber-chip edge couplers are extensively used in integrated optics for coupling of light between planar waveguide circuits and optical fibers. In this work, we report on a new fiber-chip edge coupler concept with large mode size for silicon photonic wire waveguides. The coupler allows direct coupling with conventional cleaved optical fibers with large mode size while circumventing the need for lensed fibers. The coupler is designed for 220 nm silicon-on-insulator (SOI) platform. It exhibits an overall coupling efficiency exceeding 90%, as independently confirmed by 3D Finite-Difference Time-Domain (FDTD) and fully vectorial 3D Eigenmode Expansion (EME) calculations. We present two specific coupler designs, namely for a high numerical aperture single mode optical fiber with 6 µm mode field diameter (MFD) and a standard SMF-28 fiber with 10.4 µm MFD. An important advantage of our coupler concept is the ability to expand the mode at the chip edge without leading to high substrate leakage losses through buried oxide (BOX), which in our design is set to 3 µm. This remarkable feature is achieved by implementing in the SiO 2 upper cladding thin high-index Si 3 N 4 layers. The Si 3 N 4 layers increase the effective refractive index of the upper cladding near the facet. The index is controlled along the taper by subwavelength refractive index engineering to facilitate adiabatic mode transformation to the silicon wire waveguide while the Si-wire waveguide is inversely tapered along the coupler. The mode overlap optimization at the chip facet is carried out with a full vectorial mode solver. The mode transformation along the coupler is studied using 3D-FDTD simulations and with fully-vectorial 3D-EME calculations. The couplers are optimized for operating with transverse electric (TE) polarization and the operating wavelength is centered at 1.55 µm.

  18. L-shaped fiber-chip grating couplers with high directionality and low reflectivity fabricated with deep-UV lithography.

    PubMed

    Benedikovic, Daniel; Alonso-Ramos, Carlos; Pérez-Galacho, Diego; Guerber, Sylvain; Vakarin, Vladyslav; Marcaud, Guillaume; Le Roux, Xavier; Cassan, Eric; Marris-Morini, Delphine; Cheben, Pavel; Boeuf, Frédéric; Baudot, Charles; Vivien, Laurent

    2017-09-01

    Grating couplers enable position-friendly interfacing of silicon chips by optical fibers. The conventional coupler designs call upon comparatively complex architectures to afford efficient light coupling to sub-micron silicon-on-insulator (SOI) waveguides. Conversely, the blazing effect in double-etched gratings provides high coupling efficiency with reduced fabrication intricacy. In this Letter, we demonstrate for the first time, to the best of our knowledge, the realization of an ultra-directional L-shaped grating coupler, seamlessly fabricated by using 193 nm deep-ultraviolet (deep-UV) lithography. We also include a subwavelength index engineered waveguide-to-grating transition that provides an eight-fold reduction of the grating reflectivity, down to 1% (-20  dB). A measured coupling efficiency of -2.7  dB (54%) is achieved, with a bandwidth of 62 nm. These results open promising prospects for the implementation of efficient, robust, and cost-effective coupling interfaces for sub-micrometric SOI waveguides, as desired for large-volume applications in silicon photonics.

  19. Parallel Reaction Monitoring: A Targeted Experiment Performed Using High Resolution and High Mass Accuracy Mass Spectrometry

    PubMed Central

    Rauniyar, Navin

    2015-01-01

    The parallel reaction monitoring (PRM) assay has emerged as an alternative method of targeted quantification. The PRM assay is performed in a high resolution and high mass accuracy mode on a mass spectrometer. This review presents the features that make PRM a highly specific and selective method for targeted quantification using quadrupole-Orbitrap hybrid instruments. In addition, this review discusses the label-based and label-free methods of quantification that can be performed with the targeted approach. PMID:26633379

  20. Analysis of the rectangular resonator with butterfly MMI coupler using SOI

    NASA Astrophysics Data System (ADS)

    Kim, Sun-Ho; Park, Jun-Hee; Kim, Eudum; Jeon, Su-Jin; Kim, Ji-Hoon; Choi, Young-Wan

    2018-02-01

    We propose a rectangular resonator sensor structure with butterfly MMI coupler using SOI. It consists of the rectangular resonator, total internal reflection (TIR) mirror, and the butterfly MMI coupler. The rectangular resonator is expected to be used as bio and chemical sensors because of the advantages of using MMI coupler and the absence of bending loss unlike ring resonators. The butterfly MMI coupler can miniaturize the device compared to conventional MMI by using a linear butterfly shape instead of a square in the MMI part. The width, height, and slab height of the rib type waveguide are designed to be 1.5 μm, 1.5 μm, and 0.9 μm, respectively. This structure is designed as a single mode. When designing a TIR mirror, we considered the Goos-Hänchen shift and critical angle. We designed 3:1 MMI coupler because rectangular resonator has no bending loss. The width of MMI is designed to be 4.5 μm and we optimize the length of the butterfly MMI coupler using finite-difference time-domain (FDTD) method for higher Q-factor. It has the equal performance with conventional MMI even though the length is reduced by 1/3. As a result of the simulation, Qfactor of rectangular resonator can be obtained as 7381.

  1. Suppression of multipacting in high power RF couplers operating with superconducting cavities

    NASA Astrophysics Data System (ADS)

    Ostroumov, P. N.; Kazakov, S.; Morris, D.; Larter, T.; Plastun, A. S.; Popielarski, J.; Wei, J.; Xu, T.

    2017-06-01

    Capacitive input couplers based on a 50 Ω coaxial transmission line are frequently used to transmit RF power to superconducting (SC) resonators operating in CW mode. It is well known that coaxial transmission lines are prone to multipacting phenomenon in a wide range of RF power level and operating frequency. The Facility for Rare Isotope Beams (FRIB) being constructed at Michigan State University includes two types of quarter wave SC resonators (QWR) operating at 80.5 MHz and two types of half wave SC resonators (HWR) operating at 322 MHz. As was reported in ref. [1] a capacitive input coupler used with HWRs was experiencing strong multipacting that resulted in a long conditioning time prior the cavity testing at design levels of accelerating fields. We have developed an insert into 50 Ω coaxial transmission line that provides opportunity to bias the RF coupler antenna and protect the amplifier from the bias potential in the case of breakdown in DC isolation. Two of such devices have been built and are currently used for the off-line testing of 8 HWRs installed in the cryomodule.

  2. OBLIMAP 2.0: a fast climate model-ice sheet model coupler including online embeddable mapping routines

    NASA Astrophysics Data System (ADS)

    Reerink, Thomas J.; van de Berg, Willem Jan; van de Wal, Roderik S. W.

    2016-11-01

    This paper accompanies the second OBLIMAP open-source release. The package is developed to map climate fields between a general circulation model (GCM) and an ice sheet model (ISM) in both directions by using optimal aligned oblique projections, which minimize distortions. The curvature of the surfaces of the GCM and ISM grid differ, both grids may be irregularly spaced and the ratio of the grids is allowed to differ largely. OBLIMAP's stand-alone version is able to map data sets that differ in various aspects on the same ISM grid. Each grid may either coincide with the surface of a sphere, an ellipsoid or a flat plane, while the grid types might differ. Re-projection of, for example, ISM data sets is also facilitated. This is demonstrated by relevant applications concerning the major ice caps. As the stand-alone version also applies to the reverse mapping direction, it can be used as an offline coupler. Furthermore, OBLIMAP 2.0 is an embeddable GCM-ISM coupler, suited for high-frequency online coupled experiments. A new fast scan method is presented for structured grids as an alternative for the former time-consuming grid search strategy, realising a performance gain of several orders of magnitude and enabling the mapping of high-resolution data sets with a much larger number of grid nodes. Further, a highly flexible masked mapping option is added. The limitation of the fast scan method with respect to unstructured and adaptive grids is discussed together with a possible future parallel Message Passing Interface (MPI) implementation.

  3. Ultrashort hybrid metal-insulator plasmonic directional coupler.

    PubMed

    Noghani, Mahmoud Talafi; Samiei, Mohammad Hashem Vadjed

    2013-11-01

    An ultrashort plasmonic directional coupler based on the hybrid metal-insulator slab waveguide is proposed and analyzed at the telecommunication wavelength of 1550 nm. It is first analyzed using the supermode theory based on mode analysis via the transfer matrix method in the interaction region. Then the 2D model of the coupler, including transition arms, is analyzed using a commercial finite-element method simulator. The hybrid slab waveguide is composed of a metallic layer of silver and two dielectric layers of silica (SiO2) and silicon (Si). The coupler is optimized to have a minimum coupling length and to transfer maximum power considering the layer thicknesses as optimization variables. The resulting coupling length in the submicrometer region along with a noticeable power transfer efficiency are advantages of the proposed coupler compared to previously reported plasmonic couplers.

  4. A Ratiometric Wavelength Measurement Based on a Silicon-on-Insulator Directional Coupler Integrated Device

    PubMed Central

    Wang, Pengfei; Hatta, Agus Muhamad; Zhao, Haoyu; Zheng, Jie; Farrell, Gerald; Brambilla, Gilberto

    2015-01-01

    A ratiometric wavelength measurement based on a Silicon-on-Insulator (SOI) integrated device is proposed and designed, which consists of directional couplers acting as two edge filters with opposite spectral responses. The optimal separation distance between two parallel silicon waveguides and the interaction length of the directional coupler are designed to meet the desired spectral response by using local supermodes. The wavelength discrimination ability of the designed ratiometric structure is demonstrated by a beam propagation method numerically and then is verified experimentally. The experimental results have shown a general agreement with the theoretical models. The ratiometric wavelength system demonstrates a resolution of better than 50 pm at a wavelength around 1550 nm with ease of assembly and calibration. PMID:26343668

  5. 30 CFR 75.805 - Couplers.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... ground check continuity conductor shall be broken first and the ground conductors shall be broken last.... [Statutory Provisions] Couplers that are used with medium-voltage or high-voltage power circuits shall be of the three-phase type with a full metallic shell, except that the Secretary may permit, under such...

  6. Optimization of fiber grating couplers on SOI using advanced search algorithms.

    PubMed

    Wohlfeil, Benjamin; Zimmermann, Lars; Petermann, Klaus

    2014-06-01

    A one-dimensional fiber grating coupler is derived from a waveguide with random etches using implementations of particle swarm and genetic algorithms. The resulting gratings yield a theoretical coupling efficiency of up to 1.1 dB and prompt clear design rules for the layout of highly efficient fiber grating couplers.

  7. Element of an inductive coupler

    DOEpatents

    Hall, David R.; Fox, Joe

    2006-08-15

    An element for an inductive coupler in a downhole component comprises magnetically conductive material, which is disposed in a recess in annular housing. The magnetically conductive material forms a generally circular trough. The circular trough comprises an outer generally U-shaped surface, an inner generally U-shaped surface, and two generally planar surfaces joining the inner and outer surfaces. The element further comprises pressure relief grooves in at least one of the surfaces of the circular trough. The pressure relief grooves may be scored lines. Preferably the pressure relief grooves are parallel to the magnetic field generated by the magnetically conductive material. The magnetically conductive material is selected from the group consisting of soft iron, ferrite, a nickel iron alloy, a silicon iron alloy, a cobalt iron alloy, and a mu-metal. Preferably, the annular housing is a metal ring.

  8. A History of the Chemical Innovations in Silver-Halide Materials for Color PhotographyII. Color-Forming Development, Part 5. Coupler Innovations after the 1970's—Two-Equivalent Coupler and DIR Coupler

    NASA Astrophysics Data System (ADS)

    Oishi, Yasushi

    After the 1970's on, several manufacturers including Fuji Film, Konica and Agfa-Gevaert participated in innovating color photographic materials by adding their own coupler chemistry to the technological architecture built by Kodak before then. One area of their major advances was development of the couplers having a coupling-off organic group. One of their functional forms was two-equivalent coupler which made the dye-forming process efficient and made the photosensitive layers slim. And another was DIR coupler which improved dramatically the image quality of color negative materials. In this paper a historical overview of these innovations is constructed from the technical documents, mainly patents.

  9. Design and development of ultra-wideband 3 dB hybrid coupler for Ion cyclotron resonance frequency heating in tokamak.

    PubMed

    Yadav, Rana Pratap; Kumar, Sunil; Kulkarni, S V

    2014-04-01

    Design and development of a high power ultra-wideband, 3 dB tandem hybrid coupler is presented and its application in ICRF heating of the tokamak is discussed. In order to achieve the desired frequency band of 38-112 MHz and 200 kW power handling capability, the 3 dB hybrid coupler is developed using two 3-element 8.34 ± 0.2 dB coupled lines sections in tandem. In multi-element coupled lines, junctions are employed for the joining of coupled elements that produce the undesirable reactance called junction discontinuity effect. The effect becomes prominent in the high power multi-element coupled lines for high frequency (HF) and very high frequency(VHF) applications because of larger structural dimensions. Junction discontinuity effect significantly deteriorates coupling and output performance from the theoretical predictions. For the analysis of junction discontinuity effect and its compensation, a theoretical approach has been developed and generalized for n-element coupled lines section. The theory has been applied in the development of the 3 dB hybrid coupler. The fabricated hybrid coupler has been experimentally characterized using vector network analyzer and obtained results are found in good agreement with developed theory.

  10. Wet-chemical fabrication of a single leakage-channel grating coupler

    NASA Astrophysics Data System (ADS)

    Weisenbach, Lori; Zelinski, Brian J. J.; Roncone, Ronald L.; Burke, James J.

    1995-04-01

    We demonstrate the fabrication of a unique optical device, the single leakage-channel grating coupler, using sol-gel techniques. Design specifications are outlined to establish the material criteria for the sol-gel compositions. Material choice and preparation are described. We evaluate the characteristics and performance of the single leakage-channel grating coupler by comparing the predicted and the measured branching ratios. The branching ratio of the solution-derived device is within 3% of the theoretically predicted value.

  11. Magnetic field sensor based on cascaded microfiber coupler with magnetic fluid

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Mao, Lianmin; Su, Delong; Wang, Zhaofang

    A kind of magnetic field sensor based on cascaded microfiber coupler with magnetic fluid is proposed and experimentally demonstrated. The magnetic fluid is utilized as the cladding of the fused regions of the cascaded microfiber coupler. As the interference valley wavelength of the sensing structure is sensitive to the ambient variation, considering the magnetic-field-dependent refractive index of magnetic fluid, the proposed structure is employed for magnetic field sensing. The effective coupling length for each coupling region of the as-fabricated cascaded microfiber coupler is 6031 μm. The achieved sensitivity is 125 pm/Oe, which is about three times larger than that of the previouslymore » similar structure based on the single microfiber coupler. Experimental results indicate that the sensing sensitivity can be easily improved by increasing the effective coupling length or cascading more microfiber couplers. The proposed magnetic field sensor is attractive due to its low cost, immunity to electromagnetic interference, as well as high sensitivity, which also has the potentials in other tunable all-fiber photonic devices, such as filter.« less

  12. High-performance parallel approaches for three-dimensional light detection and ranging point clouds gridding

    NASA Astrophysics Data System (ADS)

    Rizki, Permata Nur Miftahur; Lee, Heezin; Lee, Minsu; Oh, Sangyoon

    2017-01-01

    With the rapid advance of remote sensing technology, the amount of three-dimensional point-cloud data has increased extraordinarily, requiring faster processing in the construction of digital elevation models. There have been several attempts to accelerate the computation using parallel methods; however, little attention has been given to investigating different approaches for selecting the most suited parallel programming model for a given computing environment. We present our findings and insights identified by implementing three popular high-performance parallel approaches (message passing interface, MapReduce, and GPGPU) on time demanding but accurate kriging interpolation. The performances of the approaches are compared by varying the size of the grid and input data. In our empirical experiment, we demonstrate the significant acceleration by all three approaches compared to a C-implemented sequential-processing method. In addition, we also discuss the pros and cons of each method in terms of usability, complexity infrastructure, and platform limitation to give readers a better understanding of utilizing those parallel approaches for gridding purposes.

  13. Computational Performance of a Parallelized Three-Dimensional High-Order Spectral Element Toolbox

    NASA Astrophysics Data System (ADS)

    Bosshard, Christoph; Bouffanais, Roland; Clémençon, Christian; Deville, Michel O.; Fiétier, Nicolas; Gruber, Ralf; Kehtari, Sohrab; Keller, Vincent; Latt, Jonas

    In this paper, a comprehensive performance review of an MPI-based high-order three-dimensional spectral element method C++ toolbox is presented. The focus is put on the performance evaluation of several aspects with a particular emphasis on the parallel efficiency. The performance evaluation is analyzed with help of a time prediction model based on a parameterization of the application and the hardware resources. A tailor-made CFD computation benchmark case is introduced and used to carry out this review, stressing the particular interest for clusters with up to 8192 cores. Some problems in the parallel implementation have been detected and corrected. The theoretical complexities with respect to the number of elements, to the polynomial degree, and to communication needs are correctly reproduced. It is concluded that this type of code has a nearly perfect speed up on machines with thousands of cores, and is ready to make the step to next-generation petaflop machines.

  14. Parallel-vector unsymmetric Eigen-Solver on high performance computers

    NASA Technical Reports Server (NTRS)

    Nguyen, Duc T.; Jiangning, Qin

    1993-01-01

    The popular QR algorithm for solving all eigenvalues of an unsymmetric matrix is reviewed. Among the basic components in the QR algorithm, it was concluded from this study, that the reduction of an unsymmetric matrix to a Hessenberg form (before applying the QR algorithm itself) can be done effectively by exploiting the vector speed and multiple processors offered by modern high-performance computers. Numerical examples of several test cases have indicated that the proposed parallel-vector algorithm for converting a given unsymmetric matrix to a Hessenberg form offers computational advantages over the existing algorithm. The time saving obtained by the proposed methods is increased as the problem size increased.

  15. Compact broadband polarization beam splitter using a symmetric directional coupler with sinusoidal bends.

    PubMed

    Zhang, Fan; Yun, Han; Wang, Yun; Lu, Zeqin; Chrostowski, Lukas; Jaeger, Nicolas A F

    2017-01-15

    We design and demonstrate a compact broadband polarization beam splitter (PBS) using a symmetric directional coupler with sinusoidal bends on a silicon-on-insulator platform. The sinusoidal bends in our PBS suppress the power exchange between two parallel symmetric strip waveguides for the transverse-electric (TE) mode, while allowing for the maximum power transfer to the adjacent waveguide for the transverse-magnetic (TM) mode. Our PBS has a nominal coupler length of 8.55 μm, and it has an average extinction ratio (ER) of 12.0 dB for the TE mode, an average ER of 20.1 dB for the TM mode, an average polarization isolation (PI) of 20.6 dB for the through port, and an average PI of 11.5 dB for the cross port, all over a bandwidth of 100 nm.

  16. Geometric optimisation of an accurate cosine correcting optic fibre coupler for solar spectral measurement.

    PubMed

    Cahuantzi, Roberto; Buckley, Alastair

    2017-09-01

    Making accurate and reliable measurements of solar irradiance is important for understanding performance in the photovoltaic energy sector. In this paper, we present design details and performance of a number of fibre optic couplers for use in irradiance measurement systems employing remote light sensors applicable for either spectrally resolved or broadband measurement. The angular and spectral characteristics of different coupler designs are characterised and compared with existing state-of-the-art commercial technology. The new coupler designs are fabricated from polytetrafluorethylene (PTFE) rods and operate through forward scattering of incident sunlight on the front surfaces of the structure into an optic fibre located in a cavity to the rear of the structure. The PTFE couplers exhibit up to 4.8% variation in scattered transmission intensity between 425 nm and 700 nm and show minimal specular reflection, making the designs accurate and reliable over the visible region. Through careful geometric optimization near perfect cosine dependence on the angular response of the coupler can be achieved. The PTFE designs represent a significant improvement over the state of the art with less than 0.01% error compared with ideal cosine response for angles of incidence up to 50°.

  17. High-Performance Parallel Analysis of Coupled Problems for Aircraft Propulsion

    NASA Technical Reports Server (NTRS)

    Felippa, C. A.; Farhat, C.; Park, K. C.; Gumaste, U.; Chen, P.-S.; Lesoinne, M.; Stern, P.

    1997-01-01

    Applications are described of high-performance computing methods to the numerical simulation of complete jet engines. The methodology focuses on the partitioned analysis of the interaction of the gas flow with a flexible structure and with the fluid mesh motion driven by structural displacements. The latter is treated by a ALE technique that models the fluid mesh motion as that of a fictitious mechanical network laid along the edges of near-field elements. New partitioned analysis procedures to treat this coupled three-component problem were developed. These procedures involved delayed corrections and subcycling, and have been successfully tested on several massively parallel computers, including the iPSC-860, Paragon XP/S and the IBM SP2. The NASA-sponsored ENG10 program was used for the global steady state analysis of the whole engine. This program uses a regular FV-multiblock-grid discretization in conjunction with circumferential averaging to include effects of blade forces, loss, combustor heat addition, blockage, bleeds and convective mixing. A load-balancing preprocessor for parallel versions of ENG10 was developed as well as the capability for the first full 3D aeroelastic simulation of a multirow engine stage. This capability was tested on the IBM SP2 parallel supercomputer at NASA Ames.

  18. Apodized grating coupler using fully-etched nanostructures

    NASA Astrophysics Data System (ADS)

    Wu, Hua; Li, Chong; Li, Zhi-Yong; Guo, Xia

    2016-08-01

    A two-dimensional apodized grating coupler for interfacing between single-mode fiber and photonic circuit is demonstrated in order to bridge the mode gap between the grating coupler and optical fiber. The grating grooves of the grating couplers are realized by columns of fully etched nanostructures, which are utilized to digitally tailor the effective refractive index of each groove in order to obtain the Gaussian-like output diffractive mode and then enhance the coupling efficiency. Compared with that of the uniform grating coupler, the coupling efficiency of the apodized grating coupler is increased by 4.3% and 5.7%, respectively, for the nanoholes and nanorectangles as refractive index tunes layer. Project supported by the National Natural Science Foundation of China (Grant Nos. 61222501, 61335004, and 61505003), the Specialized Research Fund for the Doctoral Program of Higher Education of China (Grant No. 20111103110019), the Postdoctoral Science Foundation of Beijing Funded Project, China (Grant No. Q6002012201502), and the Science and Technology Research Project of Jiangxi Provincial Education Department, China (Grant No. GJJ150998).

  19. Fluoride-fiber-based side-pump coupler for high-power fiber lasers at 2.8  μm.

    PubMed

    Schäfer, C A; Uehara, H; Konishi, D; Hattori, S; Matsukuma, H; Murakami, M; Shimizu, S; Tokita, S

    2018-05-15

    A side-pump coupler made of fluoride fibers was fabricated and tested. The tested device had a coupling efficiency of 83% and was driven with an incident pump power of up to 83.5 W, demonstrating high-power operation. Stable laser output of 15 W at a wavelength of around 2.8 μm was achieved over 1 h when using an erbium-doped double-clad fiber as the active medium. To the best of our knowledge, this is the first time a fluoride-glass-fiber-based side-pump coupler has been developed. A test with two devices demonstrated further power scalability.

  20. Ultralow loss, high Q, four port resonant couplers for quantum optics and photonics.

    PubMed

    Rokhsari, H; Vahala, K J

    2004-06-25

    We demonstrate a low-loss, optical four port resonant coupler (add-drop geometry), using ultrahigh Q (>10(8)) toroidal microcavities. Different regimes of operation are investigated by variation of coupling between resonator and fiber taper waveguides. As a result, waveguide-to-waveguide power transfer efficiency of 93% (0.3 dB loss) and nonresonant insertion loss of 0.02% (<0.001 dB) for narrow bandwidth (57 MHz) four port couplers are achieved in this work. The combination of low-loss, fiber compatibility, and wafer-scale design would be suitable for a variety of applications ranging from quantum optics to photonic networks.

  1. Parameters that affect parallel processing for computational electromagnetic simulation codes on high performance computing clusters

    NASA Astrophysics Data System (ADS)

    Moon, Hongsik

    What is the impact of multicore and associated advanced technologies on computational software for science? Most researchers and students have multicore laptops or desktops for their research and they need computing power to run computational software packages. Computing power was initially derived from Central Processing Unit (CPU) clock speed. That changed when increases in clock speed became constrained by power requirements. Chip manufacturers turned to multicore CPU architectures and associated technological advancements to create the CPUs for the future. Most software applications benefited by the increased computing power the same way that increases in clock speed helped applications run faster. However, for Computational ElectroMagnetics (CEM) software developers, this change was not an obvious benefit - it appeared to be a detriment. Developers were challenged to find a way to correctly utilize the advancements in hardware so that their codes could benefit. The solution was parallelization and this dissertation details the investigation to address these challenges. Prior to multicore CPUs, advanced computer technologies were compared with the performance using benchmark software and the metric was FLoting-point Operations Per Seconds (FLOPS) which indicates system performance for scientific applications that make heavy use of floating-point calculations. Is FLOPS an effective metric for parallelized CEM simulation tools on new multicore system? Parallel CEM software needs to be benchmarked not only by FLOPS but also by the performance of other parameters related to type and utilization of the hardware, such as CPU, Random Access Memory (RAM), hard disk, network, etc. The codes need to be optimized for more than just FLOPs and new parameters must be included in benchmarking. In this dissertation, the parallel CEM software named High Order Basis Based Integral Equation Solver (HOBBIES) is introduced. This code was developed to address the needs of the

  2. Grating-assisted surface acoustic wave directional couplers

    NASA Astrophysics Data System (ADS)

    Golan, G.; Griffel, G.; Seidman, A.; Croitoru, N.

    1991-07-01

    Physical properties of novel grating-assisted Y directional couplers are examined using the coupled-mode theory. A general formalism for the analysis of the lateral perturbed directional coupler properties is presented. Explicit expressions for waveguide key parameters such as coupling length, grating period, and other structural characterizations, are obtained. The influence of other physical properties such as time and frequency response or cutoff conditions are also analyzed. A plane grating-assisted directional coupler is presented and examined as a basic component in the integrated acoustic technology.

  3. 49 CFR 215.123 - Defective couplers.

    Code of Federal Regulations, 2010 CFR

    2010-10-01

    ... automatically with the adjacent car; (b) The car has a coupler that has a crack in the highly stressed junction... knuckle that is broken or cracked on the inside pulling face of the knuckle. (d) The car has a knuckle pin...) Missing; (ii) Inoperative; (iii) Bent; (iv) Cracked; or (v) Broken. ...

  4. 49 CFR 215.123 - Defective couplers.

    Code of Federal Regulations, 2011 CFR

    2011-10-01

    ... automatically with the adjacent car; (b) The car has a coupler that has a crack in the highly stressed junction... knuckle that is broken or cracked on the inside pulling face of the knuckle. (d) The car has a knuckle pin...) Missing; (ii) Inoperative; (iii) Bent; (iv) Cracked; or (v) Broken. ...

  5. Integrated optical XY coupler

    DOEpatents

    Vawter, G. Allen; Hadley, G. Ronald

    1997-01-01

    An integrated optical XY coupler having two converging input waveguide arms meeting in a central section and a central output waveguide arm and two diverging flanking output waveguide arms emanating from the central section. In-phase light from the input arms constructively interfers in the central section to produce a single mode output in the central output arm with the rest of the light being collected in the flanking output arms. Crosstalk between devices on a substrate is minimized by this collection of the out-of-phase light by the flanking output arms of the XY coupler.

  6. Integrated optical XY coupler

    DOEpatents

    Vawter, G.A.; Hadley, G.R.

    1997-05-06

    An integrated optical XY coupler having two converging input waveguide arms meeting in a central section and a central output waveguide arm and two diverging flanking output waveguide arms emanating from the central section. In-phase light from the input arms constructively interferes in the central section to produce a single mode output in the central output arm with the rest of the light being collected in the flanking output arms. Crosstalk between devices on a substrate is minimized by this collection of the out-of-phase light by the flanking output arms of the XY coupler. 9 figs.

  7. Detuning related coupler kick variation of a superconducting nine-cell 1.3 GHz cavity

    NASA Astrophysics Data System (ADS)

    Hellert, Thorsten; Dohlus, Martin

    2018-04-01

    Superconducting TESLA-type cavities are widely used to accelerate electrons in long bunch trains, such as in high repetition rate free electron lasers. The TESLA cavity is equipped with two higher order mode couplers and a fundamental power coupler (FPC), which break the axial symmetry of the cavity. The passing electrons therefore experience axially asymmetrical coupler kicks, which depend on the transverse beam position at the couplers and the rf phase. The resulting emittance dilution has been studied in detail in the literature. However, the kick induced by the FPC depends explicitly on the ratio of the forward to the backward traveling waves at the coupler, which has received little attention. The intention of this paper is to present the concept of discrete coupler kicks with a novel approach of separating the field disturbances related to the standing wave and a reflection dependent part. Particular attention is directed to the role of the penetration depth of the FPC antenna, which determines the loaded quality factor of the cavity. The developed beam transport model is compared to dedicated experiments at FLASH and European XFEL. Both the observed transverse coupling and detuning related coupler kick variations are in good agreement with the model. Finally, the expected trajectory variations due to coupler kick variations at European XFEL are investigated and results of numerical studies are presented.

  8. High-Performance Parallel Analysis of Coupled Problems for Aircraft Propulsion

    NASA Technical Reports Server (NTRS)

    Felippa, C. A.; Farhat, C.; Park, K. C.; Gumaste, U.; Chen, P.-S.; Lesoinne, M.; Stern, P.

    1996-01-01

    This research program dealt with the application of high-performance computing methods to the numerical simulation of complete jet engines. The program was initiated in January 1993 by applying two-dimensional parallel aeroelastic codes to the interior gas flow problem of a bypass jet engine. The fluid mesh generation, domain decomposition and solution capabilities were successfully tested. Attention was then focused on methodology for the partitioned analysis of the interaction of the gas flow with a flexible structure and with the fluid mesh motion driven by these structural displacements. The latter is treated by a ALE technique that models the fluid mesh motion as that of a fictitious mechanical network laid along the edges of near-field fluid elements. New partitioned analysis procedures to treat this coupled three-component problem were developed during 1994 and 1995. These procedures involved delayed corrections and subcycling, and have been successfully tested on several massively parallel computers, including the iPSC-860, Paragon XP/S and the IBM SP2. For the global steady-state axisymmetric analysis of a complete engine we have decided to use the NASA-sponsored ENG10 program, which uses a regular FV-multiblock-grid discretization in conjunction with circumferential averaging to include effects of blade forces, loss, combustor heat addition, blockage, bleeds and convective mixing. A load-balancing preprocessor tor parallel versions of ENG10 was developed. During 1995 and 1996 we developed the capability tor the first full 3D aeroelastic simulation of a multirow engine stage. This capability was tested on the IBM SP2 parallel supercomputer at NASA Ames. Benchmark results were presented at the 1196 Computational Aeroscience meeting.

  9. Reconfigurable nanoscale spin-wave directional coupler

    PubMed Central

    Wang, Qi; Pirro, Philipp; Verba, Roman; Slavin, Andrei; Hillebrands, Burkard; Chumak, Andrii V.

    2018-01-01

    Spin waves, and their quanta magnons, are prospective data carriers in future signal processing systems because Gilbert damping associated with the spin-wave propagation can be made substantially lower than the Joule heat losses in electronic devices. Although individual spin-wave signal processing devices have been successfully developed, the challenging contemporary problem is the formation of two-dimensional planar integrated spin-wave circuits. Using both micromagnetic modeling and analytical theory, we present an effective solution of this problem based on the dipolar interaction between two laterally adjacent nanoscale spin-wave waveguides. The developed device based on this principle can work as a multifunctional and dynamically reconfigurable signal directional coupler performing the functions of a waveguide crossing element, tunable power splitter, frequency separator, or multiplexer. The proposed design of a spin-wave directional coupler can be used both in digital logic circuits intended for spin-wave computing and in analog microwave signal processing devices. PMID:29376117

  10. Reconfigurable nanoscale spin-wave directional coupler.

    PubMed

    Wang, Qi; Pirro, Philipp; Verba, Roman; Slavin, Andrei; Hillebrands, Burkard; Chumak, Andrii V

    2018-01-01

    Spin waves, and their quanta magnons, are prospective data carriers in future signal processing systems because Gilbert damping associated with the spin-wave propagation can be made substantially lower than the Joule heat losses in electronic devices. Although individual spin-wave signal processing devices have been successfully developed, the challenging contemporary problem is the formation of two-dimensional planar integrated spin-wave circuits. Using both micromagnetic modeling and analytical theory, we present an effective solution of this problem based on the dipolar interaction between two laterally adjacent nanoscale spin-wave waveguides. The developed device based on this principle can work as a multifunctional and dynamically reconfigurable signal directional coupler performing the functions of a waveguide crossing element, tunable power splitter, frequency separator, or multiplexer. The proposed design of a spin-wave directional coupler can be used both in digital logic circuits intended for spin-wave computing and in analog microwave signal processing devices.

  11. Tissue Viscoelasticity Imaging Using Vibration and Ultrasound Coupler Gel

    NASA Astrophysics Data System (ADS)

    Yamakawa, Makoto; Shiina, Tsuyoshi

    2012-07-01

    In tissue diagnosis, both elasticity and viscosity are important indexes. Therefore, we propose a method for evaluating tissue viscoelasticity by applying vibration that is usually performed in elastography and using an ultrasound coupler gel with known viscoelasticity. In this method, we use three viscoelasticity parameters based on the coupler strain and tissue strain: the strain ratio as an elasticity parameter, and the phase difference and the normalized hysteresis loop area as viscosity parameters. In the agar phantom experiment, using these viscoelasticity parameters, we were able to estimate the viscoelasticity distribution of the phantom. In particular, the strain ratio and the phase difference were robust to strain estimation error.

  12. (abstract) The Design of a Benign Fail-safe Mechanism Using a Low-melting-point Metal Alloy Coupler

    NASA Technical Reports Server (NTRS)

    Blomquist, Richard S.

    1995-01-01

    Because the alpha proton X ray spectrometer (APXS) sensor head on the Mars Pathfinder rover, Sojourner, is placed on Martian soil by the deployment mechanism (ADM), the rover would be crippled if the actuator fails when the mechanism is in its deployed position, as rover ground clearance is then reduced to zero. This paper describes the unique fail-safe mounted on the ADM, especially the use of a low-temperature-melting alloy as a coupler device. The final form of the design is a low-melting-point metal pellet coupler, made from Cerrobend, in parallel with a Negator spring pack. In its solid state, the metal rigidly connects the driver (the actuator) and the driven part (the mechanism). When commanded, a strip heater wrapped around the coupler melts the metal pellet (at 60(deg)C), allowing the driven part to turn independent of the driver. The Negator spring retracts the mechanism to its fully stowed position. This concept meets all the design criteria, and provides an added benefit. When the metal hardens the coupler once again rigidly connects the actuator and the mechanism. The concept presented here can easily be applied to other applications. Anywhere release devices are needed, low-melting-point couplers can be considered. The issues to be concerned with are thermal isolation, proper setting of the parts before actuation, and possible outgassing concerns. However, when these issues are overcome, the resulting release mechanism can promise to be the most light, simple, power conserving alternative available.

  13. Simplified flangeless unisex waveguide coupler assembly

    DOEpatents

    Michelangelo, Dimartino; Moeller, Charles P.

    1993-01-01

    A unisex coupler assembly is disclosed capable of providing a leak tight coupling for waveguides with axial alignment of the waveguides and rotational capability. The sealing means of the coupler assembly are not exposed to RF energy, and the coupler assembly does not require the provision of external flanges on the waveguides. In a preferred embodiment, O ring seals are not used and the coupler assembly is, therefore, bakeable at a temperature up to about 150.degree. C. The coupler assembly comprises a split collar which clamps around the waveguides and a second collar which fastens to the split collar. The split collar contains an inner annular groove. Each of the waveguides is provided with an external annular groove which receives a retaining ring. The split collar is clamped around one of the waveguides with the inner annular groove of the split collar engaging the retaining ring carried in the external annular groove in the waveguide. The second collar is then slipped over the second waveguide behind the annular groove and retaining ring therein and the second collar is coaxially secured by fastening means to the split collar to draw the respective waveguides together by coaxial force exerted by the second collar against the retaining ring on the second waveguide. A sealing ring is placed against an external sealing surface at a reduced external diameter end formed on one waveguide to sealingly engage a corresponding sealing surface on the other waveguide as the waveguides are urged toward each other.

  14. Simplified flangeless unisex waveguide coupler assembly

    DOEpatents

    Michelangelo, D.; Moeller, C.P.

    1993-05-04

    A unisex coupler assembly is disclosed capable of providing a leak tight coupling for waveguides with axial alignment of the waveguides and rotational capability. The sealing means of the coupler assembly are not exposed to RF energy, and the coupler assembly does not require the provision of external flanges on the waveguides. In a preferred embodiment, O ring seals are not used and the coupler assembly is, therefore, bakeable at a temperature up to about 150 C. The coupler assembly comprises a split collar which clamps around the waveguides and a second collar which fastens to the split collar. The split collar contains an inner annular groove. Each of the waveguides is provided with an external annular groove which receives a retaining ring. The split collar is clamped around one of the waveguides with the inner annular groove of the split collar engaging the retaining ring carried in the external annular groove in the waveguide. The second collar is then slipped over the second waveguide behind the annular groove and retaining ring therein and the second collar is coaxially secured by fastening means to the split collar to draw the respective waveguides together by coaxial force exerted by the second collar against the retaining ring on the second waveguide. A sealing ring is placed against an external sealing surface at a reduced external diameter end formed on one waveguide to sealingly engage a corresponding sealing surface on the other waveguide as the waveguides are urged toward each other.

  15. Simplified flangeless unisex waveguide coupler assembly

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Michelangelo, D.; Moeller, C.P.

    1993-05-04

    A unisex coupler assembly is disclosed capable of providing a leak tight coupling for waveguides with axial alignment of the waveguides and rotational capability. The sealing means of the coupler assembly are not exposed to RF energy, and the coupler assembly does not require the provision of external flanges on the waveguides. In a preferred embodiment, O ring seals are not used and the coupler assembly is, therefore, bakeable at a temperature up to about 150 C. The coupler assembly comprises a split collar which clamps around the waveguides and a second collar which fastens to the split collar. Themore » split collar contains an inner annular groove. Each of the waveguides is provided with an external annular groove which receives a retaining ring. The split collar is clamped around one of the waveguides with the inner annular groove of the split collar engaging the retaining ring carried in the external annular groove in the waveguide. The second collar is then slipped over the second waveguide behind the annular groove and retaining ring therein and the second collar is coaxially secured by fastening means to the split collar to draw the respective waveguides together by coaxial force exerted by the second collar against the retaining ring on the second waveguide. A sealing ring is placed against an external sealing surface at a reduced external diameter end formed on one waveguide to sealingly engage a corresponding sealing surface on the other waveguide as the waveguides are urged toward each other.« less

  16. High-performance parallel analysis of coupled problems for aircraft propulsion

    NASA Technical Reports Server (NTRS)

    Felippa, C. A.; Farhat, C.; Chen, P.-S.; Gumaste, U.; Leoinne, M.; Stern, P.

    1995-01-01

    This research program deals with the application of high-performance computing methods to the numerical simulation of complete jet engines. The program was initiated in 1993 by applying two-dimensional parallel aeroelastic codes to the interior gas flow problem of a by-pass jet engine. The fluid mesh generation, domain decomposition and solution capabilities were successfully tested. Attention was then focused on methodology for the partitioned analysis of the interaction of the gas flow with a flexible structure and with the fluid mesh motion driven by these structural displacements. The latter is treated by an ALE technique that models the fluid mesh motion as that of a fictitious mechanical network laid along the edges of near-field fluid elements. New partitioned analysis procedures to treat this coupled 3-component problem were developed in 1994. These procedures involved delayed corrections and subcycling, and have been successfully tested on several massively parallel computers. For the global steady-state axisymmetric analysis of a complete engine we have decided to use the NASA-sponsored ENG10 program, which uses a regular FV-multiblock-grid discretization in conjunction with circumferential averaging to include effects of blade forces, loss, combustor heat addition, blockage, bleeds and convective mixing. A load-balancing preprocessor for parallel versions of ENG10 has been developed. It is planned to use the steady-state global solution provided by ENG10 as input to a localized three-dimensional FSI analysis for engine regions where aeroelastic effects may be important.

  17. Two-channel highly sensitive sensors based on 4 × 4 multimode interference couplers

    NASA Astrophysics Data System (ADS)

    Le, Trung-Thanh

    2017-12-01

    We propose a new kind of microring resonators (MRR) based on 4 × 4 multimode interference (MMI) couplers for multichannel and highly sensitive chemical and biological sensors. The proposed sensor structure has advantages of compactness and high sensitivity compared with the reported sensing structures. By using the transfer matrix method (TMM) and numerical simulations, the designs of the sensor based on silicon waveguides are optimized and demonstrated in detail. We apply our structure to detect glucose and ethanol concentrations simultaneously. A high sensitivity of 9000 nm/RIU, detection limit of 2 × 10‒4 for glucose sensing and sensitivity of 6000 nm/RIU, detection limit of 1.3 × 10‒5 for ethanol sensing are achieved.

  18. Wireless power transfer magnetic couplers

    DOEpatents

    Wu, Hunter; Gilchrist, Aaron; Sealy, Kylee

    2016-01-19

    A magnetic coupler is disclosed for wireless power transfer systems. A ferrimagnetic component is capable of guiding a magnetic field. A wire coil is wrapped around at least a portion of the ferrimagnetic component. A screen is capable of blocking leakage magnetic fields. The screen may be positioned to cover at least one side of the ferrimagnetic component and the coil. A distance across the screen may be at least six times an air gap distance between the ferrimagnetic component and a receiving magnetic coupler.

  19. Unidirectional complex grating assisted couplers

    NASA Astrophysics Data System (ADS)

    Greenberg, Maxim; Orenstein, Meir

    2004-08-01

    We present a novel concept which enables the realization of unidirectional and irreversible grating assisted couplers by using gain-loss modulated medium to eliminate the reversibility. Employing a matched periodic modulation of both refractive index and loss (gain) we achieve a unidirectional energy transfer between the modes of the coupler which translates to light transmission from one waveguide to another while disabling the inverse transmission. The importance of self coupling coefficients is explored as well and a feasible implementation, where the real and imaginary perturbations are implemented in different waveguides is presented.

  20. Microstructure analysis in the coupling region of fiber coupler with a novel electrical micro-heater

    NASA Astrophysics Data System (ADS)

    Shuai, Cijun; Gao, Chengde; Nie, Yi; Hu, Huanlong; Peng, Shuping

    2011-12-01

    Fused-tapered fiber coupler is widely used in optical-fiber communication, optical-fiber sensor and optical signal processing. Its optical performance is mainly determined by the glass properties in the coupling region. In this study, the effect of fused biconical taper (FBT) process on glass microstructure of fiber coupler was investigated by testing the microstructure of the cross-section of coupling region. The fiber coupler is fabricated with a novel home-designed electrical heater. Our experimental results show that the boundary between fiber core and fiber cladding become vague or indistinct after FBT under transmission electron microscopy (TEM) and Ge 2+ in fiber core diffuses into fiber cladding. Crystallizations are observed in coupling region under scanning electron microscope (SEM) and microscopic infrared (IR), and the micro crystallizations become smaller with the drawing speed increasing. The wave number of fiberglass increases after FBT and it is in proportion to the drawing speed. The analysis of the microstructure in the coupling region explored the mechanism of the improvement in the performance of fiber couplers which can be used for the guidance of fabrication process.

  1. High-efficiency fiber-to-chip grating couplers realized using an advanced CMOS-compatible silicon-on-insulator platform.

    PubMed

    Vermeulen, D; Selvaraja, S; Verheyen, P; Lepage, G; Bogaerts, W; Absil, P; Van Thourhout, D; Roelkens, G

    2010-08-16

    A new generation of Silicon-on-Insulator fiber-to-chip grating couplers which use a silicon overlay to enhance the directionality and thereby the coupling efficiency is presented. Devices are realized on a 200 mm wafer in a CMOS pilot line. The fabricated fiber couplers show a coupling efficiency of -1.6 dB and a 3 dB bandwidth of 80 nm.

  2. Flexible polymeric rib waveguide with self-align couplers system

    PubMed Central

    Huang, Cheng-Sheng; Wang, Wei-Chih

    2011-01-01

    The authors report a polymeric based rib waveguide with U shape self-align fiber couplers system using a simple micromolding process with SU8 as a molding material and polydimethysiloxane as a waveguide material. The material is used for its good optical transparency, low surface tension, biocompatibility, and durability. Furthermore, the material is highly formable. This unique fabrication molding technique provides a means of keeping the material and manufacturing costs to a minimum. The self-align fiber couplers system also proves a fast and simple means of light coupling. The flexible nature of the waveguide material makes this process ideal for a potential wearable optical sensor. PMID:22171151

  3. Inband radar cross section of phased arrays with parallel feeds

    NASA Astrophysics Data System (ADS)

    Flokas, Vassilios

    1994-06-01

    Approximate formulas for the inband radar cross section of arrays with parallel feeds are presented. To obtain the formulas, multiple reflections are neglected, and devices of the same type are assumed to have identical electrical performance. The approximate results were compared to the results obtained using a scattering matrix formulation. Both methods were in agreement in predicting RCS lobe positions, levels, and behavior with scanning. The advantages of the approximate method are its computational efficiency and its flexibility in handling an arbitrary number of coupler levels.

  4. Coupler for remote manipulators

    NASA Technical Reports Server (NTRS)

    Rudmann, A. A.

    1980-01-01

    Reliable, low-cost coupler alines and grasps moving and rotating objects. Coupling mechanism may be used in handling of radio-active materials or in underwater explorations and other remote manipulators.

  5. Fiber-optic couplers as displacement sensors

    NASA Astrophysics Data System (ADS)

    Baruch, Martin C.; Gerdt, David W.; Adkins, Charles M.

    2003-04-01

    We introduce the novel concept of using a fiber-optic coupler as a versatile displacement sensor. Comparatively long fiber-optic couplers, with a coupling region of approximately 10 mm, are manufactured using standard communication SM fiber and placed in a looped-back configuration. The result is a displacement sensor, which is robust and highly sensitive over a wide dynamic range. This displacement sensor resolves 1-2 μm over distances of 1-1.5 mm and is characterized by the essential absence of a 'spring constant' plaguing other strain gauge-type sensors. Consequently, it is possible to couple to extremely weak vibrations, such as the skin displacement affected by arterial heart beat pulsations. Used as a wrist-worn heartbeat monitor, the fidelity of the arterial pulse signal has been shown to be so high that it is possible to not only determine heartbeat and breathing rates, but to implement a new single-point blood pressure measurement scheme which does not squeeze the arm. In an application as a floor vibration sensor for the non-intrusive monitoring of independently living elderly, the sensor has been shown to resolve the distinct vibration spectra of different persons and different events.

  6. Performance of the Galley Parallel File System

    NASA Technical Reports Server (NTRS)

    Nieuwejaar, Nils; Kotz, David

    1996-01-01

    As the input/output (I/O) needs of parallel scientific applications increase, file systems for multiprocessors are being designed to provide applications with parallel access to multiple disks. Many parallel file systems present applications with a conventional Unix-like interface that allows the application to access multiple disks transparently. This interface conceals the parallism within the file system, which increases the ease of programmability, but makes it difficult or impossible for sophisticated programmers and libraries to use knowledge about their I/O needs to exploit that parallelism. Furthermore, most current parallel file systems are optimized for a different workload than they are being asked to support. We introduce Galley, a new parallel file system that is intended to efficiently support realistic parallel workloads. Initial experiments, reported in this paper, indicate that Galley is capable of providing high-performance 1/O to applications the applications that rely on them. In Section 3 we describe that access data in patterns that have been observed to be common.

  7. Memory Benchmarks for SMP-Based High Performance Parallel Computers

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Yoo, A B; de Supinski, B; Mueller, F

    2001-11-20

    As the speed gap between CPU and main memory continues to grow, memory accesses increasingly dominates the performance of many applications. The problem is particularly acute for symmetric multiprocessor (SMP) systems, where the shared memory may be accessed concurrently by a group of threads running on separate CPUs. Unfortunately, several key issues governing memory system performance in current systems are not well understood. Complex interactions between the levels of the memory hierarchy, buses or switches, DRAM back-ends, system software, and application access patterns can make it difficult to pinpoint bottlenecks and determine appropriate optimizations, and the situation is even moremore » complex for SMP systems. To partially address this problem, we formulated a set of multi-threaded microbenchmarks for characterizing and measuring the performance of the underlying memory system in SMP-based high-performance computers. We report our use of these microbenchmarks on two important SMP-based machines. This paper has four primary contributions. First, we introduce a microbenchmark suite to systematically assess and compare the performance of different levels in SMP memory hierarchies. Second, we present a new tool based on hardware performance monitors to determine a wide array of memory system characteristics, such as cache sizes, quickly and easily; by using this tool, memory performance studies can be targeted to the full spectrum of performance regimes with many fewer data points than is otherwise required. Third, we present experimental results indicating that the performance of applications with large memory footprints remains largely constrained by memory. Fourth, we demonstrate that thread-level parallelism further degrades memory performance, even for the latest SMPs with hardware prefetching and switch-based memory interconnects.« less

  8. Design of high-performance parallelized gene predictors in MATLAB.

    PubMed

    Rivard, Sylvain Robert; Mailloux, Jean-Gabriel; Beguenane, Rachid; Bui, Hung Tien

    2012-04-10

    This paper proposes a method of implementing parallel gene prediction algorithms in MATLAB. The proposed designs are based on either Goertzel's algorithm or on FFTs and have been implemented using varying amounts of parallelism on a central processing unit (CPU) and on a graphics processing unit (GPU). Results show that an implementation using a straightforward approach can require over 4.5 h to process 15 million base pairs (bps) whereas a properly designed one could perform the same task in less than five minutes. In the best case, a GPU implementation can yield these results in 57 s. The present work shows how parallelism can be used in MATLAB for gene prediction in very large DNA sequences to produce results that are over 270 times faster than a conventional approach. This is significant as MATLAB is typically overlooked due to its apparent slow processing time even though it offers a convenient environment for bioinformatics. From a practical standpoint, this work proposes two strategies for accelerating genome data processing which rely on different parallelization mechanisms. Using a CPU, the work shows that direct access to the MEX function increases execution speed and that the PARFOR construct should be used in order to take full advantage of the parallelizable Goertzel implementation. When the target is a GPU, the work shows that data needs to be segmented into manageable sizes within the GFOR construct before processing in order to minimize execution time.

  9. Comparison of Psychophysical and Physical Measurements of Real Ear to Coupler Differences.

    PubMed

    Koning, Raphael; Wouters, Jan; Francart, Tom

    2015-01-01

    The purpose of the study is to compare real ear to coupler difference (RECD) curves based on physical and psychophysical measures. For the physically measured RECD, the RECD was measured with real ear and coupler measurements for the ear simulator and HA1- and HA2 2-cc couplers. The psychophysically measured RECDs were derived from audiogram measures. RECDs were measured in 19 normally hearing subjects. The coupler measurement was done with the probe microphone and the coupler microphone itself. Psychophysically measured RECDs were derived for all subjects by measuring the audiogram in sound field and with an ER-3A insert phone. Reference data were obtained for the three coupler types. It was possible to derive the RECD curve with psychophysical methods. There was no overall statistical difference between the physically and psychophysically measured RECD curves for the HA2 2-cc coupler and the ear simulator. The standard deviation was, however, much higher for the psychophysically derived RECD, indicating that physically measured RECDs are more precise than psychophysically derived RECDs. For the physical RECD measurements, the coupler microphone should be used for the coupler measurement. Physically measured RECDs were validated on group level by the reliable derivation of the RECD curve from audiogram measures.

  10. Four-Pass Coupler for Laser-Diode-Pumped Solid-State Laser

    NASA Technical Reports Server (NTRS)

    Coyle, Donald B.

    2008-01-01

    A four-pass optical coupler affords increased (in comparison with related prior two-pass optical couplers) utilization of light generated by a laser diode in side pumping of a solid-state laser slab. The original application for which this coupler was conceived involves a neodymium-doped yttrium aluminum garnet (Nd:YAG) crystal slab, which, when pumped by a row of laser diodes at a wavelength of 809 nm, lases at a wavelength of 1,064 nm. Heretofore, typically, a thin laser slab has been pumped in two passes, the second pass occurring by virtue of reflection of pump light from a highly reflective thin film on the side opposite the side through which the pump light enters. In two-pass pumping, a Nd:YAG slab having a thickness of 2 mm (which is typical) absorbs about 84 percent of the 809-nm pump light power, leaving about 16 percent of the pump light power to travel back toward the laser diodes. This unused power can cause localized heating of the laser diodes, thereby reducing their lifetimes. Moreover, if the slab is thinner than 2 mm, then even more unused power travels back toward the laser diodes. The four-pass optical coupler captures most of this unused pump light and sends it back to the laser slab for two more passes. As a result, the slab absorbs more pump light, as though it were twice as thick. The gain and laser cavity beam quality of a smaller laser slab in conjunction with this optical coupler can thus be made comparable to those of a larger two-pass-pumped laser slab.

  11. Improvements for the stability of heavy-haul couplers with arc surface contact

    NASA Astrophysics Data System (ADS)

    Wu, Guosong; Wang, Huang; Yao, Yuan

    2018-03-01

    To investigate the stability mechanism of heavy-haul couplers with arc surface contact, the geometry and force analysis were conducted according to the friction circle theory. To improve the stability of the coupler, four improvements were proposed, which are increasing the secondary lateral stiffness of locomotives, adding a restoring bumpstop at the end of the coupler, increasing the arc surfaces radii and changing the clearance and stiffness of secondary lateral stopping block. A multi-body dynamics model with four heavy-haul locomotives and three detailed couplers were established to simulate the emergency braking. In addition, the coupler yaw instability was tested to investigate the effects of relevant parameters on the coupler stability. The results show that increasing the secondary lateral stiffness of locomotives, adding a bumpstop with a smaller bumpstop gap, increasing the arc surfaces radii, increasing the stiffness and decreasing the clearance of secondary lateral stopping block are conducive to improving the stability of the coupler with arc surface contact.

  12. Visualizing Parallel Computer System Performance

    NASA Technical Reports Server (NTRS)

    Malony, Allen D.; Reed, Daniel A.

    1988-01-01

    Parallel computer systems are among the most complex of man's creations, making satisfactory performance characterization difficult. Despite this complexity, there are strong, indeed, almost irresistible, incentives to quantify parallel system performance using a single metric. The fallacy lies in succumbing to such temptations. A complete performance characterization requires not only an analysis of the system's constituent levels, it also requires both static and dynamic characterizations. Static or average behavior analysis may mask transients that dramatically alter system performance. Although the human visual system is remarkedly adept at interpreting and identifying anomalies in false color data, the importance of dynamic, visual scientific data presentation has only recently been recognized Large, complex parallel system pose equally vexing performance interpretation problems. Data from hardware and software performance monitors must be presented in ways that emphasize important events while eluding irrelevant details. Design approaches and tools for performance visualization are the subject of this paper.

  13. The microvascular anastomotic coupler for venous anastomoses in free flap breast reconstruction improves outcomes

    PubMed Central

    Rozen, Warren Matthew; Chowdhry, Muhammad; Patel, Nakul Gamanlal; Chow, Whitney T.H.; Griffiths, Matthew; Ramakrishnan, Venkat V.

    2016-01-01

    Background Venous couplers are ubiquitous around the world and are a useful tool for the reconstructive microsurgeon. A systematic review of coupler performance studies demonstrated a thrombosis rate range of 0% to 3%, whilst the average time of using the device is 5 minutes. There is sparse published data on cost analysis and the impact of operator experience on the anastomotic coupler device success. Improvements in outcomes other than time benefits have also not been shown. This study aims to address these deficiencies in the literature. Methods A retrospective clinical study was undertaken, aiming to compare equivalent groups of patients that had free flap surgery with venous micro-anastomoses with those that had sutured anastomoses. The cohort comprised all patients undergoing microsurgical breast reconstruction at the St Andrew’s Centre for Plastic Surgery & Burns from January 2009 to December 2014. Results Between January 2010 to December 2014, 1,064 patients underwent 1,206 free flap breast reconstructions. The average age of patients was 50 years. Seventy percent of patients underwent mastectomy and immediate reconstruction during this period with the remaining 30% having a delayed reconstruction. The 1,206 free flaps comprised of 83 transverse myocutaneous gracilis (TMG) flaps, and 1,123 deep inferior epigastric artery perforator (DIEP) flaps. In total the coupler was used in 319 flaps, 26% of the cohort. There was a statistically significant clinical benefit in using the anastomotic coupler for venous anastomosis. Overall, the return to theatre rate was 12.69% whilst the overall flap loss rate was 0.75%. The overall coupler failure rate was significantly less at 1.4% whilst sutured vein failure rate was 3.57% (P=0.001). Conclusions The anastomotic coupler for venous anastomosis in free flap surgery is associated with reduced operating times, reduced take-backs to theatre and cost benefits. This is the first study to demonstrate clear clinical benefits

  14. Performance Evaluation in Network-Based Parallel Computing

    NASA Technical Reports Server (NTRS)

    Dezhgosha, Kamyar

    1996-01-01

    Network-based parallel computing is emerging as a cost-effective alternative for solving many problems which require use of supercomputers or massively parallel computers. The primary objective of this project has been to conduct experimental research on performance evaluation for clustered parallel computing. First, a testbed was established by augmenting our existing SUNSPARCs' network with PVM (Parallel Virtual Machine) which is a software system for linking clusters of machines. Second, a set of three basic applications were selected. The applications consist of a parallel search, a parallel sort, a parallel matrix multiplication. These application programs were implemented in C programming language under PVM. Third, we conducted performance evaluation under various configurations and problem sizes. Alternative parallel computing models and workload allocations for application programs were explored. The performance metric was limited to elapsed time or response time which in the context of parallel computing can be expressed in terms of speedup. The results reveal that the overhead of communication latency between processes in many cases is the restricting factor to performance. That is, coarse-grain parallelism which requires less frequent communication between processes will result in higher performance in network-based computing. Finally, we are in the final stages of installing an Asynchronous Transfer Mode (ATM) switch and four ATM interfaces (each 155 Mbps) which will allow us to extend our study to newer applications, performance metrics, and configurations.

  15. Venous coupler use for free-flap breast reconstructions: specific analyses of TMG and DIEP flaps.

    PubMed

    Bodin, Frédéric; Brunetti, Stefania; Dissaux, Caroline; Erik, A Sauleau; Facca, Sybille; Bruant-Rodier, Catherine; Liverneaux, Philippe

    2015-05-01

    The purpose of this report was to present the results of comparisons of anastomotic data and flap complications in the use of venous coupler in breast reconstruction with the transverse musculocutaneous gracilis (TMG) flap and the deep inferior epigastric perforator (DIEP) flap. Over a three-year period, 95 patients suffering from breast cancer were treated with mastectomy and breast reconstruction using free flaps. We performed 121 mechanical venous anastomoses for 105 flap procedures (80 DIEP and 25 TMG). The coupler size, anastomotic duration, number of anastomoses and postoperative complications were assessed for the entire series. The coupling device was perfectly suitable for all end-to-end anastomoses between the vein(s) of the flap and the internal mammary vein(s). No venous thrombosis occurred. The mean anastomotic time did not significantly differ between the DIEP (330 seconds) and TMG flap procedures (352 seconds) (P = 0.069). Additionally, there were no differences in coupling time observed following a comparison of seven coupler sizes (P = 0.066). The mean coupler size used during the TMG flap procedure was smaller than that used with the DIEP (2.4 mm versus 2.8 mm) (P < 0.001). The mean size was also smaller when double venous anastomoses were required compared to single anastomosis (2.4 mm versus 2.9 mm) (P < 0.001). The double branching was more frequent with the TMG flap (28%) than with the DIEP flap (11%). The coupler size used was smaller for the TMG procedure and when double venous anastomosis was performed. Additionally, anastomotic time was not affected by the flap type or coupler size used or by anastomosis number. © 2014 Wiley Periodicals, Inc.

  16. Switching of transmission resonances in a two-channels coupler: A Boundary Wall Method scattering study

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Nunes, A.; Zanetti, F.M.; Lyra, M.L., E-mail: marcelo@fis.ufal.br

    2016-10-15

    In this work, we study the transmission characteristics of a two-channels coupler model system using the Boundary Wall Method (BWM) to determine the solution of the corresponding scattering problem of an incident plane wave. We show that the BWM provides detailed information regarding the transmission resonances. In particular, we focus on the case of single channel input aiming to explore the energy switching performance of the coupler. We show that the coupler geometry can be tailored to allow for the first transmission resonances to be predominantly transmitted on specific output channels, an important characteristic for the realization of logical operations.more » - Highlights: • The switching performance of a coupled waveguide device is studied via the boundary wall method. • The method efficiently identifies all resonant transmission modes. • Energy switching is controlled and optimized as a function of the device geometry.« less

  17. The high performance parallel algorithm for Unified Gas-Kinetic Scheme

    NASA Astrophysics Data System (ADS)

    Li, Shiyi; Li, Qibing; Fu, Song; Xu, Jinxiu

    2016-11-01

    A high performance parallel algorithm for UGKS is developed to simulate three-dimensional flows internal and external on arbitrary grid system. The physical domain and velocity domain are divided into different blocks and distributed according to the two-dimensional Cartesian topology with intra-communicators in physical domain for data exchange and other intra-communicators in velocity domain for sum reduction to moment integrals. Numerical results of three-dimensional cavity flow and flow past a sphere agree well with the results from the existing studies and validate the applicability of the algorithm. The scalability of the algorithm is tested both on small (1-16) and large (729-5832) scale processors. The tested speed-up ratio is near linear ashind thus the efficiency is around 1, which reveals the good scalability of the present algorithm.

  18. GPU based cloud system for high-performance arrhythmia detection with parallel k-NN algorithm.

    PubMed

    Tae Joon Jun; Hyun Ji Park; Hyuk Yoo; Young-Hak Kim; Daeyoung Kim

    2016-08-01

    In this paper, we propose an GPU based Cloud system for high-performance arrhythmia detection. Pan-Tompkins algorithm is used for QRS detection and we optimized beat classification algorithm with K-Nearest Neighbor (K-NN). To support high performance beat classification on the system, we parallelized beat classification algorithm with CUDA to execute the algorithm on virtualized GPU devices on the Cloud system. MIT-BIH Arrhythmia database is used for validation of the algorithm. The system achieved about 93.5% of detection rate which is comparable to previous researches while our algorithm shows 2.5 times faster execution time compared to CPU only detection algorithm.

  19. High-energy physics software parallelization using database techniques

    NASA Astrophysics Data System (ADS)

    Argante, E.; van der Stok, P. D. V.; Willers, I.

    1997-02-01

    A programming model for software parallelization, called CoCa, is introduced that copes with problems caused by typical features of high-energy physics software. By basing CoCa on the database transaction paradimg, the complexity induced by the parallelization is for a large part transparent to the programmer, resulting in a higher level of abstraction than the native message passing software. CoCa is implemented on a Meiko CS-2 and on a SUN SPARCcenter 2000 parallel computer. On the CS-2, the performance is comparable with the performance of native PVM and MPI.

  20. Evaluation of the Repeatability and Accuracy of the Wideband Real-Ear-to-Coupler Difference.

    PubMed

    Vaisberg, Jonathan M; Folkeard, Paula; Pumford, John; Narten, Philipp; Scollie, Susan

    2018-06-01

    The real-ear-to-coupler difference (RECD) is an ANSI standardized method for estimating ear canal sound pressure level (SPL) thresholds and assisting in the prediction of real-ear aided responses. It measures the difference in dB between the SPL produced in the ear canal and the SPL produced in an HA-1 2-cc coupler by the same sound source. Recent evidence demonstrates that extended high-frequency bandwidth, beyond the hearing aid bandwidth typically measured, is capable of providing additional clinical benefit. The industry has, in turn, moved toward developing hearing aids and verification equipment capable of producing and measuring extended high-frequency audible output. As a result, a revised RECD procedure conducted using a smaller, 0.4-cc coupler, known as the wideband-RECD (wRECD), has been introduced to facilitate extended high-frequency coupler-based measurements up to 12.5 kHz. This study aimed to (1) compare test-retest repeatability between the RECD and wRECD and (2) measure absolute agreement between the RECD and wRECD when both are referenced to a common coupler. RECDs and wRECDs were measured bilaterally in adult ears by calculating the dB difference in SPL between the ear canal and coupler responses. Real-ear probe microphone measures were completed twice per ear per participant for both foam-tip and customized earmold couplings using the Audioscan Verifit 1 and Verifit 2 fitting systems, followed by measurements in the respective couplers. Twenty-one adults (mean age = 67 yr, range = 19-78) with typical aural anatomy (as determined by measures of impedance and otoscopy) participated in this study, leading to a sample size of 42 ears. Repeatability within RECD and wRECD was assessed for each coupling configuration using a repeated-measures analysis of variance (ANOVA) with test-retest and frequency as within-participants factors. Repeatability between the RECD and wRECD was assessed within each configuration using a repeated-measures ANOVA with

  1. Bi-wavelength two dimensional chirped grating couplers for low cost WDM PON transceivers

    NASA Astrophysics Data System (ADS)

    Xu, Lin; Chen, Xia; Li, Chao; Tsang, Hon Ki

    2011-04-01

    We propose and demonstrate a bi-wavelength two dimensional (2D) waveguide grating coupler on silicon-on-insulator which has efficient coupling of optical light with two-wavelength bands independently between standard optical single mode fibers and nanophotonic waveguides. The details of design are described and the measurement results as well as system performance are experimentally characterized. The bi-wavelength grating coupler can be used as wavelength-division-multiplexing (WDM) splitter/combiner for monolithically silicon integrated transceivers, potentially meeting the low cost requirements for future WDM passive optical network (PON).

  2. Surface acoustic waves voltage controlled directional coupler

    NASA Astrophysics Data System (ADS)

    Golan, G.; Griffel, G.; Yanilov, E.; Ruschin, S.; Seidman, A.; Croitoru, N.

    1988-10-01

    An important condition for the development of surface wave integrated-acoustic devices is the ability to guide and control the propagation of the acoustic energy. This can be implemented by deposition of metallic "loading" channels on an anisotropic piezoelectric substrate. Deposition of such two parallel channels causes an effective coupling of acoustic energy from one channel to the other. A basic requirement for this coupling effect is the existence of the two basic modes: a symmetrical and a nonsymmetrical one. A mode map that shows the number of sustained modes as a function of the device parameters (i.e., channel width; distance between channels; material velocity; and acoustical exciting frequency) is presented. This kind of map can help significantly in the design process of such a device. In this paper we devise an advanced acoustical "Y" coupler with the ability to control its effective coupling by an externally applied voltage, thereby causing modulation of the output intensities of the signals.

  3. NAS Parallel Benchmark. Results 11-96: Performance Comparison of HPF and MPI Based NAS Parallel Benchmarks. 1.0

    NASA Technical Reports Server (NTRS)

    Saini, Subash; Bailey, David; Chancellor, Marisa K. (Technical Monitor)

    1997-01-01

    High Performance Fortran (HPF), the high-level language for parallel Fortran programming, is based on Fortran 90. HALF was defined by an informal standards committee known as the High Performance Fortran Forum (HPFF) in 1993, and modeled on TMC's CM Fortran language. Several HPF features have since been incorporated into the draft ANSI/ISO Fortran 95, the next formal revision of the Fortran standard. HPF allows users to write a single parallel program that can execute on a serial machine, a shared-memory parallel machine, or a distributed-memory parallel machine. HPF eliminates the complex, error-prone task of explicitly specifying how, where, and when to pass messages between processors on distributed-memory machines, or when to synchronize processors on shared-memory machines. HPF is designed in a way that allows the programmer to code an application at a high level, and then selectively optimize portions of the code by dropping into message-passing or calling tuned library routines as 'extrinsics'. Compilers supporting High Performance Fortran features first appeared in late 1994 and early 1995 from Applied Parallel Research (APR) Digital Equipment Corporation, and The Portland Group (PGI). IBM introduced an HPF compiler for the IBM RS/6000 SP/2 in April of 1996. Over the past two years, these implementations have shown steady improvement in terms of both features and performance. The performance of various hardware/ programming model (HPF and MPI (message passing interface)) combinations will be compared, based on latest NAS (NASA Advanced Supercomputing) Parallel Benchmark (NPB) results, thus providing a cross-machine and cross-model comparison. Specifically, HPF based NPB results will be compared with MPI based NPB results to provide perspective on performance currently obtainable using HPF versus MPI or versus hand-tuned implementations such as those supplied by the hardware vendors. In addition we would also present NPB (Version 1.0) performance results for

  4. Incident polarization angle and temperature dependence of polarization and spectral response characteristics in optical fiber couplers.

    PubMed

    Namihira, Y; Kawazawa, T; Wakabayashi, H

    1991-03-20

    The incident polarization angle and temperature dependence of the polarization and spectral response characteristics of three different types of fiber coupler are presented. The couplers are (1) the biconicalfused- twisted-taper single-mode fiber (coupler A), (2) the asymmetric-etched-fused-taper wavelength division multiplex (coupler B), and (3) the biconical-polished polarization maintaining fiber (coupler C), respectively. It is confirmed experimentally that the polarization characteristics of couplers A and B vary greatly with temperature, but those of coupler C are independent of temperature. Also, the wavelength dependence characteristics of the power splitting ratio of couplers B and C have almost no change with temperature. However, the wavelength dependence of coupler A is greatly changed with temperature. Comparing couplers A and B, it is postulated that the sinusoidal variations of the polarization state vs the incident polarization angle are due to the stress birefringence caused by the fiber twisting when the fused fiber coupler is fabricated and packaged.

  5. DIRECTIONAL COUPLERS

    DOEpatents

    Nigg, D.J.

    1961-12-01

    A directional coupler of small size is designed. Stripline conductors of non-rectilinear configuration, and separated from each other by a thin dielectric spacer. cross each other at least at two locations at right angles, thus providing practically pure capacitive coupling which substantially eliminates undesirable inductive coupling. The conductors are sandwiched between a pair of ground planes. The coupling factor is dependent only on the thickness and dielectric constant of the dielectric spacer at the point of conductor crossover. (AEC)

  6. InGaN directional coupler made with a one-step etching technique

    NASA Astrophysics Data System (ADS)

    Gao, Xumin; Yuan, Jialei; Yang, Yongchao; Zhang, Shuai; Shi, Zheng; Li, Xin; Wang, Yongjin

    2017-06-01

    We propose, fabricate and characterize an on-chip integration of light source, InGaN waveguide, directional coupler and photodiode, in which AlGaN layers are used as top and bottom optical claddings to form an InGaN waveguide for guiding the in-plane emitted light from the InGaN/GaN multiple-quantum-well light-emitting diode (MQW-LED). The difference in etch rate caused by different exposure windows leads to an etching depth discrepancy using the one-step etching technique, which forms the InGaN directional coupler with the overlapped underlying slab. Light propagation results directly confirm effective light coupling in the InGaN directional coupler, which is achieved through high-order guided modes. The InGaN waveguide couples the modulated light from the InGaN/GaN MQW-LED and transfers part of light to the coupled waveguide via the InGaN directional coupler. The in-plane InGaN/GaN MQW-photodiode absorbs the guided light by the coupled InGaN waveguide and induces the photocurrent. The on-chip InGaN photonic integration experimentally demonstrates an in-plane light communication with a data transmission of 50 Mbps.

  7. Parallel STEPS: Large Scale Stochastic Spatial Reaction-Diffusion Simulation with High Performance Computers.

    PubMed

    Chen, Weiliang; De Schutter, Erik

    2017-01-01

    Stochastic, spatial reaction-diffusion simulations have been widely used in systems biology and computational neuroscience. However, the increasing scale and complexity of models and morphologies have exceeded the capacity of any serial implementation. This led to the development of parallel solutions that benefit from the boost in performance of modern supercomputers. In this paper, we describe an MPI-based, parallel operator-splitting implementation for stochastic spatial reaction-diffusion simulations with irregular tetrahedral meshes. The performance of our implementation is first examined and analyzed with simulations of a simple model. We then demonstrate its application to real-world research by simulating the reaction-diffusion components of a published calcium burst model in both Purkinje neuron sub-branch and full dendrite morphologies. Simulation results indicate that our implementation is capable of achieving super-linear speedup for balanced loading simulations with reasonable molecule density and mesh quality. In the best scenario, a parallel simulation with 2,000 processes runs more than 3,600 times faster than its serial SSA counterpart, and achieves more than 20-fold speedup relative to parallel simulation with 100 processes. In a more realistic scenario with dynamic calcium influx and data recording, the parallel simulation with 1,000 processes and no load balancing is still 500 times faster than the conventional serial SSA simulation.

  8. Cross-guide Moreno directional coupler in empty substrate integrated waveguide

    NASA Astrophysics Data System (ADS)

    Miralles, E.; Belenguer, A.; Esteban, H.; Boria, V.

    2017-05-01

    Substrate integrated waveguides (SIWs) combine the advantages of rectangular waveguides (low losses) and planar circuits (low cost and low profile). Empty substrate integrated waveguide (ESIW) has been proposed as a novel configuration in SIWs recently. This technology significantly reduces the losses of conventional SIW by removing its inner dielectric. The cross-guide directional coupler is a well-known low-profile design for having a broadband waveguide coupler. In this paper a cross-guide coupler with ESIW technique is proposed. In such a manner, the device can be integrated with microwave circuits and other printed circuit board components. It is the first time that a cross-guide coupler is implemented in ESIW technology. The designed, fabricated, and measured device presents good results as a matter of insertion loss of 1 dB (including transitions), reflection under 20 dB, coupling between 19.5 and 21.5 dB, and directivity higher than 15 dB over targeted frequency range from 12.4 GHz to 18 GHz. The coupler implemented in ESIW improves the directivity when compared to similar solutions in other empty substrate integrated waveguide solutions.

  9. HPC-NMF: A High-Performance Parallel Algorithm for Nonnegative Matrix Factorization

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kannan, Ramakrishnan; Sukumar, Sreenivas R.; Ballard, Grey M.

    NMF is a useful tool for many applications in different domains such as topic modeling in text mining, background separation in video analysis, and community detection in social networks. Despite its popularity in the data mining community, there is a lack of efficient distributed algorithms to solve the problem for big data sets. We propose a high-performance distributed-memory parallel algorithm that computes the factorization by iteratively solving alternating non-negative least squares (NLS) subproblems formore » $$\\WW$$ and $$\\HH$$. It maintains the data and factor matrices in memory (distributed across processors), uses MPI for interprocessor communication, and, in the dense case, provably minimizes communication costs (under mild assumptions). As opposed to previous implementation, our algorithm is also flexible: It performs well for both dense and sparse matrices, and allows the user to choose any one of the multiple algorithms for solving the updates to low rank factors $$\\WW$$ and $$\\HH$$ within the alternating iterations.« less

  10. Design and fabrication of N x N optical couplers based on organic polymer optical waveguides

    NASA Astrophysics Data System (ADS)

    Krchnavek, Robert R.; Rode, Daniel L.

    1994-08-01

    In this report, we examine the design and fabrication of a planar, 10x10 optical coupler utilizing photopolymerizable organic polymers. Background information on the theory of operation of the coupler culminating in a set of design equations is presented. The details of the material processing are described, including the preparation of monomer mixtures that result in single-mode polymer waveguides (lambda = 1300 nm) that have core dimensions approximately equal to those of single-mode fiber. This is necessary to insure high coupling efficiency between the planar device and optical fiber. A unique method of aligning and attaching optical fibers to the coupler is demonstrated. This method relies on patterned alignment ways, a transcision cut, and single-mode D-fiber. A theoretical analysis of the in situ monitoring technique used to fabricate the single-mode D-fiber is presented and compared favorably with the experimental results. Finally, the 10x10 coupler is characterized. We have measured an excess loss of approximately 8 dB.

  11. High Performance Computing at NASA

    NASA Technical Reports Server (NTRS)

    Bailey, David H.; Cooper, D. M. (Technical Monitor)

    1994-01-01

    The speaker will give an overview of high performance computing in the U.S. in general and within NASA in particular, including a description of the recently signed NASA-IBM cooperative agreement. The latest performance figures of various parallel systems on the NAS Parallel Benchmarks will be presented. The speaker was one of the authors of the NAS (National Aerospace Standards) Parallel Benchmarks, which are now widely cited in the industry as a measure of sustained performance on realistic high-end scientific applications. It will be shown that significant progress has been made by the highly parallel supercomputer industry during the past year or so, with several new systems, based on high-performance RISC processors, that now deliver superior performance per dollar compared to conventional supercomputers. Various pitfalls in reporting performance will be discussed. The speaker will then conclude by assessing the general state of the high performance computing field.

  12. Effect of twist on single-mode fiber-optic 3 × 3 couplers

    NASA Astrophysics Data System (ADS)

    Chen, Dandan; Ji, Minning; Peng, Lei

    2018-01-01

    In the fabricating process of a 3 × 3 fused tapered coupler, the three fibers are usually twisted to be close-contact. The effect of twist on 3 × 3 fused tapered couplers is investigated in this paper. It is found that though a linear 3 × 3 coupler may realize equal power splitting ratio theoretically by twisting a special angle, it is hard to be fabricated actually because the twist angle and the coupler's length must be determined in advance. While an equilateral 3 × 3 coupler can not only realize approximate equal power splitting ratio theoretically but can also be fabricated just by controlling the elongation length. The effect of twist on the equilateral 3 × 3 coupler lies in the relationship between the equal ratio error and the twist angle. The more the twist angle is, the larger the equal ratio error may be. The twist angle usually should be no larger than 90° on one coupling period length in order to keep the equal ratio error small enough. The simulation results agree well with the experimental data.

  13. pWeb: A High-Performance, Parallel-Computing Framework for Web-Browser-Based Medical Simulation.

    PubMed

    Halic, Tansel; Ahn, Woojin; De, Suvranu

    2014-01-01

    This work presents a pWeb - a new language and compiler for parallelization of client-side compute intensive web applications such as surgical simulations. The recently introduced HTML5 standard has enabled creating unprecedented applications on the web. Low performance of the web browser, however, remains the bottleneck of computationally intensive applications including visualization of complex scenes, real time physical simulations and image processing compared to native ones. The new proposed language is built upon web workers for multithreaded programming in HTML5. The language provides fundamental functionalities of parallel programming languages as well as the fork/join parallel model which is not supported by web workers. The language compiler automatically generates an equivalent parallel script that complies with the HTML5 standard. A case study on realistic rendering for surgical simulations demonstrates enhanced performance with a compact set of instructions.

  14. 30 CFR 75.805 - Couplers.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... shall be grounded to the ground conductor in the cable. The coupler shall be constructed so that the ground check continuity conductor shall be broken first and the ground conductors shall be broken last...

  15. Performance of the SERI parallel-passage dehumidifer

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Schlepp, D.; Barlow, R.

    1984-09-01

    The key component in improving the performance of solar desiccant cooling systems is the dehumidifier. A parallel-passage geometry for the desiccant dehumidifier has been identified as meeting key criteria of low pressure drop, high mass transfer efficiency, and compact size. An experimental program to build and test a small-scale prototype of this design was undertaken in FY 1982, and the results are presented in this report. Computer models to predict the adsorption/desorption behavior of desiccant dehumidifiers were updated to take into account the geometry of the bed and predict potential system performance using the new component design. The parallel-passage designmore » proved to have high mass transfer effectiveness and low pressure drop over a wide range of test conditions typical of desiccant cooling system operation. The prototype dehumidifier averaged 93% effectiveness at pressure drops of less than 50 Pa at design point conditions. Predictions of system performance using models validated with the experimental data indicate that system thermal coefficients of performance (COPs) of 1.0 to 1.2 and electrical COPs above 8.5 are possible using this design.« less

  16. High-performance parallel analysis of coupled problems for aircraft propulsion

    NASA Technical Reports Server (NTRS)

    Felippa, C. A.; Farhat, C.; Lanteri, S.; Maman, N.; Piperno, S.; Gumaste, U.

    1994-01-01

    This research program deals with the application of high-performance computing methods for the analysis of complete jet engines. We have entitled this program by applying the two dimensional parallel aeroelastic codes to the interior gas flow problem of a bypass jet engine. The fluid mesh generation, domain decomposition, and solution capabilities were successfully tested. We then focused attention on methodology for the partitioned analysis of the interaction of the gas flow with a flexible structure and with the fluid mesh motion that results from these structural displacements. This is treated by a new arbitrary Lagrangian-Eulerian (ALE) technique that models the fluid mesh motion as that of a fictitious mass-spring network. New partitioned analysis procedures to treat this coupled three-component problem are developed. These procedures involved delayed corrections and subcycling. Preliminary results on the stability, accuracy, and MPP computational efficiency are reported.

  17. National Combustion Code: Parallel Implementation and Performance

    NASA Technical Reports Server (NTRS)

    Quealy, A.; Ryder, R.; Norris, A.; Liu, N.-S.

    2000-01-01

    The National Combustion Code (NCC) is being developed by an industry-government team for the design and analysis of combustion systems. CORSAIR-CCD is the current baseline reacting flow solver for NCC. This is a parallel, unstructured grid code which uses a distributed memory, message passing model for its parallel implementation. The focus of the present effort has been to improve the performance of the NCC flow solver to meet combustor designer requirements for model accuracy and analysis turnaround time. Improving the performance of this code contributes significantly to the overall reduction in time and cost of the combustor design cycle. This paper describes the parallel implementation of the NCC flow solver and summarizes its current parallel performance on an SGI Origin 2000. Earlier parallel performance results on an IBM SP-2 are also included. The performance improvements which have enabled a turnaround of less than 15 hours for a 1.3 million element fully reacting combustion simulation are described.

  18. High-performance parallel processors based on star-coupled wavelength division multiplexing optical interconnects

    DOEpatents

    Deri, Robert J.; DeGroot, Anthony J.; Haigh, Ronald E.

    2002-01-01

    As the performance of individual elements within parallel processing systems increases, increased communication capability between distributed processor and memory elements is required. There is great interest in using fiber optics to improve interconnect communication beyond that attainable using electronic technology. Several groups have considered WDM, star-coupled optical interconnects. The invention uses a fiber optic transceiver to provide low latency, high bandwidth channels for such interconnects using a robust multimode fiber technology. Instruction-level simulation is used to quantify the bandwidth, latency, and concurrency required for such interconnects to scale to 256 nodes, each operating at 1 GFLOPS performance. Performance scales have been shown to .apprxeq.100 GFLOPS for scientific application kernels using a small number of wavelengths (8 to 32), only one wavelength received per node, and achievable optoelectronic bandwidth and latency.

  19. Magnetic Shielding Design for Coupler of Wireless Electric Vehicle Charging Using Finite Element Analysis

    NASA Astrophysics Data System (ADS)

    Zhao, W. N.; Yang, X. J.; Yao, C.; Ma, D. G.; Tang, H. J.

    2017-10-01

    Inductive power transfer (IPT) is a practical and preferable method for wireless electric vehicle (EV) charging which proved to be safe, convenient and reliable. Due to the air gap between the magnetic coupler, the magnetic field coupling decreases and the magnetic leakage increases significantly compared to traditional transformer, and this may lead to the magnetic flux density around the coupler more than the safety limit for human. So magnetic shielding should be adding to the winding made from litz wire to enhance the magnetic field coupling effect in the working area and reduce magnetic field strength in non-working area. Magnetic shielding can be achieved by adding high-permeability material or high-conductivity material. For high-permeability material its magnetic reluctance is much lower than the surrounding air medium so most of the magnetic line goes through the high-permeability material rather than surrounding air. For high-conductivity material the eddy current in the material can produce reverse magnetic field to achieve magnetic shielding. This paper studies the effect of the two types of shielding material on coupler for wireless EV charging and designs combination shielding made from high-permeability material and high-conductivity material. The investigation of the paper is done with the help of finite element analysis.

  20. Parallel STEPS: Large Scale Stochastic Spatial Reaction-Diffusion Simulation with High Performance Computers

    PubMed Central

    Chen, Weiliang; De Schutter, Erik

    2017-01-01

    Stochastic, spatial reaction-diffusion simulations have been widely used in systems biology and computational neuroscience. However, the increasing scale and complexity of models and morphologies have exceeded the capacity of any serial implementation. This led to the development of parallel solutions that benefit from the boost in performance of modern supercomputers. In this paper, we describe an MPI-based, parallel operator-splitting implementation for stochastic spatial reaction-diffusion simulations with irregular tetrahedral meshes. The performance of our implementation is first examined and analyzed with simulations of a simple model. We then demonstrate its application to real-world research by simulating the reaction-diffusion components of a published calcium burst model in both Purkinje neuron sub-branch and full dendrite morphologies. Simulation results indicate that our implementation is capable of achieving super-linear speedup for balanced loading simulations with reasonable molecule density and mesh quality. In the best scenario, a parallel simulation with 2,000 processes runs more than 3,600 times faster than its serial SSA counterpart, and achieves more than 20-fold speedup relative to parallel simulation with 100 processes. In a more realistic scenario with dynamic calcium influx and data recording, the parallel simulation with 1,000 processes and no load balancing is still 500 times faster than the conventional serial SSA simulation. PMID:28239346

  1. Inductive coupler for downhole components and method for making same

    DOEpatents

    Hall, David R.; Hall, Jr., H. Tracy; Pixton, David S.; Dahlgren, Scott; Sneddon, Cameron; Fox, Joe; Briscoe, Michael A.

    2006-10-03

    An inductive coupler for downhole components. The inductive coupler includes an annular housing having a recess defined by a bottom portion and two opposing side wall portions. At least one side wall portion includes a lip extending toward but not reaching the other side wall portion. A plurality of generally U-shaped MCEI segments, preferably comprised of ferrite, are disposed in the recess and aligned so as to form a circular trough. The coupler further includes a conductor disposed within the circular trough and a polymer filling spaces between the segments, the annular housing and the conductor.

  2. High-performance parallel interface to synchronous optical network gateway

    DOEpatents

    St. John, Wallace B.; DuBois, David H.

    1996-01-01

    A system of sending and receiving gateways interconnects high speed data interfaces, e.g., HIPPI interfaces, through fiber optic links, e.g., a SONET network. An electronic stripe distributor distributes bytes of data from a first interface at the sending gateway onto parallel fiber optics of the fiber optic link to form transmitted data. An electronic stripe collector receives the transmitted data on the parallel fiber optics and reforms the data into a format effective for input to a second interface at the receiving gateway. Preferably, an error correcting syndrome is constructed at the sending gateway and sent with a data frame so that transmission errors can be detected and corrected in a real-time basis. Since the high speed data interface operates faster than any of the fiber optic links the transmission rate must be adapted to match the available number of fiber optic links so the sending and receiving gateways monitor the availability of fiber links and adjust the data throughput accordingly. In another aspect, the receiving gateway must have sufficient available buffer capacity to accept an incoming data frame. A credit-based flow control system provides for continuously updating the sending gateway on the available buffer capacity at the receiving gateway.

  3. High-performance parallel interface to synchronous optical network gateway

    DOEpatents

    St. John, W.B.; DuBois, D.H.

    1996-12-03

    Disclosed is a system of sending and receiving gateways interconnects high speed data interfaces, e.g., HIPPI interfaces, through fiber optic links, e.g., a SONET network. An electronic stripe distributor distributes bytes of data from a first interface at the sending gateway onto parallel fiber optics of the fiber optic link to form transmitted data. An electronic stripe collector receives the transmitted data on the parallel fiber optics and reforms the data into a format effective for input to a second interface at the receiving gateway. Preferably, an error correcting syndrome is constructed at the sending gateway and sent with a data frame so that transmission errors can be detected and corrected in a real-time basis. Since the high speed data interface operates faster than any of the fiber optic links the transmission rate must be adapted to match the available number of fiber optic links so the sending and receiving gateways monitor the availability of fiber links and adjust the data throughput accordingly. In another aspect, the receiving gateway must have sufficient available buffer capacity to accept an incoming data frame. A credit-based flow control system provides for continuously updating the sending gateway on the available buffer capacity at the receiving gateway. 7 figs.

  4. Structural and dynamic analysis of an ultra short intracavity directional coupler

    NASA Astrophysics Data System (ADS)

    Gravé, Ilan; Griffel, Giora; Daou, Youssef; Golan, Gadi

    1997-01-01

    A recently proposed intracavity directional coupler is analysed. Exact analytic expressions for important parameters such as the transmission ratio, the coupling length, and the photon lifetime are given. We show that by controlling the mirror reflectivities of the cavity, it is theoretically possible to reduce the coupling length to a zero limit. The photon lifetime, which governs the dynamic properties of the structure, sets an upper frequency limit of a few hundreds of GHz, which is well over the bandwidth limitation of microwave lumped or travelling wave electrodes. This novel family of intracavity couplers has important applications in the realization of integrated optics circuits for high-speed computing, data processing, and communication.

  5. Implementation and performance of parallel Prolog interpreter

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Wei, S.; Kale, L.V.; Balkrishna, R.

    1988-01-01

    In this paper, the authors discuss the implementation of a parallel Prolog interpreter on different parallel machines. The implementation is based on the REDUCE--OR process model which exploits both AND and OR parallelism in logic programs. It is machine independent as it runs on top of the chare-kernel--a machine-independent parallel programming system. The authors also give the performance of the interpreter running a diverse set of benchmark pargrams on parallel machines including shared memory systems: an Alliant FX/8, Sequent and a MultiMax, and a non-shared memory systems: Intel iPSC/32 hypercube, in addition to its performance on a multiprocessor simulation system.

  6. Waveguide couplers with new power splitting ratios made possible by cascading of short multimode interference sections

    NASA Astrophysics Data System (ADS)

    Feng, David J. Y.; Lay, T. S.; Chang, T. Y.

    2007-02-01

    We show that it is possible to obtain 2 x 2 waveguide couplers with new power splitting ratios for cross coupling of 7%, 64%, 80% and 93% by cascading two short MMI sections. These couplers have simple geometry and low loss. They offer valuable new possibilities for designing waveguide power taps, high-Q ring resonators, ladder-structure optical filters, and loop-mirror partial reflectors.

  7. Outcomes and reliability of the flow coupler in postoperative monitoring of head and neck free flaps.

    PubMed

    Fujiwara, Rance J T; Dibble, Jacqueline M; Larson, Scott V; Pierce, Matthew L; Mehra, Saral

    2018-04-01

    To assess the accuracy and reliability of the flow coupler relative to the implantable arterial Doppler probe in postoperative monitoring of head and neck free flaps. Retrospective single-institution study, April 2015 to March 2017. Both the venous flow coupler and arterial Doppler were employed in 120 consecutive head and neck free flap cases. When Doppler signal loss occurred, flaps were evaluated by physical exam to determine whether signal loss was a true positive necessitating operating room takeback. Sensitivity, specificity, and false positive rate (FPR) were recorded for each device. Logistic regression was conducted to identify user trends over time. Eleven of 120 patients (9.2%) required takeback, 10 from venous thrombosis and one from arterial thrombosis. Permanent signal loss (PSL) occurred in the flow coupler in all takebacks; PSL occurred in the arterial Doppler only in the case of arterial thrombosis. Salvage rate was 9/11 (81.8%). For the flow coupler, sensitivity was 100%, specificity 86.4%, and FPR 13.6%. For the arterial probe, sensitivity was 9.1%, specificity 97.1%, and FPR 2.9%. A 4.1% decrease in false positives with each additional flow coupler use was observed. Monitoring the vein via flow coupler has high sensitivity in identifying vascular compromise compared to the arterial probe, especially for venous thrombosis. There is moderate FPR; this decreases with increased usage and, when supplemented with physical examination, does not result in unnecessary takebacks. The flow coupler can be a valuable tool in postoperative monitoring of head and neck free flaps. 4. Laryngoscope, 128:812-817, 2018. © 2017 The American Laryngological, Rhinological and Otological Society, Inc.

  8. Integration experiences and performance studies of A COTS parallel archive systems

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Chen, Hsing-bung; Scott, Cody; Grider, Bary

    2010-01-01

    Current and future Archive Storage Systems have been asked to (a) scale to very high bandwidths, (b) scale in metadata performance, (c) support policy-based hierarchical storage management capability, (d) scale in supporting changing needs of very large data sets, (e) support standard interface, and (f) utilize commercial-off-the-shelf(COTS) hardware. Parallel file systems have been asked to do the same thing but at one or more orders of magnitude faster in performance. Archive systems continue to move closer to file systems in their design due to the need for speed and bandwidth, especially metadata searching speeds such as more caching and lessmore » robust semantics. Currently the number of extreme highly scalable parallel archive solutions is very small especially those that will move a single large striped parallel disk file onto many tapes in parallel. We believe that a hybrid storage approach of using COTS components and innovative software technology can bring new capabilities into a production environment for the HPC community much faster than the approach of creating and maintaining a complete end-to-end unique parallel archive software solution. In this paper, we relay our experience of integrating a global parallel file system and a standard backup/archive product with a very small amount of additional code to provide a scalable, parallel archive. Our solution has a high degree of overlap with current parallel archive products including (a) doing parallel movement to/from tape for a single large parallel file, (b) hierarchical storage management, (c) ILM features, (d) high volume (non-single parallel file) archives for backup/archive/content management, and (e) leveraging all free file movement tools in Linux such as copy, move, ls, tar, etc. We have successfully applied our working COTS Parallel Archive System to the current world's first petaflop/s computing system, LANL's Roadrunner, and demonstrated its capability to address requirements of future

  9. Integration experiments and performance studies of a COTS parallel archive system

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Chen, Hsing-bung; Scott, Cody; Grider, Gary

    2010-06-16

    Current and future Archive Storage Systems have been asked to (a) scale to very high bandwidths, (b) scale in metadata performance, (c) support policy-based hierarchical storage management capability, (d) scale in supporting changing needs of very large data sets, (e) support standard interface, and (f) utilize commercial-off-the-shelf (COTS) hardware. Parallel file systems have been asked to do the same thing but at one or more orders of magnitude faster in performance. Archive systems continue to move closer to file systems in their design due to the need for speed and bandwidth, especially metadata searching speeds such as more caching andmore » less robust semantics. Currently the number of extreme highly scalable parallel archive solutions is very small especially those that will move a single large striped parallel disk file onto many tapes in parallel. We believe that a hybrid storage approach of using COTS components and innovative software technology can bring new capabilities into a production environment for the HPC community much faster than the approach of creating and maintaining a complete end-to-end unique parallel archive software solution. In this paper, we relay our experience of integrating a global parallel file system and a standard backup/archive product with a very small amount of additional code to provide a scalable, parallel archive. Our solution has a high degree of overlap with current parallel archive products including (a) doing parallel movement to/from tape for a single large parallel file, (b) hierarchical storage management, (c) ILM features, (d) high volume (non-single parallel file) archives for backup/archive/content management, and (e) leveraging all free file movement tools in Linux such as copy, move, Is, tar, etc. We have successfully applied our working COTS Parallel Archive System to the current world's first petafiop/s computing system, LANL's Roadrunner machine, and demonstrated its capability to address requirements

  10. High-Performance Psychometrics: The Parallel-E Parallel-M Algorithm for Generalized Latent Variable Models. Research Report. ETS RR-16-34

    ERIC Educational Resources Information Center

    von Davier, Matthias

    2016-01-01

    This report presents results on a parallel implementation of the expectation-maximization (EM) algorithm for multidimensional latent variable models. The developments presented here are based on code that parallelizes both the E step and the M step of the parallel-E parallel-M algorithm. Examples presented in this report include item response…

  11. High-performance computing — an overview

    NASA Astrophysics Data System (ADS)

    Marksteiner, Peter

    1996-08-01

    An overview of high-performance computing (HPC) is given. Different types of computer architectures used in HPC are discussed: vector supercomputers, high-performance RISC processors, various parallel computers like symmetric multiprocessors, workstation clusters, massively parallel processors. Software tools and programming techniques used in HPC are reviewed: vectorizing compilers, optimization and vector tuning, optimization for RISC processors; parallel programming techniques like shared-memory parallelism, message passing and data parallelism; and numerical libraries.

  12. Multitasking TORT under UNICOS: Parallel performance models and measurements

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Barnett, A.; Azmy, Y.Y.

    1999-09-27

    The existing parallel algorithms in the TORT discrete ordinates code were updated to function in a UNICOS environment. A performance model for the parallel overhead was derived for the existing algorithms. The largest contributors to the parallel overhead were identified and a new algorithm was developed. A parallel overhead model was also derived for the new algorithm. The results of the comparison of parallel performance models were compared to applications of the code to two TORT standard test problems and a large production problem. The parallel performance models agree well with the measured parallel overhead.

  13. Multitasking TORT Under UNICOS: Parallel Performance Models and Measurements

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Azmy, Y.Y.; Barnett, D.A.

    1999-09-27

    The existing parallel algorithms in the TORT discrete ordinates were updated to function in a UNI-COS environment. A performance model for the parallel overhead was derived for the existing algorithms. The largest contributors to the parallel overhead were identified and a new algorithm was developed. A parallel overhead model was also derived for the new algorithm. The results of the comparison of parallel performance models were compared to applications of the code to two TORT standard test problems and a large production problem. The parallel performance models agree well with the measured parallel overhead.

  14. Studying quick coupler efficiency in working attachment system of single-bucket power shovel

    NASA Astrophysics Data System (ADS)

    Duganova, E. V.; Zagorodniy, N. A.; Solodovnikov, D. N.; Korneyev, A. S.

    2018-03-01

    A prototype of a quick-disconnect connector (quick coupler) with an unloaded retention mechanism was developed from the analysis of typical quick couplers used as intermediate elements for power shovels of different manufacturers. A method is presented, allowing building a simulation model of the quick coupler prototype as an alternative to physical modeling for further studies.

  15. Homemade Buckeye-Pi: A Learning Many-Node Platform for High-Performance Parallel Computing

    NASA Astrophysics Data System (ADS)

    Amooie, M. A.; Moortgat, J.

    2017-12-01

    We report on the "Buckeye-Pi" cluster, the supercomputer developed in The Ohio State University School of Earth Sciences from 128 inexpensive Raspberry Pi (RPi) 3 Model B single-board computers. Each RPi is equipped with fast Quad Core 1.2GHz ARMv8 64bit processor, 1GB of RAM, and 32GB microSD card for local storage. Therefore, the cluster has a total RAM of 128GB that is distributed on the individual nodes and a flash capacity of 4TB with 512 processors, while it benefits from low power consumption, easy portability, and low total cost. The cluster uses the Message Passing Interface protocol to manage the communications between each node. These features render our platform the most powerful RPi supercomputer to date and suitable for educational applications in high-performance-computing (HPC) and handling of large datasets. In particular, we use the Buckeye-Pi to implement optimized parallel codes in our in-house simulator for subsurface media flows with the goal of achieving a massively-parallelized scalable code. We present benchmarking results for the computational performance across various number of RPi nodes. We believe our project could inspire scientists and students to consider the proposed unconventional cluster architecture as a mainstream and a feasible learning platform for challenging engineering and scientific problems.

  16. A high-speed linear algebra library with automatic parallelism

    NASA Technical Reports Server (NTRS)

    Boucher, Michael L.

    1994-01-01

    Parallel or distributed processing is key to getting highest performance workstations. However, designing and implementing efficient parallel algorithms is difficult and error-prone. It is even more difficult to write code that is both portable to and efficient on many different computers. Finally, it is harder still to satisfy the above requirements and include the reliability and ease of use required of commercial software intended for use in a production environment. As a result, the application of parallel processing technology to commercial software has been extremely small even though there are numerous computationally demanding programs that would significantly benefit from application of parallel processing. This paper describes DSSLIB, which is a library of subroutines that perform many of the time-consuming computations in engineering and scientific software. DSSLIB combines the high efficiency and speed of parallel computation with a serial programming model that eliminates many undesirable side-effects of typical parallel code. The result is a simple way to incorporate the power of parallel processing into commercial software without compromising maintainability, reliability, or ease of use. This gives significant advantages over less powerful non-parallel entries in the market.

  17. Event parallelism: Distributed memory parallel computing for high energy physics experiments

    NASA Astrophysics Data System (ADS)

    Nash, Thomas

    1989-12-01

    This paper describes the present and expected future development of distributed memory parallel computers for high energy physics experiments. It covers the use of event parallel microprocessor farms, particularly at Fermilab, including both ACP multiprocessors and farms of MicroVAXES. These systems have proven very cost effective in the past. A case is made for moving to the more open environment of UNIX and RISC processors. The 2nd Generation ACP Multiprocessor System, which is based on powerful RISC system, is described. Given the promise of still more extraordinary increases in processor performance, a new emphasis on point to point, rather than bussed, communication will be required. Developments in this direction are described.

  18. Design and analysis of O-S-C triple band wavelength division demultiplexer using cascaded MMI couplers

    NASA Astrophysics Data System (ADS)

    Chack, Devendra; Kumar, V.; Raghuwanshi, Sanjeev Kumar; Singh, Dev Prakash

    2017-01-01

    Compact triple O-S-C band wavelength demultiplexer, which consists of series cascaded multimode interference (MMI) couplers has been carried out in this paper. The MMI coupler has been used to drop the wavelengths of 1510 nm and 1550 nm at bar port while the wavelength 1300 nm into the cross port. Then another MMI coupler has been designed to separate the wavelength 1510 nminto one port and wavelength 1550 nm into another port. The triple wavelength demultiplexer function has been performed by choosing a suitable refractive index of the guiding region and geometrical parameters such as the width and length of MMI coupler. Numerical simulation with finite difference beam propagation method (BPM) has been utilized to design and optimize the operation of the proposed triple wavelength demultiplexer. The simulation results show that insertion losses of wavelength O, S and C, bands are 1.884 dB, 1.452 dB and 2.568 dB, respectively, with isolations for each output waveguide ranging from 10 dB to 28.72 dB. The 3-dB bandwidth of insertion loss for 1300 nm, 1510 nm and 1550 nm are 80 nm, 20 nm and 10 nm, respectively.

  19. Omnidirectional spin-wave nanograting coupler

    PubMed Central

    Yu, Haiming; Duerr, G.; Huber, R.; Bahr, M.; Schwarze, T.; Brandl, F.; Grundler, D.

    2013-01-01

    Magnonics as an emerging nanotechnology offers functionalities beyond current semiconductor technology. Spin waves used in cellular nonlinear networks are expected to speed up technologically, demanding tasks such as image processing and speech recognition at low power consumption. However, efficient coupling to microelectronics poses a vital challenge. Previously developed techniques for spin-wave excitation (for example, by using parametric pumping in a cavity) may not allow for the relevant downscaling or provide only individual point-like sources. Here we demonstrate that a grating coupler of periodically nanostructured magnets provokes multidirectional emission of short-wavelength spin waves with giantly enhanced amplitude compared with a bare microwave antenna. Exploring the dependence on ferromagnetic materials, lattice constants and the applied magnetic field, we find the magnonic grating coupler to be more versatile compared with gratings in photonics and plasmonics. Our results allow one to convert, in particular, straight microwave antennas into omnidirectional emitters for short-wavelength spin waves, which are key to cellular nonlinear networks and integrated magnonics. PMID:24189978

  20. Experience in highly parallel processing using DAP

    NASA Technical Reports Server (NTRS)

    Parkinson, D.

    1987-01-01

    Distributed Array Processors (DAP) have been in day to day use for ten years and a large amount of user experience has been gained. The profile of user applications is similar to that of the Massively Parallel Processor (MPP) working group. Experience has shown that contrary to expectations, highly parallel systems provide excellent performance on so-called dirty problems such as the physics part of meteorological codes. The reasons for this observation are discussed. The arguments against replacing bit processors with floating point processors are also discussed.

  1. Double-clad photonic crystal fiber coupler for compact nonlinear optical microscopy imaging.

    PubMed

    Fu, Ling; Gu, Min

    2006-05-15

    A 1 x 2 double-clad photonic crystal fiber coupler is fabricated by the fused tapered method, showing a low excess loss of 1.1 dB and a splitting ratio of 97/3 over the entire visible and near-infrared wavelength range. In addition to the property of splitting the laser power, the double-clad feature of the coupler facilitates the separation of a near-infrared single-mode beam from a visible multimode beam, which is ideal for nonlinear optical microscopy imaging. In conjunction with a gradient-index lens, this coupler is used to construct a miniaturized microscope based on two-photon fluorescence and second-harmonic generation. Three-dimensional nonlinear optical images demonstrate potential applications of the coupler to compact all-fiber and nonlinear optical microscopy and endoscopy.

  2. Modeling Cooperative Threads to Project GPU Performance for Adaptive Parallelism

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Meng, Jiayuan; Uram, Thomas; Morozov, Vitali A.

    Most accelerators, such as graphics processing units (GPUs) and vector processors, are particularly suitable for accelerating massively parallel workloads. On the other hand, conventional workloads are developed for multi-core parallelism, which often scale to only a few dozen OpenMP threads. When hardware threads significantly outnumber the degree of parallelism in the outer loop, programmers are challenged with efficient hardware utilization. A common solution is to further exploit the parallelism hidden deep in the code structure. Such parallelism is less structured: parallel and sequential loops may be imperfectly nested within each other, neigh boring inner loops may exhibit different concurrency patternsmore » (e.g. Reduction vs. Forall), yet have to be parallelized in the same parallel section. Many input-dependent transformations have to be explored. A programmer often employs a larger group of hardware threads to cooperatively walk through a smaller outer loop partition and adaptively exploit any encountered parallelism. This process is time-consuming and error-prone, yet the risk of gaining little or no performance remains high for such workloads. To reduce risk and guide implementation, we propose a technique to model workloads with limited parallelism that can automatically explore and evaluate transformations involving cooperative threads. Eventually, our framework projects the best achievable performance and the most promising transformations without implementing GPU code or using physical hardware. We envision our technique to be integrated into future compilers or optimization frameworks for autotuning.« less

  3. Heat-driven thermoacoustic cryocooler operating at liquid hydrogen temperature with a unique coupler

    NASA Astrophysics Data System (ADS)

    Hu, J. Y.; Luo, E. C.; Li, S. F.; Yu, B.; Dai, W.

    2008-05-01

    A heat-driven thermoacoustic cryocooler is constructed. A unique coupler composed of a tube, reservoir, and elastic diaphragm is introduced to couple a traveling-wave thermoacoustic engine (TE) and two-stage pulse tube refrigerator (PTR). The amplitude of the pressure wave generated in the engine is first amplified in the coupler and the wave then passes into the refrigerator to pump heat. The TE uses nitrogen as its working gas and the PTR still uses helium as its working gas. With this coupler, the efficiency of the system is doubled. The engine and coupler match at a much lower operating frequency, which is of great benefit for the PTR to obtain a lower cooling temperature. The coupling place between the coupler and engine is also optimized. The onset problem is effectively solved. With these improvements, the heat-driven thermoacoustic cryocooler reaches a lowest temperature of 18.1K, which is the demonstration of heat-driven thermoacoustic refrigeration technology used for cooling at liquid hydrogen temperatures.

  4. Design of the new couplers for C-ADS RFQ

    NASA Astrophysics Data System (ADS)

    Shi, Ai-Min; Sun, Lie-Peng; Zhang, Zhou-Li; Xu, Xian-Bo; Shi, Long-Bo; Li, Chen-Xing; Wang, Wen-Bin

    2015-04-01

    A new special coupler with a kind of bowl-shaped ceramic window for a proton linear accelerator named the Chinese Accelerator Driven System (C-ADS) at the Institute of Modern Physics (IMP) has been simulated and constructed and a continuous wave (CW) beam commissioning through a four-meter long radio frequency quadruple (RFQ) was completed by the end of July 2014. In the experiments of conditioning and beam, some problems were promoted gradually such as sparking and thermal issues. Finally, two new couplers were passed with almost 110 kW CW power and 120 kW pulsed mode, respectively. The 10 mA intensity beam experiments have now been completed, and the couplers during the operation had no thermal or electro-magnetic problems. The detailed design and results are presented in the paper. Supported by Strategic Priority Research Program of Chinese Academy of Sciences (XDA03020500)

  5. Relaxed tolerance adiabatic silicon coupler for high I/O port-density optical interconnects (Conference Presentation)

    NASA Astrophysics Data System (ADS)

    Fard, Erfan; Norwood, Robert A.; Peyghambarian, Nasser N.; Koch, Thomas L.

    2017-02-01

    Widespread deployment of silicon photonics will benefit strongly from improved high-port-density interconnect solutions between chips, interposers, and other waveguide fabrics. We present an adiabatic silicon waveguide to polymer waveguide coupler design incorporating strong vertical asymmetries offering high efficiency, small footprint, and improved tolerance to lateral misalignment. The design incorporates a standard 450nm-wide silicon waveguide tapered down to 50nm over a distance of 200μm with a 1.6μm-thick polymer waveguide having a 4μm-wide core atop the taper. The coupler exhibits <0.1dB loss for both TE and TM modes based on 3-dimensional finite element modeling. Moreover, the modeled device exhibits less than 0.1dB excess loss with lateral misalignment of +/-2μm between polymer and silicon waveguide for TE mode, and 0.2dB excess loss with +/-1.6μm offset for the TM mode, and 1dB excess loss for both TE and TM modes with +/-2.7μm misalignment. This taper design should enable reduction in manufacturing costs due to a reduced on-chip footprint and the potential for lower-precision, higher-throughput assembly tools. The authors would like to acknowledge the support of AIM Photonics. This material is based on research sponsored by Air Force Research Laboratory under agreement number FA8650-15-2-5220. The U.S. Government is authorized to reproduce and distribute reprints for Governmental purposes notwithstanding any copyright notation thereon. The views and conclusions contained herein are those of the authors and should not be interpreted as necessarily representing the official policies or endorsements, either expressed or implied, of Air Force Research Laboratory or the U.S. Government.

  6. Fiber pigtailed thin wall capillary coupler for excitation of microsphere WGM resonator.

    PubMed

    Wang, Hanzheng; Lan, Xinwei; Huang, Jie; Yuan, Lei; Kim, Cheol-Woon; Xiao, Hai

    2013-07-01

    In this paper, we demonstrate a fiber pigtailed thin wall capillary coupler for excitation of Whispering Gallery Modes (WGMs) of microsphere resonators. The coupler is made by fusion-splicing an optical fiber with a capillary tube and consequently etching the capillary wall to a thickness of a few microns. Light is coupled through the peripheral contact between inserted microsphere and the etched capillary wall. The coupling efficiency as a function of the wall thickness was studied experimentally. WGM resonance with a Q-factor of 1.14 × 10(4) was observed using a borosilicate glass microsphere with a diameter of 71 μm. The coupler operates in the reflection mode and provides a robust mechanical support to the microsphere resonator. It is expected that the new coupler may find broad applications in sensors, optical filters and lasers.

  7. National Combustion Code Parallel Performance Enhancements

    NASA Technical Reports Server (NTRS)

    Quealy, Angela; Benyo, Theresa (Technical Monitor)

    2002-01-01

    The National Combustion Code (NCC) is being developed by an industry-government team for the design and analysis of combustion systems. The unstructured grid, reacting flow code uses a distributed memory, message passing model for its parallel implementation. The focus of the present effort has been to improve the performance of the NCC code to meet combustor designer requirements for model accuracy and analysis turnaround time. Improving the performance of this code contributes significantly to the overall reduction in time and cost of the combustor design cycle. This report describes recent parallel processing modifications to NCC that have improved the parallel scalability of the code, enabling a two hour turnaround for a 1.3 million element fully reacting combustion simulation on an SGI Origin 2000.

  8. Optical damage observed in the LHMEL II output coupler

    NASA Astrophysics Data System (ADS)

    Eric, John J.; Bagford, John O.; Devlin, Christie L. H.; Hull, Robert J.; Seibert, Daniel B.

    2008-01-01

    During the annual NIST calibration testing done at the LHMEL facility in FY06 on its high energy Carbon-Dioxide lasers, the LHMEL II device suffered severe damage to the internal surface of its ZnSe output coupler optics. The damage occurred during a high power, short duration run and it was believed to have been the result of a significant amount of surface contaminants interacting with the LHMEL cavity beam. Initial theories as to the source of the contamination led to the inspection of the vacuum grease that seals the piping that supplies the source gases to the laser cavity. Other contamination sources were considered, and analysis was conducted in an effort to identify the material found at the damage sites on the optic, but the tests were mainly inconclusive. Some procedure changes were initiated to identify possible contamination before high energy laser operation in an attempt to mitigate and possibly prevent the continued occurrence of damage to the output coupler window. This paper is to illustrate the type and extent of the damage encountered, highlight some of the theories as to the contamination source, and serve as a notice as to the severity and consequences of damage that is possible even due to small amounts of foreign material in a high energy laser environment.

  9. Parallel high-performance grid computing: capabilities and opportunities of a novel demanding service and business class allowing highest resource efficiency.

    PubMed

    Kepper, Nick; Ettig, Ramona; Dickmann, Frank; Stehr, Rene; Grosveld, Frank G; Wedemann, Gero; Knoch, Tobias A

    2010-01-01

    Especially in the life-science and the health-care sectors the huge IT requirements are imminent due to the large and complex systems to be analysed and simulated. Grid infrastructures play here a rapidly increasing role for research, diagnostics, and treatment, since they provide the necessary large-scale resources efficiently. Whereas grids were first used for huge number crunching of trivially parallelizable problems, increasingly parallel high-performance computing is required. Here, we show for the prime example of molecular dynamic simulations how the presence of large grid clusters including very fast network interconnects within grid infrastructures allows now parallel high-performance grid computing efficiently and thus combines the benefits of dedicated super-computing centres and grid infrastructures. The demands for this service class are the highest since the user group has very heterogeneous requirements: i) two to many thousands of CPUs, ii) different memory architectures, iii) huge storage capabilities, and iv) fast communication via network interconnects, are all needed in different combinations and must be considered in a highly dedicated manner to reach highest performance efficiency. Beyond, advanced and dedicated i) interaction with users, ii) the management of jobs, iii) accounting, and iv) billing, not only combines classic with parallel high-performance grid usage, but more importantly is also able to increase the efficiency of IT resource providers. Consequently, the mere "yes-we-can" becomes a huge opportunity like e.g. the life-science and health-care sectors as well as grid infrastructures by reaching higher level of resource efficiency.

  10. The high throughput investigation of polyphenolic couplers in biodegradable packaging materials

    NASA Astrophysics Data System (ADS)

    Lochhead, Robert Y.; Haynes, Camille T.; Jones, Stephen R.; Smith, Virginia

    2006-01-01

    create a coupler from the hydrogen-bonded coacervate formed between a polyphenolic compound and polyvinylpyrrolidone, and to use this to exfoliate and couple montmorillonite nanoparticles to polycaprolactone. To achieve this, solubility parameter mapping of candidate polymeric couplers, polycaprolactone and target polyphenolic compounds was undertaken. This was used as a screening process in predicting incompatibilities and eliminating unpromising materials that were soluble in the same materials as the polycaprolactone and the polyvinylpyrrolidone. High throughput generation of Hansen-Hoy solubility diagrams coupled with simple techniques like high throughput FT-IR spectroscopy and polarized light microscopy provide a powerful tool for the evaluation of compatibility between formulation components. We were able to quickly evaluate over 110 food-contact-approved phenolic compounds, select the two promising candidates and eliminate all of the rest by evaluating their propensity for compatibility and hydrogen bonding.

  11. High Performance Computing Based Parallel HIearchical Modal Association Clustering (HPAR HMAC)

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Patlolla, Dilip R; Surendran Nair, Sujithkumar; Graves, Daniel A.

    For many applications, clustering is a crucial step in order to gain insight into the makeup of a dataset. The best approach to a given problem often depends on a variety of factors, such as the size of the dataset, time restrictions, and soft clustering requirements. The HMAC algorithm seeks to combine the strengths of 2 particular clustering approaches: model-based and linkage-based clustering. One particular weakness of HMAC is its computational complexity. HMAC is not practical for mega-scale data clustering. For high-definition imagery, a user would have to wait months or years for a result; for a 16-megapixel image, themore » estimated runtime skyrockets to over a decade! To improve the execution time of HMAC, it is reasonable to consider an multi-core implementation that utilizes available system resources. An existing imple-mentation (Ray and Cheng 2014) divides the dataset into N partitions - one for each thread prior to executing the HMAC algorithm. This implementation benefits from 2 types of optimization: parallelization and divide-and-conquer. By running each partition in parallel, the program is able to accelerate computation by utilizing more system resources. Although the parallel implementation provides considerable improvement over the serial HMAC, it still suffers from poor computational complexity, O(N2). Once the maximum number of cores on a system is exhausted, the program exhibits slower behavior. We now consider a modification to HMAC that involves a recursive partitioning scheme. Our modification aims to exploit divide-and-conquer benefits seen by the parallel HMAC implementation. At each level in the recursion tree, partitions are divided into 2 sub-partitions until a threshold size is reached. When the partition can no longer be divided without falling below threshold size, the base HMAC algorithm is applied. This results in a significant speedup over the parallel HMAC.« less

  12. Portability and Cross-Platform Performance of an MPI-Based Parallel Polygon Renderer

    NASA Technical Reports Server (NTRS)

    Crockett, Thomas W.

    1999-01-01

    Visualizing the results of computations performed on large-scale parallel computers is a challenging problem, due to the size of the datasets involved. One approach is to perform the visualization and graphics operations in place, exploiting the available parallelism to obtain the necessary rendering performance. Over the past several years, we have been developing algorithms and software to support visualization applications on NASA's parallel supercomputers. Our results have been incorporated into a parallel polygon rendering system called PGL. PGL was initially developed on tightly-coupled distributed-memory message-passing systems, including Intel's iPSC/860 and Paragon, and IBM's SP2. Over the past year, we have ported it to a variety of additional platforms, including the HP Exemplar, SGI Origin2OOO, Cray T3E, and clusters of Sun workstations. In implementing PGL, we have had two primary goals: cross-platform portability and high performance. Portability is important because (1) our manpower resources are limited, making it difficult to develop and maintain multiple versions of the code, and (2) NASA's complement of parallel computing platforms is diverse and subject to frequent change. Performance is important in delivering adequate rendering rates for complex scenes and ensuring that parallel computing resources are used effectively. Unfortunately, these two goals are often at odds. In this paper we report on our experiences with portability and performance of the PGL polygon renderer across a range of parallel computing platforms.

  13. Flat-Passband 3 × 3 Interleaving Filter Designed With Optical Directional Couplers in Lattice Structure

    NASA Astrophysics Data System (ADS)

    Wang, Qi Jie; Zhang, Ying; Soh, Yeng Chai

    2005-12-01

    This paper presents a novel lattice optical delay-line circuit using 3 × 3 directional couplers to implement three-port optical interleaving filters. It is shown that the proposed circuit can deliver three channels of 2pi/3 phase-shifted interleaving transmission spectra if the coupling ratios of the last two directional couplers are selected appropriately. The other performance requirements of an optical interleaver can be achieved by designing the remaining part of the lattice circuit. A recursive synthesis design algorithm is developed to calculate the design parameters of the lattice circuit that will yield the desired filter response. As illustrative examples, interleavers with maximally flat-top passband transmission and with given transmission performance on passband ripples and passband bandwidth, respectively, are designed to verify the effectiveness of the proposed design scheme.

  14. Concentric ring flywheel with hooked ring carbon fiber separator/torque coupler

    DOEpatents

    Kuklo, Thomas C.

    1999-01-01

    A concentric ring flywheel with expandable separators, which function as torque couplers, between the rings to take up the gap formed between adjacent rings due to differential expansion between different radius rings during rotation of the flywheel. The expandable separators or torque couplers include a hook-like section at an upper end which is positioned over an inner ring and a shelf-like or flange section at a lower end onto which the next adjacent outer ring is positioned. As the concentric rings are rotated the gap formed by the differential expansion there between is partially taken up by the expandable separators or torque couplers to maintain torque and centering attachment of the concentric rings.

  15. Concentric ring flywheel with hooked ring carbon fiber separator/torque coupler

    DOEpatents

    Kuklo, T.C.

    1999-07-20

    A concentric ring flywheel with expandable separators, which function as torque couplers, between the rings to take up the gap formed between adjacent rings due to differential expansion between different radius rings during rotation of the flywheel. The expandable separators or torque couplers include a hook-like section at an upper end which is positioned over an inner ring and a shelf-like or flange section at a lower end onto which the next adjacent outer ring is positioned. As the concentric rings are rotated the gap formed by the differential expansion there between is partially taken up by the expandable separators or torque couplers to maintain torque and centering attachment of the concentric rings. 2 figs.

  16. An Overview of High-performance Parallel Big Data transfers over multiple network channels with Transport Layer Security (TLS) and TLS plus Perfect Forward Secrecy (PFS)

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Fang, Chin; Corttrell, R. A.

    This Technical Note provides an overview of high-performance parallel Big Data transfers with and without encryption for data in-transit over multiple network channels. It shows that with the parallel approach, it is feasible to carry out high-performance parallel "encrypted" Big Data transfers without serious impact to throughput. But other impacts, e.g. the energy-consumption part should be investigated. It also explains our rationales of using a statistics-based approach for gaining understanding from test results and for improving the system. The presentation is of high-level nature. Nevertheless, at the end we will pose some questions and identify potentially fruitful directions for futuremore » work.« less

  17. Ultracompact and high efficient silicon-based polarization splitter-rotator using a partially-etched subwavelength grating coupler

    PubMed Central

    Xu, Yin; Xiao, Jinbiao

    2016-01-01

    On-chip polarization manipulation is pivotal for silicon-on-insulator material platform to realize polarization-transparent circuits and polarization-division-multiplexing transmissions, where polarization splitters and rotators are fundamental components. In this work, we propose an ultracompact and high efficient silicon-based polarization splitter-rotator (PSR) using a partially-etched subwavelength grating (SWG) coupler. The proposed PSR consists of a taper-integrated SWG coupler combined with a partially-etched waveguide between the input and output strip waveguides to make the input transverse-electric (TE) mode couple and convert to the output transverse-magnetic (TM) mode at the cross port while the input TM mode confine well in the strip waveguide during propagation and directly output from the bar port with nearly neglected coupling. Moreover, to better separate input polarizations, an additional tapered waveguide extended from the partially-etched waveguide is also added. From results, an ultracompact PSR of only 8.2 μm in length is achieved, which is so far the reported shortest one. The polarization conversion loss and efficiency are 0.12 dB and 98.52%, respectively, together with the crosstalk and reflection loss of −31.41/−22.43 dB and −34.74/−33.13 dB for input TE/TM mode at wavelength of 1.55 μm. These attributes make the present device suitable for constructing on-chip compact photonic integrated circuits with polarization-independence. PMID:27306112

  18. Design and fabrication of multimode interference couplers based on digital micro-mirror system

    NASA Astrophysics Data System (ADS)

    Wu, Sumei; He, Xingdao; Shen, Chenbo

    2008-03-01

    Multimode interference (MMI) couplers, based on the self-imaging effect (SIE), are accepted popularly in integrated optics. According to the importance of MMI devices, in this paper, we present a novel method to design and fabricate MMI couplers. A technology of maskless lithography to make MMI couplers based on a smart digital micro-mirror device (DMD) system is proposed. A 1×4 MMI device is designed as an example, which shows the present method is efficient and cost-effective.

  19. A traveling-wave forward coupler design for a new accelerating mode in a silicon woodpile accelerator

    DOE PAGES

    Wu, Ziran; Lee, Chunghun H.; Wootton, Kent P.; ...

    2016-03-01

    Silicon woodpile photonic crystals provide a base structure that can be used to build a three-dimensional dielectric waveguide system for high-gradient laser driven acceleration. A new woodpile waveguide design that hosts a phase synchronous, centrally confined accelerating mode is proposed. Comparing with previously discovered silicon woodpile accelerating modes, this mode shows advantages in terms of better electron beam loading and higher achievable acceleration gradient. Several traveling-wave coupler design schemes developed for multi-cell RF cavity accelerators are adapted to the woodpile power coupler design for this new accelerating mode. Design of a forward coupled, highly efficient silicon woodpile accelerator is achieved.more » Simulation shows high efficiency of over 75% of the drive laser power coupled to this fundamental accelerating mode, with less than 15% backward wave scattering. The estimated acceleration gradient, when the coupler structure is driven at the damage threshold fluence of silicon at its operating 1.506 μm wavelength, can reach 185 MV/m. Lastly, a 17-layer woodpile waveguide structure was successfully fabricated, and the measured bandgap is in excellent agreement with simulation.« less

  20. A traveling-wave forward coupler design for a new accelerating mode in a silicon woodpile accelerator

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Wu, Ziran; Lee, Chunghun H.; Wootton, Kent P.

    Silicon woodpile photonic crystals provide a base structure that can be used to build a three-dimensional dielectric waveguide system for high-gradient laser driven acceleration. A new woodpile waveguide design that hosts a phase synchronous, centrally confined accelerating mode is proposed. Comparing with previously discovered silicon woodpile accelerating modes, this mode shows advantages in terms of better electron beam loading and higher achievable acceleration gradient. Several traveling-wave coupler design schemes developed for multi-cell RF cavity accelerators are adapted to the woodpile power coupler design for this new accelerating mode. Design of a forward coupled, highly efficient silicon woodpile accelerator is achieved.more » Simulation shows high efficiency of over 75% of the drive laser power coupled to this fundamental accelerating mode, with less than 15% backward wave scattering. The estimated acceleration gradient, when the coupler structure is driven at the damage threshold fluence of silicon at its operating 1.506 μm wavelength, can reach 185 MV/m. Lastly, a 17-layer woodpile waveguide structure was successfully fabricated, and the measured bandgap is in excellent agreement with simulation.« less

  1. A study of polaritonic transparency in couplers made from excitonic materials

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Singh, Mahi R.; Racknor, Chris

    2015-03-14

    We have studied light matter interaction in quantum dot and exciton-polaritonic coupler hybrid systems. The coupler is made by embedding two slabs of an excitonic material (CdS) into a host excitonic material (ZnO). An ensemble of non-interacting quantum dots is doped in the coupler. The bound exciton polariton states are calculated in the coupler using the transfer matrix method in the presence of the coupling between the external light (photons) and excitons. These bound exciton-polaritons interact with the excitons present in the quantum dots and the coupler is acting as a reservoir. The Schrödinger equation method has been used tomore » calculate the absorption coefficient in quantum dots. It is found that when the distance between two slabs (CdS) is greater than decay length of evanescent waves the absorption spectrum has two peaks and one minimum. The minimum corresponds to a transparent state in the system. However, when the distance between the slabs is smaller than the decay length of evanescent waves, the absorption spectra has three peaks and two transparent states. In other words, one transparent state can be switched to two transparent states when the distance between the two layers is modified. This could be achieved by applying stress and strain fields. It is also found that transparent states can be switched on and off by applying an external control laser field.« less

  2. Deriving the real-ear SPL of audiometric data using the "coupler to dial difference" and the "real ear to coupler difference".

    PubMed

    Munro, K J; Davis, J

    2003-04-01

    The purpose of the study was to compare the measured real-ear sound pressure level (SPL) of audiometer output with the derived real-ear SPL obtained by adding the coupler to dial difference (CDD) and real-ear to coupler difference (RECD) to the audiometer dial reading. The real-ear SPL and RECD were measured in one ear of 16 normally hearing subjects using a probe-tube microphone. The CDD transform and the RECD transfer function were measured in an HA1 and an HA2 2-cc coupler using an EAR-LINK foam ear-tip or a customized earmold. The RECD transfer function was measured using the EARTone ER 3A and the Audioscan RE770 insert earphone. The procedures were very reliable with mean differences on retest of less than 1 dB. The mean difference between the measured and derived real-ear SPL was generally less than 1 dB and rarely exceeded 3 dB in any subject. The CDD measured for an individual audiometer and the RECD measured for an individual ear can be used to derive a valid estimate of real-ear SPL when it has not been possible to measure this directly.

  3. Improving estimations of greenhouse gas transfer velocities by atmosphere-ocean couplers in Earth-System and regional models

    NASA Astrophysics Data System (ADS)

    Vieira, V. M. N. C. S.; Sahlée, E.; Jurus, P.; Clementi, E.; Pettersson, H.; Mateus, M.

    2015-09-01

    Earth-System and regional models, forecasting climate change and its impacts, simulate atmosphere-ocean gas exchanges using classical yet too simple generalizations relying on wind speed as the sole mediator while neglecting factors as sea-surface agitation, atmospheric stability, current drag with the bottom, rain and surfactants. These were proved fundamental for accurate estimates, particularly in the coastal ocean, where a significant part of the atmosphere-ocean greenhouse gas exchanges occurs. We include several of these factors in a customizable algorithm proposed for the basis of novel couplers of the atmospheric and oceanographic model components. We tested performances with measured and simulated data from the European coastal ocean, having found our algorithm to forecast greenhouse gas exchanges largely different from the forecasted by the generalization currently in use. Our algorithm allows calculus vectorization and parallel processing, improving computational speed roughly 12× in a single cpu core, an essential feature for Earth-System models applications.

  4. Design of the 1.5 MW, 30-96 MHz ultra-wideband 3 dB high power hybrid coupler for Ion Cyclotron Resonance Frequency (ICRF) heating in fusion grade reactor.

    PubMed

    Yadav, Rana Pratap; Kumar, Sunil; Kulkarni, S V

    2016-01-01

    Design and developmental procedure of strip-line based 1.5 MW, 30-96 MHz, ultra-wideband high power 3 dB hybrid coupler has been presented and its applicability in ion cyclotron resonance heating (ICRH) in tokamak is discussed. For the high power handling capability, spacing between conductors and ground need to very high. Hence other structural parameters like strip-width, strip thickness coupling gap, and junction also become large which can be gone upto optimum limit where various constrains like fabrication tolerance, discontinuities, and excitation of higher TE and TM modes become prominent and significantly deteriorates the desired parameters of the coupled lines system. In designed hybrid coupler, two 8.34 dB coupled lines are connected in tandem to get desired coupling of 3 dB and air is used as dielectric. The spacing between ground and conductors are taken as 0.164 m for 1.5 MW power handling capability. To have the desired spacing, each of 8.34 dB segments are designed with inner dimension of 3.6 × 1.0 × 40 cm where constraints have been significantly realized, compensated, and applied in designing of 1.5 MW hybrid coupler and presented in paper.

  5. Design of the 1.5 MW, 30-96 MHz ultra-wideband 3 dB high power hybrid coupler for Ion Cyclotron Resonance Frequency (ICRF) heating in fusion grade reactor

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Yadav, Rana Pratap, E-mail: ranayadav97@gmail.com; Kumar, Sunil; Kulkarni, S. V.

    2016-01-15

    Design and developmental procedure of strip-line based 1.5 MW, 30-96 MHz, ultra-wideband high power 3 dB hybrid coupler has been presented and its applicability in ion cyclotron resonance heating (ICRH) in tokamak is discussed. For the high power handling capability, spacing between conductors and ground need to very high. Hence other structural parameters like strip-width, strip thickness coupling gap, and junction also become large which can be gone upto optimum limit where various constrains like fabrication tolerance, discontinuities, and excitation of higher TE and TM modes become prominent and significantly deteriorates the desired parameters of the coupled lines system. Inmore » designed hybrid coupler, two 8.34 dB coupled lines are connected in tandem to get desired coupling of 3 dB and air is used as dielectric. The spacing between ground and conductors are taken as 0.164 m for 1.5 MW power handling capability. To have the desired spacing, each of 8.34 dB segments are designed with inner dimension of 3.6 × 1.0 × 40 cm where constraints have been significantly realized, compensated, and applied in designing of 1.5 MW hybrid coupler and presented in paper.« less

  6. Design of the 1.5 MW, 30-96 MHz ultra-wideband 3 dB high power hybrid coupler for Ion Cyclotron Resonance Frequency (ICRF) heating in fusion grade reactor

    NASA Astrophysics Data System (ADS)

    Yadav, Rana Pratap; Kumar, Sunil; Kulkarni, S. V.

    2016-01-01

    Design and developmental procedure of strip-line based 1.5 MW, 30-96 MHz, ultra-wideband high power 3 dB hybrid coupler has been presented and its applicability in ion cyclotron resonance heating (ICRH) in tokamak is discussed. For the high power handling capability, spacing between conductors and ground need to very high. Hence other structural parameters like strip-width, strip thickness coupling gap, and junction also become large which can be gone upto optimum limit where various constrains like fabrication tolerance, discontinuities, and excitation of higher TE and TM modes become prominent and significantly deteriorates the desired parameters of the coupled lines system. In designed hybrid coupler, two 8.34 dB coupled lines are connected in tandem to get desired coupling of 3 dB and air is used as dielectric. The spacing between ground and conductors are taken as 0.164 m for 1.5 MW power handling capability. To have the desired spacing, each of 8.34 dB segments are designed with inner dimension of 3.6 × 1.0 × 40 cm where constraints have been significantly realized, compensated, and applied in designing of 1.5 MW hybrid coupler and presented in paper.

  7. Integrating Cache Performance Modeling and Tuning Support in Parallelization Tools

    NASA Technical Reports Server (NTRS)

    Waheed, Abdul; Yan, Jerry; Saini, Subhash (Technical Monitor)

    1998-01-01

    With the resurgence of distributed shared memory (DSM) systems based on cache-coherent Non Uniform Memory Access (ccNUMA) architectures and increasing disparity between memory and processors speeds, data locality overheads are becoming the greatest bottlenecks in the way of realizing potential high performance of these systems. While parallelization tools and compilers facilitate the users in porting their sequential applications to a DSM system, a lot of time and effort is needed to tune the memory performance of these applications to achieve reasonable speedup. In this paper, we show that integrating cache performance modeling and tuning support within a parallelization environment can alleviate this problem. The Cache Performance Modeling and Prediction Tool (CPMP), employs trace-driven simulation techniques without the overhead of generating and managing detailed address traces. CPMP predicts the cache performance impact of source code level "what-if" modifications in a program to assist a user in the tuning process. CPMP is built on top of a customized version of the Computer Aided Parallelization Tools (CAPTools) environment. Finally, we demonstrate how CPMP can be applied to tune a real Computational Fluid Dynamics (CFD) application.

  8. Parallel Markov chain Monte Carlo - bridging the gap to high-performance Bayesian computation in animal breeding and genetics.

    PubMed

    Wu, Xiao-Lin; Sun, Chuanyu; Beissinger, Timothy M; Rosa, Guilherme Jm; Weigel, Kent A; Gatti, Natalia de Leon; Gianola, Daniel

    2012-09-25

    Most Bayesian models for the analysis of complex traits are not analytically tractable and inferences are based on computationally intensive techniques. This is true of Bayesian models for genome-enabled selection, which uses whole-genome molecular data to predict the genetic merit of candidate animals for breeding purposes. In this regard, parallel computing can overcome the bottlenecks that can arise from series computing. Hence, a major goal of the present study is to bridge the gap to high-performance Bayesian computation in the context of animal breeding and genetics. Parallel Monte Carlo Markov chain algorithms and strategies are described in the context of animal breeding and genetics. Parallel Monte Carlo algorithms are introduced as a starting point including their applications to computing single-parameter and certain multiple-parameter models. Then, two basic approaches for parallel Markov chain Monte Carlo are described: one aims at parallelization within a single chain; the other is based on running multiple chains, yet some variants are discussed as well. Features and strategies of the parallel Markov chain Monte Carlo are illustrated using real data, including a large beef cattle dataset with 50K SNP genotypes. Parallel Markov chain Monte Carlo algorithms are useful for computing complex Bayesian models, which does not only lead to a dramatic speedup in computing but can also be used to optimize model parameters in complex Bayesian models. Hence, we anticipate that use of parallel Markov chain Monte Carlo will have a profound impact on revolutionizing the computational tools for genomic selection programs.

  9. Parallel Markov chain Monte Carlo - bridging the gap to high-performance Bayesian computation in animal breeding and genetics

    PubMed Central

    2012-01-01

    Background Most Bayesian models for the analysis of complex traits are not analytically tractable and inferences are based on computationally intensive techniques. This is true of Bayesian models for genome-enabled selection, which uses whole-genome molecular data to predict the genetic merit of candidate animals for breeding purposes. In this regard, parallel computing can overcome the bottlenecks that can arise from series computing. Hence, a major goal of the present study is to bridge the gap to high-performance Bayesian computation in the context of animal breeding and genetics. Results Parallel Monte Carlo Markov chain algorithms and strategies are described in the context of animal breeding and genetics. Parallel Monte Carlo algorithms are introduced as a starting point including their applications to computing single-parameter and certain multiple-parameter models. Then, two basic approaches for parallel Markov chain Monte Carlo are described: one aims at parallelization within a single chain; the other is based on running multiple chains, yet some variants are discussed as well. Features and strategies of the parallel Markov chain Monte Carlo are illustrated using real data, including a large beef cattle dataset with 50K SNP genotypes. Conclusions Parallel Markov chain Monte Carlo algorithms are useful for computing complex Bayesian models, which does not only lead to a dramatic speedup in computing but can also be used to optimize model parameters in complex Bayesian models. Hence, we anticipate that use of parallel Markov chain Monte Carlo will have a profound impact on revolutionizing the computational tools for genomic selection programs. PMID:23009363

  10. Experimental study of switching in a rho-i(MQW)-eta vertical coupler

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Cavailles, J.A.; Erman, M.; Woodbridge, K.

    1989-11-01

    Electrically controlled switching in a vertically arranged directional coupler with GaAs/GaAIAs multiple quantum well waveguides is demonstrated. Coupling lengths and extinction parameters are determined by using a sample processed in such a way that injection conditions are well defined and that the coupler length can be varied continuously.

  11. Performance Analysis of Multilevel Parallel Applications on Shared Memory Architectures

    NASA Technical Reports Server (NTRS)

    Biegel, Bryan A. (Technical Monitor); Jost, G.; Jin, H.; Labarta J.; Gimenez, J.; Caubet, J.

    2003-01-01

    Parallel programming paradigms include process level parallelism, thread level parallelization, and multilevel parallelism. This viewgraph presentation describes a detailed performance analysis of these paradigms for Shared Memory Architecture (SMA). This analysis uses the Paraver Performance Analysis System. The presentation includes diagrams of a flow of useful computations.

  12. Overlapping-image multimode interference couplers with a reduced number of self-images for uniform and nonuniform power splitting

    NASA Astrophysics Data System (ADS)

    Bachmann, M.; Besse, P. A.; Melchior, H.

    1995-10-01

    Overlapping-image multimode interference (MMI) couplers, a new class of devices, permit uniform and nonuniform power splitting. A theoretical description directly relates coupler geometry to image intensities, positions, and phases. Among many possibilities of nonuniform power splitting, examples of 1 \\times 2 couplers with ratios of 15:85 and 28:72 are given. An analysis of uniform power splitters includes the well-known 2 \\times N and 1 \\times N MMI couplers. Applications of MMI couplers include mode filters, mode splitters-combiners, and mode converters.

  13. Element for use in an inductive coupler for downhole drilling components

    DOEpatents

    Hall, David R.; Hall, Jr., H. Tracy; Pixton, David S.; Dahlgren, Scott; Fox, Joe; Sneddon, Cameron

    2006-08-29

    The present invention includes an element for use in an inductive coupler in a downhole component. The element includes a plurality of ductile, generally U-shaped leaves that are electrically conductive. The leaves are less than about 0.0625" thick and are separated by an electrically insulating material. These leaves are aligned so as to form a generally circular trough. The invention also includes an inductive coupler for use in downhole components, the inductive coupler including an annular housing having a recess with a magnetically conductive, electrically insulating (MCEI) element disposed in the recess. The MCEI element includes a plurality of segments where each segment further includes a plurality of ductile, generally U-shaped electrically conductive leaves. Each leaf is less than about 0.0625" thick and separated from the otherwise adjacent leaves by electrically insulating material. The segments and leaves are aligned so as to form a generally circular trough. The inductive coupler further includes an insulated conductor disposed within the generally circular trough. A polymer fills spaces between otherwise adjacent segments, the annular housing, insulated conductor, and further fills the circular trough.

  14. Waveguide Multimode Directional Coupler for Harvesting Harmonic Power from the Output of Traveling-Wave Tube Amplifiers

    NASA Technical Reports Server (NTRS)

    Simons, Rainee N.; Wintucky, Edwin G.

    2017-01-01

    The paper presents the design, fabrication, and test results for a novel waveguide multimode directional coupler (MDC). The coupler fabricated from dissimilar frequency band waveguides, is capable of isolating power at the 2nd harmonic frequency from the fundamental power at the output port of a high power traveling-wave tube amplifier. The major advantage of the MDC is significantly lower insertion loss compared to a diplexer. The presentation slides for the paper that was approved is attached. The tracking number for the paper that was approved is TN 37015.

  15. GROMACS: High performance molecular simulations through multi-level parallelism from laptops to supercomputers

    DOE PAGES

    Abraham, Mark James; Murtola, Teemu; Schulz, Roland; ...

    2015-07-15

    GROMACS is one of the most widely used open-source and free software codes in chemistry, used primarily for dynamical simulations of biomolecules. It provides a rich set of calculation types, preparation and analysis tools. Several advanced techniques for free-energy calculations are supported. In version 5, it reaches new performance heights, through several new and enhanced parallelization algorithms. This work on every level; SIMD registers inside cores, multithreading, heterogeneous CPU–GPU acceleration, state-of-the-art 3D domain decomposition, and ensemble-level parallelization through built-in replica exchange and the separate Copernicus framework. Finally, the latest best-in-class compressed trajectory storage format is supported.

  16. GROMACS: High performance molecular simulations through multi-level parallelism from laptops to supercomputers

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Abraham, Mark James; Murtola, Teemu; Schulz, Roland

    GROMACS is one of the most widely used open-source and free software codes in chemistry, used primarily for dynamical simulations of biomolecules. It provides a rich set of calculation types, preparation and analysis tools. Several advanced techniques for free-energy calculations are supported. In version 5, it reaches new performance heights, through several new and enhanced parallelization algorithms. This work on every level; SIMD registers inside cores, multithreading, heterogeneous CPU–GPU acceleration, state-of-the-art 3D domain decomposition, and ensemble-level parallelization through built-in replica exchange and the separate Copernicus framework. Finally, the latest best-in-class compressed trajectory storage format is supported.

  17. Performance of the Wavelet Decomposition on Massively Parallel Architectures

    NASA Technical Reports Server (NTRS)

    El-Ghazawi, Tarek A.; LeMoigne, Jacqueline; Zukor, Dorothy (Technical Monitor)

    2001-01-01

    Traditionally, Fourier Transforms have been utilized for performing signal analysis and representation. But although it is straightforward to reconstruct a signal from its Fourier transform, no local description of the signal is included in its Fourier representation. To alleviate this problem, Windowed Fourier transforms and then wavelet transforms have been introduced, and it has been proven that wavelets give a better localization than traditional Fourier transforms, as well as a better division of the time- or space-frequency plane than Windowed Fourier transforms. Because of these properties and after the development of several fast algorithms for computing the wavelet representation of any signal, in particular the Multi-Resolution Analysis (MRA) developed by Mallat, wavelet transforms have increasingly been applied to signal analysis problems, especially real-life problems, in which speed is critical. In this paper we present and compare efficient wavelet decomposition algorithms on different parallel architectures. We report and analyze experimental measurements, using NASA remotely sensed images. Results show that our algorithms achieve significant performance gains on current high performance parallel systems, and meet scientific applications and multimedia requirements. The extensive performance measurements collected over a number of high-performance computer systems have revealed important architectural characteristics of these systems, in relation to the processing demands of the wavelet decomposition of digital images.

  18. Experimental study of a VBG-based Tm : YLF slab laser at different output coupler parameters

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Duan, X M; Ding, Y; Dai, T Y

    2015-04-30

    The performance of a Tm : YLF slab laser is studied at different output coupler parameters. Use is made of a 20-mm-long a-cut slab crystal doped with 2.5 at. % thulium ions. With a volume Bragg grating and a Fabry – Perot etalon, the selected output wavelength of this Tm : YLF slab laser is 1908 nm. For the optimised output coupler with a transmission of 20% and a radius of curvature of 300 mm, the output power exceeds 74.1 W and the slope efficiency with respect to the absorbed pump power reaches 48.4%. In addition, the beam quality ofmore » the Tm : YLF slab laser is improved. (lasers)« less

  19. A Tandem Coupler for Terahertz Integrated Circuits

    NASA Technical Reports Server (NTRS)

    Reck, Theodore J.; Deal, William; Chattopadhyay, Goutam

    2013-01-01

    A coplanar waveguide 3 dB quadrature coupler operating from 500 to 700 GHz is designed, fabricated and measured. On-wafer measurements demonstrate an amplitude balance of +/-2 dB and phase balance of +/-20 deg.

  20. Hybrid unidirectional meta-coupler for vertical incidence to a high-refractive-index waveguide in telecom wavelength.

    PubMed

    Gong, Chensheng; Zhang, Jianhao; He, Sailing

    2017-12-15

    Unidirectional optical manipulation, especially the coupling from a vertical light beam to a waveguide unidirectionally, is desirable in photonic integration. We first propose a hybrid unidirectional meta-coupler for vertical incidence to a high-refractive-index waveguide in telecom wavelength, a periodic plasmonic metasurface composed of metal-insulator-metal unit cells is used for phase matching. Three designs are given for devices working around wavelengths 0.85, 1.31, and 1.55 μm. The simulated coupling efficiencies are all around 70%, and the 1 dB coupling bandwidths are 29, 82, and 105 nm, respectively. Our approach paves the way for the applications of optical metasurfaces to planar lightwave circuits.

  1. Enhancement of coupling ratios in SOI based asymmetrical optical directional couplers

    NASA Astrophysics Data System (ADS)

    Pendam, Nagaraju; Vardhani, Chunduru Parvatha

    2017-11-01

    A novel design of slab structured asymmetrical optical directional coupler with S-bend waveguides on silicon-on-insulator (SOI) platform has been designed by using R-Soft CAD tool. Beam propagation method (BPM) is used for light propagation analysis. The simulation results of asymmetrical optical directional couplers are reported. We find that the asymmetrical directional coupler has lower coupling ratios and higher extinction ratios with waveguide parameters such as width, wavelength, waveguide spacing, and coupling length. Simulation results designate that the coupling efficiency for transverse electric (TE) and transverse magnetic (TM) modes can reach about more than 95% and extinction ratio about 6 dB when the coupling length is 6 mm for both the polarization modes and insertion loss is 17 dB with same coupling length 6 mm at central wavelength 1550 nm.

  2. Multiplexed Energy Coupler for Rotating Equipment

    NASA Technical Reports Server (NTRS)

    Zhao, Xiaoliang

    2011-01-01

    A multiplexing antenna assembly can efficiently couple AC signal/energy into, or out of, rotating equipment. The unit only passes AC energy while blocking DC energy. Concentric tubes that are sliced into multiple pieces are assembled together so that, when a piece from an outer tube aligns well with an inner tube piece, efficient energy coupling is achieved through a capacitive scheme. With N outer pieces and M inner pieces, an effective N x M combination can be achieved in a multiplexed manner. The energy coupler is non-contact, which is useful if isolation from rotating and stationary parts is required. Additionally, the innovation can operate in high temperatures. Applications include rotating structure sensing, non-contact energy transmission, etc.

  3. Proposal for fabrication-tolerant SOI polarization splitter-rotator based on cascaded MMI couplers and an assisted bi-level taper

    PubMed Central

    Wang, Jing; Qi, Minghao; Xuan, Yi; Huang, Haiyang; Li, You; Li, Ming; Chen, Xin; Jia, Qi; Sheng, Zhen; Wu, Aimin; Li, Wei; Wang, Xi; Zou, Shichang; Gan, Fuwan

    2014-01-01

    A novel silicon-on-insulator (SOI) polarization splitter-rotator (PSR) with a large fabrication tolerance is proposed based on cascaded multimode interference (MMI) couplers and an assisted mode-evolution taper. The tapers are designed to adiabatically convert the input TM0 mode into the TE1 mode, which will output as the TE0 mode after processed by the subsequent MMI mode converter, 90-degree phase shifter (PS) and MMI 3 dB coupler. The numerical simulation results show that the proposed device has a < 0.5 dB insertion loss with < −17 dB crosstalk in C optical communication band. Fabrication tolerance analysis is also performed with respect to the deviations of MMI coupler width, PS width, slab height and upper-cladding refractive index, showing that this device could work well even when affected by considerable fabrication errors. With such a robust performance with a large bandwidth, this device offers potential applications for CMOS-compatible polarization diversity, especially in the booming 100 Gb/s coherent optical communications based on silicon photonics technology. PMID:25402029

  4. Proposal for fabrication-tolerant SOI polarization splitter-rotator based on cascaded MMI couplers and an assisted bi-level taper.

    PubMed

    Wang, Jing; Qi, Minghao; Xuan, Yi; Huang, Haiyang; Li, You; Li, Ming; Chen, Xin; Jia, Qi; Sheng, Zhen; Wu, Aimin; Li, Wei; Wang, Xi; Zou, Shichang; Gan, Fuwan

    2014-11-17

    A novel silicon-on-insulator (SOI) polarization splitter-rotator (PSR) with a large fabrication tolerance is proposed based on cascaded multimode interference (MMI) couplers and an assisted mode-evolution taper. The tapers are designed to adiabatically convert the input TM(0) mode into the TE(1) mode, which will output as the TE(0) mode after processed by the subsequent MMI mode converter, 90-degree phase shifter (PS) and MMI 3 dB coupler. The numerical simulation results show that the proposed device has a < 0.5 dB insertion loss with < -17 dB crosstalk in C optical communication band. Fabrication tolerance analysis is also performed with respect to the deviations of MMI coupler width, PS width, slab height and upper-cladding refractive index, showing that this device could work well even when affected by considerable fabrication errors. With such a robust performance with a large bandwidth, this device offers potential applications for CMOS-compatible polarization diversity, especially in the booming 100 Gb/s coherent optical communications based on silicon photonics technology.

  5. Integrated-optical directional coupler biosensor

    NASA Astrophysics Data System (ADS)

    Luff, B. J.; Harris, R. D.; Wilkinson, J. S.; Wilson, R.; Schiffrin, D. J.

    1996-04-01

    We present measurements of biomolecular binding reactions, using a new type of integrated-optical biosensor based on a planar directional coupler structure. The device is fabricated by Ag+ - Na+ ion exchange in glass, and definition of the sensing region is achieved by use of transparent fluoropolymer isolation layers formed by thermal evaporation. The suitability of the sensor for application to the detection of environmental pollutants is considered.

  6. A parallel calibration utility for WRF-Hydro on high performance computers

    NASA Astrophysics Data System (ADS)

    Wang, J.; Wang, C.; Kotamarthi, V. R.

    2017-12-01

    A successful modeling of complex hydrological processes comprises establishing an integrated hydrological model which simulates the hydrological processes in each water regime, calibrates and validates the model performance based on observation data, and estimates the uncertainties from different sources especially those associated with parameters. Such a model system requires large computing resources and often have to be run on High Performance Computers (HPC). The recently developed WRF-Hydro modeling system provides a significant advancement in the capability to simulate regional water cycles more completely. The WRF-Hydro model has a large range of parameters such as those in the input table files — GENPARM.TBL, SOILPARM.TBL and CHANPARM.TBL — and several distributed scaling factors such as OVROUGHRTFAC. These parameters affect the behavior and outputs of the model and thus may need to be calibrated against the observations in order to obtain a good modeling performance. Having a parameter calibration tool specifically for automate calibration and uncertainty estimates of WRF-Hydro model can provide significant convenience for the modeling community. In this study, we developed a customized tool using the parallel version of the model-independent parameter estimation and uncertainty analysis tool, PEST, to enabled it to run on HPC with PBS and SLURM workload manager and job scheduler. We also developed a series of PEST input file templates that are specifically for WRF-Hydro model calibration and uncertainty analysis. Here we will present a flood case study occurred in April 2013 over Midwest. The sensitivity and uncertainties are analyzed using the customized PEST tool we developed.

  7. Interfacing Computer Aided Parallelization and Performance Analysis

    NASA Technical Reports Server (NTRS)

    Jost, Gabriele; Jin, Haoqiang; Labarta, Jesus; Gimenez, Judit; Biegel, Bryan A. (Technical Monitor)

    2003-01-01

    When porting sequential applications to parallel computer architectures, the program developer will typically go through several cycles of source code optimization and performance analysis. We have started a project to develop an environment where the user can jointly navigate through program structure and performance data information in order to make efficient optimization decisions. In a prototype implementation we have interfaced the CAPO computer aided parallelization tool with the Paraver performance analysis tool. We describe both tools and their interface and give an example for how the interface helps within the program development cycle of a benchmark code.

  8. Multistability and switching in oppositely-directed saturated coupler

    NASA Astrophysics Data System (ADS)

    Nithyanandan, K.; Shafeeque Ali, A. K.; Porsezian, K.; Nishad, M. P. M.; Tchofo Dinda, P.; Grelu, Ph.

    2018-06-01

    We investigate theoretically the optical multistability that takes place in a two-core oppositely-directed saturated coupler (ODSC) having negative index material (NIM) channel. The dynamics are studied using the Lagrangian variational method, and analytical solutions are constructed with Jacobi elliptic functions. The ODSC exhibits a bandgap as a consequence of the effective feedback mechanism due to the opposite directionality of the phase velocity and the Poynting vector in the NIM channel. Depending on the strength of the nonlinear saturation, the system admits multiple stable states. Considering the additional degrees of design freedom with respect to conventional nonlinear couplers, the ODSC could become an attractive choice for all-optical switching. The existence of multiple transmission resonance windows could also facilitate the realization of gap solitons.

  9. Parallel Processing at the High School Level.

    ERIC Educational Resources Information Center

    Sheary, Kathryn Anne

    This study investigated the ability of high school students to cognitively understand and implement parallel processing. Data indicates that most parallel processing is being taught at the university level. Instructional modules on C, Linux, and the parallel processing language, P4, were designed to show that high school students are highly…

  10. Theoretical and experimental analysis of a linear accelerator endowed with single feed coupler with movable short-circuit.

    PubMed

    Dal Forno, Massimo; Craievich, Paolo; Penco, Giuseppe; Vescovo, Roberto

    2013-11-01

    The front-end injection systems of the FERMI@Elettra linac produce high brightness electron beams that define the performance of the Free Electron Laser. The photoinjector mainly consists of the radiofrequency (rf) gun and of two S-band rf structures which accelerate the beam. Accelerating structures endowed with a single feed coupler cause deflection and degradation of the electron beam properties, due to the asymmetry of the electromagnetic field. In this paper, a new type of single feed structure with movable short-circuit is proposed. It has the advantage of having only one waveguide input, but we propose a novel design where the dipolar component is reduced. Moreover, the racetrack geometry allows to reduce the quadrupolar component. This paper presents the microwave design and the analysis of the particle motion inside the linac. A prototype has been machined at the Elettra facility to verify the new coupler design and the rf field has been measured by adopting the bead-pull method. The results are here presented, showing good agreement with the expectations.

  11. Highly parallel sparse Cholesky factorization

    NASA Technical Reports Server (NTRS)

    Gilbert, John R.; Schreiber, Robert

    1990-01-01

    Several fine grained parallel algorithms were developed and compared to compute the Cholesky factorization of a sparse matrix. The experimental implementations are on the Connection Machine, a distributed memory SIMD machine whose programming model conceptually supplies one processor per data element. In contrast to special purpose algorithms in which the matrix structure conforms to the connection structure of the machine, the focus is on matrices with arbitrary sparsity structure. The most promising algorithm is one whose inner loop performs several dense factorizations simultaneously on a 2-D grid of processors. Virtually any massively parallel dense factorization algorithm can be used as the key subroutine. The sparse code attains execution rates comparable to those of the dense subroutine. Although at present architectural limitations prevent the dense factorization from realizing its potential efficiency, it is concluded that a regular data parallel architecture can be used efficiently to solve arbitrarily structured sparse problems. A performance model is also presented and it is used to analyze the algorithms.

  12. Parallelized direct execution simulation of message-passing parallel programs

    NASA Technical Reports Server (NTRS)

    Dickens, Phillip M.; Heidelberger, Philip; Nicol, David M.

    1994-01-01

    As massively parallel computers proliferate, there is growing interest in findings ways by which performance of massively parallel codes can be efficiently predicted. This problem arises in diverse contexts such as parallelizing computers, parallel performance monitoring, and parallel algorithm development. In this paper we describe one solution where one directly executes the application code, but uses a discrete-event simulator to model details of the presumed parallel machine such as operating system and communication network behavior. Because this approach is computationally expensive, we are interested in its own parallelization specifically the parallelization of the discrete-event simulator. We describe methods suitable for parallelized direct execution simulation of message-passing parallel programs, and report on the performance of such a system, Large Application Parallel Simulation Environment (LAPSE), we have built on the Intel Paragon. On all codes measured to date, LAPSE predicts performance well typically within 10 percent relative error. Depending on the nature of the application code, we have observed low slowdowns (relative to natively executing code) and high relative speedups using up to 64 processors.

  13. Measurement of chalcogenide glass optical dispersion using a mid-infrared prism coupler

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Qiao, Hong; Anheier, Norman C.; Musgraves, Jonathan D.

    2011-05-01

    Physical properties of chalcogenide glass, including broadband infrared transparency, high refractive index, low glass transition temperature, and nonlinear properties, make them attractive candidates for advanced mid-infrared (3 to 12 {micro}m) optical designs. Efforts focused at developing new chalcogenide glass formulations and processing methods require rapid quantitative evaluation of their optical contents to guide the materials research. However, characterization of important optical parameters such as optical dispersion remains a slow and costly process, generally with limited accuracy. The recent development of a prism coupler at the Pacific Northwest National Laboratory (PNNL) now enables rapid, high precision measurement of refractive indices atmore » discrete wavelengths from the visible to the mid-infrared. Optical dispersion data of several chalcogenide glass families were collected using this method. Variations in the optical dispersion were correlated to glass composition and compared against measurements using other methods. While this work has been focused on facilitating chalcogenide glass synthesis, mid-infrared prism coupler analysis has broader applications to other mid-infrared optical material development efforts, including oxide glasses and crystalline materials.« less

  14. Improving parallel I/O autotuning with performance modeling

    DOE PAGES

    Behzad, Babak; Byna, Surendra; Wild, Stefan M.; ...

    2014-01-01

    Various layers of the parallel I/O subsystem offer tunable parameters for improving I/O performance on large-scale computers. However, searching through a large parameter space is challenging. We are working towards an autotuning framework for determining the parallel I/O parameters that can achieve good I/O performance for different data write patterns. In this paper, we characterize parallel I/O and discuss the development of predictive models for use in effectively reducing the parameter space. Furthermore, applying our technique on tuning an I/O kernel derived from a large-scale simulation code shows that the search time can be reduced from 12 hours to 2more » hours, while achieving 54X I/O performance speedup.« less

  15. Pseudo-circulator implemented as a multimode fiber coupler

    NASA Astrophysics Data System (ADS)

    Bulota, F.; Bélanger, P.; Leduc, M.; Boudoux, C.; Godbout, N.

    2016-03-01

    We present a linear all-fiber device exhibiting the functionality of a circulator, albeit for multimode fibers. We define a pseudo-circulator as a linear three-port component that transfers most of a multimode light signal from Port 1 to Port 2, and from Port 2 to Port 3. Unlike a traditional circulator which depends on a nonlinear phenomenon to achieve a non-reciprocal behavior, our device is a linear component that seemingly breaks the principle of reciprocity by exploiting the variations of etendue of the multimode fibers in the coupler. The pseudo-circulator is implemented as a 2x2 asymmetric multimode fiber coupler, fabricated using the fusion-tapering technique. The coupler is asymmetric in its transverse fused section. The two multimode fibers differ in area, thus favoring the transfer of light from the smaller to the bigger fiber. The desired difference of area is obtained by tapering one of the fiber before the fusion process. Using this technique, we have successfully fabricated a pseudo-circulator surpassing in efficiency a 50/50 beam-splitter. In all the visible and near-IR spectrum, the transmission ratio exceeds 77% from Port 1 to Port 2, and 80% from Port 2 to Port 3. The excess loss is less than 0.5 dB, regardless of the entry port.

  16. Highly-Parallel, Highly-Compact Computing Structures Implemented in Nanotechnology

    NASA Technical Reports Server (NTRS)

    Crawley, D. G.; Duff, M. J. B.; Fountain, T. J.; Moffat, C. D.; Tomlinson, C. D.

    1995-01-01

    In this paper, we describe work in which we are evaluating how the evolving properties of nano-electronic devices could best be utilized in highly parallel computing structures. Because of their combination of high performance, low power, and extreme compactness, such structures would have obvious applications in spaceborne environments, both for general mission control and for on-board data analysis. However, the anticipated properties of nano-devices mean that the optimum architecture for such systems is by no means certain. Candidates include single instruction multiple datastream (SIMD) arrays, neural networks, and multiple instruction multiple datastream (MIMD) assemblies.

  17. Total internal reflection-based planar waveguide solar concentrator with symmetric air prisms as couplers.

    PubMed

    Xie, Peng; Lin, Huichuan; Liu, Yong; Li, Baojun

    2014-10-20

    We present a waveguide coupling approach for planar waveguide solar concentrator. In this approach, total internal reflection (TIR)-based symmetric air prisms are used as couplers to increase the coupler reflectivity and to maximize the optical efficiency. The proposed concentrator consists of a line focusing cylindrical lens array over a planar waveguide. The TIR-based couplers are located at the focal line of each lens to couple the focused sunlight into the waveguide. The optical system was modeled and simulated with a commercial ray tracing software (Zemax). Results show that the system used with optimized TIR-based couplers can achieve 70% optical efficiency at 50 × geometrical concentration ratio, resulting in a flux concentration ratio of 35 without additional secondary concentrator. An acceptance angle of ± 7.5° is achieved in the x-z plane due to the use of cylindrical lens array as the primary concentrator.

  18. Scalable Performance Environments for Parallel Systems

    NASA Technical Reports Server (NTRS)

    Reed, Daniel A.; Olson, Robert D.; Aydt, Ruth A.; Madhyastha, Tara M.; Birkett, Thomas; Jensen, David W.; Nazief, Bobby A. A.; Totty, Brian K.

    1991-01-01

    As parallel systems expand in size and complexity, the absence of performance tools for these parallel systems exacerbates the already difficult problems of application program and system software performance tuning. Moreover, given the pace of technological change, we can no longer afford to develop ad hoc, one-of-a-kind performance instrumentation software; we need scalable, portable performance analysis tools. We describe an environment prototype based on the lessons learned from two previous generations of performance data analysis software. Our environment prototype contains a set of performance data transformation modules that can be interconnected in user-specified ways. It is the responsibility of the environment infrastructure to hide details of module interconnection and data sharing. The environment is written in C++ with the graphical displays based on X windows and the Motif toolkit. It allows users to interconnect and configure modules graphically to form an acyclic, directed data analysis graph. Performance trace data are represented in a self-documenting stream format that includes internal definitions of data types, sizes, and names. The environment prototype supports the use of head-mounted displays and sonic data presentation in addition to the traditional use of visual techniques.

  19. Flexibility and Performance of Parallel File Systems

    NASA Technical Reports Server (NTRS)

    Kotz, David; Nieuwejaar, Nils

    1996-01-01

    As we gain experience with parallel file systems, it becomes increasingly clear that a single solution does not suit all applications. For example, it appears to be impossible to find a single appropriate interface, caching policy, file structure, or disk-management strategy. Furthermore, the proliferation of file-system interfaces and abstractions make applications difficult to port. We propose that the traditional functionality of parallel file systems be separated into two components: a fixed core that is standard on all platforms, encapsulating only primitive abstractions and interfaces, and a set of high-level libraries to provide a variety of abstractions and application-programmer interfaces (API's). We present our current and next-generation file systems as examples of this structure. Their features, such as a three-dimensional file structure, strided read and write interfaces, and I/O-node programs, are specifically designed with the flexibility and performance necessary to support a wide range of applications.

  20. Analysis of a single ring resonator with 2×2 90-degree multimode waveguide turning couplers

    NASA Astrophysics Data System (ADS)

    Chiu, C. L.; Liao, Yen-Hsun

    2016-02-01

    A novel design of a single ring resonator with two low-loss 2×2 90-degree multimode waveguide turning mirror couplers based on a InP structure. The coupling factor of the 2×2 90-degree multimode waveguide turning mirror coupler is inversed for K=0.85 to K=0.15 when one folding is achieved. The 2×2 90-degree turning mirror coupler for K=0.15 is (3/4)Lπ in length. Its length is reduced 3 times than the conventional straight 2×2 multimode waveguide interference coupler (9/4)Lπ in length for K=0.15. The cavity length of the curve waveguide (90-degree arc length) in this ring resonator with two 2×2 90-degree multimode waveguide turning couplers is decreased 1/2 times than with two 2×2 MMI couplers (180-degree arc length). The free spectral range (FSR) is increased 2 times. The output spectral response gets a FSR of 82 GHz for the device and a contrast of 4 dB and FWHM of 0.24 nm for the drop port. The results of numerical analysis calculated by the transfer functions in a single ring resonator are agreement with the experimental results.

  1. Implementation of a diode-pumped Nd:YAG laser with quick-change output couplers for high-beam quality 1064 or 532 nm wavelength generation

    NASA Astrophysics Data System (ADS)

    Li, Chun-Hao; Tsai, Ming-Jong

    2009-06-01

    A novel diode-pumped Nd:YAG laser system that employs a fixed active laser medium and a pair of quick-change output couplers on a precision linear stage for 1064 or 532 nm wavelength generation is presented. Fixed elements include a rear mirror, an acousto-optical Q-switch, and a diode-pumped solid-state laser (DPSSL). Movable elements for 1064 nm generation include an intra-cavity aperture as a mode selection element (MSE) and an output coupler. Movable elements for 532 nm generation include an intra-cavity frequency conversion with KTP, an intra-cavity aperture as a mode selection element (MSE), and an output coupler. Under stable operating conditions, the 1064 nm configuration produced a beam propagation ratio of 1.18 whereas the 532 nm configuration produced a beam propagation ratio of 1.1, both of which used an intra-cavity MSE with an aperture of 1.2 mm and a length of 5 mm.

  2. Multiplexing of adjacent vortex modes with the forked grating coupler

    NASA Astrophysics Data System (ADS)

    Nadovich, Christopher T.; Kosciolek, Derek J.; Crouse, David T.; Jemison, William D.

    2017-08-01

    For vortex fiber multiplexing to reach practical commercial viability, simple silicon photonic interfaces with vortex fiber will be required. These interfaces must support multiplexing. Toward this goal, an efficient singlefed multimode Forked Grating Coupler (FGC) for coupling two different optical vortex OAM charges to or from the TE0 and TE1 rectangular waveguide modes has been developed. A simple, apodized device implemented with e-beam lithography and a conventional dual-etch processing on SOI wafer exhibits low crosstalk and reasonable mode match. Advanced designs using this concept are expected to further improve performance.

  3. Interferometric fiber-optic temperature sensor with spiral polarization couplers

    NASA Astrophysics Data System (ADS)

    Cortés, R.; Khomenko, A. V.; Starodumov, A. N.; Arzate, N.; Zenteno, L. A.

    1998-09-01

    A fiber optic temperature sensor, for which the changes in modal birefringence of a short section of a long birefringent fiber are monitored remotely, is described. It employs a white light interferometer, which is formed by two concatenated spiral polarization mode couplers. A new method for white light interferometer output signal processing is described which provides a high accuracy absolute temperature measurement even in discontinuous operation of the sensor. Experimental results are presented for temperature measurements over a 100°C range with resolution of 3×10 -3 °C.

  4. Microwave coupler and method

    DOEpatents

    Holcombe, C.E.

    1984-11-29

    The present invention is directed to a microwave coupler for enhancing the heating or metallurgical treatment of materials within a cold-wall, rapidly heated cavity as provided by a microwave furnace. The coupling material of the present invention is an alpha-rhombohedral-boron-derivative-structure material such as boron carbide or boron silicide which can be appropriately positioned as a susceptor within the furnace to heat other material or be in powder particulate form so that composites and structures of boron carbide such as cutting tools, grinding wheels and the like can be rapidly and efficiently formed within microwave furnaces.

  5. Microwave coupler and method

    DOEpatents

    Holcombe, Cressie E.

    1985-01-01

    The present invention is directed to a microwave coupler for enhancing the heating or metallurgical treatment of materials within a cold-wall, rapidly heated cavity as provided by a microwave furnace. The coupling material of the present invention is an alpha-rhombohedral-boron-derivative-structure material such as boron carbide or boron silicide which can be appropriately positioned as a susceptor within the furnace to heat other material or be in powder particulate form so that composites and structures of boron carbide such as cutting tools, grinding wheels and the like can be rapidly and efficiently formed within microwave furnaces.

  6. Performance Modeling and Measurement of Parallelized Code for Distributed Shared Memory Multiprocessors

    NASA Technical Reports Server (NTRS)

    Waheed, Abdul; Yan, Jerry

    1998-01-01

    This paper presents a model to evaluate the performance and overhead of parallelizing sequential code using compiler directives for multiprocessing on distributed shared memory (DSM) systems. With increasing popularity of shared address space architectures, it is essential to understand their performance impact on programs that benefit from shared memory multiprocessing. We present a simple model to characterize the performance of programs that are parallelized using compiler directives for shared memory multiprocessing. We parallelized the sequential implementation of NAS benchmarks using native Fortran77 compiler directives for an Origin2000, which is a DSM system based on a cache-coherent Non Uniform Memory Access (ccNUMA) architecture. We report measurement based performance of these parallelized benchmarks from four perspectives: efficacy of parallelization process; scalability; parallelization overhead; and comparison with hand-parallelized and -optimized version of the same benchmarks. Our results indicate that sequential programs can conveniently be parallelized for DSM systems using compiler directives but realizing performance gains as predicted by the performance model depends primarily on minimizing architecture-specific data locality overhead.

  7. Optical single sideband millimeter-wave signal generation and transmission using 120° hybrid coupler

    NASA Astrophysics Data System (ADS)

    Zheng, Zhiwei; Peng, Miao; Zhou, Hui; Chen, Ming; Jiang, Leyong; Tan, Li; Dai, Xiaoyu; Xiang, Yuanjiang

    2018-03-01

    We propose a novel 60 GHz optical single sideband (OSSB) millimeter-wave (mm-wave) signal generation scheme using 120° hybrid coupler based on external integrated Mach-Zehnder modulator (MZM). The proposed scheme shows that the bit error ratio (BER) performance is improved by suppressing the +2nd-order sideband. Meanwhile, the transmission distance is extended as only the optical +1st-order sideband is modulated by using 5 Gbit/s baseband signal while the carrier is blank, owing to the elimination of walk-off effect suffered from fiber dispersion. The simulation results demonstrated that the eye diagrams of the generated 60 GHz OSSB signal keep open and clear after 100 km standard single-mode fiber (SSMF). In addition, the proposed scheme can achieve 2 dB receiver sensitivity improvements than the conventional 90° hybrid coupler when transmitted over 100 km SSMF at a BER of 10-9.

  8. Direct laser written polymer waveguides with out of plane couplers for optical chips

    NASA Astrophysics Data System (ADS)

    Landowski, Alexander; Zepp, Dominik; Wingerter, Sebastian; von Freymann, Georg; Widera, Artur

    2017-10-01

    Optical technologies call for waveguide networks featuring high integration densities, low losses, and simple operation. Here, we present polymer waveguides fabricated from a negative tone photoresist via two-photon-lithography in direct laser writing, and show a detailed parameter study of their performance. Specifically, we produce waveguides featuring bend radii down to 40 μ m, insertion losses of the order of 10 dB, and loss coefficients smaller than 0.81 dB mm-1, facilitating high integration densities in writing fields of 300 μ m×300 μ m. A novel three-dimensional coupler design allows for coupling control as well as direct observation of outputs in a single field of view through a microscope objective. Finally, we present beam-splitting devices to construct larger optical networks, and we show that the waveguide material is compatible with the integration of quantum emitters.

  9. High performance data transfer

    NASA Astrophysics Data System (ADS)

    Cottrell, R.; Fang, C.; Hanushevsky, A.; Kreuger, W.; Yang, W.

    2017-10-01

    The exponentially increasing need for high speed data transfer is driven by big data, and cloud computing together with the needs of data intensive science, High Performance Computing (HPC), defense, the oil and gas industry etc. We report on the Zettar ZX software. This has been developed since 2013 to meet these growing needs by providing high performance data transfer and encryption in a scalable, balanced, easy to deploy and use way while minimizing power and space utilization. In collaboration with several commercial vendors, Proofs of Concept (PoC) consisting of clusters have been put together using off-the- shelf components to test the ZX scalability and ability to balance services using multiple cores, and links. The PoCs are based on SSD flash storage that is managed by a parallel file system. Each cluster occupies 4 rack units. Using the PoCs, between clusters we have achieved almost 200Gbps memory to memory over two 100Gbps links, and 70Gbps parallel file to parallel file with encryption over a 5000 mile 100Gbps link.

  10. Strip gratings on dielectric substrates as output couplers for submillimeter lasers

    NASA Astrophysics Data System (ADS)

    Veron, D.; Whitbourn, L. B.

    1986-03-01

    This paper describes the use and advantages of metallic strip gratings on dielectric substrates as output couplers for both optically pumped and discharge-excited submillimeter lasers. Formulas are presented for the calculation of transmittance and loss of such couplers, taking account of loss in the strip grating as well as loss and multiple reflections in the substrate. Included are expressions for the phase shifts on reflection and transmission by an arbitrary lossy grid on a plane boundary between two dielectrics, according to a transmission-line model that is applicable for wavelengths in both dielectrics longer than the grid period. In relation to these phase shifts, attention is drawn to an important sign convention. The theory is shown to agree well with measured transmittance of a typical device between 500 and 1600 GHz as well as spot measurements at 891 (337-micron HCN laser), 1540, and 1578 GHz (195- and 190-micron DCN laser). Finally, the theory is used to design a low loss coupler for the low gain 119-micron line of discharge excited H2O.

  11. Low-cost integrated-optic fiber couplers

    NASA Astrophysics Data System (ADS)

    Sheem, Sang K.; Zhang, Feng; Choi, Jong-Ho; Lee, Yong-Woo; Low, Sarah; Lu, Shih-Yau

    1997-04-01

    In an effort to lower the cost of fiber optic couplers, integrated optic channel waveguide circuits are made of a UV-curable polymer using a molding technique, and then a novel fiber-to-channel connecting approach is employed in which UV light radiating from an optical fiber core cures the polymer in the channel, thus accomplishing a 'touchdown' of the core-extension waveguide onto the walls of the channel waveguide.

  12. Parallel integer sorting with medium and fine-scale parallelism

    NASA Technical Reports Server (NTRS)

    Dagum, Leonardo

    1993-01-01

    Two new parallel integer sorting algorithms, queue-sort and barrel-sort, are presented and analyzed in detail. These algorithms do not have optimal parallel complexity, yet they show very good performance in practice. Queue-sort designed for fine-scale parallel architectures which allow the queueing of multiple messages to the same destination. Barrel-sort is designed for medium-scale parallel architectures with a high message passing overhead. The performance results from the implementation of queue-sort on a Connection Machine CM-2 and barrel-sort on a 128 processor iPSC/860 are given. The two implementations are found to be comparable in performance but not as good as a fully vectorized bucket sort on the Cray YMP.

  13. A practical approach to portability and performance problems on massively parallel supercomputers

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Beazley, D.M.; Lomdahl, P.S.

    1994-12-08

    We present an overview of the tactics we have used to achieve a high-level of performance while improving portability for a large-scale molecular dynamics code SPaSM. SPaSM was originally implemented in ANSI C with message passing for the Connection Machine 5 (CM-5). In 1993, SPaSM was selected as one of the winners in the IEEE Gordon Bell Prize competition for sustaining 50 Gflops on the 1024 node CM-5 at Los Alamos National Laboratory. Achieving this performance on the CM-5 required rewriting critical sections of code in CDPEAC assembler language. In addition, the code made extensive use of CM-5 parallel I/Omore » and the CMMD message passing library. Given this highly specialized implementation, we describe how we have ported the code to the Cray T3D and high performance workstations. In addition we will describe how it has been possible to do this using a single version of source code that runs on all three platforms without sacrificing any performance. Sound too good to be true? We hope to demonstrate that one can realize both code performance and portability without relying on the latest and greatest prepackaged tool or parallelizing compiler.« less

  14. Tuning HDF5 subfiling performance on parallel file systems

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Byna, Suren; Chaarawi, Mohamad; Koziol, Quincey

    Subfiling is a technique used on parallel file systems to reduce locking and contention issues when multiple compute nodes interact with the same storage target node. Subfiling provides a compromise between the single shared file approach that instigates the lock contention problems on parallel file systems and having one file per process, which results in generating a massive and unmanageable number of files. In this paper, we evaluate and tune the performance of recently implemented subfiling feature in HDF5. In specific, we explain the implementation strategy of subfiling feature in HDF5, provide examples of using the feature, and evaluate andmore » tune parallel I/O performance of this feature with parallel file systems of the Cray XC40 system at NERSC (Cori) that include a burst buffer storage and a Lustre disk-based storage. We also evaluate I/O performance on the Cray XC30 system, Edison, at NERSC. Our results show performance benefits of 1.2X to 6X performance advantage with subfiling compared to writing a single shared HDF5 file. We present our exploration of configurations, such as the number of subfiles and the number of Lustre storage targets to storing files, as optimization parameters to obtain superior I/O performance. Based on this exploration, we discuss recommendations for achieving good I/O performance as well as limitations with using the subfiling feature.« less

  15. National Combustion Code: Parallel Performance

    NASA Technical Reports Server (NTRS)

    Babrauckas, Theresa

    2001-01-01

    This report discusses the National Combustion Code (NCC). The NCC is an integrated system of codes for the design and analysis of combustion systems. The advanced features of the NCC meet designers' requirements for model accuracy and turn-around time. The fundamental features at the inception of the NCC were parallel processing and unstructured mesh. The design and performance of the NCC are discussed.

  16. Suspended mid-infrared fiber-to-chip grating couplers for SiGe waveguides

    NASA Astrophysics Data System (ADS)

    Favreau, Julien; Durantin, Cédric; Fédéli, Jean-Marc; Boutami, Salim; Duan, Guang-Hua

    2016-03-01

    Silicon photonics has taken great importance owing to the applications in optical communications, ranging from short reach to long haul. Originally dedicated to telecom wavelengths, silicon photonics is heading toward circuits handling with a broader spectrum, especially in the short and mid-infrared (MIR) range. This trend is due to potential applications in chemical sensing, spectroscopy and defense in the 2-10 μm range. We previously reported the development of a MIR photonic platform based on buried SiGe/Si waveguide with propagation losses between 1 and 2 dB/cm. However the low index contrast of the platform makes the design of efficient grating couplers very challenging. In order to achieve a high fiber-to-chip efficiency, we propose a novel grating coupler structure, in which the grating is locally suspended in air. The grating has been designed with a FDTD software. To achieve high efficiency, suspended structure thicknesses have been jointly optimized with the grating parameters, namely the fill factor, the period and the grating etch depth. Using the Efficient Global Optimization (EGO) method we obtained a configuration where the fiber-to-waveguide efficiency is above 57 %. Moreover the optical transition between the suspended and the buried SiGe waveguide has been carefully designed by using an Eigenmode Expansion software. Transition efficiency as high as 86 % is achieved.

  17. Performance Comparison of HPF and MPI Based NAS Parallel Benchmarks

    NASA Technical Reports Server (NTRS)

    Saini, Subhash

    1997-01-01

    Compilers supporting High Performance Form (HPF) features first appeared in late 1994 and early 1995 from Applied Parallel Research (APR), Digital Equipment Corporation, and The Portland Group (PGI). IBM introduced an HPF compiler for the IBM RS/6000 SP2 in April of 1996. Over the past two years, these implementations have shown steady improvement in terms of both features and performance. The performance of various hardware/ programming model (HPF and MPI) combinations will be compared, based on latest NAS Parallel Benchmark results, thus providing a cross-machine and cross-model comparison. Specifically, HPF based NPB results will be compared with MPI based NPB results to provide perspective on performance currently obtainable using HPF versus MPI or versus hand-tuned implementations such as those supplied by the hardware vendors. In addition, we would also present NPB, (Version 1.0) performance results for the following systems: DEC Alpha Server 8400 5/440, Fujitsu CAPP Series (VX, VPP300, and VPP700), HP/Convex Exemplar SPP2000, IBM RS/6000 SP P2SC node (120 MHz), NEC SX-4/32, SGI/CRAY T3E, and SGI Origin2000. We would also present sustained performance per dollar for Class B LU, SP and BT benchmarks.

  18. Conceptual design of a sapphire loaded coupler for superconducting radio-frequency 1.3 GHz cavities

    DOE PAGES

    Xu, Chen; Tantawi, Sami

    2016-02-25

    This paper explores a hybrid mode rf structure that served as a superconducting radio-frequency coupler. This application achieves a reflection S (1,1) varying from 0 to -30 db and delivers cw power at 7 KW. The coupler has good thermal isolation between the 2 and 300 K sections due to vacuum separation. Only one single hybrid mode can propagate through each section, and no higher order mode is coupled. The analytical and numerical analysis for this coupler is given and the design is optimized. As a result, the coupling mechanism to the cavity is also discussed.

  19. Effect of Electrode Loss on the Dynamic Range of Linearized Directional Coupler Modulators

    DTIC Science & Technology

    2006-02-01

    Coupler Modulators George A. Brost , Richard Michalak, Paul Payson, and Kevin Magde Abstract—Numerical simulations were used to study the effect of...RANGE OF LINEARIZED DIRECTIONAL COUPLER MODULATORS In-House N/A 62204F LINKI SN 01 George A. Brost , Richard Michalak, Paul Payson and Kevin Magde AFRL...Fazio Nash BROST et al.: EFFECT OF ELECTRODE LOSS ON THE DYNAMIC RANGE OF LINEARIZED DCMs 515 Fig. 1. Frequency dependence of SFDR for the 1 2 DCM (s

  20. Design issues for directional coupler- and MMI-based optical microring resonator filters on InP

    NASA Astrophysics Data System (ADS)

    Themistos, Christos; Kalli, Kyriacos; Komodromos, Michalis; Rajarajan, Muttukrishnan; Rahman, B. M. A.; Grattan, Kenneth T. V.

    2004-08-01

    The characterization and optimization of optical microring resonator-based optical filters on deeply etched GaInAsP-Inp waveguides, using the finite element-based beam propagation approach is presented here. Design issues for directional coupler- and multimode interference coupler-based devices, such as field evolution, optical power, phase, fabrication tolerance and wavelength dependence have been investigated.

  1. Integrated Fiber-Optic Coupler.

    DTIC Science & Technology

    1987-04-01

    p. 563, 1984. 1 .T.H. W. n h= n , G.M. Metze, B.- Y . Tuu ,J.C.C. Far., "A a s double-heterostructure diode lasers fabricated on a monolithic GaAs/Si...INII RAitI) R HR ( OLIlIR HR t( N ,% NOS( I D108 I R IOst\\1 tN( LASS~l1 D R 87 mm mhhh z V. 0 0- z C ,, Technical Document 1086 April 1987 Integrated...Cmeed".~) n Interated Fiber-Optic Coupler 12 PERSONAL AU1HOS) P.L Pruaal, E.R. Foesuim 139 TYPE OF RE[POR 3b, IME COVERED4 DATE OF REPORT (’r. 4#e ow S

  2. A high performance data parallel tensor contraction framework: Application to coupled electro-mechanics

    NASA Astrophysics Data System (ADS)

    Poya, Roman; Gil, Antonio J.; Ortigosa, Rogelio

    2017-07-01

    The paper presents aspects of implementation of a new high performance tensor contraction framework for the numerical analysis of coupled and multi-physics problems on streaming architectures. In addition to explicit SIMD instructions and smart expression templates, the framework introduces domain specific constructs for the tensor cross product and its associated algebra recently rediscovered by Bonet et al. (2015, 2016) in the context of solid mechanics. The two key ingredients of the presented expression template engine are as follows. First, the capability to mathematically transform complex chains of operations to simpler equivalent expressions, while potentially avoiding routes with higher levels of computational complexity and, second, to perform a compile time depth-first or breadth-first search to find the optimal contraction indices of a large tensor network in order to minimise the number of floating point operations. For optimisations of tensor contraction such as loop transformation, loop fusion and data locality optimisations, the framework relies heavily on compile time technologies rather than source-to-source translation or JIT techniques. Every aspect of the framework is examined through relevant performance benchmarks, including the impact of data parallelism on the performance of isomorphic and nonisomorphic tensor products, the FLOP and memory I/O optimality in the evaluation of tensor networks, the compilation cost and memory footprint of the framework and the performance of tensor cross product kernels. The framework is then applied to finite element analysis of coupled electro-mechanical problems to assess the speed-ups achieved in kernel-based numerical integration of complex electroelastic energy functionals. In this context, domain-aware expression templates combined with SIMD instructions are shown to provide a significant speed-up over the classical low-level style programming techniques.

  3. Polarization preserving single mode fiber optic coupler

    NASA Technical Reports Server (NTRS)

    Nelson, M. D.; Goss, W. C.

    1982-01-01

    A technique is described for fabrication of etched single mode fiber optical waveguide couplers which preserve the polarization state to within 0.0001. The coupling ratio is tunable over a broad range (0-9 percent) during fabrication. Back-coupling is less than 0.001, insertion loss is less than 1.5 dB, and coupling ratio thermal coefficient is about 1 percent per degree C.

  4. Inductive coupler for downhole components and method for making same

    DOEpatents

    Hall, David R.; Hall, Jr., H. Tracy; Pixton, David S.; Dahlgren, Scott; Briscoe, Michael A.; Sneddon, Cameron; Fox, Joe

    2006-05-09

    The present invention includes a method of making an inductive coupler for downhole components. The method includes providing an annular housing, preferably made of steel, the housing having a recess. A conductor, preferably an insulated wire, is also provided along with a plurality of generally U-shaped magnetically conducting, electrically insulating (MCEI) segments. Preferably, the MCEI segments comprise ferrite. An assembly is formed by placing the plurality of MCEI segments within the recess in the annular housing. The segments are aligned to form a generally circular trough. A first portion of the conductor is placed within the circular trough. This assembly is consolidated with a meltable polymer which fills spaces between the segments, annular housing and the first portion of the conductor. The invention also includes an inductive coupler including an annular housing having a recess defined by a bottom portion and two opposing side wall portions. At least one side wall portion includes a lip extending toward but not reaching the other side wall portion. A plurality of generally U-shaped MCEI segments, preferably comprised of ferrite, are disposed in the recess and aligned so as to form a circular trough. The coupler further includes a conductor disposed within the circular trough and a polymer filling spaces between the segments, the annular housing and the conductor.

  5. Balanced PIN-TIA photoreceiver with integrated 3 dB fiber coupler for distributed fiber optic sensors

    NASA Astrophysics Data System (ADS)

    Datta, Shubhashish; Rajagopalan, Sruti; Lemke, Shaun; Joshi, Abhay

    2014-06-01

    We report a balanced PIN-TIA photoreceiver integrated with a 3 dB fiber coupler for distributed fiber optic sensors. This detector demonstrates -3 dB bandwidth >15 GHz and coupled conversion gain >65 V/W per photodiode through either input port of the 3 dB coupler, and can be operated at local oscillator power of +17 dBm. The combined common mode rejection of the balanced photoreceiver and the integrated 3 dB coupler is >20 dB. We also present measurement results with various optical stimuli, namely impulses, sinusoids, and pseudo-random sequences, which are relevant for time domain reflectometry, frequency domain reflectometry, and code correlation sensors, respectively.

  6. Rapid indirect trajectory optimization on highly parallel computing architectures

    NASA Astrophysics Data System (ADS)

    Antony, Thomas

    Trajectory optimization is a field which can benefit greatly from the advantages offered by parallel computing. The current state-of-the-art in trajectory optimization focuses on the use of direct optimization methods, such as the pseudo-spectral method. These methods are favored due to their ease of implementation and large convergence regions while indirect methods have largely been ignored in the literature in the past decade except for specific applications in astrodynamics. It has been shown that the shortcomings conventionally associated with indirect methods can be overcome by the use of a continuation method in which complex trajectory solutions are obtained by solving a sequence of progressively difficult optimization problems. High performance computing hardware is trending towards more parallel architectures as opposed to powerful single-core processors. Graphics Processing Units (GPU), which were originally developed for 3D graphics rendering have gained popularity in the past decade as high-performance, programmable parallel processors. The Compute Unified Device Architecture (CUDA) framework, a parallel computing architecture and programming model developed by NVIDIA, is one of the most widely used platforms in GPU computing. GPUs have been applied to a wide range of fields that require the solution of complex, computationally demanding problems. A GPU-accelerated indirect trajectory optimization methodology which uses the multiple shooting method and continuation is developed using the CUDA platform. The various algorithmic optimizations used to exploit the parallelism inherent in the indirect shooting method are described. The resulting rapid optimal control framework enables the construction of high quality optimal trajectories that satisfy problem-specific constraints and fully satisfy the necessary conditions of optimality. The benefits of the framework are highlighted by construction of maximum terminal velocity trajectories for a hypothetical

  7. Parallelized multi-graphics processing unit framework for high-speed Gabor-domain optical coherence microscopy.

    PubMed

    Tankam, Patrice; Santhanam, Anand P; Lee, Kye-Sung; Won, Jungeun; Canavesi, Cristina; Rolland, Jannick P

    2014-07-01

    Gabor-domain optical coherence microscopy (GD-OCM) is a volumetric high-resolution technique capable of acquiring three-dimensional (3-D) skin images with histological resolution. Real-time image processing is needed to enable GD-OCM imaging in a clinical setting. We present a parallelized and scalable multi-graphics processing unit (GPU) computing framework for real-time GD-OCM image processing. A parallelized control mechanism was developed to individually assign computation tasks to each of the GPUs. For each GPU, the optimal number of amplitude-scans (A-scans) to be processed in parallel was selected to maximize GPU memory usage and core throughput. We investigated five computing architectures for computational speed-up in processing 1000×1000 A-scans. The proposed parallelized multi-GPU computing framework enables processing at a computational speed faster than the GD-OCM image acquisition, thereby facilitating high-speed GD-OCM imaging in a clinical setting. Using two parallelized GPUs, the image processing of a 1×1×0.6  mm3 skin sample was performed in about 13 s, and the performance was benchmarked at 6.5 s with four GPUs. This work thus demonstrates that 3-D GD-OCM data may be displayed in real-time to the examiner using parallelized GPU processing.

  8. Three-Dimensional High-Lift Analysis Using a Parallel Unstructured Multigrid Solver

    NASA Technical Reports Server (NTRS)

    Mavriplis, Dimitri J.

    1998-01-01

    A directional implicit unstructured agglomeration multigrid solver is ported to shared and distributed memory massively parallel machines using the explicit domain-decomposition and message-passing approach. Because the algorithm operates on local implicit lines in the unstructured mesh, special care is required in partitioning the problem for parallel computing. A weighted partitioning strategy is described which avoids breaking the implicit lines across processor boundaries, while incurring minimal additional communication overhead. Good scalability is demonstrated on a 128 processor SGI Origin 2000 machine and on a 512 processor CRAY T3E machine for reasonably fine grids. The feasibility of performing large-scale unstructured grid calculations with the parallel multigrid algorithm is demonstrated by computing the flow over a partial-span flap wing high-lift geometry on a highly resolved grid of 13.5 million points in approximately 4 hours of wall clock time on the CRAY T3E.

  9. Highly scalable parallel processing of extracellular recordings of Multielectrode Arrays.

    PubMed

    Gehring, Tiago V; Vasilaki, Eleni; Giugliano, Michele

    2015-01-01

    Technological advances of Multielectrode Arrays (MEAs) used for multisite, parallel electrophysiological recordings, lead to an ever increasing amount of raw data being generated. Arrays with hundreds up to a few thousands of electrodes are slowly seeing widespread use and the expectation is that more sophisticated arrays will become available in the near future. In order to process the large data volumes resulting from MEA recordings there is a pressing need for new software tools able to process many data channels in parallel. Here we present a new tool for processing MEA data recordings that makes use of new programming paradigms and recent technology developments to unleash the power of modern highly parallel hardware, such as multi-core CPUs with vector instruction sets or GPGPUs. Our tool builds on and complements existing MEA data analysis packages. It shows high scalability and can be used to speed up some performance critical pre-processing steps such as data filtering and spike detection, helping to make the analysis of larger data sets tractable.

  10. Coupling analysis of non-circular-symmetric modes and design of orientation-insensitive few-mode fiber couplers

    NASA Astrophysics Data System (ADS)

    Li, Jiaxiong; Du, Jiangbing; Ma, Lin; Li, Ming-Jun; Jiang, Shoulin; Xu, Xiao; He, Zuyuan

    2017-01-01

    We study the coupling between two identical weakly-coupled few-mode fibers based on coupled-mode theory. The coupling behavior of non-circular-symmetric modes, such as LP11 and LP21, is investigated analytically and numerically. By carefully choosing the fiber core separation and coupler length, we can design orientation-insensitive fiber couplers for non-circular-symmetric modes at arbitrary coupling ratios. Based on the design method, we propose an orientation-insensitive two-mode fiber coupler at 850 nm working as a mode multiplexer/demultiplexer for two-mode transmission using standard single-mode fiber. Within the band from 845 to 855 nm, the insertion losses of LP01 and LP11 modes are less than 0.03 dB and 0.24 dB, respectively. When the two-mode fiber coupler is used as mode demultiplexer, the LP01/LP11 and LP11/LP01 extinction ratios in the separated branches are respectively above 12.6 dB and 21.2 dB. Our design method can be extended to two-mode communication or sensing systems at other wavelengths.

  11. Performance Characteristics of the Multi-Zone NAS Parallel Benchmarks

    NASA Technical Reports Server (NTRS)

    Jin, Haoqiang; VanderWijngaart, Rob F.

    2003-01-01

    We describe a new suite of computational benchmarks that models applications featuring multiple levels of parallelism. Such parallelism is often available in realistic flow computations on systems of grids, but had not previously been captured in bench-marks. The new suite, named NPB Multi-Zone, is extended from the NAS Parallel Benchmarks suite, and involves solving the application benchmarks LU, BT and SP on collections of loosely coupled discretization meshes. The solutions on the meshes are updated independently, but after each time step they exchange boundary value information. This strategy provides relatively easily exploitable coarse-grain parallelism between meshes. Three reference implementations are available: one serial, one hybrid using the Message Passing Interface (MPI) and OpenMP, and another hybrid using a shared memory multi-level programming model (SMP+OpenMP). We examine the effectiveness of hybrid parallelization paradigms in these implementations on three different parallel computers. We also use an empirical formula to investigate the performance characteristics of the multi-zone benchmarks.

  12. Multimode Directional Coupler for Utilization of Harmonic Frequencies from TWTAs

    NASA Technical Reports Server (NTRS)

    Simmons, Rainee N.; Wintucky, Edwin G.

    2013-01-01

    A novel waveguide multimode directional coupler (MDC) intended for the measurement and potential utilization of the second and higher order harmonic frequencies from high-power traveling wave tube amplifiers (TWTAs) has been successfully designed, fabricated, and tested. The design is based on the characteristic multiple propagation modes of the electrical and magnetic field components of electromagnetic waves in a rectangular waveguide. The purpose was to create a rugged, easily constructed, more efficient waveguide- based MDC for extraction and exploitation of the second harmonic signal from the RF output of high-power TWTs used for space communications. The application would be a satellitebased beacon source needed for Qband and V/W-band atmospheric propagation studies. The MDC could function as a CW narrow-band source or as a wideband source for study of atmospheric group delay effects on highdata- rate links. The MDC is fabricated from two sections of waveguide - a primary one for the fundamental frequency and a secondary waveguide for the second harmonic - that are joined together such that the second harmonic higher order modes are selectively coupled via precision- machined slots for propagation in the secondary waveguide. In the TWTA output waveguide port, both the fundamental and the second harmonic signals are present. These signals propagate in the output waveguide as the dominant and higher order modes, respectively. By including an appropriate mode selective waveguide directional coupler, such as the MDC presented here at the output of the TWTA, the power at the second harmonic can be sampled and amplified to the power level needed for atmospheric propagation studies. The important conclusions from the preliminary test results for the multimode directional coupler are: (1) the second harmonic (Ka-band) can be measured and effectively separated from the fundamental (Ku-band) with no coupling of the latter, (2) power losses in the fundamental frequency

  13. Thermal-Mechanical Study of 3.9 GHz CW Coupler and Cavity for LCLS-II Project

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Gonin, Ivan; Harms, Elvin; Khabiboulline, Timergali

    2017-05-01

    Third harmonic system was originally developed by Fermilab for FLASH facility at DESY and then was adopted and modified by INFN for the XFEL project [1-3]. In contrast to XFEL project, all cryomodules in LCLS-II project will operate in CW regime with higher RF average power for 1.3 GHz and 3.9 GHz cavities and couplers. Design of the cavity and fundamental power coupler has been modified to satisfy LCLS-II requirements. In this paper we discuss the results of COMSOL thermal and mechanical analysis of the 3.9 GHz coupler and cavity to verify proposed modifica-tion of the design. For the dressedmore » cavity we present simulations of Lorentz force detuning, helium pressure sensitivity df/dP and major mechanical resonances.« less

  14. The UPSF code: a metaprogramming-based high-performance automatically parallelized plasma simulation framework

    NASA Astrophysics Data System (ADS)

    Gao, Xiatian; Wang, Xiaogang; Jiang, Binhao

    2017-10-01

    UPSF (Universal Plasma Simulation Framework) is a new plasma simulation code designed for maximum flexibility by using edge-cutting techniques supported by C++17 standard. Through use of metaprogramming technique, UPSF provides arbitrary dimensional data structures and methods to support various kinds of plasma simulation models, like, Vlasov, particle in cell (PIC), fluid, Fokker-Planck, and their variants and hybrid methods. Through C++ metaprogramming technique, a single code can be used to arbitrary dimensional systems with no loss of performance. UPSF can also automatically parallelize the distributed data structure and accelerate matrix and tensor operations by BLAS. A three-dimensional particle in cell code is developed based on UPSF. Two test cases, Landau damping and Weibel instability for electrostatic and electromagnetic situation respectively, are presented to show the validation and performance of the UPSF code.

  15. Performance of a 300 Mbps 1:16 serial/parallel optoelectronic receiver module

    NASA Technical Reports Server (NTRS)

    Richard, M. A.; Claspy, P. C.; Bhasin, K. B.; Bendett, M. B.

    1990-01-01

    Optical interconnects are being considered for the high speed distribution of multiplexed control signals in GaAs monolithic microwave integrated circuit (MMIC) based phased array antennas. The performance of a hybrid GaAs optoelectronic integrated circuit (OEIC) is described, as well as its design and fabrication. The OEIC converts a 16-bit serial optical input to a 16 parallel line electrical output using an on-board 1:16 demultiplexer and operates at data rates as high as 30b Mbps. The performance characteristics and potential applications of the device are presented.

  16. Enhancing Application Performance Using Mini-Apps: Comparison of Hybrid Parallel Programming Paradigms

    NASA Technical Reports Server (NTRS)

    Lawson, Gary; Sosonkina, Masha; Baurle, Robert; Hammond, Dana

    2017-01-01

    In many fields, real-world applications for High Performance Computing have already been developed. For these applications to stay up-to-date, new parallel strategies must be explored to yield the best performance; however, restructuring or modifying a real-world application may be daunting depending on the size of the code. In this case, a mini-app may be employed to quickly explore such options without modifying the entire code. In this work, several mini-apps have been created to enhance a real-world application performance, namely the VULCAN code for complex flow analysis developed at the NASA Langley Research Center. These mini-apps explore hybrid parallel programming paradigms with Message Passing Interface (MPI) for distributed memory access and either Shared MPI (SMPI) or OpenMP for shared memory accesses. Performance testing shows that MPI+SMPI yields the best execution performance, while requiring the largest number of code changes. A maximum speedup of 23 was measured for MPI+SMPI, but only 11 was measured for MPI+OpenMP.

  17. Design and Performance of a 1 ms High-Speed Vision Chip with 3D-Stacked 140 GOPS Column-Parallel PEs †.

    PubMed

    Nose, Atsushi; Yamazaki, Tomohiro; Katayama, Hironobu; Uehara, Shuji; Kobayashi, Masatsugu; Shida, Sayaka; Odahara, Masaki; Takamiya, Kenichi; Matsumoto, Shizunori; Miyashita, Leo; Watanabe, Yoshihiro; Izawa, Takashi; Muramatsu, Yoshinori; Nitta, Yoshikazu; Ishikawa, Masatoshi

    2018-04-24

    We have developed a high-speed vision chip using 3D stacking technology to address the increasing demand for high-speed vision chips in diverse applications. The chip comprises a 1/3.2-inch, 1.27 Mpixel, 500 fps (0.31 Mpixel, 1000 fps, 2 × 2 binning) vision chip with 3D-stacked column-parallel Analog-to-Digital Converters (ADCs) and 140 Giga Operation per Second (GOPS) programmable Single Instruction Multiple Data (SIMD) column-parallel PEs for new sensing applications. The 3D-stacked structure and column parallel processing architecture achieve high sensitivity, high resolution, and high-accuracy object positioning.

  18. New NAS Parallel Benchmarks Results

    NASA Technical Reports Server (NTRS)

    Yarrow, Maurice; Saphir, William; VanderWijngaart, Rob; Woo, Alex; Kutler, Paul (Technical Monitor)

    1997-01-01

    NPB2 (NAS (NASA Advanced Supercomputing) Parallel Benchmarks 2) is an implementation, based on Fortran and the MPI (message passing interface) message passing standard, of the original NAS Parallel Benchmark specifications. NPB2 programs are run with little or no tuning, in contrast to NPB vendor implementations, which are highly optimized for specific architectures. NPB2 results complement, rather than replace, NPB results. Because they have not been optimized by vendors, NPB2 implementations approximate the performance a typical user can expect for a portable parallel program on distributed memory parallel computers. Together these results provide an insightful comparison of the real-world performance of high-performance computers. New NPB2 features: New implementation (CG), new workstation class problem sizes, new serial sample versions, more performance statistics.

  19. Performance of a parallel thermal-hydraulics code TEMPEST

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Fann, G.I.; Trent, D.S.

    The authors describe the parallelization of the Tempest thermal-hydraulics code. The serial version of this code is used for production quality 3-D thermal-hydraulics simulations. Good speedup was obtained with a parallel diagonally preconditioned BiCGStab non-symmetric linear solver, using a spatial domain decomposition approach for the semi-iterative pressure-based and mass-conserved algorithm. The test case used here to illustrate the performance of the BiCGStab solver is a 3-D natural convection problem modeled using finite volume discretization in cylindrical coordinates. The BiCGStab solver replaced the LSOR-ADI method for solving the pressure equation in TEMPEST. BiCGStab also solves the coupled thermal energy equation. Scalingmore » performance of 3 problem sizes (221220 nodes, 358120 nodes, and 701220 nodes) are presented. These problems were run on 2 different parallel machines: IBM-SP and SGI PowerChallenge. The largest problem attains a speedup of 68 on an 128 processor IBM-SP. In real terms, this is over 34 times faster than the fastest serial production time using the LSOR-ADI solver.« less

  20. Highly parallel computation

    NASA Technical Reports Server (NTRS)

    Denning, Peter J.; Tichy, Walter F.

    1990-01-01

    Highly parallel computing architectures are the only means to achieve the computation rates demanded by advanced scientific problems. A decade of research has demonstrated the feasibility of such machines and current research focuses on which architectures designated as multiple instruction multiple datastream (MIMD) and single instruction multiple datastream (SIMD) have produced the best results to date; neither shows a decisive advantage for most near-homogeneous scientific problems. For scientific problems with many dissimilar parts, more speculative architectures such as neural networks or data flow may be needed.

  1. Parallelized multi–graphics processing unit framework for high-speed Gabor-domain optical coherence microscopy

    PubMed Central

    Tankam, Patrice; Santhanam, Anand P.; Lee, Kye-Sung; Won, Jungeun; Canavesi, Cristina; Rolland, Jannick P.

    2014-01-01

    Abstract. Gabor-domain optical coherence microscopy (GD-OCM) is a volumetric high-resolution technique capable of acquiring three-dimensional (3-D) skin images with histological resolution. Real-time image processing is needed to enable GD-OCM imaging in a clinical setting. We present a parallelized and scalable multi-graphics processing unit (GPU) computing framework for real-time GD-OCM image processing. A parallelized control mechanism was developed to individually assign computation tasks to each of the GPUs. For each GPU, the optimal number of amplitude-scans (A-scans) to be processed in parallel was selected to maximize GPU memory usage and core throughput. We investigated five computing architectures for computational speed-up in processing 1000×1000 A-scans. The proposed parallelized multi-GPU computing framework enables processing at a computational speed faster than the GD-OCM image acquisition, thereby facilitating high-speed GD-OCM imaging in a clinical setting. Using two parallelized GPUs, the image processing of a 1×1×0.6  mm3 skin sample was performed in about 13 s, and the performance was benchmarked at 6.5 s with four GPUs. This work thus demonstrates that 3-D GD-OCM data may be displayed in real-time to the examiner using parallelized GPU processing. PMID:24695868

  2. Highly sensitive magnetic field sensor based on microfiber coupler with magnetic fluid

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Luo, Longfeng; Pu, Shengli, E-mail: shlpu@usst.edu.cn; Tang, Jiali

    2015-05-11

    A kind of magnetic field sensor using a microfiber coupler (MFC) surrounded with magnetic fluid (MF) is proposed and experimentally demonstrated. As the MFC is strongly sensitive to the surrounding refractive index (RI) and MF's RI is sensitive to magnetic field, the magnetic field sensing function of the proposed structure is realized. Interrogation of magnetic field strength is achieved by measuring the dip wavelength shift and transmission loss change of the transmission spectrum. The experimental results show that the sensitivity of the sensor is wavelength-dependent. The maximum sensitivity of 191.8 pm/Oe is achieved at wavelength of around 1537 nm in this work.more » In addition, a sensitivity of −0.037 dB/Oe is achieved by monitoring variation of the fringe visibility. These suggest the potential applications of the proposed structure in tunable all-in-fiber photonic devices such as magneto-optical modulator, filter, and sensing.« less

  3. Analysis and design of arrayed waveguide gratings with MMI couplers.

    PubMed

    Munoz, P; Pastor, D; Capmany, J

    2001-09-24

    We present an extension of the AWG model and design procedure described in [1] to incorporate multimode interference, MMI, couplers. For the first time to our knowledge, a closed formula for the passing bands bandwidth and crosstalk estimation plots are derived.

  4. Scalable parallel communications

    NASA Technical Reports Server (NTRS)

    Maly, K.; Khanna, S.; Overstreet, C. M.; Mukkamala, R.; Zubair, M.; Sekhar, Y. S.; Foudriat, E. C.

    1992-01-01

    Coarse-grain parallelism in networking (that is, the use of multiple protocol processors running replicated software sending over several physical channels) can be used to provide gigabit communications for a single application. Since parallel network performance is highly dependent on real issues such as hardware properties (e.g., memory speeds and cache hit rates), operating system overhead (e.g., interrupt handling), and protocol performance (e.g., effect of timeouts), we have performed detailed simulations studies of both a bus-based multiprocessor workstation node (based on the Sun Galaxy MP multiprocessor) and a distributed-memory parallel computer node (based on the Touchstone DELTA) to evaluate the behavior of coarse-grain parallelism. Our results indicate: (1) coarse-grain parallelism can deliver multiple 100 Mbps with currently available hardware platforms and existing networking protocols (such as Transmission Control Protocol/Internet Protocol (TCP/IP) and parallel Fiber Distributed Data Interface (FDDI) rings); (2) scale-up is near linear in n, the number of protocol processors, and channels (for small n and up to a few hundred Mbps); and (3) since these results are based on existing hardware without specialized devices (except perhaps for some simple modifications of the FDDI boards), this is a low cost solution to providing multiple 100 Mbps on current machines. In addition, from both the performance analysis and the properties of these architectures, we conclude: (1) multiple processors providing identical services and the use of space division multiplexing for the physical channels can provide better reliability than monolithic approaches (it also provides graceful degradation and low-cost load balancing); (2) coarse-grain parallelism supports running several transport protocols in parallel to provide different types of service (for example, one TCP handles small messages for many users, other TCP's running in parallel provide high bandwidth

  5. High performace silicon 2x2 optical switch based on a thermo-optically tunable multimode interference coupler and efficient electrodes.

    PubMed

    Rosa, Álvaro; Gutiérrez, Ana; Brimont, Antoine; Griol, Amadeu; Sanchis, Pablo

    2016-01-11

    Optical switches based on tunable multimode interference (MMI) couplers can simultaneously reduce the footprint and increase the tolerance against fabrication deviations. Here, a compact 2x2 silicon switch based on a thermo-optically tunable MMI structure with a footprint of only 0.005 mm(2) is proposed and demonstrated. The MMI structure has been optimized using a silica trench acting as a thermal isolator without introducing any substantial loss penalty or crosstalk degradation. Furthermore, the electrodes performance have significantly been improved via engineering the heater geometry and using two metallization steps. Thereby, a drastic power consumption reduction of around 90% has been demonstrated yielding to values as low as 24.9 mW. Furthermore, very fast switching times of only 1.19 µs have also been achieved.

  6. Aspects of the development of ultrabroadband precision directional couplers

    NASA Astrophysics Data System (ADS)

    Kats, B. M.; Larionov, A. I.; Meshchanov, V. P.

    1991-03-01

    The synthesis of ultrabroadband coaxial directional couplers (DCs) with improved characteristics is examined. A precision DC with operating ranges of 0.6-12.5 and 1.5-18.0 GHz have been developed and experimentally tested. The device is realized on the basis of coupled coaxial lines of a new type.

  7. Shift-and-invert parallel spectral transformation eigensolver: Massively parallel performance for density-functional based tight-binding

    DOE PAGES

    Zhang, Hong; Zapol, Peter; Dixon, David A.; ...

    2015-11-17

    The Shift-and-invert parallel spectral transformations (SIPs), a computational approach to solve sparse eigenvalue problems, is developed for massively parallel architectures with exceptional parallel scalability and robustness. The capabilities of SIPs are demonstrated by diagonalization of density-functional based tight-binding (DFTB) Hamiltonian and overlap matrices for single-wall metallic carbon nanotubes, diamond nanowires, and bulk diamond crystals. The largest (smallest) example studied is a 128,000 (2000) atom nanotube for which ~330,000 (~5600) eigenvalues and eigenfunctions are obtained in ~190 (~5) seconds when parallelized over 266,144 (16,384) Blue Gene/Q cores. Weak scaling and strong scaling of SIPs are analyzed and the performance of SIPsmore » is compared with other novel methods. Different matrix ordering methods are investigated to reduce the cost of the factorization step, which dominates the time-to-solution at the strong scaling limit. As a result, a parallel implementation of assembling the density matrix from the distributed eigenvectors is demonstrated.« less

  8. Shift-and-invert parallel spectral transformation eigensolver: Massively parallel performance for density-functional based tight-binding

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Zhang, Hong; Zapol, Peter; Dixon, David A.

    The Shift-and-invert parallel spectral transformations (SIPs), a computational approach to solve sparse eigenvalue problems, is developed for massively parallel architectures with exceptional parallel scalability and robustness. The capabilities of SIPs are demonstrated by diagonalization of density-functional based tight-binding (DFTB) Hamiltonian and overlap matrices for single-wall metallic carbon nanotubes, diamond nanowires, and bulk diamond crystals. The largest (smallest) example studied is a 128,000 (2000) atom nanotube for which ~330,000 (~5600) eigenvalues and eigenfunctions are obtained in ~190 (~5) seconds when parallelized over 266,144 (16,384) Blue Gene/Q cores. Weak scaling and strong scaling of SIPs are analyzed and the performance of SIPsmore » is compared with other novel methods. Different matrix ordering methods are investigated to reduce the cost of the factorization step, which dominates the time-to-solution at the strong scaling limit. As a result, a parallel implementation of assembling the density matrix from the distributed eigenvectors is demonstrated.« less

  9. GROMACS 4.5: a high-throughput and highly parallel open source molecular simulation toolkit

    PubMed Central

    Pronk, Sander; Páll, Szilárd; Schulz, Roland; Larsson, Per; Bjelkmar, Pär; Apostolov, Rossen; Shirts, Michael R.; Smith, Jeremy C.; Kasson, Peter M.; van der Spoel, David; Hess, Berk; Lindahl, Erik

    2013-01-01

    Motivation: Molecular simulation has historically been a low-throughput technique, but faster computers and increasing amounts of genomic and structural data are changing this by enabling large-scale automated simulation of, for instance, many conformers or mutants of biomolecules with or without a range of ligands. At the same time, advances in performance and scaling now make it possible to model complex biomolecular interaction and function in a manner directly testable by experiment. These applications share a need for fast and efficient software that can be deployed on massive scale in clusters, web servers, distributed computing or cloud resources. Results: Here, we present a range of new simulation algorithms and features developed during the past 4 years, leading up to the GROMACS 4.5 software package. The software now automatically handles wide classes of biomolecules, such as proteins, nucleic acids and lipids, and comes with all commonly used force fields for these molecules built-in. GROMACS supports several implicit solvent models, as well as new free-energy algorithms, and the software now uses multithreading for efficient parallelization even on low-end systems, including windows-based workstations. Together with hand-tuned assembly kernels and state-of-the-art parallelization, this provides extremely high performance and cost efficiency for high-throughput as well as massively parallel simulations. Availability: GROMACS is an open source and free software available from http://www.gromacs.org. Contact: erik.lindahl@scilifelab.se Supplementary information: Supplementary data are available at Bioinformatics online. PMID:23407358

  10. Femtosecond laser fabrication of birefringent directional couplers as polarization beam splitters in fused silica.

    PubMed

    Fernandes, Luís A; Grenier, Jason R; Herman, Peter R; Aitchison, J Stewart; Marques, Paulo V S

    2011-06-20

    Integrated polarization beam splitters based on birefringent directional couplers are demonstrated. The devices are fabricated in bulk fused silica glass by femtosecond laser writing (300 fs, 150 nJ at 500 kHz, 522 nm). The birefringence was measured from the spectral splitting of the Bragg grating resonances associated with the vertically and horizontally polarized modes. Polarization splitting directional couplers were designed and demonstrated with 0.5 dB/cm propagation losses and -19 dB and -24 dB extinction ratios for the polarization splitting.

  11. Monolithically integrated self-rolled-up microtube-based vertical coupler for three-dimensional photonic integration

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Yu, Xin; Arbabi, Ehsan; Goddard, Lynford L.

    2015-07-20

    We demonstrate a self-rolled-up microtube-based vertical photonic coupler monolithically integrated on top of a ridge waveguide to achieve three-dimensional (3D) photonic integration. The fabrication process is fully compatible with standard planar silicon processing technology. Strong light coupling between the vertical coupler and the ridge waveguide was observed experimentally, which may provide an alternative route for 3D heterogeneous photonic integration. The highest extinction ratio observed in the transmission spectrum passing through the ridge waveguide was 23 dB.

  12. A Multi-Level Parallelization Concept for High-Fidelity Multi-Block Solvers

    NASA Technical Reports Server (NTRS)

    Hatay, Ferhat F.; Jespersen, Dennis C.; Guruswamy, Guru P.; Rizk, Yehia M.; Byun, Chansup; Gee, Ken; VanDalsem, William R. (Technical Monitor)

    1997-01-01

    The integration of high-fidelity Computational Fluid Dynamics (CFD) analysis tools with the industrial design process benefits greatly from the robust implementations that are transportable across a wide range of computer architectures. In the present work, a hybrid domain-decomposition and parallelization concept was developed and implemented into the widely-used NASA multi-block Computational Fluid Dynamics (CFD) packages implemented in ENSAERO and OVERFLOW. The new parallel solver concept, PENS (Parallel Euler Navier-Stokes Solver), employs both fine and coarse granularity in data partitioning as well as data coalescing to obtain the desired load-balance characteristics on the available computer platforms. This multi-level parallelism implementation itself introduces no changes to the numerical results, hence the original fidelity of the packages are identically preserved. The present implementation uses the Message Passing Interface (MPI) library for interprocessor message passing and memory accessing. By choosing an appropriate combination of the available partitioning and coalescing capabilities only during the execution stage, the PENS solver becomes adaptable to different computer architectures from shared-memory to distributed-memory platforms with varying degrees of parallelism. The PENS implementation on the IBM SP2 distributed memory environment at the NASA Ames Research Center obtains 85 percent scalable parallel performance using fine-grain partitioning of single-block CFD domains using up to 128 wide computational nodes. Multi-block CFD simulations of complete aircraft simulations achieve 75 percent perfect load-balanced executions using data coalescing and the two levels of parallelism. SGI PowerChallenge, SGI Origin 2000, and a cluster of workstations are the other platforms where the robustness of the implementation is tested. The performance behavior on the other computer platforms with a variety of realistic problems will be included as this on

  13. All silicon waveguide spherical microcavity coupler device.

    PubMed

    Xifré-Pérez, E; Domenech, J D; Fenollosa, R; Muñoz, P; Capmany, J; Meseguer, F

    2011-02-14

    A coupler based on silicon spherical microcavities coupled to silicon waveguides for telecom wavelengths is presented. The light scattered by the microcavity is detected and analyzed as a function of the wavelength. The transmittance signal through the waveguide is strongly attenuated (up to 25 dB) at wavelengths corresponding to the Mie resonances of the microcavity. The coupling between the microcavity and the waveguide is experimentally demonstrated and theoretically modeled with the help of FDTD calculations.

  14. Design and Fabrication of NxN Optical Couplers Based on Organic Polymer Opti al WaveGuides

    DTIC Science & Technology

    1994-08-01

    lOxlO optical coupler utilizing photopolymerizable organic polymers. Background information on the theory of operation of the coupler culminating in a...Channel Waveguides Based on Photopolymerizable Di/Tri Acrylates," in Optoelecwonic Interconnects Ii, Ray T. Chen, John A. Neff, Editors, Proc. SPIE 2153, pp...demonstrated that acrylic polymers can be used to fabricate single-mode optical wavguides. The resins that we have formulated are photopolymerizable

  15. DVS-SOFTWARE: An Effective Tool for Applying Highly Parallelized Hardware To Computational Geophysics

    NASA Astrophysics Data System (ADS)

    Herrera, I.; Herrera, G. S.

    2015-12-01

    Most geophysical systems are macroscopic physical systems. The behavior prediction of such systems is carried out by means of computational models whose basic models are partial differential equations (PDEs) [1]. Due to the enormous size of the discretized version of such PDEs it is necessary to apply highly parallelized super-computers. For them, at present, the most efficient software is based on non-overlapping domain decomposition methods (DDM). However, a limiting feature of the present state-of-the-art techniques is due to the kind of discretizations used in them. Recently, I. Herrera and co-workers using 'non-overlapping discretizations' have produced the DVS-Software which overcomes this limitation [2]. The DVS-software can be applied to a great variety of geophysical problems and achieves very high parallel efficiencies (90%, or so [3]). It is therefore very suitable for effectively applying the most advanced parallel supercomputers available at present. In a parallel talk, in this AGU Fall Meeting, Graciela Herrera Z. will present how this software is being applied to advance MOD-FLOW. Key Words: Parallel Software for Geophysics, High Performance Computing, HPC, Parallel Computing, Domain Decomposition Methods (DDM)REFERENCES [1]. Herrera Ismael and George F. Pinder, Mathematical Modelling in Science and Engineering: An axiomatic approach", John Wiley, 243p., 2012. [2]. Herrera, I., de la Cruz L.M. and Rosas-Medina A. "Non Overlapping Discretization Methods for Partial, Differential Equations". NUMER METH PART D E, 30: 1427-1454, 2014, DOI 10.1002/num 21852. (Open source) [3]. Herrera, I., & Contreras Iván "An Innovative Tool for Effectively Applying Highly Parallelized Software To Problems of Elasticity". Geofísica Internacional, 2015 (In press)

  16. Wideband tunable wavelength-selective coupling in asymmetric side-polished fiber coupler with dispersive interlayer.

    PubMed

    Chen, Nan-Kuang; Lee, Cheng-Ling; Chi, Sien

    2007-12-24

    We demonstrate tunable highly wavelength-selective filter based on a 2 x 2 asymmetric side-polished fiber coupler with dispersive interlayer in one of the coupling arms. The asymmetric fiber coupler is made of two side-polished fibers using identical single-mode fibers and one of the polished fibers is further chemically etched at the central evanescent coupling region to gain closer to the core. An optical liquid with different dispersion characteristics than that of silica fiber is used to fill up the etched hollow and therefore the propagation constant for the polished fiber with dispersive liquid becomes more dispersive and crosses with that of another untreated polished fiber. The location of the cross point and the cross angle between two propagation constant curves determine the coupling wavelength and coupling bandwidth as well as channel wavelength separation, respectively. The coupling wavelength can be tuned at least wider than 84 nm (1.326-1.410 microm) under index variation of 0.004 and with coupling ratios of higher than 30 dB.

  17. Bilingual parallel programming

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Foster, I.; Overbeek, R.

    1990-01-01

    Numerous experiments have demonstrated that computationally intensive algorithms support adequate parallelism to exploit the potential of large parallel machines. Yet successful parallel implementations of serious applications are rare. The limiting factor is clearly programming technology. None of the approaches to parallel programming that have been proposed to date -- whether parallelizing compilers, language extensions, or new concurrent languages -- seem to adequately address the central problems of portability, expressiveness, efficiency, and compatibility with existing software. In this paper, we advocate an alternative approach to parallel programming based on what we call bilingual programming. We present evidence that this approach providesmore » and effective solution to parallel programming problems. The key idea in bilingual programming is to construct the upper levels of applications in a high-level language while coding selected low-level components in low-level languages. This approach permits the advantages of a high-level notation (expressiveness, elegance, conciseness) to be obtained without the cost in performance normally associated with high-level approaches. In addition, it provides a natural framework for reusing existing code.« less

  18. Performance Analysis and Optimization on the UCLA Parallel Atmospheric General Circulation Model Code

    NASA Technical Reports Server (NTRS)

    Lou, John; Ferraro, Robert; Farrara, John; Mechoso, Carlos

    1996-01-01

    An analysis is presented of several factors influencing the performance of a parallel implementation of the UCLA atmospheric general circulation model (AGCM) on massively parallel computer systems. Several modificaitons to the original parallel AGCM code aimed at improving its numerical efficiency, interprocessor communication cost, load-balance and issues affecting single-node code performance are discussed.

  19. Parallel computing works

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Not Available

    An account of the Caltech Concurrent Computation Program (C{sup 3}P), a five year project that focused on answering the question: Can parallel computers be used to do large-scale scientific computations '' As the title indicates, the question is answered in the affirmative, by implementing numerous scientific applications on real parallel computers and doing computations that produced new scientific results. In the process of doing so, C{sup 3}P helped design and build several new computers, designed and implemented basic system software, developed algorithms for frequently used mathematical computations on massively parallel machines, devised performance models and measured the performance of manymore » computers, and created a high performance computing facility based exclusively on parallel computers. While the initial focus of C{sup 3}P was the hypercube architecture developed by C. Seitz, many of the methods developed and lessons learned have been applied successfully on other massively parallel architectures.« less

  20. Electro-Optic Modulator Based on Organic Planar Waveguide Integrated with Prism Coupler

    NASA Technical Reports Server (NTRS)

    Sarkisov, Sergey S.

    2002-01-01

    The objectives of the project, as they were formulated in the proposal, are the following: (1) Design and development of novel electro-optic modulator using single crystalline film of highly efficient electro-optic organic material integrated with prism coupler; (2) Experimental characterization of the figures-of-merit of the modulator. It is expected to perform with an extinction ratio of 10 dB at a driving signal of 5 V; (3) Conclusions on feasibility of the modulator as an element of data communication systems of future generations. The accomplishments of the project are the following: (1) The design of the electro-optic modulator based on a single crystalline film of organic material NPP has been explored; (2) The evaluation of the figures-of-merit of the electro-optic modulator has been performed; (3) Based on the results of characterization of the figures-of-merit, the conclusion was made that the modulator based on a thin film of NPP is feasible and has a great potential of being used in optic communication with a modulation bandwidth of up to 100 GHz and a driving voltage of the order of 3 to 5 V.

  1. Development of Ultra-Low Noise, High Performance III-V Quantum Well Infrared Photodetectors (QWIPs) for Focal Plane Array Staring Image Sensor Systems

    DTIC Science & Technology

    1993-08-01

    Development of Ultra-Low Noise , High Performance III-V Quantum Well Infrared Photodetectors ( QWIPs ) for Focal Plane Array Staring Image Sensor Systems...using a 2-D square mesh grating coupler to achieve maximum responsivity for an InGaAs SBTM QWIP , and (iv) performed noise characterization on four...different types of Ir-V QWIPs and identified their noise sources. Detailed results and accomplishments are discussed in this report. 1 SJ •aTEtcRMrtlS

  2. Extending Automatic Parallelization to Optimize High-Level Abstractions for Multicore

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Liao, C; Quinlan, D J; Willcock, J J

    2008-12-12

    Automatic introduction of OpenMP for sequential applications has attracted significant attention recently because of the proliferation of multicore processors and the simplicity of using OpenMP to express parallelism for shared-memory systems. However, most previous research has only focused on C and Fortran applications operating on primitive data types. C++ applications using high-level abstractions, such as STL containers and complex user-defined types, are largely ignored due to the lack of research compilers that are readily able to recognize high-level object-oriented abstractions and leverage their associated semantics. In this paper, we automatically parallelize C++ applications using ROSE, a multiple-language source-to-source compiler infrastructuremore » which preserves the high-level abstractions and gives us access to their semantics. Several representative parallelization candidate kernels are used to explore semantic-aware parallelization strategies for high-level abstractions, combined with extended compiler analyses. Those kernels include an array-base computation loop, a loop with task-level parallelism, and a domain-specific tree traversal. Our work extends the applicability of automatic parallelization to modern applications using high-level abstractions and exposes more opportunities to take advantage of multicore processors.« less

  3. Theoretical study of nanophotonic directional couplers comprising near-field-coupled metal nanoparticles.

    PubMed

    Holmström, Petter; Yuan, Jun; Qiu, Min; Thylén, Lars; Bratkovsky, Alexander M

    2011-04-11

    The properties of integrated-photonics directional couplers composed of near-field-coupled arrays of metal nanoparticles are analyzed theoretically. It is found that it is possible to generate very compact, submicron length, high field-confinement and functionality devices with very low switch energies. The analysis is carried out for a hypothetical lossless silver to demonstrate the potential of this type of circuits for applications in telecom and interconnects. Employing losses of real silver, standalone devices with the above properties are still feasible in optimized metal nanoparticle structures. © 2011 Optical Society of America

  4. A Novel Multimode Waveguide Coupler for Accurate Power Measurement of Traveling Wave Tube Harmonic Frequencies

    NASA Technical Reports Server (NTRS)

    Wintucky, Edwin G.; Simons, Rainee N.

    2014-01-01

    This paper presents the design, fabrication and test results for a novel waveguide multimode directional coupler (MDC). The coupler fabricated from two dissimilar waveguides is capable of isolating the power at the second harmonic frequency from the fundamental power at the output port of a traveling-wave tube (TWT). In addition to accurate power measurements at harmonic frequencies, a potential application of the MDC is in the design of a beacon source for atmospheric propagation studies at millimeter-wave frequencies.

  5. Evaluation of the power consumption of a high-speed parallel robot

    NASA Astrophysics Data System (ADS)

    Han, Gang; Xie, Fugui; Liu, Xin-Jun

    2018-06-01

    An inverse dynamic model of a high-speed parallel robot is established based on the virtual work principle. With this dynamic model, a new evaluation method is proposed to measure the power consumption of the robot during pick-and-place tasks. The power vector is extended in this method and used to represent the collinear velocity and acceleration of the moving platform. Afterward, several dynamic performance indices, which are homogenous and possess obvious physical meanings, are proposed. These indices can evaluate the power input and output transmissibility of the robot in a workspace. The distributions of the power input and output transmissibility of the high-speed parallel robot are derived with these indices and clearly illustrated in atlases. Furtherly, a low-power-consumption workspace is selected for the robot.

  6. The OpenMP Implementation of NAS Parallel Benchmarks and its Performance

    NASA Technical Reports Server (NTRS)

    Jin, Hao-Qiang; Frumkin, Michael; Yan, Jerry

    1999-01-01

    As the new ccNUMA architecture became popular in recent years, parallel programming with compiler directives on these machines has evolved to accommodate new needs. In this study, we examine the effectiveness of OpenMP directives for parallelizing the NAS Parallel Benchmarks. Implementation details will be discussed and performance will be compared with the MPI implementation. We have demonstrated that OpenMP can achieve very good results for parallelization on a shared memory system, but effective use of memory and cache is very important.

  7. Synthesizing parallel imaging applications using the CAP (computer-aided parallelization) tool

    NASA Astrophysics Data System (ADS)

    Gennart, Benoit A.; Mazzariol, Marc; Messerli, Vincent; Hersch, Roger D.

    1997-12-01

    Imaging applications such as filtering, image transforms and compression/decompression require vast amounts of computing power when applied to large data sets. These applications would potentially benefit from the use of parallel processing. However, dedicated parallel computers are expensive and their processing power per node lags behind that of the most recent commodity components. Furthermore, developing parallel applications remains a difficult task: writing and debugging the application is difficult (deadlocks), programs may not be portable from one parallel architecture to the other, and performance often comes short of expectations. In order to facilitate the development of parallel applications, we propose the CAP computer-aided parallelization tool which enables application programmers to specify at a high-level of abstraction the flow of data between pipelined-parallel operations. In addition, the CAP tool supports the programmer in developing parallel imaging and storage operations. CAP enables combining efficiently parallel storage access routines and image processing sequential operations. This paper shows how processing and I/O intensive imaging applications must be implemented to take advantage of parallelism and pipelining between data access and processing. This paper's contribution is (1) to show how such implementations can be compactly specified in CAP, and (2) to demonstrate that CAP specified applications achieve the performance of custom parallel code. The paper analyzes theoretically the performance of CAP specified applications and demonstrates the accuracy of the theoretical analysis through experimental measurements.

  8. Distributed and parallel approach for handle and perform huge datasets

    NASA Astrophysics Data System (ADS)

    Konopko, Joanna

    2015-12-01

    Big Data refers to the dynamic, large and disparate volumes of data comes from many different sources (tools, machines, sensors, mobile devices) uncorrelated with each others. It requires new, innovative and scalable technology to collect, host and analytically process the vast amount of data. Proper architecture of the system that perform huge data sets is needed. In this paper, the comparison of distributed and parallel system architecture is presented on the example of MapReduce (MR) Hadoop platform and parallel database platform (DBMS). This paper also analyzes the problem of performing and handling valuable information from petabytes of data. The both paradigms: MapReduce and parallel DBMS are described and compared. The hybrid architecture approach is also proposed and could be used to solve the analyzed problem of storing and processing Big Data.

  9. Parallel Architectures and Parallel Algorithms for Integrated Vision Systems. Ph.D. Thesis

    NASA Technical Reports Server (NTRS)

    Choudhary, Alok Nidhi

    1989-01-01

    Computer vision is regarded as one of the most complex and computationally intensive problems. An integrated vision system (IVS) is a system that uses vision algorithms from all levels of processing to perform for a high level application (e.g., object recognition). An IVS normally involves algorithms from low level, intermediate level, and high level vision. Designing parallel architectures for vision systems is of tremendous interest to researchers. Several issues are addressed in parallel architectures and parallel algorithms for integrated vision systems.

  10. High-performance Chinese multiclass traffic sign detection via coarse-to-fine cascade and parallel support vector machine detectors

    NASA Astrophysics Data System (ADS)

    Chang, Faliang; Liu, Chunsheng

    2017-09-01

    The high variability of sign colors and shapes in uncontrolled environments has made the detection of traffic signs a challenging problem in computer vision. We propose a traffic sign detection (TSD) method based on coarse-to-fine cascade and parallel support vector machine (SVM) detectors to detect Chinese warning and danger traffic signs. First, a region of interest (ROI) extraction method is proposed to extract ROIs using color contrast features in local regions. The ROI extraction can reduce scanning regions and save detection time. For multiclass TSD, we propose a structure that combines a coarse-to-fine cascaded tree with a parallel structure of histogram of oriented gradients (HOG) + SVM detectors. The cascaded tree is designed to detect different types of traffic signs in a coarse-to-fine process. The parallel HOG + SVM detectors are designed to do fine detection of different types of traffic signs. The experiments demonstrate the proposed TSD method can rapidly detect multiclass traffic signs with different colors and shapes in high accuracy.

  11. A parallel-vector algorithm for rapid structural analysis on high-performance computers

    NASA Technical Reports Server (NTRS)

    Storaasli, Olaf O.; Nguyen, Duc T.; Agarwal, Tarun K.

    1990-01-01

    A fast, accurate Choleski method for the solution of symmetric systems of linear equations is presented. This direct method is based on a variable-band storage scheme and takes advantage of column heights to reduce the number of operations in the Choleski factorization. The method employs parallel computation in the outermost DO-loop and vector computation via the 'loop unrolling' technique in the innermost DO-loop. The method avoids computations with zeros outside the column heights, and as an option, zeros inside the band. The close relationship between Choleski and Gauss elimination methods is examined. The minor changes required to convert the Choleski code to a Gauss code to solve non-positive-definite symmetric systems of equations are identified. The results for two large-scale structural analyses performed on supercomputers, demonstrate the accuracy and speed of the method.

  12. A parallel-vector algorithm for rapid structural analysis on high-performance computers

    NASA Technical Reports Server (NTRS)

    Storaasli, Olaf O.; Nguyen, Duc T.; Agarwal, Tarun K.

    1990-01-01

    A fast, accurate Choleski method for the solution of symmetric systems of linear equations is presented. This direct method is based on a variable-band storage scheme and takes advantage of column heights to reduce the number of operations in the Choleski factorization. The method employs parallel computation in the outermost DO-loop and vector computation via the loop unrolling technique in the innermost DO-loop. The method avoids computations with zeros outside the column heights, and as an option, zeros inside the band. The close relationship between Choleski and Gauss elimination methods is examined. The minor changes required to convert the Choleski code to a Gauss code to solve non-positive-definite symmetric systems of equations are identified. The results for two large scale structural analyses performed on supercomputers, demonstrate the accuracy and speed of the method.

  13. High Performance Fortran for Aerospace Applications

    NASA Technical Reports Server (NTRS)

    Mehrotra, Piyush; Zima, Hans; Bushnell, Dennis M. (Technical Monitor)

    2000-01-01

    This paper focuses on the use of High Performance Fortran (HPF) for important classes of algorithms employed in aerospace applications. HPF is a set of Fortran extensions designed to provide users with a high-level interface for programming data parallel scientific applications, while delegating to the compiler/runtime system the task of generating explicitly parallel message-passing programs. We begin by providing a short overview of the HPF language. This is followed by a detailed discussion of the efficient use of HPF for applications involving multiple structured grids such as multiblock and adaptive mesh refinement (AMR) codes as well as unstructured grid codes. We focus on the data structures and computational structures used in these codes and on the high-level strategies that can be expressed in HPF to optimally exploit the parallelism in these algorithms.

  14. A grating coupler with a trapezoidal hole array for perfectly vertical light coupling between optical fibers and waveguides

    NASA Astrophysics Data System (ADS)

    Mizutani, Akio; Eto, Yohei; Kikuta, Hisao

    2017-12-01

    A grating coupler with a trapezoidal hole array was designed and fabricated for perfectly vertical light coupling between a single-mode optical fiber and a silicon waveguide on a silicon-on-insulator (SOI) substrate. The grating coupler with an efficiency of 53% was computationally designed at a 1.1-µm-thick buried oxide (BOX) layer. The grating coupler and silicon waveguide were fabricated on the SOI substrate with a 3.0-µm-thick BOX layer by a single full-etch process. The measured coupling efficiency was 24% for TE-polarized light at 1528 nm wavelength, which was 0.69 times of the calculated coupling efficiency for the 3.0-µm-thick BOX layer.

  15. Parallel Performance of a Combustion Chemistry Simulation

    DOE PAGES

    Skinner, Gregg; Eigenmann, Rudolf

    1995-01-01

    We used a description of a combustion simulation's mathematical and computational methods to develop a version for parallel execution. The result was a reasonable performance improvement on small numbers of processors. We applied several important programming techniques, which we describe, in optimizing the application. This work has implications for programming languages, compiler design, and software engineering.

  16. Silicon Nitride Grating Coupler with Flexible Bandwidth Incorporating a Serially Concatenated Multimode Interference Filter

    NASA Astrophysics Data System (ADS)

    Kim, Woo-Ju; Lee, Hak-Soon; Lee, Sang-Shin

    2012-04-01

    A compact silicon nitride grating coupler with flexible bandwidth was demonstrated taking advantage of a basic grating integrated with a serially connected multistage multimode interference (MMI) filter. The spectral response could be tailored by varying the order of the MMI filter, without affecting the basic grating structure. The dependence of the spectral response of the proposed device on the order of the MMI stage was thoroughly investigated. As regards the fabricated grating coupler with a four-stage MMI filter, the observed spectral bandwidth was efficiently altered from 53 to 21 nm in the ˜1550 nm spectral band.

  17. On the costs of parallel processing in dual-task performance: The case of lexical processing in word production.

    PubMed

    Paucke, Madlen; Oppermann, Frank; Koch, Iring; Jescheniak, Jörg D

    2015-12-01

    Previous dual-task picture-naming studies suggest that lexical processes require capacity-limited processes and prevent other tasks to be carried out in parallel. However, studies involving the processing of multiple pictures suggest that parallel lexical processing is possible. The present study investigated the specific costs that may arise when such parallel processing occurs. We used a novel dual-task paradigm by presenting 2 visual objects associated with different tasks and manipulating between-task similarity. With high similarity, a picture-naming task (T1) was combined with a phoneme-decision task (T2), so that lexical processes were shared across tasks. With low similarity, picture-naming was combined with a size-decision T2 (nonshared lexical processes). In Experiment 1, we found that a manipulation of lexical processes (lexical frequency of T1 object name) showed an additive propagation with low between-task similarity and an overadditive propagation with high between-task similarity. Experiment 2 replicated this differential forward propagation of the lexical effect and showed that it disappeared with longer stimulus onset asynchronies. Moreover, both experiments showed backward crosstalk, indexed as worse T1 performance with high between-task similarity compared with low similarity. Together, these findings suggest that conditions of high between-task similarity can lead to parallel lexical processing in both tasks, which, however, does not result in benefits but rather in extra performance costs. These costs can be attributed to crosstalk based on the dual-task binding problem arising from parallel processing. Hence, the present study reveals that capacity-limited lexical processing can run in parallel across dual tasks but only at the expense of extraordinary high costs. (c) 2015 APA, all rights reserved).

  18. Chip-scale integrated optical interconnects: a key enabler for future high-performance computing

    NASA Astrophysics Data System (ADS)

    Haney, Michael; Nair, Rohit; Gu, Tian

    2012-01-01

    High Performance Computing (HPC) systems are putting ever-increasing demands on the throughput efficiency of their interconnection fabrics. In this paper, the limits of conventional metal trace-based inter-chip interconnect fabrics are examined in the context of state-of-the-art HPC systems, which currently operate near the 1 GFLOPS/W level. The analysis suggests that conventional metal trace interconnects will limit performance to approximately 6 GFLOPS/W in larger HPC systems that require many computer chips to be interconnected in parallel processing architectures. As the HPC communications bottlenecks push closer to the processing chips, integrated Optical Interconnect (OI) technology may provide the ultra-high bandwidths needed at the inter- and intra-chip levels. With inter-chip photonic link energies projected to be less than 1 pJ/bit, integrated OI is projected to enable HPC architecture scaling to the 50 GFLOPS/W level and beyond - providing a path to Peta-FLOPS-level HPC within a single rack, and potentially even Exa-FLOPSlevel HPC for large systems. A new hybrid integrated chip-scale OI approach is described and evaluated. The concept integrates a high-density polymer waveguide fabric directly on top of a multiple quantum well (MQW) modulator array that is area-bonded to the Silicon computing chip. Grayscale lithography is used to fabricate 5 μm x 5 μm polymer waveguides and associated novel small-footprint total internal reflection-based vertical input/output couplers directly onto a layer containing an array of GaAs MQW devices configured to be either absorption modulators or photodetectors. An external continuous wave optical "power supply" is coupled into the waveguide links. Contrast ratios were measured using a test rider chip in place of a Silicon processing chip. The results suggest that sub-pJ/b chip-scale communication is achievable with this concept. When integrated into high-density integrated optical interconnect fabrics, it could provide

  19. Understanding and Improving High-Performance I/O Subsystems

    NASA Technical Reports Server (NTRS)

    El-Ghazawi, Tarek A.; Frieder, Gideon; Clark, A. James

    1996-01-01

    This research program has been conducted in the framework of the NASA Earth and Space Science (ESS) evaluations led by Dr. Thomas Sterling. In addition to the many important research findings for NASA and the prestigious publications, the program has helped orienting the doctoral research program of two students towards parallel input/output in high-performance computing. Further, the experimental results in the case of the MasPar were very useful and helpful to MasPar with which the P.I. has had many interactions with the technical management. The contributions of this program are drawn from three experimental studies conducted on different high-performance computing testbeds/platforms, and therefore presented in 3 different segments as follows: 1. Evaluating the parallel input/output subsystem of a NASA high-performance computing testbeds, namely the MasPar MP- 1 and MP-2; 2. Characterizing the physical input/output request patterns for NASA ESS applications, which used the Beowulf platform; and 3. Dynamic scheduling techniques for hiding I/O latency in parallel applications such as sparse matrix computations. This study also has been conducted on the Intel Paragon and has also provided an experimental evaluation for the Parallel File System (PFS) and parallel input/output on the Paragon. This report is organized as follows. The summary of findings discusses the results of each of the aforementioned 3 studies. Three appendices, each containing a key scholarly research paper that details the work in one of the studies are included.

  20. HipMCL: a high-performance parallel implementation of the Markov clustering algorithm for large-scale networks

    PubMed Central

    Azad, Ariful; Ouzounis, Christos A; Kyrpides, Nikos C; Buluç, Aydin

    2018-01-01

    Abstract Biological networks capture structural or functional properties of relevant entities such as molecules, proteins or genes. Characteristic examples are gene expression networks or protein–protein interaction networks, which hold information about functional affinities or structural similarities. Such networks have been expanding in size due to increasing scale and abundance of biological data. While various clustering algorithms have been proposed to find highly connected regions, Markov Clustering (MCL) has been one of the most successful approaches to cluster sequence similarity or expression networks. Despite its popularity, MCL’s scalability to cluster large datasets still remains a bottleneck due to high running times and memory demands. Here, we present High-performance MCL (HipMCL), a parallel implementation of the original MCL algorithm that can run on distributed-memory computers. We show that HipMCL can efficiently utilize 2000 compute nodes and cluster a network of ∼70 million nodes with ∼68 billion edges in ∼2.4 h. By exploiting distributed-memory environments, HipMCL clusters large-scale networks several orders of magnitude faster than MCL and enables clustering of even bigger networks. HipMCL is based on MPI and OpenMP and is freely available under a modified BSD license. PMID:29315405

  1. HipMCL: a high-performance parallel implementation of the Markov clustering algorithm for large-scale networks

    DOE PAGES

    Azad, Ariful; Pavlopoulos, Georgios A.; Ouzounis, Christos A.; ...

    2018-01-05

    Biological networks capture structural or functional properties of relevant entities such as molecules, proteins or genes. Characteristic examples are gene expression networks or protein–protein interaction networks, which hold information about functional affinities or structural similarities. Such networks have been expanding in size due to increasing scale and abundance of biological data. While various clustering algorithms have been proposed to find highly connected regions, Markov Clustering (MCL) has been one of the most successful approaches to cluster sequence similarity or expression networks. Despite its popularity, MCL’s scalability to cluster large datasets still remains a bottleneck due to high running times andmore » memory demands. In this paper, we present High-performance MCL (HipMCL), a parallel implementation of the original MCL algorithm that can run on distributed-memory computers. We show that HipMCL can efficiently utilize 2000 compute nodes and cluster a network of ~70 million nodes with ~68 billion edges in ~2.4 h. By exploiting distributed-memory environments, HipMCL clusters large-scale networks several orders of magnitude faster than MCL and enables clustering of even bigger networks. Finally, HipMCL is based on MPI and OpenMP and is freely available under a modified BSD license.« less

  2. HipMCL: a high-performance parallel implementation of the Markov clustering algorithm for large-scale networks

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Azad, Ariful; Pavlopoulos, Georgios A.; Ouzounis, Christos A.

    Biological networks capture structural or functional properties of relevant entities such as molecules, proteins or genes. Characteristic examples are gene expression networks or protein–protein interaction networks, which hold information about functional affinities or structural similarities. Such networks have been expanding in size due to increasing scale and abundance of biological data. While various clustering algorithms have been proposed to find highly connected regions, Markov Clustering (MCL) has been one of the most successful approaches to cluster sequence similarity or expression networks. Despite its popularity, MCL’s scalability to cluster large datasets still remains a bottleneck due to high running times andmore » memory demands. In this paper, we present High-performance MCL (HipMCL), a parallel implementation of the original MCL algorithm that can run on distributed-memory computers. We show that HipMCL can efficiently utilize 2000 compute nodes and cluster a network of ~70 million nodes with ~68 billion edges in ~2.4 h. By exploiting distributed-memory environments, HipMCL clusters large-scale networks several orders of magnitude faster than MCL and enables clustering of even bigger networks. Finally, HipMCL is based on MPI and OpenMP and is freely available under a modified BSD license.« less

  3. Straightforward and accurate technique for post-coupler stabilization in drift tube linac structures

    NASA Astrophysics Data System (ADS)

    Khalvati, Mohammad Reza; Ramberger, Suitbert

    2016-04-01

    The axial electric field of Alvarez drift tube linacs (DTLs) is known to be susceptible to variations due to static and dynamic effects like manufacturing tolerances and beam loading. Post-couplers are used to stabilize the accelerating fields of DTLs against tuning errors. Tilt sensitivity and its slope have been introduced as measures for the stability right from the invention of post-couplers but since then the actual stabilization has mostly been done by tedious iteration. In the present article, the local tilt-sensitivity slope TSn' is established as the principal measure for stabilization instead of tilt sensitivity or some visual slope, and its significance is developed on the basis of an equivalent-circuit diagram of the DTL. Experimental and 3D simulation results are used to analyze its behavior and to define a technique for stabilization that allows finding the best post-coupler settings with just four tilt-sensitivity measurements. CERN's Linac4 DTL Tank 2 and Tank 3 have been stabilized successfully using this technique. The final tilt-sensitivity error has been reduced from ±100 %/MHz down to ±3 %/MHz for Tank 2 and down to ±1 %/MHz for Tank 3. Finally, an accurate procedure for tuning the structure using slug tuners is discussed.

  4. Volumetric Imaging and Characterization of Focusing Waveguide Grating Couplers

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Katzenmeyer, Aaron Michael; McGuinness, Hayden James Evans; Starbuck, Andrew Lea

    Volumetric imaging of focusing waveguide grating coupler emission with high spatial resolution in the visible (λ = 637.3 nm) is demonstrated using a scanning near-field optical microscope with long z-axis travel range. Stacks of 2-D images recorded at fixed distance from the device are compiled to yield 3-D visualization of the light emission pattern and enable extraction of parameters, such as spot size, angle of emission, and focal height. Measurements of such parameters are not prevalent in the literature yet are necessary for efficacious design and integration. As a result, it is observed that finite-difference time-domain simulations based on fabricationmore » layout files do not perfectly predict in-hand device behavior, underscoring the merit of experimental validation, particularly for critical application.« less

  5. Volumetric Imaging and Characterization of Focusing Waveguide Grating Couplers

    DOE PAGES

    Katzenmeyer, Aaron Michael; McGuinness, Hayden James Evans; Starbuck, Andrew Lea; ...

    2017-08-29

    Volumetric imaging of focusing waveguide grating coupler emission with high spatial resolution in the visible (λ = 637.3 nm) is demonstrated using a scanning near-field optical microscope with long z-axis travel range. Stacks of 2-D images recorded at fixed distance from the device are compiled to yield 3-D visualization of the light emission pattern and enable extraction of parameters, such as spot size, angle of emission, and focal height. Measurements of such parameters are not prevalent in the literature yet are necessary for efficacious design and integration. As a result, it is observed that finite-difference time-domain simulations based on fabricationmore » layout files do not perfectly predict in-hand device behavior, underscoring the merit of experimental validation, particularly for critical application.« less

  6. Highly Parallel Alternating Directions Algorithm for Time Dependent Problems

    NASA Astrophysics Data System (ADS)

    Ganzha, M.; Georgiev, K.; Lirkov, I.; Margenov, S.; Paprzycki, M.

    2011-11-01

    In our work, we consider the time dependent Stokes equation on a finite time interval and on a uniform rectangular mesh, written in terms of velocity and pressure. For this problem, a parallel algorithm based on a novel direction splitting approach is developed. Here, the pressure equation is derived from a perturbed form of the continuity equation, in which the incompressibility constraint is penalized in a negative norm induced by the direction splitting. The scheme used in the algorithm is composed of two parts: (i) velocity prediction, and (ii) pressure correction. This is a Crank-Nicolson-type two-stage time integration scheme for two and three dimensional parabolic problems in which the second-order derivative, with respect to each space variable, is treated implicitly while the other variable is made explicit at each time sub-step. In order to achieve a good parallel performance the solution of the Poison problem for the pressure correction is replaced by solving a sequence of one-dimensional second order elliptic boundary value problems in each spatial direction. The parallel code is implemented using the standard MPI functions and tested on two modern parallel computer systems. The performed numerical tests demonstrate good level of parallel efficiency and scalability of the studied direction-splitting-based algorithm.

  7. Achieving High Performance in Parallel Applications via Kernel-Application Interaction

    DTIC Science & Technology

    1996-04-01

    time systems include airplane autopilot or nuclear power plant control. New complex, parallel soft real-time applica- tions have been generating...to keep as many sheep on the table as possible, and the more powerful the sheep behavior-models and look-ahead, the better the results. General...fact that it provides considerable flexibility when considering the amount of processing power to allocate to a planner. In this experiment we again

  8. Power-ratio tunable dual-wavelength laser using linearly variable Fabry-Perot filter as output coupler.

    PubMed

    Wang, Xiaozhong; Wang, Zhongfa; Bu, Yikun; Chen, Lujian; Cai, Guoxiong; Huang, Wencai; Cai, Zhiping; Chen, Nan

    2016-02-01

    For a linearly variable Fabry-Perot filter, the peak transmission wavelengths change linearly with the transverse position shift of the substrate. Such a Fabry-Perot filter is designed and fabricated and used as an output coupler of a c-cut Nd:YVO4 laser experimentally in this paper to obtain a 1062 and 1083 nm dual-wavelength laser. The peak transmission wavelengths are gradually shifted from 1040.8 to 1070.8 nm. The peak transmission wavelength of the Fabry-Perot filter used as the output coupler for the dual-wavelength laser is 1068 nm and resides between 1062 and 1083 nm, which makes the transmissions of the desired dual wavelengths change in opposite slopes with the transverse shift of the filter. Consequently, powers of the two wavelengths change in opposite directions. A branch power, oppositely tunable 1062 and 1083 nm dual-wavelength laser is successfully demonstrated. Design principles of the linear variable Fabry-Perot filter used as an output coupler are discussed. Advantages of the method are summarized.

  9. Numerical investigation of polarization insensitive two-mode division (De)multiplexer based on an asymmetric directional coupler

    NASA Astrophysics Data System (ADS)

    Truong, Cao Dung; Trinh, M. Tuan; Dang, Hoai Bac; Nguyen, Van Tho

    2017-02-01

    We propose a polarization insensitive two-mode division (de)multiplexer based on a silicon-on-insulator platform operating with a broadband, low insertion and scattering loss, and small crosstalk. By using an asymmetric directional coupler, two-mode (de)multiplexing functions for both polarization TE and TM states can be realized by the numerical simulation. Simulated results using a three dimensional beam propagation method (3D-BPM) incorporated with an effective index method (EIM) show high performance of the device with an operation efficiency above 81.2% (i.e., insertion loss is less than 0.9 dB) in the range of ±5 nm around the central wavelength of 1550 nm. Fabrication tolerances also have proved suitability to current manufacture technologies for the planar waveguides. Besides a low scattering loss of the sidewall roughness and a little influence of dispersion, a small footprint can bring the device to applications of high bitrate and compact on-chip silicon photonic integrated circuits.

  10. Using the Eclipse Parallel Tools Platform to Assist Earth Science Model Development and Optimization on High Performance Computers

    NASA Astrophysics Data System (ADS)

    Alameda, J. C.

    2011-12-01

    Development and optimization of computational science models, particularly on high performance computers, and with the advent of ubiquitous multicore processor systems, practically on every system, has been accomplished with basic software tools, typically, command-line based compilers, debuggers, performance tools that have not changed substantially from the days of serial and early vector computers. However, model complexity, including the complexity added by modern message passing libraries such as MPI, and the need for hybrid code models (such as openMP and MPI) to be able to take full advantage of high performance computers with an increasing core count per shared memory node, has made development and optimization of such codes an increasingly arduous task. Additional architectural developments, such as many-core processors, only complicate the situation further. In this paper, we describe how our NSF-funded project, "SI2-SSI: A Productive and Accessible Development Workbench for HPC Applications Using the Eclipse Parallel Tools Platform" (WHPC) seeks to improve the Eclipse Parallel Tools Platform, an environment designed to support scientific code development targeted at a diverse set of high performance computing systems. Our WHPC project to improve Eclipse PTP takes an application-centric view to improve PTP. We are using a set of scientific applications, each with a variety of challenges, and using PTP to drive further improvements to both the scientific application, as well as to understand shortcomings in Eclipse PTP from an application developer perspective, to drive our list of improvements we seek to make. We are also partnering with performance tool providers, to drive higher quality performance tool integration. We have partnered with the Cactus group at Louisiana State University to improve Eclipse's ability to work with computational frameworks and extremely complex build systems, as well as to develop educational materials to incorporate into

  11. Parallel Computing:. Some Activities in High Energy Physics

    NASA Astrophysics Data System (ADS)

    Willers, Ian

    This paper examines some activities in High Energy Physics that utilise parallel computing. The topic includes all computing from the proposed SIMD front end detectors, the farming applications, high-powered RISC processors and the large machines in the computer centers. We start by looking at the motivation behind using parallelism for general purpose computing. The developments around farming are then described from its simplest form to the more complex system in Fermilab. Finally, there is a list of some developments that are happening close to the experiments.

  12. Calibration of Passive Microwave Polarimeters that Use Hybrid Coupler-Based Correlators

    NASA Technical Reports Server (NTRS)

    Piepmeier, J. R.

    2003-01-01

    Four calibration algorithms are studied for microwave polarimeters that use hybrid coupler-based correlators: 1) conventional two-look of hot and cold sources, 2) three looks of hot and cold source combinations, 3) two-look with correlated source, and 4) four-look combining methods 2 and 3. The systematic errors are found to depend on the polarimeter component parameters and accuracy of calibration noise temperatures. A case study radiometer in four different remote sensing scenarios was considered in light of these results. Applications for Ocean surface salinity, Ocean surface winds, and soil moisture were found to be sensitive to different systematic errors. Finally, a standard uncertainty analysis was performed on the four-look calibration algorithm, which was found to be most sensitive to the correlated calibration source.

  13. Message passing interface and multithreading hybrid for parallel molecular docking of large databases on petascale high performance computing machines.

    PubMed

    Zhang, Xiaohua; Wong, Sergio E; Lightstone, Felice C

    2013-04-30

    A mixed parallel scheme that combines message passing interface (MPI) and multithreading was implemented in the AutoDock Vina molecular docking program. The resulting program, named VinaLC, was tested on the petascale high performance computing (HPC) machines at Lawrence Livermore National Laboratory. To exploit the typical cluster-type supercomputers, thousands of docking calculations were dispatched by the master process to run simultaneously on thousands of slave processes, where each docking calculation takes one slave process on one node, and within the node each docking calculation runs via multithreading on multiple CPU cores and shared memory. Input and output of the program and the data handling within the program were carefully designed to deal with large databases and ultimately achieve HPC on a large number of CPU cores. Parallel performance analysis of the VinaLC program shows that the code scales up to more than 15K CPUs with a very low overhead cost of 3.94%. One million flexible compound docking calculations took only 1.4 h to finish on about 15K CPUs. The docking accuracy of VinaLC has been validated against the DUD data set by the re-docking of X-ray ligands and an enrichment study, 64.4% of the top scoring poses have RMSD values under 2.0 Å. The program has been demonstrated to have good enrichment performance on 70% of the targets in the DUD data set. An analysis of the enrichment factors calculated at various percentages of the screening database indicates VinaLC has very good early recovery of actives. Copyright © 2013 Wiley Periodicals, Inc.

  14. Parallel microscope-based fluorescence, absorbance and time-of-flight mass spectrometry detection for high performance liquid chromatography and determination of glucosamine in urine.

    PubMed

    Xiong, Bo; Wang, Ling-Ling; Li, Qiong; Nie, Yu-Ting; Cheng, Shuang-Shuang; Zhang, Hui; Sun, Ren-Qiang; Wang, Yu-Jiao; Zhou, Hong-Bin

    2015-11-01

    A parallel microscope-based laser-induced fluorescence (LIF), ultraviolet-visible absorbance (UV) and time-of-flight mass spectrometry (TOF-MS) detection for high performance liquid chromatography (HPLC) was achieved and used to determine glucosamine in urines. First, a reliable and convenient LIF detection was developed based on an inverted microscope and corresponding modulations. Parallel HPLC-LIF/UV/TOF-MS detection was developed by the combination of preceding Microscope-based LIF detection and HPLC coupled with UV and TOF-MS. The proposed setup, due to its parallel scheme, was free of the influence from photo bleaching in LIF detection. Rhodamine B, glutamic acid and glucosamine have been determined to evaluate its performance. Moreover, the proposed strategy was used to determine the glucosamine in urines, and subsequent results suggested that glucosamine, which was widely used in the prevention of the bone arthritis, was metabolized to urines within 4h. Furthermore, its concentration in urines decreased to 5.4mM at 12h. Efficient glucosamine detection was achieved based on a sensitive quantification (LIF), a universal detection (UV) and structural characterizations (TOF-MS). This application indicated that the proposed strategy was sensitive, universal and versatile, and it was capable of improved analysis, especially for analytes with low concentrations in complex samples, compared with conventional HPLC-UV/TOF-MS. Copyright © 2015 Elsevier B.V. All rights reserved.

  15. Open | SpeedShop: An Open Source Infrastructure for Parallel Performance Analysis

    DOE PAGES

    Schulz, Martin; Galarowicz, Jim; Maghrak, Don; ...

    2008-01-01

    Over the last decades a large number of performance tools has been developed to analyze and optimize high performance applications. Their acceptance by end users, however, has been slow: each tool alone is often limited in scope and comes with widely varying interfaces and workflow constraints, requiring different changes in the often complex build and execution infrastructure of the target application. We started the Open | SpeedShop project about 3 years ago to overcome these limitations and provide efficient, easy to apply, and integrated performance analysis for parallel systems. Open | SpeedShop has two different faces: it provides an interoperable tool set covering themore » most common analysis steps as well as a comprehensive plugin infrastructure for building new tools. In both cases, the tools can be deployed to large scale parallel applications using DPCL/Dyninst for distributed binary instrumentation. Further, all tools developed within or on top of Open | SpeedShop are accessible through multiple fully equivalent interfaces including an easy-to-use GUI as well as an interactive command line interface reducing the usage threshold for those tools.« less

  16. The NAS parallel benchmarks

    NASA Technical Reports Server (NTRS)

    Bailey, David (Editor); Barton, John (Editor); Lasinski, Thomas (Editor); Simon, Horst (Editor)

    1993-01-01

    A new set of benchmarks was developed for the performance evaluation of highly parallel supercomputers. These benchmarks consist of a set of kernels, the 'Parallel Kernels,' and a simulated application benchmark. Together they mimic the computation and data movement characteristics of large scale computational fluid dynamics (CFD) applications. The principal distinguishing feature of these benchmarks is their 'pencil and paper' specification - all details of these benchmarks are specified only algorithmically. In this way many of the difficulties associated with conventional benchmarking approaches on highly parallel systems are avoided.

  17. Interferometer using a 3 × 3 coupler and Faraday mirrors

    NASA Astrophysics Data System (ADS)

    Breguet, J.; Gisin, N.

    1995-06-01

    A new interferometric setup using a 3 \\times 3 coupler and two Faraday mirrors is presented. It has the advantages of being built only with passive components, of freedom from the polarization fading problem, and of operation with a LED. It is well suited for sensing time-dependent signals and does not depend on reciprocal or nonreciprocal constant perturbations.

  18. Ultra-low loss fully-etched grating couplers for perfectly vertical coupling compatible with DUV lithography tools

    NASA Astrophysics Data System (ADS)

    Dabos, G.; Pleros, N.; Tsiokos, D.

    2016-03-01

    Hybrid integration of VCSELs onto silicon-on-insulator (SOI) substrates has emerged as an attractive approach for bridging the gap between cost-effective and energy-efficient directly modulated laser sources and silicon-based PICs by leveraging flip-chip (FC) bonding techniques and silicon grating couplers (GCs). In this context, silicon GCs, should comply with the process requirements imposed by the complimentary-metal-oxide-semiconductor manufacturing tools addressing in parallel the challenges originating from the perfectly vertical incidence. Firstly, fully etched GCs compatible with deep-ultraviolet lithography tools offering high coupling efficiencies are imperatively needed to maintain low fabrication cost. Secondly, GC's tolerance to VCSEL bonding misalignment errors is a prerequisite for practical deployment. Finally, a major challenge originating from the perfectly vertical coupling scheme is the minimization of the direct back-reflection to the VCSEL's outgoing facet which may destabilize its operation. Motivated from the above challenges, we used numerical simulation tools to design an ultra-low loss, bidirectional VCSEL-to-SOI optical coupling scheme for either TE or TM polarization, based on low-cost fully etched GCs with a Si-layer of 340 nm without employing bottom reflectors or optimizing the buried-oxide layer. Comprehensive 2D Finite-Difference-Time- Domain simulations have been performed. The reported GC layout remains fully compatible with the back-end-of-line (BEOL) stack associated with the 3D integration technology exploiting all the inter-metal-dielectric (IMD) layers of the CMOS fab. Simulation results predicted for the first time in fully etched structures a coupling efficiency of as low as -0.87 dB at 1548 nm and -1.47 dB at 1560 nm with a minimum direct back-reflection of -27.4 dB and -14.2 dB for TE and TM polarization, respectively.

  19. Fabrication tolerant chalcogenide mid-infrared multimode interference coupler design with applications for Bracewell nulling interferometry.

    PubMed

    Goldsmith, Harry-Dean Kenchington; Cvetojevic, Nick; Ireland, Michael; Madden, Stephen

    2017-02-20

    Understanding exoplanet formation and finding potentially habitable exoplanets is vital to an enhanced understanding of the universe. The use of nulling interferometry to strongly attenuate the central star's light provides the opportunity to see objects closer to the star than ever before. Given that exoplanets are usually warm, the 4 µm Mid-Infrared region is advantageous for such observations. The key performance parameters for a nulling interferometer are the extinction ratio it can attain and how well that is maintained across the operational bandwidth. Both parameters depend on the design and fabrication accuracy of the subcomponents and their wavelength dependence. Via detailed simulation it is shown in this paper that a planar chalcogenide photonic chip, consisting of three highly fabrication tolerant multimode interference couplers, can exceed an extinction ratio of 60 dB in double nulling operation and up to 40 dB for a single nulling operation across a wavelength window of 3.9 to 4.2 µm. This provides a beam combiner with sufficient performance, in theory, to image exoplanets.

  20. Refractive index sensor based on a polymer fiber directional coupler for low index sensing.

    PubMed

    Lee, Kwang Jo; Liu, Xiaoqi; Vuillemin, Nelly; Lwin, Richard; Leon-Saval, Sergio G; Argyros, Alexander; Kuhlmey, Boris T

    2014-07-14

    We propose, numerically analyze and experimentally demonstrate a novel refractive index sensor specialized for low index sensing. The device is based on a directional coupler architecture implemented in a single microstructured polymer optical fiber incorporating two waveguides within it: a single-mode core and a satellite waveguide consisting of a hollow high-index ring. This hollow channel is filled with fluid and the refractive index of the fluid is detected through changes to the wavelength at which resonant coupling occurs between the two waveguides. The sensor design was optimized for both higher sensitivity and lower detection limit, with simulations and experiments demonstrating a sensitivity exceeding 1.4 × 10(3) nm per refractive index unit. Simulations indicate a detection limit of ~2 × 10(-6) refractive index units is achievable. We also numerically investigate the performance for refractive index changes localized at the surface of the holes, a case of particular importance for biosensing.

  1. Parallel performance of TORT on the CRAY J90: Model and measurement

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Barnett, A.; Azmy, Y.Y.

    1997-10-01

    A limitation on the parallel performance of TORT on the CRAY J90 is the amount of extra work introduced by the multitasking algorithm itself. The extra work beyond that of the serial version of the code, called overhead, arises from the synchronization of the parallel tasks and the accumulation of results by the master task. The goal of recent updates to TORT was to reduce the time consumed by these activities. To help understand which components of the multitasking algorithm contribute significantly to the overhead, a parallel performance model was constructed and compared to measurements of actual timings of themore » code.« less

  2. Evaluating the performance of the particle finite element method in parallel architectures

    NASA Astrophysics Data System (ADS)

    Gimenez, Juan M.; Nigro, Norberto M.; Idelsohn, Sergio R.

    2014-05-01

    This paper presents a high performance implementation for the particle-mesh based method called particle finite element method two (PFEM-2). It consists of a material derivative based formulation of the equations with a hybrid spatial discretization which uses an Eulerian mesh and Lagrangian particles. The main aim of PFEM-2 is to solve transport equations as fast as possible keeping some level of accuracy. The method was found to be competitive with classical Eulerian alternatives for these targets, even in their range of optimal application. To evaluate the goodness of the method with large simulations, it is imperative to use of parallel environments. Parallel strategies for Finite Element Method have been widely studied and many libraries can be used to solve Eulerian stages of PFEM-2. However, Lagrangian stages, such as streamline integration, must be developed considering the parallel strategy selected. The main drawback of PFEM-2 is the large amount of memory needed, which limits its application to large problems with only one computer. Therefore, a distributed-memory implementation is urgently needed. Unlike a shared-memory approach, using domain decomposition the memory is automatically isolated, thus avoiding race conditions; however new issues appear due to data distribution over the processes. Thus, a domain decomposition strategy for both particle and mesh is adopted, which minimizes the communication between processes. Finally, performance analysis running over multicore and multinode architectures are presented. The Courant-Friedrichs-Lewy number used influences the efficiency of the parallelization and, in some cases, a weighted partitioning can be used to improve the speed-up. However the total cputime for cases presented is lower than that obtained when using classical Eulerian strategies.

  3. Operation of high power converters in parallel

    NASA Technical Reports Server (NTRS)

    Decker, D. K.; Inouye, L. Y.

    1993-01-01

    High power converters that are used in space power subsystems are limited in power handling capability due to component and thermal limitations. For applications, such as Space Station Freedom, where multi-kilowatts of power must be delivered to user loads, parallel operation of converters becomes an attractive option when considering overall power subsystem topologies. TRW developed three different unequal power sharing approaches for parallel operation of converters. These approaches, known as droop, master-slave, and proportional adjustment, are discussed and test results are presented.

  4. Analysis of dual coupler nested coupled cavities.

    PubMed

    Adib, George A; Sabry, Yasser M; Khalil, Diaa

    2017-12-01

    Coupled ring resonators are now forming the basic building blocks in several optical systems serving different applications. In many of these applications, a small full width at half maximum is required, along with a large free spectral range. In this work, a configuration of passive coupled cavities constituting dual coupler nested cavities is proposed. A theoretical study of the configuration is presented allowing us to obtain analytical expressions of its different spectral characteristics. The transfer function of the configuration is also used to generate design curves while comparing these results with analytical expressions. Finally, the configuration is compared with other coupled cavity configurations.

  5. Waveguide Multimode Directional Coupler for Harvesting Harmonic Power from the Output of Traveling-Wave Tube Amplifiers

    NASA Technical Reports Server (NTRS)

    Simons, Rainee N.; Wintucky, Edwin G.

    2017-01-01

    This paper presents the design, fabrication, and test results for a novel waveguide multimode directional coupler (MDC). The coupler fabricated from dissimilar frequency band waveguides, is capable of isolating power at the 2nd harmonic frequency from the fundamental power at the output port of traveling-wave tube amplifiers. Test results from proof-of-concept demonstrations are presented for Ku/Ka-band and Ka/E-band MDCs, which demonstrate sufficient power in the 2nd harmonic for a space borne beacon source for mm-wave atmospheric propagation studies.

  6. Fast Face-Recognition Optical Parallel Correlator Using High Accuracy Correlation Filter

    NASA Astrophysics Data System (ADS)

    Watanabe, Eriko; Kodate, Kashiko

    2005-11-01

    We designed and fabricated a fully automatic fast face recognition optical parallel correlator [E. Watanabe and K. Kodate: Appl. Opt. 44 (2005) 5666] based on the VanderLugt principle. The implementation of an as-yet unattained ultra high-speed system was aided by reconfiguring the system to make it suitable for easier parallel processing, as well as by composing a higher accuracy correlation filter and high-speed ferroelectric liquid crystal-spatial light modulator (FLC-SLM). In running trial experiments using this system (dubbed FARCO), we succeeded in acquiring remarkably low error rates of 1.3% for false match rate (FMR) and 2.6% for false non-match rate (FNMR). Given the results of our experiments, the aim of this paper is to examine methods of designing correlation filters and arranging database image arrays for even faster parallel correlation, underlining the issues of calculation technique, quantization bit rate, pixel size and shift from optical axis. The correlation filter has proved its excellent performance and higher precision than classical correlation and joint transform correlator (JTC). Moreover, arrangement of multi-object reference images leads to 10-channel correlation signals, as sharply marked as those of a single channel. This experiment result demonstrates great potential for achieving the process speed of 10000 face/s.

  7. A task-based parallelism and vectorized approach to 3D Method of Characteristics (MOC) reactor simulation for high performance computing architectures

    NASA Astrophysics Data System (ADS)

    Tramm, John R.; Gunow, Geoffrey; He, Tim; Smith, Kord S.; Forget, Benoit; Siegel, Andrew R.

    2016-05-01

    In this study we present and analyze a formulation of the 3D Method of Characteristics (MOC) technique applied to the simulation of full core nuclear reactors. Key features of the algorithm include a task-based parallelism model that allows independent MOC tracks to be assigned to threads dynamically, ensuring load balancing, and a wide vectorizable inner loop that takes advantage of modern SIMD computer architectures. The algorithm is implemented in a set of highly optimized proxy applications in order to investigate its performance characteristics on CPU, GPU, and Intel Xeon Phi architectures. Speed, power, and hardware cost efficiencies are compared. Additionally, performance bottlenecks are identified for each architecture in order to determine the prospects for continued scalability of the algorithm on next generation HPC architectures.

  8. Performance of parallel computation using CUDA for solving the one-dimensional elasticity equations

    NASA Astrophysics Data System (ADS)

    Darmawan, J. B. B.; Mungkasi, S.

    2017-01-01

    In this paper, we investigate the performance of parallel computation in solving the one-dimensional elasticity equations. Elasticity equations are usually implemented in engineering science. Solving these equations fast and efficiently is desired. Therefore, we propose the use of parallel computation. Our parallel computation uses CUDA of the NVIDIA. Our research results show that parallel computation using CUDA has a great advantage and is powerful when the computation is of large scale.

  9. Arbitrary-ratio power splitter based on nonlinear multimode interference coupler

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Tajaldini, Mehdi; Young Researchers and Elite Club, Baft Branch, Islamic Azad University, Baft; Jafri, Mohd Zubir Mat

    2015-04-24

    We propose an ultra-compact multimode interference (MMI) power splitter based on nonlinear effects from simulations using nonlinear modal propagation analysis (NMPA) cooperation with finite difference Method (FDM) to access free choice of splitting ratio. Conventional multimode interference power splitter could only obtain a few discrete ratios. The power splitting ratio may be adjusted continuously while the input set power is varying by a tunable laser. In fact, using an ultra- compact MMI with a simple structure that is launched by a tunable nonlinear input fulfills the problem of arbitrary-ratio in integrated photonics circuits. Silicon on insulator (SOI) is used asmore » the offered material due to the high contrast refractive index and Centro symmetric properties. The high-resolution images at the end of the multimode waveguide in the simulated power splitter have a high power balance, whereas access to a free choice of splitting ratio is not possible under the linear regime in the proposed length range except changes in the dimension for any ratio. The compact dimensions and ideal performance of the device are established according to optimized parameters. The proposed regime can be extended to the design of M×N arbitrary power splitters ratio for programmable logic devices in all optical digital signal processing. The results of this study indicate that nonlinear modal propagation analysis solves the miniaturization problem for all-optical devices based on MMI couplers to achieve multiple functions in a compact planar integrated circuit and also overcomes the limitations of previously proposed methods for nonlinear MMI.« less

  10. Silicon nitride directional coupler interferometer for surface sensing

    NASA Astrophysics Data System (ADS)

    Okubo, Kyohei; Uchiyamada, Ken; Asakawa, Kiyoshi; Suzuki, Hiroaki

    2017-01-01

    A silicon nitride directional coupler (DC) used to create a biosensing device is presented. The DC detects changes in the refractive index of the cladding (nclad) as changes in the relative output intensity. The DC length (L), nclad-dependent sensitivities of the DC, and preferred dimensions of the single-mode DC waveguides are obtained through numerical simulations. The performance of the DC is evaluated through end-fire coupling measurements. The intensities measured after varying the nclad using air, water, and glycerol solutions agree well with the fitting for a wide range of L values between 60 and 600 μm, i.e., corresponding to 6 to 60 times the coupling length. The bulk refractive index sensitivity was investigated using glycerol solutions of different concentrations and was found to be 18.9 optical intensity units per refractive index unit (OIU/RIU). Biotin/streptavidin bindings were detected with a sensitivity of 60 OIU/RIU and a detection limit of 0.13 μM, suggesting the feasibility of the DC for immunosensing.

  11. Highly fault-tolerant parallel computation

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Spielman, D.A.

    We re-introduce the coded model of fault-tolerant computation in which the input and output of a computational device are treated as words in an error-correcting code. A computational device correctly computes a function in the coded model if its input and output, once decoded, are a valid input and output of the function. In the coded model, it is reasonable to hope to simulate all computational devices by devices whose size is greater by a constant factor but which are exponentially reliable even if each of their components can fail with some constant probability. We consider fine-grained parallel computations inmore » which each processor has a constant probability of producing the wrong output at each time step. We show that any parallel computation that runs for time t on w processors can be performed reliably on a faulty machine in the coded model using w log{sup O(l)} w processors and time t log{sup O(l)} w. The failure probability of the computation will be at most t {center_dot} exp(-w{sup 1/4}). The codes used to communicate with our fault-tolerant machines are generalized Reed-Solomon codes and can thus be encoded and decoded in O(n log{sup O(1)} n) sequential time and are independent of the machine they are used to communicate with. We also show how coded computation can be used to self-correct many linear functions in parallel with arbitrarily small overhead.« less

  12. The NAS parallel benchmarks

    NASA Technical Reports Server (NTRS)

    Bailey, D. H.; Barszcz, E.; Barton, J. T.; Carter, R. L.; Lasinski, T. A.; Browning, D. S.; Dagum, L.; Fatoohi, R. A.; Frederickson, P. O.; Schreiber, R. S.

    1991-01-01

    A new set of benchmarks has been developed for the performance evaluation of highly parallel supercomputers in the framework of the NASA Ames Numerical Aerodynamic Simulation (NAS) Program. These consist of five 'parallel kernel' benchmarks and three 'simulated application' benchmarks. Together they mimic the computation and data movement characteristics of large-scale computational fluid dynamics applications. The principal distinguishing feature of these benchmarks is their 'pencil and paper' specification-all details of these benchmarks are specified only algorithmically. In this way many of the difficulties associated with conventional benchmarking approaches on highly parallel systems are avoided.

  13. Parallel Microcracks-based Ultrasensitive and Highly Stretchable Strain Sensors.

    PubMed

    Amjadi, Morteza; Turan, Mehmet; Clementson, Cameron P; Sitti, Metin

    2016-03-02

    There is an increasing demand for flexible, skin-attachable, and wearable strain sensors due to their various potential applications. However, achieving strain sensors with both high sensitivity and high stretchability is still a grand challenge. Here, we propose highly sensitive and stretchable strain sensors based on the reversible microcrack formation in composite thin films. Controllable parallel microcracks are generated in graphite thin films coated on elastomer films. Sensors made of graphite thin films with short microcracks possess high gauge factors (maximum value of 522.6) and stretchability (ε ≥ 50%), whereas sensors with long microcracks show ultrahigh sensitivity (maximum value of 11,344) with limited stretchability (ε ≤ 50%). We demonstrate the high performance strain sensing of our sensors in both small and large strain sensing applications such as human physiological activity recognition, human body large motion capturing, vibration detection, pressure sensing, and soft robotics.

  14. RISC Processors and High Performance Computing

    NASA Technical Reports Server (NTRS)

    Saini, Subhash; Bailey, David H.; Lasinski, T. A. (Technical Monitor)

    1995-01-01

    In this tutorial, we will discuss top five current RISC microprocessors: The IBM Power2, which is used in the IBM RS6000/590 workstation and in the IBM SP2 parallel supercomputer, the DEC Alpha, which is in the DEC Alpha workstation and in the Cray T3D; the MIPS R8000, which is used in the SGI Power Challenge; the HP PA-RISC 7100, which is used in the HP 700 series workstations and in the Convex Exemplar; and the Cray proprietary processor, which is used in the new Cray J916. The architecture of these microprocessors will first be presented. The effective performance of these processors will then be compared, both by citing standard benchmarks and also in the context of implementing a real applications. In the process, different programming models such as data parallel (CM Fortran and HPF) and message passing (PVM and MPI) will be introduced and compared. The latest NAS Parallel Benchmark (NPB) absolute performance and performance per dollar figures will be presented. The next generation of the NP13 will also be described. The tutorial will conclude with a discussion of general trends in the field of high performance computing, including likely future developments in hardware and software technology, and the relative roles of vector supercomputers tightly coupled parallel computers, and clusters of workstations. This tutorial will provide a unique cross-machine comparison not available elsewhere.

  15. Development Of A Parallel Performance Model For The THOR Neutral Particle Transport Code

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Yessayan, Raffi; Azmy, Yousry; Schunert, Sebastian

    The THOR neutral particle transport code enables simulation of complex geometries for various problems from reactor simulations to nuclear non-proliferation. It is undergoing a thorough V&V requiring computational efficiency. This has motivated various improvements including angular parallelization, outer iteration acceleration, and development of peripheral tools. For guiding future improvements to the code’s efficiency, better characterization of its parallel performance is useful. A parallel performance model (PPM) can be used to evaluate the benefits of modifications and to identify performance bottlenecks. Using INL’s Falcon HPC, the PPM development incorporates an evaluation of network communication behavior over heterogeneous links and a functionalmore » characterization of the per-cell/angle/group runtime of each major code component. After evaluating several possible sources of variability, this resulted in a communication model and a parallel portion model. The former’s accuracy is bounded by the variability of communication on Falcon while the latter has an error on the order of 1%.« less

  16. Petascale turbulence simulation using a highly parallel fast multipole method on GPUs

    NASA Astrophysics Data System (ADS)

    Yokota, Rio; Barba, L. A.; Narumi, Tetsu; Yasuoka, Kenji

    2013-03-01

    This paper reports large-scale direct numerical simulations of homogeneous-isotropic fluid turbulence, achieving sustained performance of 1.08 petaflop/s on GPU hardware using single precision. The simulations use a vortex particle method to solve the Navier-Stokes equations, with a highly parallel fast multipole method (FMM) as numerical engine, and match the current record in mesh size for this application, a cube of 40963 computational points solved with a spectral method. The standard numerical approach used in this field is the pseudo-spectral method, relying on the FFT algorithm as the numerical engine. The particle-based simulations presented in this paper quantitatively match the kinetic energy spectrum obtained with a pseudo-spectral method, using a trusted code. In terms of parallel performance, weak scaling results show the FMM-based vortex method achieving 74% parallel efficiency on 4096 processes (one GPU per MPI process, 3 GPUs per node of the TSUBAME-2.0 system). The FFT-based spectral method is able to achieve just 14% parallel efficiency on the same number of MPI processes (using only CPU cores), due to the all-to-all communication pattern of the FFT algorithm. The calculation time for one time step was 108 s for the vortex method and 154 s for the spectral method, under these conditions. Computing with 69 billion particles, this work exceeds by an order of magnitude the largest vortex-method calculations to date.

  17. Multi-petascale highly efficient parallel supercomputer

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Asaad, Sameh; Bellofatto, Ralph E.; Blocksome, Michael A.

    A Multi-Petascale Highly Efficient Parallel Supercomputer of 100 petaflop-scale includes node architectures based upon System-On-a-Chip technology, where each processing node comprises a single Application Specific Integrated Circuit (ASIC). The ASIC nodes are interconnected by a five dimensional torus network that optimally maximize the throughput of packet communications between nodes and minimize latency. The network implements collective network and a global asynchronous network that provides global barrier and notification functions. Integrated in the node design include a list-based prefetcher. The memory system implements transaction memory, thread level speculation, and multiversioning cache that improves soft error rate at the same time andmore » supports DMA functionality allowing for parallel processing message-passing.« less

  18. Efficient Parallel Kernel Solvers for Computational Fluid Dynamics Applications

    NASA Technical Reports Server (NTRS)

    Sun, Xian-He

    1997-01-01

    Distributed-memory parallel computers dominate today's parallel computing arena. These machines, such as Intel Paragon, IBM SP2, and Cray Origin2OO, have successfully delivered high performance computing power for solving some of the so-called "grand-challenge" problems. Despite initial success, parallel machines have not been widely accepted in production engineering environments due to the complexity of parallel programming. On a parallel computing system, a task has to be partitioned and distributed appropriately among processors to reduce communication cost and to attain load balance. More importantly, even with careful partitioning and mapping, the performance of an algorithm may still be unsatisfactory, since conventional sequential algorithms may be serial in nature and may not be implemented efficiently on parallel machines. In many cases, new algorithms have to be introduced to increase parallel performance. In order to achieve optimal performance, in addition to partitioning and mapping, a careful performance study should be conducted for a given application to find a good algorithm-machine combination. This process, however, is usually painful and elusive. The goal of this project is to design and develop efficient parallel algorithms for highly accurate Computational Fluid Dynamics (CFD) simulations and other engineering applications. The work plan is 1) developing highly accurate parallel numerical algorithms, 2) conduct preliminary testing to verify the effectiveness and potential of these algorithms, 3) incorporate newly developed algorithms into actual simulation packages. The work plan has well achieved. Two highly accurate, efficient Poisson solvers have been developed and tested based on two different approaches: (1) Adopting a mathematical geometry which has a better capacity to describe the fluid, (2) Using compact scheme to gain high order accuracy in numerical discretization. The previously developed Parallel Diagonal Dominant (PDD) algorithm

  19. Structural Directed Growth of Ultrathin Parallel Birnessite on β-MnO2 for High-Performance Asymmetric Supercapacitors.

    PubMed

    Zhu, Shijin; Li, Li; Liu, Jiabin; Wang, Hongtao; Wang, Tian; Zhang, Yuxin; Zhang, Lili; Ruoff, Rodney S; Dong, Fan

    2018-02-27

    Two-dimensional birnessite has attracted attention for electrochemical energy storage because of the presence of redox active Mn 4+ /Mn 3+ ions and spacious interlayer channels available for ions diffusion. However, current strategies are largely limited to enhancing the electrical conductivity of birnessite. One key limitation affecting the electrochemical properties of birnessite is the poor utilization of the MnO 6 unit. Here, we assemble β-MnO 2 /birnessite core-shell structure that exploits the exposed crystal face of β-MnO 2 as the core and ultrathin birnessite sheets that have the structure advantage to enhance the utilization efficiency of the Mn from the bulk. Our birnessite that has sheets parallel to each other is found to have unusual crystal structure with interlayer spacing, Mn(III)/Mn(IV) ratio and the content of the balancing cations differing from that of the common birnessite. The substrate directed growth mechanism is carefully investigated. The as-prepared core-shell nanostructures enhance the exposed surface area of birnessite and achieve high electrochemical performances (for example, 657 F g -1 in 1 M Na 2 SO 4 electrolyte based on the weight of parallel birnessite) and excellent rate capability over a potential window of up to 1.2 V. This strategy opens avenues for fundamental studies of birnessite and its properties and suggests the possibility of its use in energy storage and other applications. The potential window of an asymmetric supercapacitor that was assembled with this material can be enlarged to 2.2 V (in aqueous electrolyte) with a good cycling ability.

  20. Knowledge-Based Parallel Performance Technology for Scientific Application Competitiveness Final Report

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Malony, Allen D; Shende, Sameer

    The primary goal of the University of Oregon's DOE "œcompetitiveness" project was to create performance technology that embodies and supports knowledge of performance data, analysis, and diagnosis in parallel performance problem solving. The target of our development activities was the TAU Performance System and the technology accomplishments reported in this and prior reports have all been incorporated in the TAU open software distribution. In addition, the project has been committed to maintaining strong interactions with the DOE SciDAC Performance Engineering Research Institute (PERI) and Center for Technology for Advanced Scientific Component Software (TASCS). This collaboration has proved valuable for translationmore » of our knowledge-based performance techniques to parallel application development and performance engineering practice. Our outreach has also extended to the DOE Advanced CompuTational Software (ACTS) collection and project. Throughout the project we have participated in the PERI and TASCS meetings, as well as the ACTS annual workshops.« less

  1. Genetic algorithm based task reordering to improve the performance of batch scheduled massively parallel scientific applications

    DOE PAGES

    Sankaran, Ramanan; Angel, Jordan; Brown, W. Michael

    2015-04-08

    The growth in size of networked high performance computers along with novel accelerator-based node architectures has further emphasized the importance of communication efficiency in high performance computing. The world's largest high performance computers are usually operated as shared user facilities due to the costs of acquisition and operation. Applications are scheduled for execution in a shared environment and are placed on nodes that are not necessarily contiguous on the interconnect. Furthermore, the placement of tasks on the nodes allocated by the scheduler is sub-optimal, leading to performance loss and variability. Here, we investigate the impact of task placement on themore » performance of two massively parallel application codes on the Titan supercomputer, a turbulent combustion flow solver (S3D) and a molecular dynamics code (LAMMPS). Benchmark studies show a significant deviation from ideal weak scaling and variability in performance. The inter-task communication distance was determined to be one of the significant contributors to the performance degradation and variability. A genetic algorithm-based parallel optimization technique was used to optimize the task ordering. This technique provides an improved placement of the tasks on the nodes, taking into account the application's communication topology and the system interconnect topology. As a result, application benchmarks after task reordering through genetic algorithm show a significant improvement in performance and reduction in variability, therefore enabling the applications to achieve better time to solution and scalability on Titan during production.« less

  2. Performance analysis and material dependence of micro holographic optical elements as couplers for fiber optic communication

    NASA Astrophysics Data System (ADS)

    Ambadiyil, Sajan; Prasannan, G.; Sathyan, Jithesh; Ajith Kumar, P. T.

    2005-01-01

    Holographic Optical Elements (HOEs) are gaining much importance and finding newer and better applications in areas of optical fiber communication and optical information processing systems. In contrast to conventional HOEs, optical communication and information systems require smaller and efficient elements of desired characteristics and transfer functions. Such Micro Holographic Optical Elements (MHOEs) can either be an HOE, recorded with two narrow beams of laser light or a segment cut from a larger HOE (SHOEs), and recorded in the conventional manner. In this study, micro holographic couplers, having specific focusing and diffraction characteristics were recorded in different holographic recording media such as silver halide and dichromated gelatin. Wavelength response of the elements was tested at 633 nm and 442 nm. Variation in diffraction efficiency/coupling factor, and insertion loss of the elements were studied. The paper reports in detail about the above results and related design considerations.

  3. Machine Learning Based Online Performance Prediction for Runtime Parallelization and Task Scheduling

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Li, J; Ma, X; Singh, K

    2008-10-09

    With the emerging many-core paradigm, parallel programming must extend beyond its traditional realm of scientific applications. Converting existing sequential applications as well as developing next-generation software requires assistance from hardware, compilers and runtime systems to exploit parallelism transparently within applications. These systems must decompose applications into tasks that can be executed in parallel and then schedule those tasks to minimize load imbalance. However, many systems lack a priori knowledge about the execution time of all tasks to perform effective load balancing with low scheduling overhead. In this paper, we approach this fundamental problem using machine learning techniques first to generatemore » performance models for all tasks and then applying those models to perform automatic performance prediction across program executions. We also extend an existing scheduling algorithm to use generated task cost estimates for online task partitioning and scheduling. We implement the above techniques in the pR framework, which transparently parallelizes scripts in the popular R language, and evaluate their performance and overhead with both a real-world application and a large number of synthetic representative test scripts. Our experimental results show that our proposed approach significantly improves task partitioning and scheduling, with maximum improvements of 21.8%, 40.3% and 22.1% and average improvements of 15.9%, 16.9% and 4.2% for LMM (a real R application) and synthetic test cases with independent and dependent tasks, respectively.« less

  4. Design and fabrication of three-dimensional polymer mode multiplexer based on asymmetric waveguide couplers

    NASA Astrophysics Data System (ADS)

    He, Guobing; Gao, Yang; Xu, Yan; Ji, Lanting; Sun, Xiaoqiang; Wang, Xibin; Yi, Yunji; Chen, Changming; Wang, Fei; Zhang, Daming; Wu, Yuanda

    2018-05-01

    A polymer mode multiplexer based on asymmetric couplers is theoretically designed and experimentally demonstrated. The proposed X-junction coupler is formed by waveguides overlapped with different crossing angles in the vertical direction. A beam propagation method is adopted to optimize the dimensional parameters of the mode multiplexer to convert LP01 mode of two lower waveguides to LP11a and LP21a mode of the upper waveguide. The ultraviolet lithography and wet chemical etching are used in the fabrication process. A conversion ratio over 98% for both LP11a and LP21a mode in the wavelength range from 1530 to 1570 nm are experimentally demonstrated. This mode multiplexer has potential in broadband mode-division multiplexing transmission systems.

  5. Teaching RLC Parallel Circuits in High-School Physics Class

    ERIC Educational Resources Information Center

    Simon, Alpár

    2015-01-01

    This paper will try to give an alternative treatment of the subject "parallel RLC circuits" and "resonance in parallel RLC circuits" from the Physics curricula for the XIth grade from Romanian high-schools, with an emphasis on practical type circuits and their possible applications, and intends to be an aid for both Physics…

  6. The FORCE - A highly portable parallel programming language

    NASA Technical Reports Server (NTRS)

    Jordan, Harry F.; Benten, Muhammad S.; Alaghband, Gita; Jakob, Ruediger

    1989-01-01

    This paper explains why the FORCE parallel programming language is easily portable among six different shared-memory multiprocessors, and how a two-level macro preprocessor makes it possible to hide low-level machine dependencies and to build machine-independent high-level constructs on top of them. These FORCE constructs make it possible to write portable parallel programs largely independent of the number of processes and the specific shared-memory multiprocessor executing them.

  7. The FORCE: A highly portable parallel programming language

    NASA Technical Reports Server (NTRS)

    Jordan, Harry F.; Benten, Muhammad S.; Alaghband, Gita; Jakob, Ruediger

    1989-01-01

    Here, it is explained why the FORCE parallel programming language is easily portable among six different shared-memory microprocessors, and how a two-level macro preprocessor makes it possible to hide low level machine dependencies and to build machine-independent high level constructs on top of them. These FORCE constructs make it possible to write portable parallel programs largely independent of the number of processes and the specific shared memory multiprocessor executing them.

  8. Experimental Study on the Flexural Performance of Parallel Strand Bamboo Beams

    PubMed Central

    Zhou, Aiping; Bian, Yuling

    2014-01-01

    Searching for materials to provide proper housing with less emission and low energy becomes an urgent demand with the ever-growing population. Bamboo has gained a reputation as an ecofriendly, highly renewable source of material. Parallel Strand Bamboo (PSB) is a new biocomposite made of bamboo strips which has superiority performances than wood products. It has attracted considerable interests as a sustainable alternative for more traditional building materials. But the mechanical performance study of PSB as construction materials is still inadequate. Also, the structural behavior of PSB is not quite understood as conventional construction materials, which results in the difficulties to predict the performances of PSB structural members. To achieve this purpose, 4-point bending experiments for PSB beams were carried out. The flexural performances, mode of failure in bending, and the damage mechanism of PSB beams were investigated in this paper. PMID:24701141

  9. Parallelization of NAS Benchmarks for Shared Memory Multiprocessors

    NASA Technical Reports Server (NTRS)

    Waheed, Abdul; Yan, Jerry C.; Saini, Subhash (Technical Monitor)

    1998-01-01

    This paper presents our experiences of parallelizing the sequential implementation of NAS benchmarks using compiler directives on SGI Origin2000 distributed shared memory (DSM) system. Porting existing applications to new high performance parallel and distributed computing platforms is a challenging task. Ideally, a user develops a sequential version of the application, leaving the task of porting to new generations of high performance computing systems to parallelization tools and compilers. Due to the simplicity of programming shared-memory multiprocessors, compiler developers have provided various facilities to allow the users to exploit parallelism. Native compilers on SGI Origin2000 support multiprocessing directives to allow users to exploit loop-level parallelism in their programs. Additionally, supporting tools can accomplish this process automatically and present the results of parallelization to the users. We experimented with these compiler directives and supporting tools by parallelizing sequential implementation of NAS benchmarks. Results reported in this paper indicate that with minimal effort, the performance gain is comparable with the hand-parallelized, carefully optimized, message-passing implementations of the same benchmarks.

  10. Miniature mechanical transfer optical coupler

    DOEpatents

    Abel, Philip [Overland Park, KS; Watterson, Carl [Kansas City, MO

    2011-02-15

    A miniature mechanical transfer (MT) optical coupler ("MMTOC") for optically connecting a first plurality of optical fibers with at least one other plurality of optical fibers. The MMTOC may comprise a beam splitting element, a plurality of collimating lenses, and a plurality of alignment elements. The MMTOC may optically couple a first plurality of fibers disposed in a plurality of ferrules of a first MT connector with a second plurality of fibers disposed in a plurality of ferrules of a second MT connector and a third plurality of fibers disposed in a plurality of ferrules of a third MT connector. The beam splitting element may allow a portion of each beam of light from the first plurality of fibers to pass through to the second plurality of fibers and simultaneously reflect another portion of each beam of light from the first plurality of fibers to the third plurality of fibers.

  11. On the photonic implementation of universal quantum gates, bell states preparation circuit and quantum LDPC encoders and decoders based on directional couplers and HNLF.

    PubMed

    Djordjevic, Ivan B

    2010-04-12

    The Bell states preparation circuit is a basic circuit required in quantum teleportation. We describe how to implement it in all-fiber technology. The basic building blocks for its implementation are directional couplers and highly nonlinear optical fiber (HNLF). Because the quantum information processing is based on delicate superposition states, it is sensitive to quantum errors. In order to enable fault-tolerant quantum computing the use of quantum error correction is unavoidable. We show how to implement in all-fiber technology encoders and decoders for sparse-graph quantum codes, and provide an illustrative example to demonstrate this implementation. We also show that arbitrary set of universal quantum gates can be implemented based on directional couplers and HNLFs.

  12. Bi-directional triplexer with butterfly MMI coupler using SU-8 polymer waveguides

    NASA Astrophysics Data System (ADS)

    Mareš, David; Jeřábek, Vítězslav; Prajzler, Václav

    2015-01-01

    We report about a design of a bi-directional planar optical multiplex/demultiplex filter (triplexer) for the optical part of planar hybrid WDM bi-directional transceiver in fiber-to-the-home (FTTH) PON applications. The triplex lightwave circuit is based on the Epoxy Novolak Resin SU-8 waveguides on the silica-on-silicon substrate with Polymethylmethacrylate cladding layer. The triplexer is comprised of a linear butterfly concept of multimode interference (MMI) coupler separating downstream optical signals of 1490 nm and 1550 nm. For the upstream channel of 1310 nm, an additional directional coupler (DC) is used to add optical signal of 1310 nm propagating in opposite direction. The optical triplexer was designed and optimized using beam propagation method. The insertion losses, crosstalk attenuation, and extinction ratio for all three inputs/outputs were investigated. The intended triplexer was designed using the parameters of the separated DC and MMI filter to approximate the idealized direct connection of both devices.

  13. Performance Metrics for Monitoring Parallel Program Executions

    NASA Technical Reports Server (NTRS)

    Sarukkai, Sekkar R.; Gotwais, Jacob K.; Yan, Jerry; Lum, Henry, Jr. (Technical Monitor)

    1994-01-01

    Existing tools for debugging performance of parallel programs either provide graphical representations of program execution or profiles of program executions. However, for performance debugging tools to be useful, such information has to be augmented with information that highlights the cause of poor program performance. Identifying the cause of poor performance necessitates the need for not only determining the significance of various performance problems on the execution time of the program, but also needs to consider the effect of interprocessor communications of individual source level data structures. In this paper, we present a suite of normalized indices which provide a convenient mechanism for focusing on a region of code with poor performance and highlights the cause of the problem in terms of processors, procedures and data structure interactions. All the indices are generated from trace files augmented with data structure information.. Further, we show with the help of examples from the NAS benchmark suite that the indices help in detecting potential cause of poor performance, based on augmented execution traces obtained by monitoring the program.

  14. Evaluating the performance of parallel subsurface simulators: An illustrative example with PFLOTRAN

    PubMed Central

    Hammond, G E; Lichtner, P C; Mills, R T

    2014-01-01

    [1] To better inform the subsurface scientist on the expected performance of parallel simulators, this work investigates performance of the reactive multiphase flow and multicomponent biogeochemical transport code PFLOTRAN as it is applied to several realistic modeling scenarios run on the Jaguar supercomputer. After a brief introduction to the code's parallel layout and code design, PFLOTRAN's parallel performance (measured through strong and weak scalability analyses) is evaluated in the context of conceptual model layout, software and algorithmic design, and known hardware limitations. PFLOTRAN scales well (with regard to strong scaling) for three realistic problem scenarios: (1) in situ leaching of copper from a mineral ore deposit within a 5-spot flow regime, (2) transient flow and solute transport within a regional doublet, and (3) a real-world problem involving uranium surface complexation within a heterogeneous and extremely dynamic variably saturated flow field. Weak scalability is discussed in detail for the regional doublet problem, and several difficulties with its interpretation are noted. PMID:25506097

  15. Evaluating the performance of parallel subsurface simulators: An illustrative example with PFLOTRAN.

    PubMed

    Hammond, G E; Lichtner, P C; Mills, R T

    2014-01-01

    [1] To better inform the subsurface scientist on the expected performance of parallel simulators, this work investigates performance of the reactive multiphase flow and multicomponent biogeochemical transport code PFLOTRAN as it is applied to several realistic modeling scenarios run on the Jaguar supercomputer. After a brief introduction to the code's parallel layout and code design, PFLOTRAN's parallel performance (measured through strong and weak scalability analyses) is evaluated in the context of conceptual model layout, software and algorithmic design, and known hardware limitations. PFLOTRAN scales well (with regard to strong scaling) for three realistic problem scenarios: (1) in situ leaching of copper from a mineral ore deposit within a 5-spot flow regime, (2) transient flow and solute transport within a regional doublet, and (3) a real-world problem involving uranium surface complexation within a heterogeneous and extremely dynamic variably saturated flow field. Weak scalability is discussed in detail for the regional doublet problem, and several difficulties with its interpretation are noted.

  16. Adapting high-level language programs for parallel processing using data flow

    NASA Technical Reports Server (NTRS)

    Standley, Hilda M.

    1988-01-01

    EASY-FLOW, a very high-level data flow language, is introduced for the purpose of adapting programs written in a conventional high-level language to a parallel environment. The level of parallelism provided is of the large-grained variety in which parallel activities take place between subprograms or processes. A program written in EASY-FLOW is a set of subprogram calls as units, structured by iteration, branching, and distribution constructs. A data flow graph may be deduced from an EASY-FLOW program.

  17. Self-aligned grating couplers on template-stripped metal pyramids via nanostencil lithography

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Klemme, Daniel J.; Johnson, Timothy W.; Mohr, Daniel A.

    2016-05-23

    We combine nanostencil lithography and template stripping to create self-aligned patterns about the apex of ultrasmooth metal pyramids with high throughput. Three-dimensional patterns such as spiral and asymmetric linear gratings, which can couple incident light into a hot spot at the tip, are presented as examples of this fabrication method. Computer simulations demonstrate that spiral and linear diffraction grating patterns are both effective at coupling light to the tip. The self-aligned stencil lithography technique can be useful for integrating plasmonic couplers with sharp metallic tips for applications such as near-field optical spectroscopy, tip-based optical trapping, plasmonic sensing, and heat-assisted magneticmore » recording.« less

  18. High order parallel numerical schemes for solving incompressible flows

    NASA Technical Reports Server (NTRS)

    Lin, Avi; Milner, Edward J.; Liou, May-Fun; Belch, Richard A.

    1992-01-01

    The use of parallel computers for numerically solving flow fields has gained much importance in recent years. This paper introduces a new high order numerical scheme for computational fluid dynamics (CFD) specifically designed for parallel computational environments. A distributed MIMD system gives the flexibility of treating different elements of the governing equations with totally different numerical schemes in different regions of the flow field. The parallel decomposition of the governing operator to be solved is the primary parallel split. The primary parallel split was studied using a hypercube like architecture having clusters of shared memory processors at each node. The approach is demonstrated using examples of simple steady state incompressible flows. Future studies should investigate the secondary split because, depending on the numerical scheme that each of the processors applies and the nature of the flow in the specific subdomain, it may be possible for a processor to seek better, or higher order, schemes for its particular subcase.

  19. Routing performance analysis and optimization within a massively parallel computer

    DOEpatents

    Archer, Charles Jens; Peters, Amanda; Pinnow, Kurt Walter; Swartz, Brent Allen

    2013-04-16

    An apparatus, program product and method optimize the operation of a massively parallel computer system by, in part, receiving actual performance data concerning an application executed by the plurality of interconnected nodes, and analyzing the actual performance data to identify an actual performance pattern. A desired performance pattern may be determined for the application, and an algorithm may be selected from among a plurality of algorithms stored within a memory, the algorithm being configured to achieve the desired performance pattern based on the actual performance data.

  20. Optical interconnection networks for high-performance computing systems

    NASA Astrophysics Data System (ADS)

    Biberman, Aleksandr; Bergman, Keren

    2012-04-01

    Enabled by silicon photonic technology, optical interconnection networks have the potential to be a key disruptive technology in computing and communication industries. The enduring pursuit of performance gains in computing, combined with stringent power constraints, has fostered the ever-growing computational parallelism associated with chip multiprocessors, memory systems, high-performance computing systems and data centers. Sustaining these parallelism growths introduces unique challenges for on- and off-chip communications, shifting the focus toward novel and fundamentally different communication approaches. Chip-scale photonic interconnection networks, enabled by high-performance silicon photonic devices, offer unprecedented bandwidth scalability with reduced power consumption. We demonstrate that the silicon photonic platforms have already produced all the high-performance photonic devices required to realize these types of networks. Through extensive empirical characterization in much of our work, we demonstrate such feasibility of waveguides, modulators, switches and photodetectors. We also demonstrate systems that simultaneously combine many functionalities to achieve more complex building blocks. We propose novel silicon photonic devices, subsystems, network topologies and architectures to enable unprecedented performance of these photonic interconnection networks. Furthermore, the advantages of photonic interconnection networks extend far beyond the chip, offering advanced communication environments for memory systems, high-performance computing systems, and data centers.

  1. An FPGA-based High Speed Parallel Signal Processing System for Adaptive Optics Testbed

    NASA Astrophysics Data System (ADS)

    Kim, H.; Choi, Y.; Yang, Y.

    In this paper a state-of-the-art FPGA (Field Programmable Gate Array) based high speed parallel signal processing system (SPS) for adaptive optics (AO) testbed with 1 kHz wavefront error (WFE) correction frequency is reported. The AO system consists of Shack-Hartmann sensor (SHS) and deformable mirror (DM), tip-tilt sensor (TTS), tip-tilt mirror (TTM) and an FPGA-based high performance SPS to correct wavefront aberrations. The SHS is composed of 400 subapertures and the DM 277 actuators with Fried geometry, requiring high speed parallel computing capability SPS. In this study, the target WFE correction speed is 1 kHz; therefore, it requires massive parallel computing capabilities as well as strict hard real time constraints on measurements from sensors, matrix computation latency for correction algorithms, and output of control signals for actuators. In order to meet them, an FPGA based real-time SPS with parallel computing capabilities is proposed. In particular, the SPS is made up of a National Instrument's (NI's) real time computer and five FPGA boards based on state-of-the-art Xilinx Kintex 7 FPGA. Programming is done with NI's LabView environment, providing flexibility when applying different algorithms for WFE correction. It also facilitates faster programming and debugging environment as compared to conventional ones. One of the five FPGA's is assigned to measure TTS and calculate control signals for TTM, while the rest four are used to receive SHS signal, calculate slops for each subaperture and correction signal for DM. With this parallel processing capabilities of the SPS the overall closed-loop WFE correction speed of 1 kHz has been achieved. System requirements, architecture and implementation issues are described; furthermore, experimental results are also given.

  2. Variable temperature, variable-gap Otto prism coupler for use in a vacuum environment

    NASA Astrophysics Data System (ADS)

    Cairns, G. F.; O'Prey, S. M.; Dawson, P.

    2000-11-01

    The field of surface polariton physics really took off with the prism coupling techniques developed by Kretschmann and Raether, and by Otto. This article reports on the construction and operation of a rotatable, in vacuo, variable temperature, Otto coupler with a coupling gap that can be varied by remote control. The specific design attributes of the system offer additional advantages to those of standard Otto systems of (i) temperature variation (ambient to 85 K), and (ii) the use of a valuable, additional reference point, namely the gap-independent reflectance at the Brewster angle at any given, fixed temperature. The instrument is placed firmly in a historical context of developments in the field. The efficacy of the coupler is demonstrated by sample attenuated total reflectance results on films of platinum, niobium, and yttrium barium copper oxide and on aluminum/gallium arsenide (Al/GaAs) Schottky diode structures.

  3. "Hot-wire" microfluidic flowmeter based on a microfiber coupler.

    PubMed

    Yan, Shao-Cheng; Liu, Zeng-Yong; Li, Cheng; Ge, Shi-Jun; Xu, Fei; Lu, Yan-Qing

    2016-12-15

    Using an optical microfiber coupler (MC), we present a microfluidic platform for strong direct or indirect light-liquid interaction by wrapping a MC around a functionalized capillary. The light propagating in the MC and the liquid flowing in the capillary can be combined and divorced smoothly, keeping a long-distance interaction without the conflict of input and output coupling. Using this approach, we experimentally demonstrate a "hot-wire" microfluidic flowmeter based on a gold-integrated helical MC device. The microfluid inside the glass channel takes away the heat, then cools the MC and shifts the resonant wavelength. Due to the long-distance interaction and high temperature sensitivity, the proposed microfluidic flowmeter shows an ultrahigh flow rate sensitivity of 2.183 nm/(μl/s) at a flow rate of 1 μl/s. The minimum detectable change of the flow rate is around 9 nl/s at 1 μl/s.

  4. A Programming Model Performance Study Using the NAS Parallel Benchmarks

    DOE PAGES

    Shan, Hongzhang; Blagojević, Filip; Min, Seung-Jai; ...

    2010-01-01

    Harnessing the power of multicore platforms is challenging due to the additional levels of parallelism present. In this paper we use the NAS Parallel Benchmarks to study three programming models, MPI, OpenMP and PGAS to understand their performance and memory usage characteristics on current multicore architectures. To understand these characteristics we use the Integrated Performance Monitoring tool and other ways to measure communication versus computation time, as well as the fraction of the run time spent in OpenMP. The benchmarks are run on two different Cray XT5 systems and an Infiniband cluster. Our results show that in general the threemore » programming models exhibit very similar performance characteristics. In a few cases, OpenMP is significantly faster because it explicitly avoids communication. For these particular cases, we were able to re-write the UPC versions and achieve equal performance to OpenMP. Using OpenMP was also the most advantageous in terms of memory usage. Also we compare performance differences between the two Cray systems, which have quad-core and hex-core processors. We show that at scale the performance is almost always slower on the hex-core system because of increased contention for network resources.« less

  5. Two-dimensional Ag/SiO2 and Cu/SiO2 nanocomposite surface-relief grating couplers and their vertical input coupling properties

    NASA Astrophysics Data System (ADS)

    Wang, Jun; Mu, Xiaoyu; Wang, Gang; Liu, Changlong

    2017-11-01

    By etching two SiO2 optical waveguide slabs separately implanted with 90 keV Ag ions and 60 keV Cu ions at the same dose of 6 × 1016 cm-2, two-dimensional Ag/SiO2 and Cu/SiO2 nanocomposite surface-relief grating couplers with 600-nm periodicity and 100-nm thickness were fabricated, and their structural and vertical input coupling properties were investigated. Experimental results revealed that the two couplers could convert light beams at wavelengths of 620-880 nm into guided waves with different efficiencies, highlighting the special importance of metal nanoparticles (NPs). Further discussions also revealed that owing to the introduction of periodically distributed metal NPs, the periodical phase modification of the transmitted beam was enhanced drastically, and the nanocomposite veins could behave as efficient light scatterers. As a result, the two couplers were much larger in coupling efficiency than the NP-free one with identical morphological parameters. The above findings may be useful to construct thin and short but efficient surface-relief grating couplers on glass optical waveguides.

  6. Analysis and performance of paralleling circuits for modular inverter-converter systems

    NASA Technical Reports Server (NTRS)

    Birchenough, A. G.; Gourash, F.

    1972-01-01

    As part of a modular inverter-converter development program, control techniques were developed to provide load sharing among paralleled inverters or converters. An analysis of the requirements of paralleling circuits and a discussion of the circuits developed and their performance are included in this report. The current sharing was within 5.6 percent of rated-load current for the ac modules and 7.4 percent for the dc modules for an initial output voltage unbalance of 5 volts.

  7. Kinematic Analysis and Performance Evaluation of Novel PRS Parallel Mechanism

    NASA Astrophysics Data System (ADS)

    Balaji, K.; Khan, B. Shahul Hamid

    2018-02-01

    In this paper, a 3 DoF (Degree of Freedom) novel PRS (Prismatic-Revolute- Spherical) type parallel mechanisms has been designed and presented. The combination of striaght and arc type linkages for 3 DOF parallel mechanism is introduced for the first time. The performances of the mechanisms are evaluated based on the indices such as Minimum Singular Value (MSV), Condition Number (CN), Local Conditioning Index (LCI), Kinematic Configuration Index (KCI) and Global Conditioning Index (GCI). The overall reachable workspace of all mechanisms are presented. The kinematic measure, dexterity measure and workspace analysis for all the mechanism have been evaluated and compared.

  8. Template based parallel checkpointing in a massively parallel computer system

    DOEpatents

    Archer, Charles Jens [Rochester, MN; Inglett, Todd Alan [Rochester, MN

    2009-01-13

    A method and apparatus for a template based parallel checkpoint save for a massively parallel super computer system using a parallel variation of the rsync protocol, and network broadcast. In preferred embodiments, the checkpoint data for each node is compared to a template checkpoint file that resides in the storage and that was previously produced. Embodiments herein greatly decrease the amount of data that must be transmitted and stored for faster checkpointing and increased efficiency of the computer system. Embodiments are directed to a parallel computer system with nodes arranged in a cluster with a high speed interconnect that can perform broadcast communication. The checkpoint contains a set of actual small data blocks with their corresponding checksums from all nodes in the system. The data blocks may be compressed using conventional non-lossy data compression algorithms to further reduce the overall checkpoint size.

  9. Parallel high-precision orbit propagation using the modified Picard-Chebyshev method

    NASA Astrophysics Data System (ADS)

    Koblick, Darin C.

    2012-03-01

    The modified Picard-Chebyshev method, when run in parallel, is thought to be more accurate and faster than the most efficient sequential numerical integration techniques when applied to orbit propagation problems. Previous experiments have shown that the modified Picard-Chebyshev method can have up to a one order magnitude speedup over the 12th order Runge-Kutta-Nystrom method. For this study, the evaluation of the accuracy and computational time of the modified Picard-Chebyshev method, using the Java Astrodynamics Toolkit high-precision force model, is conducted to assess its runtime performance. Simulation results of the modified Picard-Chebyshev method, implemented in MATLAB and the MATLAB Parallel Computing Toolbox, are compared against the most efficient first and second order Ordinary Differential Equation (ODE) solvers. A total of six processors were used to assess the runtime performance of the modified Picard-Chebyshev method. It was found that for all orbit propagation test cases, where the gravity model was simulated to be of higher degree and order (above 225 to increase computational overhead), the modified Picard-Chebyshev method was faster, by as much as a factor of two, than the other ODE solvers which were tested.

  10. Performance Evaluation and Modeling Techniques for Parallel Processors. Ph.D. Thesis

    NASA Technical Reports Server (NTRS)

    Dimpsey, Robert Tod

    1992-01-01

    In practice, the performance evaluation of supercomputers is still substantially driven by singlepoint estimates of metrics (e.g., MFLOPS) obtained by running characteristic benchmarks or workloads. With the rapid increase in the use of time-shared multiprogramming in these systems, such measurements are clearly inadequate. This is because multiprogramming and system overhead, as well as other degradations in performance due to time varying characteristics of workloads, are not taken into account. In multiprogrammed environments, multiple jobs and users can dramatically increase the amount of system overhead and degrade the performance of the machine. Performance techniques, such as benchmarking, which characterize performance on a dedicated machine ignore this major component of true computer performance. Due to the complexity of analysis, there has been little work done in analyzing, modeling, and predicting the performance of applications in multiprogrammed environments. This is especially true for parallel processors, where the costs and benefits of multi-user workloads are exacerbated. While some may claim that the issue of multiprogramming is not a viable one in the supercomputer market, experience shows otherwise. Even in recent massively parallel machines, multiprogramming is a key component. It has even been claimed that a partial cause of the demise of the CM2 was the fact that it did not efficiently support time-sharing. In the same paper, Gordon Bell postulates that, multicomputers will evolve to multiprocessors in order to support efficient multiprogramming. Therefore, it is clear that parallel processors of the future will be required to offer the user a time-shared environment with reasonable response times for the applications. In this type of environment, the most important performance metric is the completion of response time of a given application. However, there are a few evaluation efforts addressing this issue.

  11. STS-119 EVA 3 GAT S1 Truss Flex Hose Rotary Coupler (FHRC) P-Clamp Release

    NASA Image and Video Library

    2009-03-23

    S119-E-007110 (23 March 2009) --- Astronaut Joseph Acaba, STS-119 mission specialist, participates in the mission's third scheduled session of extravehicular activity (EVA) as construction and maintenance continue on the International Space Station. During the six-hour, 27-minute spacewalk, Acaba and Richard Arnold (out of frame), mission specialist, helped robotic arm operators relocate the Crew Equipment Translation Aid (CETA) cart from the Port 1 to Starboard 1 truss segment, installed a new coupler on the CETA cart, lubricated snares on the "B" end of the space station's robotic arm and performed a few "get ahead" tasks.

  12. STS-119 EVA 3 GAT S1 Truss Flex Hose Rotary Coupler (FHRC) P-Clamp Release

    NASA Image and Video Library

    2009-03-23

    S119-E-007119 (23 March 2009) --- Astronaut Joseph Acaba, STS-119 mission specialist, participates in the mission's third scheduled session of extravehicular activity (EVA) as construction and maintenance continue on the International Space Station. During the six-hour, 27-minute spacewalk, Acaba and Richard Arnold (out of frame), mission specialist, helped robotic arm operators relocate the Crew Equipment Translation Aid (CETA) cart from the Port 1 to Starboard 1 truss segment, installed a new coupler on the CETA cart, lubricated snares on the "B" end of the space station's robotic arm and performed a few "get ahead" tasks.

  13. New design of a triplexer using ring resonator integrated with directional coupler based on photonic crystals

    NASA Astrophysics Data System (ADS)

    Wu, Yaw-Dong; Shih, Tien-Tsorng; Lee, Jian-Jang

    2009-11-01

    In this paper, we proposed the design of directional coupler integrated with ring resonator based on two-dimensional photonic crystals (2D PCs) to develop a triplexer filter. It can be widely used as the fiber access network element for multiplexer-demultiplexer wavelength selective in fiber-to-the-home (FTTH) communication systems. The directional coupler is chosen to separate the wavelengths of 1490nm and 1310nm. The ring resonator separates the wavelength of 1550nm. The transmission efficiency is larger than 90%. Besides, the total size of propose triplexer is only 19μm×12μm. We present simulation results using the finite-difference time-domain (FDTD) method for the proposed structure.

  14. Method and apparatus for fabrication of high gradient insulators with parallel surface conductors spaced less than one millimeter apart

    DOEpatents

    Sanders, David M.; Decker, Derek E.

    1999-01-01

    Optical patterns and lithographic techniques are used as part of a process to embed parallel and evenly spaced conductors in the non-planar surfaces of an insulator to produce high gradient insulators. The approach extends the size that high gradient insulating structures can be fabricated as well as improves the performance of those insulators by reducing the scale of the alternating parallel lines of insulator and conductor along the surface. This fabrication approach also substantially decreases the cost required to produce high gradient insulators.

  15. Rectangular pulsed LD pumped saturable output coupler (SOC) Q-switched microchip laser

    NASA Astrophysics Data System (ADS)

    Wang, Yan-biao; Wang, Sha; Feng, Guo-ying; Zhou, Shou-huan

    2017-02-01

    We studied the cw LD and rectangular pulsed LD pumped saturable output coupler (SOC) passively Q-switched Nd:YVO4 transmission microchip laser experimentally. We demonstrated that the SOC passively Q-switched Nd:YVO4 transmission microchip laser pumped by a highly stabilized narrow bandwidth pulsed LD has a much lower timing jitter than pumped by a continuous wave (CW) LD, especially at low output frequency regime. By changing the pump beam size in the rectangular shape pulsed pump scheme, the output frequency can be achieved from 333.3 kHz to 71.4 kHz, while the relative timing jitter decreased from 0.09865% to 0.03115% accordingly. Additionally, the microchip laser has a good stability of output power, the power fluctuation below 2%.

  16. Stiffness modeling of compliant parallel mechanisms and applications in the performance analysis of a decoupled parallel compliant stage

    NASA Astrophysics Data System (ADS)

    Jiang, Yao; Li, Tie-Min; Wang, Li-Ping

    2015-09-01

    This paper investigates the stiffness modeling of compliant parallel mechanism (CPM) based on the matrix method. First, the general compliance matrix of a serial flexure chain is derived. The stiffness modeling of CPMs is next discussed in detail, considering the relative positions of the applied load and the selected displacement output point. The derived stiffness models have simple and explicit forms, and the input, output, and coupling stiffness matrices of the CPM can easily be obtained. The proposed analytical model is applied to the stiffness modeling and performance analysis of an XY parallel compliant stage with input and output decoupling characteristics. Then, the key geometrical parameters of the stage are optimized to obtain the minimum input decoupling degree. Finally, a prototype of the compliant stage is developed and its input axial stiffness, coupling characteristics, positioning resolution, and circular contouring performance are tested. The results demonstrate the excellent performance of the compliant stage and verify the effectiveness of the proposed theoretical model. The general stiffness models provided in this paper will be helpful for performance analysis, especially in determining coupling characteristics, and the structure optimization of the CPM.

  17. Performance bounds on parallel self-initiating discrete-event

    NASA Technical Reports Server (NTRS)

    Nicol, David M.

    1990-01-01

    The use is considered of massively parallel architectures to execute discrete-event simulations of what is termed self-initiating models. A logical process in a self-initiating model schedules its own state re-evaluation times, independently of any other logical process, and sends its new state to other logical processes following the re-evaluation. The interest is in the effects of that communication on synchronization. The performance is considered of various synchronization protocols by deriving upper and lower bounds on optimal performance, upper bounds on Time Warp's performance, and lower bounds on the performance of a new conservative protocol. The analysis of Time Warp includes the overhead costs of state-saving and rollback. The analysis points out sufficient conditions for the conservative protocol to outperform Time Warp. The analysis also quantifies the sensitivity of performance to message fan-out, lookahead ability, and the probability distributions underlying the simulation.

  18. Implementation and performance of FDPS: a framework for developing parallel particle simulation codes

    NASA Astrophysics Data System (ADS)

    Iwasawa, Masaki; Tanikawa, Ataru; Hosono, Natsuki; Nitadori, Keigo; Muranushi, Takayuki; Makino, Junichiro

    2016-08-01

    We present the basic idea, implementation, measured performance, and performance model of FDPS (Framework for Developing Particle Simulators). FDPS is an application-development framework which helps researchers to develop simulation programs using particle methods for large-scale distributed-memory parallel supercomputers. A particle-based simulation program for distributed-memory parallel computers needs to perform domain decomposition, exchange of particles which are not in the domain of each computing node, and gathering of the particle information in other nodes which are necessary for interaction calculation. Also, even if distributed-memory parallel computers are not used, in order to reduce the amount of computation, algorithms such as the Barnes-Hut tree algorithm or the Fast Multipole Method should be used in the case of long-range interactions. For short-range interactions, some methods to limit the calculation to neighbor particles are required. FDPS provides all of these functions which are necessary for efficient parallel execution of particle-based simulations as "templates," which are independent of the actual data structure of particles and the functional form of the particle-particle interaction. By using FDPS, researchers can write their programs with the amount of work necessary to write a simple, sequential and unoptimized program of O(N2) calculation cost, and yet the program, once compiled with FDPS, will run efficiently on large-scale parallel supercomputers. A simple gravitational N-body program can be written in around 120 lines. We report the actual performance of these programs and the performance model. The weak scaling performance is very good, and almost linear speed-up was obtained for up to the full system of the K computer. The minimum calculation time per timestep is in the range of 30 ms (N = 107) to 300 ms (N = 109). These are currently limited by the time for the calculation of the domain decomposition and communication

  19. A Parallel Rendering Algorithm for MIMD Architectures

    NASA Technical Reports Server (NTRS)

    Crockett, Thomas W.; Orloff, Tobias

    1991-01-01

    Applications such as animation and scientific visualization demand high performance rendering of complex three dimensional scenes. To deliver the necessary rendering rates, highly parallel hardware architectures are required. The challenge is then to design algorithms and software which effectively use the hardware parallelism. A rendering algorithm targeted to distributed memory MIMD architectures is described. For maximum performance, the algorithm exploits both object-level and pixel-level parallelism. The behavior of the algorithm is examined both analytically and experimentally. Its performance for large numbers of processors is found to be limited primarily by communication overheads. An experimental implementation for the Intel iPSC/860 shows increasing performance from 1 to 128 processors across a wide range of scene complexities. It is shown that minimal modifications to the algorithm will adapt it for use on shared memory architectures as well.

  20. Spatial mode filters realized with multimode interference couplers

    NASA Astrophysics Data System (ADS)

    Leuthold, J.; Hess, R.; Eckner, J.; Besse, P. A.; Melchior, H.

    1996-06-01

    Spatial mode filters based on multimode interference couplers (MMI's) that offer the possibility of splitting off antisymmetric from symmetric modes are presented, and realizations of these filters in InGaAsP / InP are demonstrated. Measured suppression of the antisymmetric first-order modes at the output for the symmetric mode is better than 18 dB. Such MMI's are useful for monolithically integrating mode filters with all-optical devices, which are controlled through an antisymmetric first-order mode. The filtering out of optical control signals is necessary for cascading all-optical devices. Another application is the improvement of on-off ratios in optical switches.

  1. Performance of a parallel code for the Euler equations on hypercube computers

    NASA Technical Reports Server (NTRS)

    Barszcz, Eric; Chan, Tony F.; Jesperson, Dennis C.; Tuminaro, Raymond S.

    1990-01-01

    The performance of hypercubes were evaluated on a computational fluid dynamics problem and the parallel environment issues were considered that must be addressed, such as algorithm changes, implementation choices, programming effort, and programming environment. The evaluation focuses on a widely used fluid dynamics code, FLO52, which solves the two dimensional steady Euler equations describing flow around the airfoil. The code development experience is described, including interacting with the operating system, utilizing the message-passing communication system, and code modifications necessary to increase parallel efficiency. Results from two hypercube parallel computers (a 16-node iPSC/2, and a 512-node NCUBE/ten) are discussed and compared. In addition, a mathematical model of the execution time was developed as a function of several machine and algorithm parameters. This model accurately predicts the actual run times obtained and is used to explore the performance of the code in interesting but yet physically realizable regions of the parameter space. Based on this model, predictions about future hypercubes are made.

  2. Efficient parallel architecture for highly coupled real-time linear system applications

    NASA Technical Reports Server (NTRS)

    Carroll, Chester C.; Homaifar, Abdollah; Barua, Soumavo

    1988-01-01

    A systematic procedure is developed for exploiting the parallel constructs of computation in a highly coupled, linear system application. An overall top-down design approach is adopted. Differential equations governing the application under consideration are partitioned into subtasks on the basis of a data flow analysis. The interconnected task units constitute a task graph which has to be computed in every update interval. Multiprocessing concepts utilizing parallel integration algorithms are then applied for efficient task graph execution. A simple scheduling routine is developed to handle task allocation while in the multiprocessor mode. Results of simulation and scheduling are compared on the basis of standard performance indices. Processor timing diagrams are developed on the basis of program output accruing to an optimal set of processors. Basic architectural attributes for implementing the system are discussed together with suggestions for processing element design. Emphasis is placed on flexible architectures capable of accommodating widely varying application specifics.

  3. Performance analysis of three dimensional integral equation computations on a massively parallel computer. M.S. Thesis

    NASA Technical Reports Server (NTRS)

    Logan, Terry G.

    1994-01-01

    The purpose of this study is to investigate the performance of the integral equation computations using numerical source field-panel method in a massively parallel processing (MPP) environment. A comparative study of computational performance of the MPP CM-5 computer and conventional Cray-YMP supercomputer for a three-dimensional flow problem is made. A serial FORTRAN code is converted into a parallel CM-FORTRAN code. Some performance results are obtained on CM-5 with 32, 62, 128 nodes along with those on Cray-YMP with a single processor. The comparison of the performance indicates that the parallel CM-FORTRAN code near or out-performs the equivalent serial FORTRAN code for some cases.

  4. High-Performance Computation of Distributed-Memory Parallel 3D Voronoi and Delaunay Tessellation

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Peterka, Tom; Morozov, Dmitriy; Phillips, Carolyn

    2014-11-14

    Computing a Voronoi or Delaunay tessellation from a set of points is a core part of the analysis of many simulated and measured datasets: N-body simulations, molecular dynamics codes, and LIDAR point clouds are just a few examples. Such computational geometry methods are common in data analysis and visualization; but as the scale of simulations and observations surpasses billions of particles, the existing serial and shared-memory algorithms no longer suffice. A distributed-memory scalable parallel algorithm is the only feasible approach. The primary contribution of this paper is a new parallel Delaunay and Voronoi tessellation algorithm that automatically determines which neighbormore » points need to be exchanged among the subdomains of a spatial decomposition. Other contributions include periodic and wall boundary conditions, comparison of our method using two popular serial libraries, and application to numerous science datasets.« less

  5. High Performance Parallel Analysis of Coupled Problems for Aircraft Propulsion

    NASA Technical Reports Server (NTRS)

    Felippa, C. A.; Farhat, C.; Lanteri, S.; Maman, N.; Piperno, S.; Gumaste, U.

    1994-01-01

    In order to predict the dynamic response of a flexible structure in a fluid flow, the equations of motion of the structure and the fluid must be solved simultaneously. In this paper, we present several partitioned procedures for time-integrating this focus coupled problem and discuss their merits in terms of accuracy, stability, heterogeneous computing, I/O transfers, subcycling, and parallel processing. All theoretical results are derived for a one-dimensional piston model problem with a compressible flow, because the complete three-dimensional aeroelastic problem is difficult to analyze mathematically. However, the insight gained from the analysis of the coupled piston problem and the conclusions drawn from its numerical investigation are confirmed with the numerical simulation of the two-dimensional transient aeroelastic response of a flexible panel in a transonic nonlinear Euler flow regime.

  6. A high performance parallel computing architecture for robust image features

    NASA Astrophysics Data System (ADS)

    Zhou, Renyan; Liu, Leibo; Wei, Shaojun

    2014-03-01

    A design of parallel architecture for image feature detection and description is proposed in this article. The major component of this architecture is a 2D cellular network composed of simple reprogrammable processors, enabling the Hessian Blob Detector and Haar Response Calculation, which are the most computing-intensive stage of the Speeded Up Robust Features (SURF) algorithm. Combining this 2D cellular network and dedicated hardware for SURF descriptors, this architecture achieves real-time image feature detection with minimal software in the host processor. A prototype FPGA implementation of the proposed architecture achieves 1318.9 GOPS general pixel processing @ 100 MHz clock and achieves up to 118 fps in VGA (640 × 480) image feature detection. The proposed architecture is stand-alone and scalable so it is easy to be migrated into VLSI implementation.

  7. Progress on H5Part: A Portable High Performance Parallel DataInterface for Electromagnetics Simulations

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Adelmann, Andreas; Gsell, Achim; Oswald, Benedikt

    Significant problems facing all experimental andcomputationalsciences arise from growing data size and complexity. Commonto allthese problems is the need to perform efficient data I/O ondiversecomputer architectures. In our scientific application, thelargestparallel particle simulations generate vast quantitiesofsix-dimensional data. Such a simulation run produces data foranaggregate data size up to several TB per run. Motived by the needtoaddress data I/O and access challenges, we have implemented H5Part,anopen source data I/O API that simplifies the use of the HierarchicalDataFormat v5 library (HDF5). HDF5 is an industry standard forhighperformance, cross-platform data storage and retrieval that runsonall contemporary architectures from large parallel supercomputerstolaptops. H5Part, whichmore » is oriented to the needs of the particlephysicsand cosmology communities, provides support for parallelstorage andretrieval of particles, structured and in the future unstructuredmeshes.In this paper, we describe recent work focusing on I/O supportforparticles and structured meshes and provide data showing performance onmodernsupercomputer architectures like the IBM POWER 5.« less

  8. High output lamp with high brightness

    DOEpatents

    Kirkpatrick, Douglas A.; Bass, Gary K.; Copsey, Jesse F.; Garber, Jr., William E.; Kwong, Vincent H.; Levin, Izrail; MacLennan, Donald A.; Roy, Robert J.; Steiner, Paul E.; Tsai, Peter; Turner, Brian P.

    2002-01-01

    An ultra bright, low wattage inductively coupled electrodeless aperture lamp is powered by a solid state RF source in the range of several tens to several hundreds of watts at various frequencies in the range of 400 to 900 MHz. Numerous novel lamp circuits and components are disclosed including a wedding ring shaped coil having one axial and one radial lead, a high accuracy capacitor stack, a high thermal conductivity aperture cup and various other aperture bulb configurations, a coaxial capacitor arrangement, and an integrated coil and capacitor assembly. Numerous novel RF circuits are also disclosed including a high power oscillator circuit with reduced complexity resonant pole configuration, parallel RF power FET transistors with soft gate switching, a continuously variable frequency tuning circuit, a six port directional coupler, an impedance switching RF source, and an RF source with controlled frequency-load characteristics. Numerous novel RF control methods are disclosed including controlled adjustment of the operating frequency to find a resonant frequency and reduce reflected RF power, controlled switching of an impedance switched lamp system, active power control and active gate bias control.

  9. Terahertz master-oscillator power-amplifier quantum cascade laser with a grating coupler of extremely low reflectivity.

    PubMed

    Zhu, Huan; Zhu, Haiqing; Wang, Fangfang; Chang, Gaolei; Yu, Chenren; Yan, Quan; Chen, Jianxin; Li, Lianhe; Davies, A Giles; Linfield, Edmund H; Tang, Zhou; Chen, Pingping; Lu, Wei; Xu, Gangyi; He, Li

    2018-01-22

    A terahertz master-oscillation power-amplifier quantum cascade laser (THz-MOPA-QCL) is demonstrated where a grating coupler is employed to efficiently extract the THz radiation. By maximizing the group velocity and eliminating the scattering of THz wave in the grating coupler, the residue reflectivity is reduced down to the order of 10 -3 . A buried DFB grating and a tapered preamplifier are proposed to improve the seed power and to reduce the gain saturation, respectively. The THz-MOPA-QCL exhibits single-mode emission, a single-lobed beam with a narrow divergence angle of 18° × 16°, and a pulsed output power of 136 mW at 20 K, which is 36 times that of a second-order DFB laser from the same material.

  10. Tunable negative-tap photonic microwave filter based on a cladding-mode coupler and an optically injected laser of large detuning.

    PubMed

    Chan, Sze-Chun; Liu, Qing; Wang, Zhu; Chiang, Kin Seng

    2011-06-20

    A tunable negative-tap photonic microwave filter using a cladding-mode coupler together with optical injection locking of large wavelength detuning is demonstrated. Continuous and precise tunability of the filter is realized by physically sliding a pair of bare fibers inside the cladding-mode coupler. Signal inversion for the negative tap is achieved by optical injection locking of a single-mode semiconductor laser. To couple light into and out of the cladding-mode coupler, a pair of matching long-period fiber gratings is employed. The large bandwidth of the gratings requires injection locking of an exceptionally large wavelength detuning that has never been demonstrated before. Experimentally, injection locking with wavelength detuning as large as 27 nm was achieved, which corresponded to locking the 36-th side mode. Microwave filtering with a free-spectral range tunable from 88.6 MHz to 1.57 GHz and a notch depth larger than 35 dB was obtained.

  11. HVI Ballistic Performance Characterization of Non-Parallel Walls

    NASA Technical Reports Server (NTRS)

    Bohl, William; Miller, Joshua; Christiansen, Eric

    2012-01-01

    The Double-Wall, "Whipple" Shield [1] has been the subject of many hypervelocity impact studies and has proven to be an effective shield system for Micro-Meteoroid and Orbital Debris (MMOD) impacts for spacecraft. The US modules of the International Space Station (ISS), with their "bumper shields" offset from their pressure holding rear walls provide good examples of effective on-orbit use of the double wall shield. The concentric cylinder shield configuration with its large radius of curvature relative to separation distance is easily and effectively represented for testing and analysis as a system of two parallel plates. The parallel plate double wall configuration has been heavily tested and characterized for shield performance for normal and oblique impacts for the ISS and other programs. The double wall shield and principally similar Stuffed Whipple Shield are very common shield types for MMOD protection. However, in some locations with many spacecraft designs, the rear wall cannot be modeled as being parallel or concentric with the outer bumper wall. As represented in Figure 1, there is an included angle between the two walls. And, with a cylindrical outer wall, the effective included angle constantly changes. This complicates assessment of critical spacecraft components located within outer spacecraft walls when using software tools such as NASA's BumperII. In addition, the validity of the risk assessment comes into question when using the standard double wall shield equations, especially since verification testing of every set of double wall included angles is impossible.

  12. Design of a highly parallel board-level-interconnection with 320 Gbps capacity

    NASA Astrophysics Data System (ADS)

    Lohmann, U.; Jahns, J.; Limmer, S.; Fey, D.; Bauer, H.

    2012-01-01

    A parallel board-level interconnection design is presented consisting of 32 channels, each operating at 10 Gbps. The hardware uses available optoelectronic components (VCSEL, TIA, pin-diodes) and a combination of planarintegrated free-space optics, fiber-bundles and available MEMS-components, like the DMD™ from Texas Instruments. As a specific feature, we present a new modular inter-board interconnect, realized by 3D fiber-matrix connectors. The performance of the interconnect is evaluated with regard to optical properties and power consumption. Finally, we discuss the application of the interconnect for strongly distributed system architectures, as, for example, in high performance embedded computing systems and data centers.

  13. Parallel computation with the force

    NASA Technical Reports Server (NTRS)

    Jordan, H. F.

    1985-01-01

    A methodology, called the force, supports the construction of programs to be executed in parallel by a force of processes. The number of processes in the force is unspecified, but potentially very large. The force idea is embodied in a set of macros which produce multiproceossor FORTRAN code and has been studied on two shared memory multiprocessors of fairly different character. The method has simplified the writing of highly parallel programs within a limited class of parallel algorithms and is being extended to cover a broader class. The individual parallel constructs which comprise the force methodology are discussed. Of central concern are their semantics, implementation on different architectures and performance implications.

  14. Productive High Performance Parallel Programming with Auto-tuned Domain-Specific Embedded Languages

    DTIC Science & Technology

    2013-01-02

    Compilation JVM Java Virtual Machine KB Kilobyte KDT Knowledge Discovery Toolbox LAPACK Linear Algebra Package LLVM Low-Level Virtual Machine LOC Lines...different starting points. Leo Meyerovich also helped solidify some of the ideas here in discussions during Par Lab retreats. I would also like to thank...multi-timestep computations by blocking in both time and space. 88 Implementation Output Approx DSL Type Language Language Parallelism LoC Graphite

  15. Directions in parallel programming: HPF, shared virtual memory and object parallelism in pC++

    NASA Technical Reports Server (NTRS)

    Bodin, Francois; Priol, Thierry; Mehrotra, Piyush; Gannon, Dennis

    1994-01-01

    Fortran and C++ are the dominant programming languages used in scientific computation. Consequently, extensions to these languages are the most popular for programming massively parallel computers. We discuss two such approaches to parallel Fortran and one approach to C++. The High Performance Fortran Forum has designed HPF with the intent of supporting data parallelism on Fortran 90 applications. HPF works by asking the user to help the compiler distribute and align the data structures with the distributed memory modules in the system. Fortran-S takes a different approach in which the data distribution is managed by the operating system and the user provides annotations to indicate parallel control regions. In the case of C++, we look at pC++ which is based on a concurrent aggregate parallel model.

  16. Parallel-In-Time For Moving Meshes

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Falgout, R. D.; Manteuffel, T. A.; Southworth, B.

    2016-02-04

    With steadily growing computational resources available, scientists must develop e ective ways to utilize the increased resources. High performance, highly parallel software has be- come a standard. However until recent years parallelism has focused primarily on the spatial domain. When solving a space-time partial di erential equation (PDE), this leads to a sequential bottleneck in the temporal dimension, particularly when taking a large number of time steps. The XBraid parallel-in-time library was developed as a practical way to add temporal parallelism to existing se- quential codes with only minor modi cations. In this work, a rezoning-type moving mesh is appliedmore » to a di usion problem and formulated in a parallel-in-time framework. Tests and scaling studies are run using XBraid and demonstrate excellent results for the simple model problem considered herein.« less

  17. Compact cantilever couplers for low-loss fiber coupling to silicon photonic integrated circuits.

    PubMed

    Wood, Michael; Sun, Peng; Reano, Ronald M

    2012-01-02

    We demonstrate coupling from tapered optical fibers to 450 nm by 250 nm silicon strip waveguides using compact cantilever couplers. The couplers consist of silicon inverse width tapers embedded within silicon dioxide cantilevers. Finite difference time domain simulations are used to design the length of the silicon inverse width taper to as short as 6.5 μm for a cantilever width of 2 μm. Modeling of various strip waveguide taper profiles shows reduced coupling losses for a quadratic taper profile. Infrared measurements of fabricated devices demonstrate average coupling losses of 0.62 dB per connection for the quasi-TE mode and 0.50 dB per connection for the quasi-TM mode across the optical telecommunications C band. In the wavelength range from 1477 nm to 1580 nm, coupling losses for both polarizations are less than 1 dB per connection. The compact, broadband, and low-loss coupling scheme enables direct access to photonic integrated circuits on an entire chip surface without the need for dicing or cleaving the chip.

  18. A Comprehensive Investigation and Coupler Design for Higher-Order Modes in the BNL Energy Recovery Linear Accelerator

    NASA Astrophysics Data System (ADS)

    Marques, Carlos

    A next generation Energy Recovery Linac (ERL) is under development in the Collider-Accelerator Department at Brookhaven National Laboratory (BNL). This ERL uses a superconducting radio frequency (SFR) cavity to produce an electric field gradient ideal to accelerate charged particles. As with many accelerators, higher-order modes (HOMs) can be induced by a beam of charged particles traversing the linear accelerator cavity. The excitation of these modes can result in problematic single and multi-bunch effects and also produce undesirable heat loads to the cryogenic system. Understanding HOM prevalence and structure inside the accelerator cavity is crucial for devising a procedure for extracting HOM power and promoting excellent beam quality. In this work, a method was created to identify and characterize HOMs using a perturbation technique on a copper (Cu) cavity prototype of the BNL3 linac and a double lambda/4 crab cavity. Both analyses and correlation between simulated and measured results are shown. A coaxial to dual-ridge waveguide HOM coupler was designed, constructed and implemented to extract power from HOMs simultaneously making an evanescent fundamental mode for the BNL3 cavity. A full description of the design is given along with a simulated analysis of its performance. Comparison between previous HOM coupler designs as well as correspondence between simulation and measurement is also given.

  19. Life-cycle costs of high-performance cells

    NASA Technical Reports Server (NTRS)

    Daniel, R.; Burger, D.; Reiter, L.

    1985-01-01

    A life cycle cost analysis of high efficiency cells was presented. Although high efficiency cells produce more power, they also cost more to make and are more susceptible to array hot-spot heating. Three different computer analysis programs were used: SAMICS (solar array manufacturing industry costing standards), PVARRAY (an array failure mode/degradation simulator), and LCP (lifetime cost and performance). The high efficiency cell modules were found to be more economical in this study, but parallel redundancy is recommended.

  20. Design of microwave antenna system on planar Yagi-Uda elements and microstrip coupler

    NASA Astrophysics Data System (ADS)

    Petrovnin, K. V.; Latypov, R. R.

    2017-11-01

    Paper presents results of calculation, electromagnetic modelling and measurements of manufactured antenna system on planar Yagi-Uda elements and microstrip coupler. System has summary and subtract modes. Center frequency of system is 1532 MHz with 96 MHz bandwidth. Gain of system is 8 dB in main lobe direction (in-phase mode) and 5 dB (antiphase mode).

  1. Performance analysis of parallel branch and bound search with the hypercube architecture

    NASA Technical Reports Server (NTRS)

    Mraz, Richard T.

    1987-01-01

    With the availability of commercial parallel computers, researchers are examining new classes of problems which might benefit from parallel computing. This paper presents results of an investigation of the class of search intensive problems. The specific problem discussed is the Least-Cost Branch and Bound search method of deadline job scheduling. The object-oriented design methodology was used to map the problem into a parallel solution. While the initial design was good for a prototype, the best performance resulted from fine-tuning the algorithm for a specific computer. The experiments analyze the computation time, the speed up over a VAX 11/785, and the load balance of the problem when using loosely coupled multiprocessor system based on the hypercube architecture.

  2. Automatic Generation of Directive-Based Parallel Programs for Shared Memory Parallel Systems

    NASA Technical Reports Server (NTRS)

    Jin, Hao-Qiang; Yan, Jerry; Frumkin, Michael

    2000-01-01

    The shared-memory programming model is a very effective way to achieve parallelism on shared memory parallel computers. As great progress was made in hardware and software technologies, performance of parallel programs with compiler directives has demonstrated large improvement. The introduction of OpenMP directives, the industrial standard for shared-memory programming, has minimized the issue of portability. Due to its ease of programming and its good performance, the technique has become very popular. In this study, we have extended CAPTools, a computer-aided parallelization toolkit, to automatically generate directive-based, OpenMP, parallel programs. We outline techniques used in the implementation of the tool and present test results on the NAS parallel benchmarks and ARC3D, a CFD application. This work demonstrates the great potential of using computer-aided tools to quickly port parallel programs and also achieve good performance.

  3. Effects of coupler height mismatch on the structural integrity of railroad tank car stub sills.

    DOT National Transportation Integrated Search

    2001-12-01

    This project evaluated the safety implications of coupler height mismatches on the integrity of railroad tank car stub sills, through a series of static and impact tests. The test car was a loaded tank car instrumented with strain gages at critical l...

  4. Computational performance of a smoothed particle hydrodynamics simulation for shared-memory parallel computing

    NASA Astrophysics Data System (ADS)

    Nishiura, Daisuke; Furuichi, Mikito; Sakaguchi, Hide

    2015-09-01

    The computational performance of a smoothed particle hydrodynamics (SPH) simulation is investigated for three types of current shared-memory parallel computer devices: many integrated core (MIC) processors, graphics processing units (GPUs), and multi-core CPUs. We are especially interested in efficient shared-memory allocation methods for each chipset, because the efficient data access patterns differ between compute unified device architecture (CUDA) programming for GPUs and OpenMP programming for MIC processors and multi-core CPUs. We first introduce several parallel implementation techniques for the SPH code, and then examine these on our target computer architectures to determine the most effective algorithms for each processor unit. In addition, we evaluate the effective computing performance and power efficiency of the SPH simulation on each architecture, as these are critical metrics for overall performance in a multi-device environment. In our benchmark test, the GPU is found to produce the best arithmetic performance as a standalone device unit, and gives the most efficient power consumption. The multi-core CPU obtains the most effective computing performance. The computational speed of the MIC processor on Xeon Phi approached that of two Xeon CPUs. This indicates that using MICs is an attractive choice for existing SPH codes on multi-core CPUs parallelized by OpenMP, as it gains computational acceleration without the need for significant changes to the source code.

  5. Characterizing parallel file-access patterns on a large-scale multiprocessor

    NASA Technical Reports Server (NTRS)

    Purakayastha, A.; Ellis, Carla; Kotz, David; Nieuwejaar, Nils; Best, Michael L.

    1995-01-01

    High-performance parallel file systems are needed to satisfy tremendous I/O requirements of parallel scientific applications. The design of such high-performance parallel file systems depends on a comprehensive understanding of the expected workload, but so far there have been very few usage studies of multiprocessor file systems. This paper is part of the CHARISMA project, which intends to fill this void by measuring real file-system workloads on various production parallel machines. In particular, we present results from the CM-5 at the National Center for Supercomputing Applications. Our results are unique because we collect information about nearly every individual I/O request from the mix of jobs running on the machine. Analysis of the traces leads to various recommendations for parallel file-system design.

  6. High speed ultra-broadband amplitude modulators with ultrahigh extinction >65 dB.

    PubMed

    Liu, S; Cai, H; DeRose, C T; Davids, P; Pomerene, A; Starbuck, A L; Trotter, D C; Camacho, R; Urayama, J; Lentine, A

    2017-05-15

    We experimentally demonstrate ultrahigh extinction ratio (>65 dB) amplitude modulators (AMs) that can be electrically tuned to operate across a broad spectral range of 160 nm from 1480 - 1640 nm and 95 nm from 1280 - 1375 nm. Our on-chip AMs employ one extra coupler compared with conventional Mach-Zehnder interferometers (MZI), thus form a cascaded MZI (CMZI) structure. Either directional or adiabatic couplers are used to compose the CMZI AMs and experimental comparisons are made between these two different structures. We investigate the performance of CMZI AMs under extreme conditions such as using 95:5 split ratio couplers and unbalanced waveguide losses. Electro-optic phase shifters are also integrated in the CMZI AMs for high-speed operation. Finally, we investigate the output optical phase when the amplitude is modulated, which provides us valuable information when both amplitude and phase are to be controlled. Our demonstration not only paves the road to applications such as quantum information processing that requires high extinction ratio AMs but also significantly alleviates the tight fabrication tolerance needed for large-scale integrated photonics.

  7. Integrated optical dipole trap for cold neutral atoms with an optical waveguide coupler

    NASA Astrophysics Data System (ADS)

    Lee, J.; Park, D. H.; Mittal, S.; Dagenais, M.; Rolston, S. L.

    2013-04-01

    An integrated optical dipole trap uses two-color (red and blue-detuned) traveling evanescent wave fields for trapping cold neutral atoms. To achieve longitudinal confinement, we propose using an integrated optical waveguide coupler, which provides a potential gradient along the beam propagation direction sufficient to confine atoms. This integrated optical dipole trap can support an atomic ensemble with a large optical depth due to its small mode area. Its quasi-TE0 waveguide mode has an advantage over the HE11 mode of a nanofiber, with little inhomogeneous Zeeman broadening at the trapping region. The longitudinal confinement eliminates the need for a one dimensional optical lattice, reducing collisional blockaded atomic loading, potentially producing larger ensembles. The waveguide trap allows for scalability and integrability with nano-fabrication technology. We analyze the potential performance of such integrated atom traps.

  8. High-performance computational fluid dynamics: a custom-code approach

    NASA Astrophysics Data System (ADS)

    Fannon, James; Loiseau, Jean-Christophe; Valluri, Prashant; Bethune, Iain; Náraigh, Lennon Ó.

    2016-07-01

    We introduce a modified and simplified version of the pre-existing fully parallelized three-dimensional Navier-Stokes flow solver known as TPLS. We demonstrate how the simplified version can be used as a pedagogical tool for the study of computational fluid dynamics (CFDs) and parallel computing. TPLS is at its heart a two-phase flow solver, and uses calls to a range of external libraries to accelerate its performance. However, in the present context we narrow the focus of the study to basic hydrodynamics and parallel computing techniques, and the code is therefore simplified and modified to simulate pressure-driven single-phase flow in a channel, using only relatively simple Fortran 90 code with MPI parallelization, but no calls to any other external libraries. The modified code is analysed in order to both validate its accuracy and investigate its scalability up to 1000 CPU cores. Simulations are performed for several benchmark cases in pressure-driven channel flow, including a turbulent simulation, wherein the turbulence is incorporated via the large-eddy simulation technique. The work may be of use to advanced undergraduate and graduate students as an introductory study in CFDs, while also providing insight for those interested in more general aspects of high-performance computing.

  9. High Performance Proactive Digital Forensics

    NASA Astrophysics Data System (ADS)

    Alharbi, Soltan; Moa, Belaid; Weber-Jahnke, Jens; Traore, Issa

    2012-10-01

    With the increase in the number of digital crimes and in their sophistication, High Performance Computing (HPC) is becoming a must in Digital Forensics (DF). According to the FBI annual report, the size of data processed during the 2010 fiscal year reached 3,086 TB (compared to 2,334 TB in 2009) and the number of agencies that requested Regional Computer Forensics Laboratory assistance increasing from 689 in 2009 to 722 in 2010. Since most investigation tools are both I/O and CPU bound, the next-generation DF tools are required to be distributed and offer HPC capabilities. The need for HPC is even more evident in investigating crimes on clouds or when proactive DF analysis and on-site investigation, requiring semi-real time processing, are performed. Although overcoming the performance challenge is a major goal in DF, as far as we know, there is almost no research on HPC-DF except for few papers. As such, in this work, we extend our work on the need of a proactive system and present a high performance automated proactive digital forensic system. The most expensive phase of the system, namely proactive analysis and detection, uses a parallel extension of the iterative z algorithm. It also implements new parallel information-based outlier detection algorithms to proactively and forensically handle suspicious activities. To analyse a large number of targets and events and continuously do so (to capture the dynamics of the system), we rely on a multi-resolution approach to explore the digital forensic space. Data set from the Honeynet Forensic Challenge in 2001 is used to evaluate the system from DF and HPC perspectives.

  10. High Performance Molecular Visualization: In-Situ and Parallel Rendering with EGL

    PubMed Central

    Stone, John E.; Messmer, Peter; Sisneros, Robert; Schulten, Klaus

    2016-01-01

    Large scale molecular dynamics simulations produce terabytes of data that is impractical to transfer to remote facilities. It is therefore necessary to perform visualization tasks in-situ as the data are generated, or by running interactive remote visualization sessions and batch analyses co-located with direct access to high performance storage systems. A significant challenge for deploying visualization software within clouds, clusters, and supercomputers involves the operating system software required to initialize and manage graphics acceleration hardware. Recently, it has become possible for applications to use the Embedded-system Graphics Library (EGL) to eliminate the requirement for windowing system software on compute nodes, thereby eliminating a significant obstacle to broader use of high performance visualization applications. We outline the potential benefits of this approach in the context of visualization applications used in the cloud, on commodity clusters, and supercomputers. We discuss the implementation of EGL support in VMD, a widely used molecular visualization application, and we outline benefits of the approach for molecular visualization tasks on petascale computers, clouds, and remote visualization servers. We then provide a brief evaluation of the use of EGL in VMD, with tests using developmental graphics drivers on conventional workstations and on Amazon EC2 G2 GPU-accelerated cloud instance types. We expect that the techniques described here will be of broad benefit to many other visualization applications. PMID:27747137

  11. High Performance Molecular Visualization: In-Situ and Parallel Rendering with EGL.

    PubMed

    Stone, John E; Messmer, Peter; Sisneros, Robert; Schulten, Klaus

    2016-05-01

    Large scale molecular dynamics simulations produce terabytes of data that is impractical to transfer to remote facilities. It is therefore necessary to perform visualization tasks in-situ as the data are generated, or by running interactive remote visualization sessions and batch analyses co-located with direct access to high performance storage systems. A significant challenge for deploying visualization software within clouds, clusters, and supercomputers involves the operating system software required to initialize and manage graphics acceleration hardware. Recently, it has become possible for applications to use the Embedded-system Graphics Library (EGL) to eliminate the requirement for windowing system software on compute nodes, thereby eliminating a significant obstacle to broader use of high performance visualization applications. We outline the potential benefits of this approach in the context of visualization applications used in the cloud, on commodity clusters, and supercomputers. We discuss the implementation of EGL support in VMD, a widely used molecular visualization application, and we outline benefits of the approach for molecular visualization tasks on petascale computers, clouds, and remote visualization servers. We then provide a brief evaluation of the use of EGL in VMD, with tests using developmental graphics drivers on conventional workstations and on Amazon EC2 G2 GPU-accelerated cloud instance types. We expect that the techniques described here will be of broad benefit to many other visualization applications.

  12. High speed parallel spectral-domain OCT using spectrally encoded line-field illumination

    NASA Astrophysics Data System (ADS)

    Lee, Kye-Sung; Hur, Hwan; Bae, Ji Yong; Kim, I. Jong; Kim, Dong Uk; Nam, Ki-Hwan; Kim, Geon-Hee; Chang, Ki Soo

    2018-01-01

    We report parallel spectral-domain optical coherence tomography (OCT) at 500 000 A-scan/s. This is the highest-speed spectral-domain (SD) OCT system using a single line camera. Spectrally encoded line-field scanning is proposed to increase the imaging speed in SD-OCT effectively, and the tradeoff between speed, depth range, and sensitivity is demonstrated. We show that three imaging modes of 125k, 250k, and 500k A-scan/s can be simply switched according to the sample to be imaged considering the depth range and sensitivity. To demonstrate the biological imaging performance of the high-speed imaging modes of the spectrally encoded line-field OCT system, human skin and a whole leaf were imaged at the speed of 250k and 500k A-scan/s, respectively. In addition, there is no sensitivity dependence in the B-scan direction, which is implicit in line-field parallel OCT using line focusing of a Gaussian beam with a cylindrical lens.

  13. A Queue Simulation Tool for a High Performance Scientific Computing Center

    NASA Technical Reports Server (NTRS)

    Spear, Carrie; McGalliard, James

    2007-01-01

    The NASA Center for Computational Sciences (NCCS) at the Goddard Space Flight Center provides high performance highly parallel processors, mass storage, and supporting infrastructure to a community of computational Earth and space scientists. Long running (days) and highly parallel (hundreds of CPUs) jobs are common in the workload. NCCS management structures batch queues and allocates resources to optimize system use and prioritize workloads. NCCS technical staff use a locally developed discrete event simulation tool to model the impacts of evolving workloads, potential system upgrades, alternative queue structures and resource allocation policies.

  14. Incorporating Parallel Computing into the Goddard Earth Observing System Data Assimilation System (GEOS DAS)

    NASA Technical Reports Server (NTRS)

    Larson, Jay W.

    1998-01-01

    Atmospheric data assimilation is a method of combining actual observations with model forecasts to produce a more accurate description of the earth system than the observations or forecast alone can provide. The output of data assimilation, sometimes called the analysis, are regular, gridded datasets of observed and unobserved variables. Analysis plays a key role in numerical weather prediction and is becoming increasingly important for climate research. These applications, and the need for timely validation of scientific enhancements to the data assimilation system pose computational demands that are best met by distributed parallel software. The mission of the NASA Data Assimilation Office (DAO) is to provide datasets for climate research and to support NASA satellite and aircraft missions. The system used to create these datasets is the Goddard Earth Observing System Data Assimilation System (GEOS DAS). The core components of the the GEOS DAS are: the GEOS General Circulation Model (GCM), the Physical-space Statistical Analysis System (PSAS), the Observer, the on-line Quality Control (QC) system, the Coupler (which feeds analysis increments back to the GCM), and an I/O package for processing the large amounts of data the system produces (which will be described in another presentation in this session). The discussion will center on the following issues: the computational complexity for the whole GEOS DAS, assessment of the performance of the individual elements of GEOS DAS, and parallelization strategy for some of the components of the system.

  15. The path toward HEP High Performance Computing

    NASA Astrophysics Data System (ADS)

    Apostolakis, John; Brun, René; Carminati, Federico; Gheata, Andrei; Wenzel, Sandro

    2014-06-01

    High Energy Physics code has been known for making poor use of high performance computing architectures. Efforts in optimising HEP code on vector and RISC architectures have yield limited results and recent studies have shown that, on modern architectures, it achieves a performance between 10% and 50% of the peak one. Although several successful attempts have been made to port selected codes on GPUs, no major HEP code suite has a "High Performance" implementation. With LHC undergoing a major upgrade and a number of challenging experiments on the drawing board, HEP cannot any longer neglect the less-than-optimal performance of its code and it has to try making the best usage of the hardware. This activity is one of the foci of the SFT group at CERN, which hosts, among others, the Root and Geant4 project. The activity of the experiments is shared and coordinated via a Concurrency Forum, where the experience in optimising HEP code is presented and discussed. Another activity is the Geant-V project, centred on the development of a highperformance prototype for particle transport. Achieving a good concurrency level on the emerging parallel architectures without a complete redesign of the framework can only be done by parallelizing at event level, or with a much larger effort at track level. Apart the shareable data structures, this typically implies a multiplication factor in terms of memory consumption compared to the single threaded version, together with sub-optimal handling of event processing tails. Besides this, the low level instruction pipelining of modern processors cannot be used efficiently to speedup the program. We have implemented a framework that allows scheduling vectors of particles to an arbitrary number of computing resources in a fine grain parallel approach. The talk will review the current optimisation activities within the SFT group with a particular emphasis on the development perspectives towards a simulation framework able to profit best from

  16. [Series: Medical Applications of the PHITS Code (2): Acceleration by Parallel Computing].

    PubMed

    Furuta, Takuya; Sato, Tatsuhiko

    2015-01-01

    Time-consuming Monte Carlo dose calculation becomes feasible owing to the development of computer technology. However, the recent development is due to emergence of the multi-core high performance computers. Therefore, parallel computing becomes a key to achieve good performance of software programs. A Monte Carlo simulation code PHITS contains two parallel computing functions, the distributed-memory parallelization using protocols of message passing interface (MPI) and the shared-memory parallelization using open multi-processing (OpenMP) directives. Users can choose the two functions according to their needs. This paper gives the explanation of the two functions with their advantages and disadvantages. Some test applications are also provided to show their performance using a typical multi-core high performance workstation.

  17. Analytical modeling and analysis of magnetic field and torque for novel axial flux eddy current couplers with PM excitation

    NASA Astrophysics Data System (ADS)

    Li, Zhao; Wang, Dazhi; Zheng, Di; Yu, Linxin

    2017-10-01

    Rotational permanent magnet eddy current couplers are promising devices for torque and speed transmission without any mechanical contact. In this study, flux-concentration disk-type permanent magnet eddy current couplers with double conductor rotor are investigated. Given the drawback of the accurate three-dimensional finite element method, this paper proposes a mixed two-dimensional analytical modeling approach. Based on this approach, the closed-form expressions of magnetic field, eddy current, electromagnetic force and torque for such devices are obtained. Finally, a three-dimensional finite element method is employed to validate the analytical results. Besides, a prototype is manufactured and tested for the torque-speed characteristic.

  18. Parallel Performance Optimizations on Unstructured Mesh-based Simulations

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Sarje, Abhinav; Song, Sukhyun; Jacobsen, Douglas

    2015-01-01

    © The Authors. Published by Elsevier B.V. This paper addresses two key parallelization challenges the unstructured mesh-based ocean modeling code, MPAS-Ocean, which uses a mesh based on Voronoi tessellations: (1) load imbalance across processes, and (2) unstructured data access patterns, that inhibit intra- and inter-node performance. Our work analyzes the load imbalance due to naive partitioning of the mesh, and develops methods to generate mesh partitioning with better load balance and reduced communication. Furthermore, we present methods that minimize both inter- and intranode data movement and maximize data reuse. Our techniques include predictive ordering of data elements for higher cachemore » efficiency, as well as communication reduction approaches. We present detailed performance data when running on thousands of cores using the Cray XC30 supercomputer and show that our optimization strategies can exceed the original performance by over 2×. Additionally, many of these solutions can be broadly applied to a wide variety of unstructured grid-based computations.« less

  19. Parallel performance optimizations on unstructured mesh-based simulations

    DOE PAGES

    Sarje, Abhinav; Song, Sukhyun; Jacobsen, Douglas; ...

    2015-06-01

    This paper addresses two key parallelization challenges the unstructured mesh-based ocean modeling code, MPAS-Ocean, which uses a mesh based on Voronoi tessellations: (1) load imbalance across processes, and (2) unstructured data access patterns, that inhibit intra- and inter-node performance. Our work analyzes the load imbalance due to naive partitioning of the mesh, and develops methods to generate mesh partitioning with better load balance and reduced communication. Furthermore, we present methods that minimize both inter- and intranode data movement and maximize data reuse. Our techniques include predictive ordering of data elements for higher cache efficiency, as well as communication reduction approaches.more » We present detailed performance data when running on thousands of cores using the Cray XC30 supercomputer and show that our optimization strategies can exceed the original performance by over 2×. Additionally, many of these solutions can be broadly applied to a wide variety of unstructured grid-based computations.« less

  20. Connectionist Models and Parallelism in High Level Vision.

    DTIC Science & Technology

    1985-01-01

    GRANT NUMBER(s) Jerome A. Feldman N00014-82-K-0193 9. PERFORMING ORGANIZATION NAME AND ADDRESS 10. PROGRAM ELEMENt. PROJECT, TASK Computer Science...Connectionist Models 2.1 Background and Overviev % Computer science is just beginning to look seriously at parallel computation : it may turn out that...the chair. The program includes intermediate level networks that compute more complex joints and ones that compute parallelograms in the image. These

  1. Parallel Implementation of a High Order Implicit Collocation Method for the Heat Equation

    NASA Technical Reports Server (NTRS)

    Kouatchou, Jules; Halem, Milton (Technical Monitor)

    2000-01-01

    We combine a high order compact finite difference approximation and collocation techniques to numerically solve the two dimensional heat equation. The resulting method is implicit arid can be parallelized with a strategy that allows parallelization across both time and space. We compare the parallel implementation of the new method with a classical implicit method, namely the Crank-Nicolson method, where the parallelization is done across space only. Numerical experiments are carried out on the SGI Origin 2000.

  2. An efficient implementation of 3D high-resolution imaging for large-scale seismic data with GPU/CPU heterogeneous parallel computing

    NASA Astrophysics Data System (ADS)

    Xu, Jincheng; Liu, Wei; Wang, Jin; Liu, Linong; Zhang, Jianfeng

    2018-02-01

    De-absorption pre-stack time migration (QPSTM) compensates for the absorption and dispersion of seismic waves by introducing an effective Q parameter, thereby making it an effective tool for 3D, high-resolution imaging of seismic data. Although the optimal aperture obtained via stationary-phase migration reduces the computational cost of 3D QPSTM and yields 3D stationary-phase QPSTM, the associated computational efficiency is still the main problem in the processing of 3D, high-resolution images for real large-scale seismic data. In the current paper, we proposed a division method for large-scale, 3D seismic data to optimize the performance of stationary-phase QPSTM on clusters of graphics processing units (GPU). Then, we designed an imaging point parallel strategy to achieve an optimal parallel computing performance. Afterward, we adopted an asynchronous double buffering scheme for multi-stream to perform the GPU/CPU parallel computing. Moreover, several key optimization strategies of computation and storage based on the compute unified device architecture (CUDA) were adopted to accelerate the 3D stationary-phase QPSTM algorithm. Compared with the initial GPU code, the implementation of the key optimization steps, including thread optimization, shared memory optimization, register optimization and special function units (SFU), greatly improved the efficiency. A numerical example employing real large-scale, 3D seismic data showed that our scheme is nearly 80 times faster than the CPU-QPSTM algorithm. Our GPU/CPU heterogeneous parallel computing framework significant reduces the computational cost and facilitates 3D high-resolution imaging for large-scale seismic data.

  3. Low-loss multimode interference couplers for terahertz waves

    NASA Astrophysics Data System (ADS)

    Themistos, Christos; Kalli, Kyriacos; Komodromos, Michael; Markides, Christos; Quadir, Anita; Rahman, B. M. Azizur; Grattan, Kenneth T. V.

    2012-04-01

    The terahertz (THz) frequency region of the electromagnetic spectrum is located between the traditional microwave spectrum and the optical frequencies, and offers a significant scientific and technological potential in many fields, such as in sensing, in imaging and in spectroscopy. Waveguiding in this intermediate spectral region is a major challenge. Amongst the various THz waveguides suggested, metal-clad plasmonic waveguides and specifically hollow core structures, coated with insulating material are the most promising low-loss waveguides used in both active and passive devices. Optical power splitters are important components in the design of optoelectronic systems and optical communication networks such as Mach-Zehnder Interferometric switches, polarization splitter and polarization scramblers. Several designs for the implementation of the 3dB power splitters have been proposed in the past, such as the directional coupler-based approach, the Y-junction-based devices and the MMI-based approach. In the present paper a novel MMI-based 3dB THz wave splitter is implemented using Gold/polystyrene (PS) coated hollow glass rectangular waveguides. The H-field FEM based full-vector formulation is used here to calculate the complex propagation characteristics of the waveguide structure and the finite element beam propagation method (FE-BPM) and finite difference time domain (FDTD) approach to demonstrate the performance of the proposed 3dB splitter.

  4. Suppressing the crosstalk between racetrack resonators by grating assisted couplers for WDM sensing

    NASA Astrophysics Data System (ADS)

    Zhang, Xuezhi; Jiang, Junfeng; Liu, Kun; Yu, Zhe; Feng, Ming; Chen, Wenjie; Liu, Tiegen

    2017-12-01

    We proposed a uniform racetrack resonators based sensor for bio-chemical WDM sensing. The sensing channels are assigned by grating assisted contra-directional couplers. Each resonator only occupies one sensing channel. The crosstalk between sensing channels can be suppressed by aligning the center coupling wavelength of one resonator with the weak coupling wavelength of the others. Based on the simulation results obtained from transfer matrix method, the sensing channel gap can be reduced down to 2 FSRs (˜1.5 nm) of the resonator. The total crosstalk can be as low as 2.5 × 10-2 dB in a sensor with 23 channels covering the whole C band. This sensor with high throughput will be very important for analyzing a wide range of analytes, such as organic compounds or biological materials.

  5. Performance Analysis of Multilevel Parallel Applications on Shared Memory Architectures

    NASA Technical Reports Server (NTRS)

    Jost, Gabriele; Jin, Haoqiang; Labarta, Jesus; Gimenez, Judit; Caubet, Jordi; Biegel, Bryan A. (Technical Monitor)

    2002-01-01

    In this paper we describe how to apply powerful performance analysis techniques to understand the behavior of multilevel parallel applications. We use the Paraver/OMPItrace performance analysis system for our study. This system consists of two major components: The OMPItrace dynamic instrumentation mechanism, which allows the tracing of processes and threads and the Paraver graphical user interface for inspection and analyses of the generated traces. We describe how to use the system to conduct a detailed comparative study of a benchmark code implemented in five different programming paradigms applicable for shared memory

  6. Parallel File System I/O Performance Testing On LANL Clusters

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Wiens, Isaac Christian; Green, Jennifer Kathleen

    2016-08-18

    These are slides from a presentation on parallel file system I/O performance testing on LANL clusters. I/O is a known bottleneck for HPC applications. Performance optimization of I/O is often required. This summer project entailed integrating IOR under Pavilion and automating the results analysis. The slides cover the following topics: scope of the work, tools utilized, IOR-Pavilion test workflow, build script, IOR parameters, how parameters are passed to IOR, *run_ior: functionality, Python IOR-Output Parser, Splunk data format, Splunk dashboard and features, and future work.

  7. Parallel Application Performance on Two Generations of Intel Xeon HPC Platforms

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Chang, Christopher H.; Long, Hai; Sides, Scott

    2015-10-15

    Two next-generation node configurations hosting the Haswell microarchitecture were tested with a suite of microbenchmarks and application examples, and compared with a current Ivy Bridge production node on NREL" tm s Peregrine high-performance computing cluster. A primary conclusion from this study is that the additional cores are of little value to individual task performance--limitations to application parallelism, or resource contention among concurrently running but independent tasks, limits effective utilization of these added cores. Hyperthreading generally impacts throughput negatively, but can improve performance in the absence of detailed attention to runtime workflow configuration. The observations offer some guidance to procurement ofmore » future HPC systems at NREL. First, raw core count must be balanced with available resources, particularly memory bandwidth. Balance-of-system will determine value more than processor capability alone. Second, hyperthreading continues to be largely irrelevant to the workloads that are commonly seen, and were tested here, at NREL. Finally, perhaps the most impactful enhancement to productivity might occur through enabling multiple concurrent jobs per node. Given the right type and size of workload, more may be achieved by doing many slow things at once, than fast things in order.« less

  8. RISC Processors and High Performance Computing

    NASA Technical Reports Server (NTRS)

    Bailey, David H.; Saini, Subhash; Craw, James M. (Technical Monitor)

    1995-01-01

    This tutorial will discuss the top five RISC microprocessors and the parallel systems in which they are used. It will provide a unique cross-machine comparison not available elsewhere. The effective performance of these processors will be compared by citing standard benchmarks in the context of real applications. The latest NAS Parallel Benchmarks, both absolute performance and performance per dollar, will be listed. The next generation of the NPB will be described. The tutorial will conclude with a discussion of future directions in the field. Technology Transfer Considerations: All of these computer systems are commercially available internationally. Information about these processors is available in the public domain, mostly from the vendors themselves. The NAS Parallel Benchmarks and their results have been previously approved numerous times for public release, beginning back in 1991.

  9. Optoelectronic associative recall using motionless-head parallel readout optical disk

    NASA Astrophysics Data System (ADS)

    Marchand, P. J.; Krishnamoorthy, A. V.; Ambs, P.; Esener, S. C.

    1990-12-01

    High data rates, low retrieval times, and simple implementation are presently shown to be obtainable by means of a motionless-head 2D parallel-readout system for optical disks. Since the optical disk obviates mechanical head motions for access, focusing, and tracking, addressing is performed exclusively through the disk's rotation. Attention is given to a high-performance associative memory system configuration which employs a parallel readout disk.

  10. Large-scale three-dimensional phase-field simulations for phase coarsening at ultrahigh volume fraction on high-performance architectures

    NASA Astrophysics Data System (ADS)

    Yan, Hui; Wang, K. G.; Jones, Jim E.

    2016-06-01

    A parallel algorithm for large-scale three-dimensional phase-field simulations of phase coarsening is developed and implemented on high-performance architectures. From the large-scale simulations, a new kinetics in phase coarsening in the region of ultrahigh volume fraction is found. The parallel implementation is capable of harnessing the greater computer power available from high-performance architectures. The parallelized code enables increase in three-dimensional simulation system size up to a 5123 grid cube. Through the parallelized code, practical runtime can be achieved for three-dimensional large-scale simulations, and the statistical significance of the results from these high resolution parallel simulations are greatly improved over those obtainable from serial simulations. A detailed performance analysis on speed-up and scalability is presented, showing good scalability which improves with increasing problem size. In addition, a model for prediction of runtime is developed, which shows a good agreement with actual run time from numerical tests.

  11. Modelling parallel programs and multiprocessor architectures with AXE

    NASA Technical Reports Server (NTRS)

    Yan, Jerry C.; Fineman, Charles E.

    1991-01-01

    AXE, An Experimental Environment for Parallel Systems, was designed to model and simulate for parallel systems at the process level. It provides an integrated environment for specifying computation models, multiprocessor architectures, data collection, and performance visualization. AXE is being used at NASA-Ames for developing resource management strategies, parallel problem formulation, multiprocessor architectures, and operating system issues related to the High Performance Computing and Communications Program. AXE's simple, structured user-interface enables the user to model parallel programs and machines precisely and efficiently. Its quick turn-around time keeps the user interested and productive. AXE models multicomputers. The user may easily modify various architectural parameters including the number of sites, connection topologies, and overhead for operating system activities. Parallel computations in AXE are represented as collections of autonomous computing objects known as players. Their use and behavior is described. Performance data of the multiprocessor model can be observed on a color screen. These include CPU and message routing bottlenecks, and the dynamic status of the software.

  12. A facetless regrowth-free single mode laser based on MMI couplers

    NASA Astrophysics Data System (ADS)

    Caro, Ludovic; Kelly, Niall P.; Dernaika, Mohamad; Shayesteh, Maryam; Morrissey, Padraic E.; Alexander, Justin K.; Peters, Frank H.

    2017-09-01

    This paper presents a facetless, tunable laser operating near 1575 nm, as well as a theoretical model predicting spectral features of the laser. The lasers were fabricated without regrowth or advanced lithography techniques, and are based on MMI couplers and etched facets. Coarse vernier tuning was achieved over a range of 25 nm, while fine, thermal tuning was also demonstrated over a range of 1.5 nm. SMSR values of 25 dB and higher were observed, with a measured laser linewidth of 600 kHz.

  13. Distributed Parallel Processing and Dynamic Load Balancing Techniques for Multidisciplinary High Speed Aircraft Design

    NASA Technical Reports Server (NTRS)

    Krasteva, Denitza T.

    1998-01-01

    Multidisciplinary design optimization (MDO) for large-scale engineering problems poses many challenges (e.g., the design of an efficient concurrent paradigm for global optimization based on disciplinary analyses, expensive computations over vast data sets, etc.) This work focuses on the application of distributed schemes for massively parallel architectures to MDO problems, as a tool for reducing computation time and solving larger problems. The specific problem considered here is configuration optimization of a high speed civil transport (HSCT), and the efficient parallelization of the embedded paradigm for reasonable design space identification. Two distributed dynamic load balancing techniques (random polling and global round robin with message combining) and two necessary termination detection schemes (global task count and token passing) were implemented and evaluated in terms of effectiveness and scalability to large problem sizes and a thousand processors. The effect of certain parameters on execution time was also inspected. Empirical results demonstrated stable performance and effectiveness for all schemes, and the parametric study showed that the selected algorithmic parameters have a negligible effect on performance.

  14. A NEW CONCEPT FOR HIGH POWER RF COUPLING BETWEEN WAVEGUIDES AND RESONANT RF CAVITIES

    DOE PAGES

    Xu, Chen; Ben-Zvi, Ilan; Wang, Haipeng; ...

    2017-01-01

    Microwave engineering of high average-power (hundreds of kilowatts) devices often involves a transition from a waveguide to a device, typically a resonant cavity. This is a basic operation, which finds use in various application areas of significance to science and industry. At relatively low frequencies, L-band and below, it is convenient, sometimes essential, to couple the power between the waveguide and the cavity through a coaxial antenna, forming a power coupler. Power flow to the cavity in the fundamental mode leads to a Fundamental Power Coupler (FPC). High-order mode power generated in the cavity by a particle beam leads tomore » a high-order mode power damper. Coupling a cryogenic device, such as a superconducting cavity to a room temperature power source (or damp) leads to additional constraints and challenges. We propose a new approach to this problem, wherein the coax line element is operated in a TE11 mode rather than the conventional TEM mode. We will show that this method leads to a significant increase in the power handling capability of the coupler as well as a few other advantages. As a result, we describe the mode converter from the waveguide to the TE11 coax line, outline the characteristics and performance limits of the coupler and provide a detailed worked out example in the challenging area of coupling to a superconducting accelerator cavity.« less

  15. A NEW CONCEPT FOR HIGH POWER RF COUPLING BETWEEN WAVEGUIDES AND RESONANT RF CAVITIES

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Xu, Chen; Ben-Zvi, Ilan; Wang, Haipeng

    Microwave engineering of high average-power (hundreds of kilowatts) devices often involves a transition from a waveguide to a device, typically a resonant cavity. This is a basic operation, which finds use in various application areas of significance to science and industry. At relatively low frequencies, L-band and below, it is convenient, sometimes essential, to couple the power between the waveguide and the cavity through a coaxial antenna, forming a power coupler. Power flow to the cavity in the fundamental mode leads to a Fundamental Power Coupler (FPC). High-order mode power generated in the cavity by a particle beam leads tomore » a high-order mode power damper. Coupling a cryogenic device, such as a superconducting cavity to a room temperature power source (or damp) leads to additional constraints and challenges. We propose a new approach to this problem, wherein the coax line element is operated in a TE11 mode rather than the conventional TEM mode. We will show that this method leads to a significant increase in the power handling capability of the coupler as well as a few other advantages. As a result, we describe the mode converter from the waveguide to the TE11 coax line, outline the characteristics and performance limits of the coupler and provide a detailed worked out example in the challenging area of coupling to a superconducting accelerator cavity.« less

  16. SISYPHUS: A high performance seismic inversion factory

    NASA Astrophysics Data System (ADS)

    Gokhberg, Alexey; Simutė, Saulė; Boehm, Christian; Fichtner, Andreas

    2016-04-01

    In the recent years the massively parallel high performance computers became the standard instruments for solving the forward and inverse problems in seismology. The respective software packages dedicated to forward and inverse waveform modelling specially designed for such computers (SPECFEM3D, SES3D) became mature and widely available. These packages achieve significant computational performance and provide researchers with an opportunity to solve problems of bigger size at higher resolution within a shorter time. However, a typical seismic inversion process contains various activities that are beyond the common solver functionality. They include management of information on seismic events and stations, 3D models, observed and synthetic seismograms, pre-processing of the observed signals, computation of misfits and adjoint sources, minimization of misfits, and process workflow management. These activities are time consuming, seldom sufficiently automated, and therefore represent a bottleneck that can substantially offset performance benefits provided by even the most powerful modern supercomputers. Furthermore, a typical system architecture of modern supercomputing platforms is oriented towards the maximum computational performance and provides limited standard facilities for automation of the supporting activities. We present a prototype solution that automates all aspects of the seismic inversion process and is tuned for the modern massively parallel high performance computing systems. We address several major aspects of the solution architecture, which include (1) design of an inversion state database for tracing all relevant aspects of the entire solution process, (2) design of an extensible workflow management framework, (3) integration with wave propagation solvers, (4) integration with optimization packages, (5) computation of misfits and adjoint sources, and (6) process monitoring. The inversion state database represents a hierarchical structure with

  17. Kalman Filter Tracking on Parallel Architectures

    NASA Astrophysics Data System (ADS)

    Cerati, Giuseppe; Elmer, Peter; Krutelyov, Slava; Lantz, Steven; Lefebvre, Matthieu; McDermott, Kevin; Riley, Daniel; Tadel, Matevž; Wittich, Peter; Würthwein, Frank; Yagil, Avi

    2016-11-01

    Power density constraints are limiting the performance improvements of modern CPUs. To address this we have seen the introduction of lower-power, multi-core processors such as GPGPU, ARM and Intel MIC. In order to achieve the theoretical performance gains of these processors, it will be necessary to parallelize algorithms to exploit larger numbers of lightweight cores and specialized functions like large vector units. Track finding and fitting is one of the most computationally challenging problems for event reconstruction in particle physics. At the High-Luminosity Large Hadron Collider (HL-LHC), for example, this will be by far the dominant problem. The need for greater parallelism has driven investigations of very different track finding techniques such as Cellular Automata or Hough Transforms. The most common track finding techniques in use today, however, are those based on a Kalman filter approach. Significant experience has been accumulated with these techniques on real tracking detector systems, both in the trigger and offline. They are known to provide high physics performance, are robust, and are in use today at the LHC. Given the utility of the Kalman filter in track finding, we have begun to port these algorithms to parallel architectures, namely Intel Xeon and Xeon Phi. We report here on our progress towards an end-to-end track reconstruction algorithm fully exploiting vectorization and parallelization techniques in a simplified experimental environment.

  18. The Galley Parallel File System

    NASA Technical Reports Server (NTRS)

    Nieuwejaar, Nils; Kotz, David

    1996-01-01

    Most current multiprocessor file systems are designed to use multiple disks in parallel, using the high aggregate bandwidth to meet the growing I/0 requirements of parallel scientific applications. Many multiprocessor file systems provide applications with a conventional Unix-like interface, allowing the application to access multiple disks transparently. This interface conceals the parallelism within the file system, increasing the ease of programmability, but making it difficult or impossible for sophisticated programmers and libraries to use knowledge about their I/O needs to exploit that parallelism. In addition to providing an insufficient interface, most current multiprocessor file systems are optimized for a different workload than they are being asked to support. We introduce Galley, a new parallel file system that is intended to efficiently support realistic scientific multiprocessor workloads. We discuss Galley's file structure and application interface, as well as the performance advantages offered by that interface.

  19. Parallel Computation of Ocean-Atmosphere-Wave Coupled Storm Surge Model

    NASA Astrophysics Data System (ADS)

    Kim, K.; Yamashita, T.

    2003-12-01

    been made the parallel codes by SPMD methods. The wave-current interface model was developed by defining the wave breaking stresses. And we developed the coupling program to collect and distribute the exchanging data with the parallel system. Every models and coupler are executed at same time, and they calculate own jobs and pass data with organic system. MPMD method programming was performed to couple the models. The coupler and each models united by the separated group, and they calculated by the group unit. Also they passed message when exchanging data by global unit. The data are exchanged every 60-second model time that is the least common multiple time of the atmosphere model, the wave model and the ocean model. The model was applied to the storm surge simulation in the Yatsushiro Sea, in which we could not simulated the observed maximum surge height with the numerical model that did not include the wave breaking stress. It is confirmed that the simulation which includes the wave breaking stress effects can produce the observed maximum height, 450 cm, at Matsuai.

  20. File-access characteristics of parallel scientific workloads

    NASA Technical Reports Server (NTRS)

    Nieuwejaar, Nils; Kotz, David; Purakayastha, Apratim; Best, Michael; Ellis, Carla Schlatter

    1995-01-01

    Phenomenal improvements in the computational performance of multiprocessors have not been matched by comparable gains in I/O system performance. This imbalance has resulted in I/O becoming a significant bottleneck for many scientific applications. One key to overcoming this bottleneck is improving the performance of parallel file systems. The design of a high-performance parallel file system requires a comprehensive understanding of the expected workload. Unfortunately, until recently, no general workload studies of parallel file systems have been conducted. The goal of the CHARISMA project was to remedy this problem by characterizing the behavior of several production workloads, on different machines, at the level of individual reads and writes. The first set of results from the CHARISMA project describe the workloads observed on an Intel iPSC/860 and a Thinking Machines CM-5. This paper is intended to compare and contrast these two workloads for an understanding of their essential similarities and differences, isolating common trends and platform-dependent variances. Using this comparison, we are able to gain more insight into the general principles that should guide parallel file-system design.

  1. An efficient parallel algorithm for matrix-vector multiplication

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Hendrickson, B.; Leland, R.; Plimpton, S.

    The multiplication of a vector by a matrix is the kernel computation of many algorithms in scientific computation. A fast parallel algorithm for this calculation is therefore necessary if one is to make full use of the new generation of parallel supercomputers. This paper presents a high performance, parallel matrix-vector multiplication algorithm that is particularly well suited to hypercube multiprocessors. For an n x n matrix on p processors, the communication cost of this algorithm is O(n/[radical]p + log(p)), independent of the matrix sparsity pattern. The performance of the algorithm is demonstrated by employing it as the kernel in themore » well-known NAS conjugate gradient benchmark, where a run time of 6.09 seconds was observed. This is the best published performance on this benchmark achieved to date using a massively parallel supercomputer.« less

  2. A Domain Decomposition Parallelization of the Fast Marching Method

    NASA Technical Reports Server (NTRS)

    Herrmann, M.

    2003-01-01

    In this paper, the first domain decomposition parallelization of the Fast Marching Method for level sets has been presented. Parallel speedup has been demonstrated in both the optimal and non-optimal domain decomposition case. The parallel performance of the proposed method is strongly dependent on load balancing separately the number of nodes on each side of the interface. A load imbalance of nodes on either side of the domain leads to an increase in communication and rollback operations. Furthermore, the amount of inter-domain communication can be reduced by aligning the inter-domain boundaries with the interface normal vectors. In the case of optimal load balancing and aligned inter-domain boundaries, the proposed parallel FMM algorithm is highly efficient, reaching efficiency factors of up to 0.98. Future work will focus on the extension of the proposed parallel algorithm to higher order accuracy. Also, to further enhance parallel performance, the coupling of the domain decomposition parallelization to the G(sub 0)-based parallelization will be investigated.

  3. Data parallel sorting for particle simulation

    NASA Technical Reports Server (NTRS)

    Dagum, Leonardo

    1992-01-01

    Sorting on a parallel architecture is a communications intensive event which can incur a high penalty in applications where it is required. In the case of particle simulation, only integer sorting is necessary, and sequential implementations easily attain the minimum performance bound of O (N) for N particles. Parallel implementations, however, have to cope with the parallel sorting problem which, in addition to incurring a heavy communications cost, can make the minimun performance bound difficult to attain. This paper demonstrates how the sorting problem in a particle simulation can be reduced to a merging problem, and describes an efficient data parallel algorithm to solve this merging problem in a particle simulation. The new algorithm is shown to be optimal under conditions usual for particle simulation, and its fieldwise implementation on the Connection Machine is analyzed in detail. The new algorithm is about four times faster than a fieldwise implementation of radix sort on the Connection Machine.

  4. Parallel Algorithms for the Exascale Era

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Robey, Robert W.

    New parallel algorithms are needed to reach the Exascale level of parallelism with millions of cores. We look at some of the research developed by students in projects at LANL. The research blends ideas from the early days of computing while weaving in the fresh approach brought by students new to the field of high performance computing. We look at reproducibility of global sums and why it is important to parallel computing. Next we look at how the concept of hashing has led to the development of more scalable algorithms suitable for next-generation parallel computers. Nearly all of this workmore » has been done by undergraduates and published in leading scientific journals.« less

  5. A highly parallel multigrid-like method for the solution of the Euler equations

    NASA Technical Reports Server (NTRS)

    Tuminaro, Ray S.

    1989-01-01

    We consider a highly parallel multigrid-like method for the solution of the two dimensional steady Euler equations. The new method, introduced as filtering multigrid, is similar to a standard multigrid scheme in that convergence on the finest grid is accelerated by iterations on coarser grids. In the filtering method, however, additional fine grid subproblems are processed concurrently with coarse grid computations to further accelerate convergence. These additional problems are obtained by splitting the residual into a smooth and an oscillatory component. The smooth component is then used to form a coarse grid problem (similar to standard multigrid) while the oscillatory component is used for a fine grid subproblem. The primary advantage in the filtering approach is that fewer iterations are required and that most of the additional work per iteration can be performed in parallel with the standard coarse grid computations. We generalize the filtering algorithm to a version suitable for nonlinear problems. We emphasize that this generalization is conceptually straight-forward and relatively easy to implement. In particular, no explicit linearization (e.g., formation of Jacobians) needs to be performed (similar to the FAS multigrid approach). We illustrate the nonlinear version by applying it to the Euler equations, and presenting numerical results. Finally, a performance evaluation is made based on execution time models and convergence information obtained from numerical experiments.

  6. Performing an allreduce operation on a plurality of compute nodes of a parallel computer

    DOEpatents

    Faraj, Ahmad [Rochester, MN

    2012-04-17

    Methods, apparatus, and products are disclosed for performing an allreduce operation on a plurality of compute nodes of a parallel computer. Each compute node includes at least two processing cores. Each processing core has contribution data for the allreduce operation. Performing an allreduce operation on a plurality of compute nodes of a parallel computer includes: establishing one or more logical rings among the compute nodes, each logical ring including at least one processing core from each compute node; performing, for each logical ring, a global allreduce operation using the contribution data for the processing cores included in that logical ring, yielding a global allreduce result for each processing core included in that logical ring; and performing, for each compute node, a local allreduce operation using the global allreduce results for each processing core on that compute node.

  7. Performance and Scalability of the NAS Parallel Benchmarks in Java

    NASA Technical Reports Server (NTRS)

    Frumkin, Michael A.; Schultz, Matthew; Jin, Haoqiang; Yan, Jerry; Biegel, Bryan A. (Technical Monitor)

    2002-01-01

    Several features make Java an attractive choice for scientific applications. In order to gauge the applicability of Java to Computational Fluid Dynamics (CFD), we have implemented the NAS (NASA Advanced Supercomputing) Parallel Benchmarks in Java. The performance and scalability of the benchmarks point out the areas where improvement in Java compiler technology and in Java thread implementation would position Java closer to Fortran in the competition for scientific applications.

  8. High figure of merit ultra-compact 3-channel parallel-connected photonic crystal mini-hexagonal-H1 defect microcavity sensor array

    NASA Astrophysics Data System (ADS)

    Wang, Chunhong; Sun, Fujun; Fu, Zhongyuan; Ding, Zhaoxiang; Wang, Chao; Zhou, Jian; Wang, Jiawen; Tian, Huiping

    2017-08-01

    In this paper, a photonic crystal (PhC) butt-coupled mini-hexagonal-H1 defect (MHHD) microcavity sensor is proposed. The MHHD microcavity is designed by introducing six mini-holes into the initial H1 defect region. Further, based on a well-designed 1 ×3 PhC Beam Splitter and three optimal MHHD microcavity sensors with different lattice constants (a), a 3-channel parallel-connected PhC sensor array on monolithic silicon on insulator (SOI) is proposed. Finite-difference time-domain (FDTD) simulations method is performed to demonstrate the high performance of our structures. As statistics show, the quality factor (Q) of our optimal MHHD microcavity attains higher than 7×104, while the sensitivity (S) reaches up to 233 nm/RIU(RIU = refractive index unit). Thus, the figure of merit (FOM) >104 of the sensor is obtained, which is enhanced by two orders of magnitude compared to the previous butt-coupled sensors [1-4]. As for the 3-channel parallel-connected PhC MHHD microcavity sensor array, the FOMs of three independent MHHD microcavity sensors are 8071, 8250 and 8250, respectively. In addition, the total footprint of the proposed 3-channel parallel-connected PhC sensor array is ultra-compactness of 12.5 μm ×31 μm (width × length). Therefore, the proposed high FOM sensor array is an ideal platform for realizing ultra-compact highly parallel refractive index (RI) sensing.

  9. Parallel simulation today

    NASA Technical Reports Server (NTRS)

    Nicol, David; Fujimoto, Richard

    1992-01-01

    This paper surveys topics that presently define the state of the art in parallel simulation. Included in the tutorial are discussions on new protocols, mathematical performance analysis, time parallelism, hardware support for parallel simulation, load balancing algorithms, and dynamic memory management for optimistic synchronization.

  10. Novel mono-static arrangement of the ASDEX Upgrade high field side reflectometers compatible with electron cyclotron resonance heating stray radiation.

    PubMed

    Silva, A; Varela, P; Meneses, L; Manso, M

    2012-10-01

    The ASDEX Upgrade frequency modulated continuous wave broadband reflectometer system uses a mono-static antenna configuration with in-vessel hog-horns and 3 dB directional couplers. The operation of the new electron cyclotron resonance heating (ECRH) launcher and the start of collective Thomson scattering experiments caused several events where the fragile dummy loads inside the high field side directional couplers were damaged, due to excessive power resulting from the ECRH stray fields. In this paper, we present a non-conventional application of the existing three-port directional coupler that hardens the system to the ECRH stray fields and at the same time generates the necessary reference signal. Electromagnetic simulations and laboratory tests were performed to validate the proposed solution and are compared with the in-vessel calibration tests.

  11. A Review of Lightweight Thread Approaches for High Performance Computing

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Castello, Adrian; Pena, Antonio J.; Seo, Sangmin

    High-level, directive-based solutions are becoming the programming models (PMs) of the multi/many-core architectures. Several solutions relying on operating system (OS) threads perfectly work with a moderate number of cores. However, exascale systems will spawn hundreds of thousands of threads in order to exploit their massive parallel architectures and thus conventional OS threads are too heavy for that purpose. Several lightweight thread (LWT) libraries have recently appeared offering lighter mechanisms to tackle massive concurrency. In order to examine the suitability of LWTs in high-level runtimes, we develop a set of microbenchmarks consisting of commonlyfound patterns in current parallel codes. Moreover, wemore » study the semantics offered by some LWT libraries in order to expose the similarities between different LWT application programming interfaces. This study reveals that a reduced set of LWT functions can be sufficient to cover the common parallel code patterns and that those LWT libraries perform better than OS threads-based solutions in cases where task and nested parallelism are becoming more popular with new architectures.« less

  12. Microfluidic integration of parallel solid-phase liquid chromatography.

    PubMed

    Huft, Jens; Haynes, Charles A; Hansen, Carl L

    2013-03-05

    We report the development of a fully integrated microfluidic chromatography system based on a recently developed column geometry that allows for robust packing of high-performance separation columns in poly(dimethylsiloxane) microfluidic devices having integrated valves made by multilayer soft lithography (MSL). The combination of parallel high-performance separation columns and on-chip plumbing was used to achieve a fully integrated system for on-chip chromatography, including all steps of automated sample loading, programmable gradient generation, separation, fluorescent detection, and sample recovery. We demonstrate this system in the separation of fluorescently labeled DNA and parallel purification of reverse transcription polymerase chain reaction (RT-PCR) amplified variable regions of mouse immunoglobulin genes using a strong anion exchange (AEX) resin. Parallel sample recovery in an immiscible oil stream offers the advantage of low sample dilution and high recovery rates. The ability to perform nucleic acid size selection and recovery on subnanogram samples of DNA holds promise for on-chip genomics applications including sequencing library preparation, cloning, and sample fractionation for diagnostics.

  13. Monolithic optofluidic mode coupler for broadband thermo- and piezo-optical characterization of liquids.

    PubMed

    Pumpe, Sebastian; Chemnitz, Mario; Kobelke, Jens; Schmidt, Markus A

    2017-09-18

    We present a monolithic fiber device that enables investigation of the thermo- and piezo-optical properties of liquids using straightforward broadband transmission measurements. The device is a directional mode coupler consisting of a multi-mode liquid core and a single-mode glass core with pronounced coupling resonances whose wavelength strongly depend on the operation temperature. We demonstrated the functionality and flexibility of our device for carbon disulfide, extending the current knowledge of the thermo-optic coefficient by 200 nm at 20 °C and uniquely for high temperatures. Moreover, our device allows measuring the piezo-optic coefficient of carbon disulfide, confirming results first obtained by Röntgen in 1891. Finally, we applied our approach to obtain the dispersion of the thermo-optic coefficients of benzene and tetrachloroethylene between 450 and 800 nm, whereas no data was available for the latter so far.

  14. Femtosecond laser inscription of asymmetric directional couplers for in-fiber optical taps and fiber cladding photonics.

    PubMed

    Grenier, Jason R; Fernandes, Luís A; Herman, Peter R

    2015-06-29

    Precise alignment of femtosecond laser tracks in standard single mode optical fiber is shown to enable controllable optical tapping of the fiber core waveguide light with fiber cladding photonic circuits. Asymmetric directional couplers are presented with tunable coupling ratios up to 62% and bandwidths up to 300 nm at telecommunication wavelengths. Real-time fiber monitoring during laser writing permitted a means of controlling the coupler length to compensate for micron-scale alignment errors and to facilitate tailored design of coupling ratio, spectral bandwidth and polarization properties. Laser induced waveguide birefringence was harnessed for polarization dependent coupling that led to the formation of in-fiber polarization-selective taps with 32 dB extinction ratio. This technology enables the interconnection of light propagating in pre-existing waveguides with laser-formed devices, thereby opening a new practical direction for the three-dimensional integration of optical devices in the cladding of optical fibers and planar lightwave circuits.

  15. Linear static structural and vibration analysis on high-performance computers

    NASA Technical Reports Server (NTRS)

    Baddourah, M. A.; Storaasli, O. O.; Bostic, S. W.

    1993-01-01

    Parallel computers offer the oppurtunity to significantly reduce the computation time necessary to analyze large-scale aerospace structures. This paper presents algorithms developed for and implemented on massively-parallel computers hereafter referred to as Scalable High-Performance Computers (SHPC), for the most computationally intensive tasks involved in structural analysis, namely, generation and assembly of system matrices, solution of systems of equations and calculation of the eigenvalues and eigenvectors. Results on SHPC are presented for large-scale structural problems (i.e. models for High-Speed Civil Transport). The goal of this research is to develop a new, efficient technique which extends structural analysis to SHPC and makes large-scale structural analyses tractable.

  16. Optical loss analysis and parameter optimization for fan-shaped single-polarization grating coupler at wavelength of 1.3 µm band

    NASA Astrophysics Data System (ADS)

    Ushida, Jun; Tokushima, Masatoshi; Sobu, Yohei; Shimura, Daisuke; Yashiki, Kenichiro; Takahashi, Shigeki; Kurata, Kazuhiko

    2018-05-01

    Fan-shaped grating couplers (F-GCs) can be smaller than straight ones but are less efficient in general in coupling to single-mode fibers. To find a small F-GC with sufficiently high fiber-coupling characteristics, we numerically compared the dependencies of coupling efficiencies on wavelengths, the starting width of gratings, and misalignment distances among 25, 45, and 60° tapered angles of fan shape by using the three-dimensional finite-difference time domain method. A F-GC with a tapered angle of 25° exhibited the highest performances for all dependencies. The optical loss origins of F-GCs were discussed in terms of the electric field structures in them and scattering at the joint between the fan-shaped slab and channel waveguide. We fabricated an optimized 25° F-GC by using ArF photolithography, which almost exactly reproduced the optical coupling efficiency and radiation angle characteristics that were numerically expected.

  17. Improved CDMA Performance Using Parallel Interference Cancellation

    NASA Technical Reports Server (NTRS)

    Simon, Marvin; Divsalar, Dariush

    1995-01-01

    This report considers a general parallel interference cancellation scheme that significantly reduces the degradation effect of user interference but with a lesser implementation complexity than the maximum-likelihood technique. The scheme operates on the fact that parallel processing simultaneously removes from each user the interference produced by the remaining users accessing the channel in an amount proportional to their reliability. The parallel processing can be done in multiple stages. The proposed scheme uses tentative decision devices with different optimum thresholds at the multiple stages to produce the most reliably received data for generation and cancellation of user interference. The 1-stage interference cancellation is analyzed for three types of tentative decision devices, namely, hard, null zone, and soft decision, and two types of user power distribution, namely, equal and unequal powers. Simulation results are given for a multitude of different situations, in particular, those cases for which the analysis is too complex.

  18. Novel ultra-short and ultra-broadband polarization beam splitter based on a bent directional coupler.

    PubMed

    Dai, Daoxin; Bowers, John E

    2011-09-12

    A novel ultra-short polarization beam splitter (PBS) based on a bent directional coupler is proposed by utilizing the evanescent coupling between two bent optical waveguides with different core widths. For the bent directional coupler, there is a significant phase-mismatch for TE polarization while the phase-matching condition is satisfied for TM polarization. Therefore, the TM polarized light can be coupled from the narrow input waveguide to the adjacent wide waveguide while the TE polarization goes through the coupling region without significant coupling. An ultra-short (<10 μm-long) PBS is designed based on silicon-on-insulator nanowires and the length of the bent coupling region is as small as 4.5 μm while the gap width is chosen as 200 nm (large enough to simplify the fabrication). Numerical simulations show that the present PBS has a good fabrication tolerance for the variation of the waveguide width (more than ± 60 nm) and a very broad band (~200 nm) for an extinction ratio of >10 dB.

  19. MLP: A Parallel Programming Alternative to MPI for New Shared Memory Parallel Systems

    NASA Technical Reports Server (NTRS)

    Taft, James R.

    1999-01-01

    Recent developments at the NASA AMES Research Center's NAS Division have demonstrated that the new generation of NUMA based Symmetric Multi-Processing systems (SMPs), such as the Silicon Graphics Origin 2000, can successfully execute legacy vector oriented CFD production codes at sustained rates far exceeding processing rates possible on dedicated 16 CPU Cray C90 systems. This high level of performance is achieved via shared memory based Multi-Level Parallelism (MLP). This programming approach, developed at NAS and outlined below, is distinct from the message passing paradigm of MPI. It offers parallelism at both the fine and coarse grained level, with communication latencies that are approximately 50-100 times lower than typical MPI implementations on the same platform. Such latency reductions offer the promise of performance scaling to very large CPU counts. The method draws on, but is also distinct from, the newly defined OpenMP specification, which uses compiler directives to support a limited subset of multi-level parallel operations. The NAS MLP method is general, and applicable to a large class of NASA CFD codes.

  20. Parallel Reconstruction Using Null Operations (PRUNO)

    PubMed Central

    Zhang, Jian; Liu, Chunlei; Moseley, Michael E.

    2011-01-01

    A novel iterative k-space data-driven technique, namely Parallel Reconstruction Using Null Operations (PRUNO), is presented for parallel imaging reconstruction. In PRUNO, both data calibration and image reconstruction are formulated into linear algebra problems based on a generalized system model. An optimal data calibration strategy is demonstrated by using Singular Value Decomposition (SVD). And an iterative conjugate- gradient approach is proposed to efficiently solve missing k-space samples during reconstruction. With its generalized formulation and precise mathematical model, PRUNO reconstruction yields good accuracy, flexibility, stability. Both computer simulation and in vivo studies have shown that PRUNO produces much better reconstruction quality than autocalibrating partially parallel acquisition (GRAPPA), especially under high accelerating rates. With the aid of PRUO reconstruction, ultra high accelerating parallel imaging can be performed with decent image quality. For example, we have done successful PRUNO reconstruction at a reduction factor of 6 (effective factor of 4.44) with 8 coils and only a few autocalibration signal (ACS) lines. PMID:21604290

  1. Parallelizing serial code for a distributed processing environment with an application to high frequency electromagnetic scattering

    NASA Astrophysics Data System (ADS)

    Work, Paul R.

    1991-12-01

    This thesis investigates the parallelization of existing serial programs in computational electromagnetics for use in a parallel environment. Existing algorithms for calculating the radar cross section of an object are covered, and a ray-tracing code is chosen for implementation on a parallel machine. Current parallel architectures are introduced and a suitable parallel machine is selected for the implementation of the chosen ray-tracing algorithm. The standard techniques for the parallelization of serial codes are discussed, including load balancing and decomposition considerations, and appropriate methods for the parallelization effort are selected. A load balancing algorithm is modified to increase the efficiency of the application, and a high level design of the structure of the serial program is presented. A detailed design of the modifications for the parallel implementation is also included, with both the high level and the detailed design specified in a high level design language called UNITY. The correctness of the design is proven using UNITY and standard logic operations. The theoretical and empirical results show that it is possible to achieve an efficient parallel application for a serial computational electromagnetic program where the characteristics of the algorithm and the target architecture critically influence the development of such an implementation.

  2. Photonic generation of ultra-wide-band doublet pulse through monolithic integration of tapered directional coupler and quantum well waveguide.

    PubMed

    Kuo, Yu-Zheng; Wu, Jui-Pin; Wu, Tsu-Hsiu; Chiu, Yi-Jen

    2012-10-22

    We proposed and demonstrated a novel scheme of photonic ultra-wide-band (UWB) doublet pulse based on monolithic integration of tapered optical-direction coupler (TODC) and multiple-quantum-well (MQW) waveguide. TODC is formed by a top tapered MQW waveguide vertically integrating with an underneath passive waveguide. Through simultaneous field-driven optical index- and absorption- change in MQW, the partial optical coupling in TODC can be used to get a valley-shaped of optical transmission against voltage. Therefore, doublet-enveloped optical pulse can be realized by high-speed and high-efficient conversion of input electrical pulse. By just adjusting bias through MQW, 1530 nm photonic UWB doublet optical pulse with 75-ps pulse width, below -41.3 dBm power, 125% fractional bandwidth, and 7.5 GHz of -10 dB bandwidth has been demonstrated, fitted into FCC requirement (3.1 GHz~10.6 GHz). Doublet-pulse data transmission generated in optical fiber is also performed for further characterization, exhibiting a successful 1.25 Gb/s error-free transmission. It suggests such optoelectronic integration template can be applied for photonic UWB generation in fiber-based communications.

  3. Parallel Scaling Characteristics of Selected NERSC User ProjectCodes

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Skinner, David; Verdier, Francesca; Anand, Harsh

    This report documents parallel scaling characteristics of NERSC user project codes between Fiscal Year 2003 and the first half of Fiscal Year 2004 (Oct 2002-March 2004). The codes analyzed cover 60% of all the CPU hours delivered during that time frame on seaborg, a 6080 CPU IBM SP and the largest parallel computer at NERSC. The scale in terms of concurrency and problem size of the workload is analyzed. Drawing on batch queue logs, performance data and feedback from researchers we detail the motivations, benefits, and challenges of implementing highly parallel scientific codes on current NERSC High Performance Computing systems.more » An evaluation and outlook of the NERSC workload for Allocation Year 2005 is presented.« less

  4. Techniques and Tools for Performance Tuning of Parallel and Distributed Scientific Applications

    NASA Technical Reports Server (NTRS)

    Sarukkai, Sekhar R.; VanderWijngaart, Rob F.; Castagnera, Karen (Technical Monitor)

    1994-01-01

    Performance degradation in scientific computing on parallel and distributed computer systems can be caused by numerous factors. In this half-day tutorial we explain what are the important methodological issues involved in obtaining codes that have good performance potential. Then we discuss what are the possible obstacles in realizing that potential on contemporary hardware platforms, and give an overview of the software tools currently available for identifying the performance bottlenecks. Finally, some realistic examples are used to illustrate the actual use and utility of such tools.

  5. Silicon photonics for high-performance interconnection networks

    NASA Astrophysics Data System (ADS)

    Biberman, Aleksandr

    2011-12-01

    We assert in the course of this work that silicon photonics has the potential to be a key disruptive technology in computing and communication industries. The enduring pursuit of performance gains in computing, combined with stringent power constraints, has fostered the ever-growing computational parallelism associated with chip multiprocessors, memory systems, high-performance computing systems, and data centers. Sustaining these parallelism growths introduces unique challenges for on- and off-chip communications, shifting the focus toward novel and fundamentally different communication approaches. This work showcases that chip-scale photonic interconnection networks, enabled by high-performance silicon photonic devices, enable unprecedented bandwidth scalability with reduced power consumption. We demonstrate that the silicon photonic platforms have already produced all the high-performance photonic devices required to realize these types of networks. Through extensive empirical characterization in much of this work, we demonstrate such feasibility of waveguides, modulators, switches, and photodetectors. We also demonstrate systems that simultaneously combine many functionalities to achieve more complex building blocks. Furthermore, we leverage the unique properties of available silicon photonic materials to create novel silicon photonic devices, subsystems, network topologies, and architectures to enable unprecedented performance of these photonic interconnection networks and computing systems. We show that the advantages of photonic interconnection networks extend far beyond the chip, offering advanced communication environments for memory systems, high-performance computing systems, and data centers. Furthermore, we explore the immense potential of all-optical functionalities implemented using parametric processing in the silicon platform, demonstrating unique methods that have the ability to revolutionize computation and communication. Silicon photonics

  6. High performance computing and communications: Advancing the frontiers of information technology

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    NONE

    1997-12-31

    This report, which supplements the President`s Fiscal Year 1997 Budget, describes the interagency High Performance Computing and Communications (HPCC) Program. The HPCC Program will celebrate its fifth anniversary in October 1996 with an impressive array of accomplishments to its credit. Over its five-year history, the HPCC Program has focused on developing high performance computing and communications technologies that can be applied to computation-intensive applications. Major highlights for FY 1996: (1) High performance computing systems enable practical solutions to complex problems with accuracies not possible five years ago; (2) HPCC-funded research in very large scale networking techniques has been instrumental inmore » the evolution of the Internet, which continues exponential growth in size, speed, and availability of information; (3) The combination of hardware capability measured in gigaflop/s, networking technology measured in gigabit/s, and new computational science techniques for modeling phenomena has demonstrated that very large scale accurate scientific calculations can be executed across heterogeneous parallel processing systems located thousands of miles apart; (4) Federal investments in HPCC software R and D support researchers who pioneered the development of parallel languages and compilers, high performance mathematical, engineering, and scientific libraries, and software tools--technologies that allow scientists to use powerful parallel systems to focus on Federal agency mission applications; and (5) HPCC support for virtual environments has enabled the development of immersive technologies, where researchers can explore and manipulate multi-dimensional scientific and engineering problems. Educational programs fostered by the HPCC Program have brought into classrooms new science and engineering curricula designed to teach computational science. This document contains a small sample of the significant HPCC Program accomplishments in FY 1996

  7. Web Based Parallel Programming Workshop for Undergraduate Education.

    ERIC Educational Resources Information Center

    Marcus, Robert L.; Robertson, Douglass

    Central State University (Ohio), under a contract with Nichols Research Corporation, has developed a World Wide web based workshop on high performance computing entitled "IBN SP2 Parallel Programming Workshop." The research is part of the DoD (Department of Defense) High Performance Computing Modernization Program. The research…

  8. Medical applications for high-performance computers in SKIF-GRID network.

    PubMed

    Zhuchkov, Alexey; Tverdokhlebov, Nikolay

    2009-01-01

    The paper presents a set of software services for massive mammography image processing by using high-performance parallel computers of SKIF-family which are linked into a service-oriented grid-network. An experience of a prototype system implementation in two medical institutions is also described.

  9. Parallel processing of genomics data

    NASA Astrophysics Data System (ADS)

    Agapito, Giuseppe; Guzzi, Pietro Hiram; Cannataro, Mario

    2016-10-01

    The availability of high-throughput experimental platforms for the analysis of biological samples, such as mass spectrometry, microarrays and Next Generation Sequencing, have made possible to analyze a whole genome in a single experiment. Such platforms produce an enormous volume of data per single experiment, thus the analysis of this enormous flow of data poses several challenges in term of data storage, preprocessing, and analysis. To face those issues, efficient, possibly parallel, bioinformatics software needs to be used to preprocess and analyze data, for instance to highlight genetic variation associated with complex diseases. In this paper we present a parallel algorithm for the parallel preprocessing and statistical analysis of genomics data, able to face high dimension of data and resulting in good response time. The proposed system is able to find statistically significant biological markers able to discriminate classes of patients that respond to drugs in different ways. Experiments performed on real and synthetic genomic datasets show good speed-up and scalability.

  10. The language parallel Pascal and other aspects of the massively parallel processor

    NASA Technical Reports Server (NTRS)

    Reeves, A. P.; Bruner, J. D.

    1982-01-01

    A high level language for the Massively Parallel Processor (MPP) was designed. This language, called Parallel Pascal, is described in detail. A description of the language design, a description of the intermediate language, Parallel P-Code, and details for the MPP implementation are included. Formal descriptions of Parallel Pascal and Parallel P-Code are given. A compiler was developed which converts programs in Parallel Pascal into the intermediate Parallel P-Code language. The code generator to complete the compiler for the MPP is being developed independently. A Parallel Pascal to Pascal translator was also developed. The architecture design for a VLSI version of the MPP was completed with a description of fault tolerant interconnection networks. The memory arrangement aspects of the MPP are discussed and a survey of other high level languages is given.

  11. Xyce parallel electronic simulator : users' guide.

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Mei, Ting; Rankin, Eric Lamont; Thornquist, Heidi K.

    2011-05-01

    This manual describes the use of the Xyce Parallel Electronic Simulator. Xyce has been designed as a SPICE-compatible, high-performance analog circuit simulator, and has been written to support the simulation needs of the Sandia National Laboratories electrical designers. This development has focused on improving capability over the current state-of-the-art in the following areas: (1) Capability to solve extremely large circuit problems by supporting large-scale parallel computing platforms (up to thousands of processors). Note that this includes support for most popular parallel and serial computers; (2) Improved performance for all numerical kernels (e.g., time integrator, nonlinear and linear solvers) through state-of-the-artmore » algorithms and novel techniques. (3) Device models which are specifically tailored to meet Sandia's needs, including some radiation-aware devices (for Sandia users only); and (4) Object-oriented code design and implementation using modern coding practices that ensure that the Xyce Parallel Electronic Simulator will be maintainable and extensible far into the future. Xyce is a parallel code in the most general sense of the phrase - a message passing parallel implementation - which allows it to run efficiently on the widest possible number of computing platforms. These include serial, shared-memory and distributed-memory parallel as well as heterogeneous platforms. Careful attention has been paid to the specific nature of circuit-simulation problems to ensure that optimal parallel efficiency is achieved as the number of processors grows. The development of Xyce provides a platform for computational research and development aimed specifically at the needs of the Laboratory. With Xyce, Sandia has an 'in-house' capability with which both new electrical (e.g., device model development) and algorithmic (e.g., faster time-integration methods, parallel solver algorithms) research and development can be performed. As a result, Xyce is a

  12. Massively parallel processor computer

    NASA Technical Reports Server (NTRS)

    Fung, L. W. (Inventor)

    1983-01-01

    An apparatus for processing multidimensional data with strong spatial characteristics, such as raw image data, characterized by a large number of parallel data streams in an ordered array is described. It comprises a large number (e.g., 16,384 in a 128 x 128 array) of parallel processing elements operating simultaneously and independently on single bit slices of a corresponding array of incoming data streams under control of a single set of instructions. Each of the processing elements comprises a bidirectional data bus in communication with a register for storing single bit slices together with a random access memory unit and associated circuitry, including a binary counter/shift register device, for performing logical and arithmetical computations on the bit slices, and an I/O unit for interfacing the bidirectional data bus with the data stream source. The massively parallel processor architecture enables very high speed processing of large amounts of ordered parallel data, including spatial translation by shifting or sliding of bits vertically or horizontally to neighboring processing elements.

  13. Incremental Parallelization of Non-Data-Parallel Programs Using the Charon Message-Passing Library

    NASA Technical Reports Server (NTRS)

    VanderWijngaart, Rob F.

    2000-01-01

    Message passing is among the most popular techniques for parallelizing scientific programs on distributed-memory architectures. The reasons for its success are wide availability (MPI), efficiency, and full tuning control provided to the programmer. A major drawback, however, is that incremental parallelization, as offered by compiler directives, is not generally possible, because all data structures have to be changed throughout the program simultaneously. Charon remedies this situation through mappings between distributed and non-distributed data. It allows breaking up the parallelization into small steps, guaranteeing correctness at every stage. Several tools are available to help convert legacy codes into high-performance message-passing programs. They usually target data-parallel applications, whose loops carrying most of the work can be distributed among all processors without much dependency analysis. Others do a full dependency analysis and then convert the code virtually automatically. Even more toolkits are available that aid construction from scratch of message passing programs. None, however, allows piecemeal translation of codes with complex data dependencies (i.e. non-data-parallel programs) into message passing codes. The Charon library (available in both C and Fortran) provides incremental parallelization capabilities by linking legacy code arrays with distributed arrays. During the conversion process, non-distributed and distributed arrays exist side by side, and simple mapping functions allow the programmer to switch between the two in any location in the program. Charon also provides wrapper functions that leave the structure of the legacy code intact, but that allow execution on truly distributed data. Finally, the library provides a rich set of communication functions that support virtually all patterns of remote data demands in realistic structured grid scientific programs, including transposition, nearest-neighbor communication, pipelining

  14. New Methods for Rotation Sensing by Using a Two-Coupler Fiber-Optic Ring Resonator

    NASA Astrophysics Data System (ADS)

    Seraji, Faramarz E.

    1993-04-01

    This paper presents a theoretical analysis of new methods for rotation sensing by using a two-coupler type fiber-optic ring resonator. It is shown that in the proposed methods a resonance spike can be generated whose amplitude gives a direct measure of the rotation rates. The approaches are simple and have a major advantage of not using a closed-loop to control the operating points for resonance.

  15. Scalable Parallel Density-based Clustering and Applications

    NASA Astrophysics Data System (ADS)

    Patwary, Mostofa Ali

    2014-04-01

    Recently, density-based clustering algorithms (DBSCAN and OPTICS) have gotten significant attention of the scientific community due to their unique capability of discovering arbitrary shaped clusters and eliminating noise data. These algorithms have several applications, which require high performance computing, including finding halos and subhalos (clusters) from massive cosmology data in astrophysics, analyzing satellite images, X-ray crystallography, and anomaly detection. However, parallelization of these algorithms are extremely challenging as they exhibit inherent sequential data access order, unbalanced workload resulting in low parallel efficiency. To break the data access sequentiality and to achieve high parallelism, we develop new parallel algorithms, both for DBSCAN and OPTICS, designed using graph algorithmic techniques. For example, our parallel DBSCAN algorithm exploits the similarities between DBSCAN and computing connected components. Using datasets containing up to a billion floating point numbers, we show that our parallel density-based clustering algorithms significantly outperform the existing algorithms, achieving speedups up to 27.5 on 40 cores on shared memory architecture and speedups up to 5,765 using 8,192 cores on distributed memory architecture. In our experiments, we found that while achieving the scalability, our algorithms produce clustering results with comparable quality to the classical algorithms.

  16. All-optical switch using optically controlled two mode interference coupler.

    PubMed

    Sahu, Partha Pratim

    2012-05-10

    In this paper, we have introduced optically controlled two-mode interference (OTMI) coupler having silicon core and GaAsInP cladding as an all-optical switch. By taking advantage of refractive index modulation by launching optical pulse into cladding region of TMI waveguide, we have shown optically controlled switching operation. We have studied optical pulse-controlled coupling characteristics of the proposed device by using a simple mathematical model on the basis of sinusoidal modes. The device length is less than that of previous work. It is also seen that the cross talk of the OTMI switch is not significantly increased with fabrication tolerances (±δw) in comparison with previous work.

  17. A design methodology for portable software on parallel computers

    NASA Technical Reports Server (NTRS)

    Nicol, David M.; Miller, Keith W.; Chrisman, Dan A.

    1993-01-01

    This final report for research that was supported by grant number NAG-1-995 documents our progress in addressing two difficulties in parallel programming. The first difficulty is developing software that will execute quickly on a parallel computer. The second difficulty is transporting software between dissimilar parallel computers. In general, we expect that more hardware-specific information will be included in software designs for parallel computers than in designs for sequential computers. This inclusion is an instance of portability being sacrificed for high performance. New parallel computers are being introduced frequently. Trying to keep one's software on the current high performance hardware, a software developer almost continually faces yet another expensive software transportation. The problem of the proposed research is to create a design methodology that helps designers to more precisely control both portability and hardware-specific programming details. The proposed research emphasizes programming for scientific applications. We completed our study of the parallelizability of a subsystem of the NASA Earth Radiation Budget Experiment (ERBE) data processing system. This work is summarized in section two. A more detailed description is provided in Appendix A ('Programming Practices to Support Eventual Parallelism'). Mr. Chrisman, a graduate student, wrote and successfully defended a Ph.D. dissertation proposal which describes our research associated with the issues of software portability and high performance. The list of research tasks are specified in the proposal. The proposal 'A Design Methodology for Portable Software on Parallel Computers' is summarized in section three and is provided in its entirety in Appendix B. We are currently studying a proposed subsystem of the NASA Clouds and the Earth's Radiant Energy System (CERES) data processing system. This software is the proof-of-concept for the Ph.D. dissertation. We have implemented and measured

  18. Misalignment tolerant efficient inverse taper coupler for silicon waveguide

    NASA Astrophysics Data System (ADS)

    Wang, Peng; Michael, Aron; Kwok, Chee Yee; Chen, Ssu-Han

    2015-12-01

    This paper describes an efficient fiber to submicron silicon waveguide coupling based on an inversely tapered silicon waveguide embedded in a SiO2 waveguide that is suspended in air. The inverse taper waveguide consist of a 50um long and 240nm thick silicon that linearly taper in width from 500nm to 120nm, which is embedded in SiO2. The SiO2 waveguide is 6um wide and 10um long. The simulation results show that the coupling loss of this new approach is 2.7dB including the interface loss at the input and output. The tolerance to fiber misalignment at the input of the coupler is 2um in both horizontal and vertical directions for only 1.5dB additional loss.

  19. All-optical universal logic gates on nonlinear multimode interference coupler using tunable input intensity

    NASA Astrophysics Data System (ADS)

    Tajaldini, Mehdi; Jafri, Mohd Zubir Mat

    2015-04-01

    The theory of Nonlinear Modal Propagation Analysis Method (NMPA) have shown significant features of nonlinear multimode interference (MMI) coupler with compact dimension and when launched near the threshold of nonlinearity. Moreover, NMPA have the potential to allow studying the nonlinear MMI based the modal interference to explorer the phenomenon that what happen due to the natural of multimode region. Proposal of all-optical switch based NMPA has approved its capability to achieving the all-optical gates. All-optical gates have attracted increasing attention due to their practical utility in all-optical signal processing networks and systems. Nonlinear multimode interference devices could apply as universal all-optical gates due to significant features that NMPA introduce them. In this Paper, we present a novel Ultra-compact MMI coupler based on NMPA method in low intensity compared to last reports either as a novel design method and potential application for optical NAND, NOR as universal gates on single structure for Boolean logic signal processing devices and optimize their application via studding the contrast ratio between ON and OFF as a function of output width. We have applied NMPA for several applications so that the miniaturization in low nonlinear intensities is their main purpose.

  20. Exact cancellation of emittance growth due to coupled transverse dynamics in solenoids and rf couplers

    NASA Astrophysics Data System (ADS)

    Dowell, David H.; Zhou, Feng; Schmerge, John

    2018-01-01

    Weak, rotated magnetic and radio frequency quadrupole fields in electron guns and injectors can couple the beam's horizontal with vertical motion, introduce correlations between otherwise orthogonal transverse momenta, and reduce the beam brightness. This paper discusses two important sources of coupled transverse dynamics common to most electron injectors. The first is quadrupole focusing followed by beam rotation in a solenoid, and the second coupling comes from a skewed high-power rf coupler or cavity port which has a rotated rf quadrupole field. It is shown that a dc quadrupole field can correct for both types of couplings and exactly cancel their emittance growths. The degree of cancellation of the rf skew quadrupole emittance is limited by the electron bunch length. Analytic expressions are derived and compared with emittance simulations and measurements.