581 Hits in 7.1 sec

Data structure design for GPU based heterogeneous systems

Jens Breitbart
2009 2009 International Conference on High Performance Computing & Simulation  
in performance compared to the static grid based implementation.  ...  Our experiments show that with a slower CPU the complex algorithm for creating the dynamic grid becomes the bottleneck and the increased simulation performance at the GPU thereby does not provide an increase  ...  The implementation uses OpenMP and splits all agents equally among the threads. A thread calculates both the simulation and modification substage for the agents associated with.  ... 
doi:10.1109/hpcsim.2009.5192780 dblp:conf/ieeehpcs/Breitbart09 fatcat:wwyrvubcgjdjnpihtmay4brt5e

BioFVM-X: An MPI+OpenMP 3-D Simulator for Biological Systems [chapter]

Gaurav Saxena, Miguel Ponce-de-Leon, Arnau Montagud, David Vicente Dorca, Alfonso Valencia
2021 Lecture Notes in Computer Science  
We present BioFVM-X: an enhanced version of BioFVM capable of running on multiple nodes.  ...  Currently, BioFVM is only shared-memory parallelized using OpenMP, greatly limiting the execution of large-scale jobs in HPC clusters.  ...  Note that it is generally very difficult for an MPI+OpenMP implementation to outperform the pure OpenMP implementation on a single node, as is the case of Fig. 2 , due to the additional memory footprint  ... 
doi:10.1007/978-3-030-85633-5_18 fatcat:wwioyaaojbhchohiwf43xrxt5q

OmniRPC: A Grid RPC Facility for Cluster and Global Computing in OpenMP [chapter]

Mitsuhisa Sato, Motonari Hirano, Yoshio Tanaka, Satoshi Sekiguchi
2001 Lecture Notes in Computer Science  
We use the POSIX thread implementation of the Omni OpenMP compiler which allows multi-threaded execution of OpenMP programs by POSIX threads even in a single processor.  ...  We propose to use OpenMP as an easy-to-use and simple programming environment for the multi-threaded client of OmniRPC.  ...  We have developed Omni OpenMP compiler[4], which is a free and open-source, portable implementation of OpenMP.  ... 
doi:10.1007/3-540-44587-0_12 fatcat:ldnlnnzukzhvjfiihpjemgmpra

Survey of Methodologies, Approaches, and Challenges in Parallel Programming Using High-Performance Computing Systems

Paweł Czarnul, Jerzy Proficz, Krzysztof Drypczewski
2020 Scientific Programming  
implementation. e Spark results were an order of magnitude slower than OpenMP/MPI ones.  ...  An MPI implementation can be combined with multithreaded APIs such as OpenMP or Pthreads, a CUDAaware MPI implementation allows using device pointers in MPI calls Requires combining with APIs such as OpenCL  ... 
doi:10.1155/2020/4176794 fatcat:j52aegknyrdxzg2nopk73g3uly

Comparative study of performance of parallel Alpha Beta Pruning for different architectures [article]

Shubhendra Pal Singhal, M.Sridevi
2019 arXiv   pre-print
This paper focuses on the various possible implementations of the Alpha Beta pruning algorithms and gives an insight of what algorithm can be used for parallelism.  ...  Parallelizing Alpha Beta pruning for the GPUs specific architectures like mesh(CUDA) etc. or shared memory model(OpenMP) helps in the reduction of the computational time.  ...  thread from the pool of forerun created free threads.  ... 
arXiv:1908.11660v2 fatcat:5rytfrbmczdbbit4kadxo3agwm

D1.2 Initial Agent-Based Model

Project Consortium Members
2020 Zenodo  
This deliverable describes the agent-based model currently used in the Life Science use case of INFORE, the parameters used in its simulation and its HPC implementation.  ...  Thus, by integrating a center-based agent model into a multiscale model (MSM) allows us to study different aspects crucial for the development and growth of tumors.  ...  each MPI process spawns 24 OpenMP threads.  ... 
doi:10.5281/zenodo.4034022 fatcat:a4prlxmlpvdyznhlvxqfzhjfyq

Analysis of Threading Libraries for High Performance Computing

Adrian Castello, Rafael Mayo, Sangmin Seo, Pavan Balaji, Enrique S. Quintana-Orti, Antonio J. Pena
2020 IEEE transactions on computers  
using threading libraries and OpenMP implementations.  ...  Lightweight thread (LWT) libraries emerged as an alternative offering lighter mechanisms to tackle the massive concurrency of current hardware.  ...  Pthreads API Pthreads [31] offers three PMs that differ in how the threads are bound and which thread is in control. An important agent in these PMs is the kernel scheduled entity (KSE).  ... 
doi:10.1109/tc.2020.2970706 fatcat:3yeebkavkbh5dj36hr4feqochm

Performance Analysis of Shared-Memory Parallel Applications Using Performance Properties [chapter]

Karl Fürlinger, Michael Gerndt
2005 Lecture Notes in Computer Science  
We describe a data model for summary (profiling) data of OpenMP code and present performance properties that are based on this data model.  ...  We evaluate the usability of the properties with on several example codes using our OpenMP profiler ompP to derive the necessary profiling data.  ...  Using lock-free data structures would be an idea. Related Work Several approaches for automating the process of performance analysis have been developed.  ... 
doi:10.1007/11557654_70 fatcat:d6xv7tnnand2hkntyjunxvzoky

Frameworks for Multi-core Architectures: A Comprehensive Evaluation Using 2D/3D Image Registration [chapter]

Richard Membarth, Frank Hannig, Jürgen Teich, Mario Körner, Wieland Eckert
2011 Lecture Notes in Computer Science  
In this paper, we present five such frameworks for parallelization on shared memory multi-core architectures, namely OpenMP, Cilk++, Threading Building Blocks, RapidMind, and OpenCL.  ...  In an empirical study, a fine-grained data parallel and a coarse-grained task parallel parallelization approach are used to evaluate and estimate different aspects like usability, performance, and overhead  ...  It can be seen that in particular OpenMP and RapidMind have an enormous initialization overhead when only few threads are used.  ... 
doi:10.1007/978-3-642-19137-4_6 fatcat:guuyi6d3ercqvbgyfs3v5p5lya

AOmpLib: An Aspect Library for Large-Scale Multi-core Parallel Programming

Bruno Medeiros, Joao L. Sobral
2013 2013 42nd International Conference on Parallel Processing  
The library is coded in AspectJ and provides a set of parallel programming abstractions that mimics the OpenMP standard.  ...  This paper introduces an aspect-oriented library aimed to support efficient execution of Java applications on multi-core systems.  ...  However, it is cumbersome to develop parallel applications with Java threads as it generates more verbose programs than an OpenMP-like implementation would generate (as it was shown in section 2).  ... 
doi:10.1109/icpp.2013.36 dblp:conf/icpp/MedeirosS13 fatcat:sqqgds2zyffylg6fw6iznvfl5i

Race Condition Detection Algorithms

2019 International Journal of Engineering and Advanced Technology  
OpenMP is an Application programming interface for shared programming model. It is a construct based model which works on fork join parallelism.  ...  In any shared programming model, variables are shared by multiple threads in the program to execute different tasks by different threads.  ...  OpenMP is a shared memory programming model in which a task is implemented by a group of threads to achieve parallelism.  ... 
doi:10.35940/ijeat.b2696.129219 fatcat:5g2hdoakyfgj5lcykrrxq6plpq

Task inefficiency patterns for a wave equation solver [article]

Holger Schulz and Gonzalo Brito Gadeschi and Oleksandr Rudyy and Tobias Weinzierl
2021 arXiv   pre-print
Finally, we propose extensions to the OpenMP standard.  ...  Task-based programming with OpenMP 5.0 is a prominent candidate to accomplish this goal.  ...  An initial high-density peak in the domain serves as a causal agent for spreading waves.  ... 
arXiv:2105.12739v2 fatcat:yohhvfbydrenhhxpnl7a4rvbkq

OpenMP Implementation in the Characterization of a Urban Growth Model Cellular Automaton

Alvaro Peraza, René Rodríguez, Wenseslao Plata
2018 International Journal of Advanced Computer Science and Applications  
This paper presents the implementation of a parallelization strategy using the OpenMP library, while developing a simulation tool based on a cellular automaton (CA) to run urban growth simulations.  ...  The characterization of an urban growth model CA is shown and it consists of a digitization process of the land use in order to get all the necessary elements for the CA to work.  ...  The library OpenMP (Open Multi-Processing), widely used in parallel programming, helps to implement a parallel strategy called fork-join.  ... 
doi:10.14569/ijacsa.2018.090179 fatcat:n4a3cuhucnhyjmkzuq3gf3vtyi

Enabling Efficient Multithreaded MPI Communication through a Library-Based Implementation of MPI Endpoints

Srinivas Sridharan, James Dinan, Dhiraj D. Kalamkar
2014 SC14: International Conference for High Performance Computing, Networking, Storage and Analysis  
Our approach uses proxy processes to isolate threads in an MPI job, eliminating threading overheads within the MPI library and allowing threads to achieve process-like communication performance.  ...  We evaluate the performance advantages of our implementation through several benchmarks and kernels.  ...  MPI Endpoints Extension MPI 2.0 [15] defined the interaction between MPI and threads in terms of levels of thread safety provided by an MPI implementation.  ... 
doi:10.1109/sc.2014.45 dblp:conf/sc/SridharanDK14 fatcat:itfodo3lxnejjnnvdlytcidtgm

GPU Computing in Economics [chapter]

Eric M. Aldrich
2014 Handbook of Computational Economics  
"Thrust/OpenMP" and "Thrust/CUDA" refer to the Thrust implementation, using the separate backends for OpenMP (on the Quad-Core Xeon CPU) and CUDA (on the Tesla C2075).  ...  Thrust As mentioned above, Thrust is a free template library that can be called within C/C++ and which provides an alternate interface to GPU hardware.  ... 
doi:10.1016/b978-0-444-52980-0.00010-4 fatcat:fo3oqwpadvdixch2chdworgjji
« Previous Showing results 1 — 15 out of 581 results