Filters








55 Hits in 3.6 sec

Performance and portability of accelerated lattice Boltzmann applications with OpenACC

Enrico Calore, Alessandro Gabbana, Jiri Kraus, Sebastiano Fabio Schifano, Raffaele Tripiccione
2016 Concurrency and Computation  
We also asses the performance impact associated to portable programming, and the actual portability and performance-portability of OpenACC-based applications across several state-of-the- art architectures  ...  We then benchmark the code on a variety of processors, including traditional CPUs and GPUs, and make accurate performance comparisons with other GPU implementations of the same algorithm using CUDA and  ...  This work was done in the framework of the COKA, COSA and Suma projects of INFN.  ... 
doi:10.1002/cpe.3862 fatcat:r5t72w47j5elfarytq64hfua7e

Design and optimization of a portable LQCD Monte Carlo code using OpenACC

Claudio Bonati, Simone Coscetti, Massimo D'Elia, Michele Mesiti, Francesco Negro, Enrico Calore, Sebastiano Fabio Schifano, Giorgio Silvi, Raffaele Tripiccione
2017 International Journal of Modern Physics C  
We also measure performance, evaluating the computing efficiency of our OpenACC code on several architectures, comparing with GPU-specific implementations and showing that a good level of performance-portability  ...  In this scenario, code portability (and performance portability) become necessary for easy maintainability of applications; this is very relevant in scientific computing where code changes are very frequent  ...  We thank the INFN Computing Center in Pisa for providing us with the development framework, and Università degli Studi di Ferrara and INFN-Ferrara for the access to the COKA GPU cluster.  ... 
doi:10.1142/s0129183117500632 fatcat:gmxustlynnglreeda63e4jm4ci

Optimization of lattice Boltzmann simulations on heterogeneous computers

E Calore, A Gabbana, SF Schifano, R Tripiccione
2017 The international journal of high performance computing applications  
In this paper we consider exactly this problem for a class of applications based on Lattice Boltzmann Methods, widely used in computational fluid-dynamics.  ...  Our goal is to develop just one program, portable and able to run efficiently on several different combinations of hosts and accelerators.  ...  Declaration of Conflicting Interests The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.  ... 
doi:10.1177/1094342017703771 fatcat:ml4n5ulsk5hmnjeq6fbvm2yyfy

A Portable OpenCL Lattice Boltzmann Code for Multi- and Many-core Processor Architectures

Enrico Calore, Sebastiano Fabio Schifano, Raffaele Tripiccione
2014 Procedia Computer Science  
The architecture of high performance computing systems is becoming more and more heterogeneous, as accelerators play an increasingly important role alongside traditional CPUs.  ...  We analyze its performance on several different state-of-the-art processors: NVIDIA GPUs and Intel Xeon-Phi many-core accelerators, as well as more traditional Ivy Bridge and Opteron multi-core commodity  ...  Acknowledgements This work was done in the framework of the COKA and Suma projects, supported by INFN.  ... 
doi:10.1016/j.procs.2014.05.004 fatcat:exrfaxm7nnhjdlectgdoop5cci

Accelerating Hydrocodes with OpenACC, OpenCL and CUDA

J. A. Herdman, W. P. Gaudin, S. McIntosh-Smith, M. Boulton, D. A. Beckingsale, A. C. Mallinson, S. A. Jarvis
2012 2012 SC Companion: High Performance Computing, Networking Storage and Analysis  
We find that OpenACC is an extremely viable programming model for accelerator devices, improving programmer productivity and achieving better performance than OpenCL and CUDA.  ...  , and portability using a recently developed Lagrangian-Eulerian explicit hydrodynamics mini-application.  ...  The authors would like to express their thanks to Cray, in particular Alistair Hart of the Cray European Exascale Research Initiative, for their help with OpenACC and also to John Pennycook of the University  ... 
doi:10.1109/sc.companion.2012.66 dblp:conf/sc/HerdmanGMBBMJ12 fatcat:hu77tqzljrgjfbdcwsoonjz5yu

Portable multi-node LQCD Monte Carlo simulations using OpenACC

Claudio Bonati, Enrico Calore, Massimo D'Elia, Michele Mesiti, Francesco Negro, Francesco Sanfilippo, Sebastiano Fabio Schifano, Giorgio Silvi, Raffaele Tripiccione
2018 International Journal of Modern Physics C  
The work focuses mainly on GPUs, which offer a significantly high level of performances for this application, but also compares with results measured on other processors.  ...  This paper describes a state-of-the-art parallel Lattice QCD Monte Carlo code for staggered fermions, purposely designed to be portable across different computer architectures, including GPUs and commodity  ...  OpenACC is becoming increasingly popular among several scientific communities for coding many lattice-based applications to run mainly on GPU accelerators, including Lattice Boltzmann Methods 16, 17,  ... 
doi:10.1142/s0129183118500109 fatcat:j27dndc2mfd53jylsgrca3dbsa

Accelerating a C++ CFD Code with OpenACC

Jiri Kraus, Michael Schlottke, Andrew Adinetz, Dirk Pleiter
2014 2014 First Workshop on Accelerator Programming using Directives  
To utilize the higher performance and energy efficiency of these accelerators, application developers need to rewrite at least parts of their codes.  ...  Using OpenACC directives permitted us to incrementally accelerate ZFS, focusing on the parts of the program that are relevant for the problem at hand.  ...  ACKNOWLEDGEMENTS This work has been carried out in the scope of the NVIDIA Application Lab at Jülich in collaboration with the JARA-HPC SimLab Fluids & Solids Engineering and the Institute of Aerodynamics  ... 
doi:10.1109/waccpd.2014.11 dblp:conf/sc/KrausSAP14 fatcat:jn42zcof7bczhl4uajlpljblue

Benchmarking OpenCL, OpenACC, OpenMP, and CUDA: programming productivity, performance, and energy consumption [article]

Suejb Memeti and Lu Li and Sabri Pllana and Joanna Kolodziej and Christoph Kessler
2017 arXiv   pre-print
In this paper, we study empirically the characteristics of OpenMP, OpenACC, OpenCL, and CUDA with respect to programming productivity, performance, and energy.  ...  Such nodes may comprise general purpose CPUs and accelerators (such as, GPU, or Intel Xeon Phi) that provide high performance with suitable energy-consumption characteristics.  ...  Acknowledgment This article is based upon work from COST Action IC1406 High-Performance Modelling and Simulation for Big Data Applications (cHiPSet), supported by COST (European Cooperation in Science  ... 
arXiv:1704.05316v1 fatcat:lax3kghaxnanxixklx3haavlxa

targetDP: an Abstraction of Lattice Based Parallelism with Portable Performance

Alan Gray, Kevin Stratford
2014 2014 IEEE Intl Conf on High Performance Computing and Communications, 2014 IEEE 6th Intl Symp on Cyberspace Safety and Security, 2014 IEEE 11th Intl Conf on Embedded Software and Syst (HPCC,CSS,ICESS)  
We present CPU and GPU performance results for a benchmark taken from the lattice Boltzmann application that motivated this work.  ...  These demonstrate not only performance portability, but also the optimisation resulting from the intelligent exposure of ILP.  ...  with and without accelerators.  ... 
doi:10.1109/hpcc.2014.212 dblp:conf/hpcc/GrayS14 fatcat:mq56eib7jjbb5devymdrovqn3i

Parallel Reservoir Simulation with OpenACC and Domain Decomposition

Zhijiang Kang, Ze Deng, Wei Han, Dongmei Zhang
2018 Algorithms  
In order to address the problems, we propose a parallel method with OpenACC to accelerate serial code and reduce the time and effort during porting an application to GPU.  ...  The experimental results indicate that (1) the proposed GPU-aided approach can outperform the CPU-based one up to about two times, meanwhile with the help of OpenACC, the workload of the transplant code  ...  Boltzmann (LB) [18] methods and so on.  ... 
doi:10.3390/a11120213 fatcat:swrmqhkzujc3fjvozi3jckd7rm

SPEC ACCEL: A Standard Application Suite for Measuring Hardware Accelerator Performance [chapter]

Guido Juckeland, William Brantley, Sunita Chandrasekaran, Barbara Chapman, Shuai Che, Mathew Colgrove, Huiyu Feng, Alexander Grund, Robert Henschel, Wen-Mei W. Hwu, Huian Li, Matthias S. Müller (+12 others)
2015 Lecture Notes in Computer Science  
The new benchmark comprises two suites of applications written in OpenCL and OpenACC and measures the performance of accelerators with respect to a reference platform.  ...  The SPEC High Performance Group (HPG) has developed a set of performance metrics to evaluate the performance and power consumption of accelerators for various science applications.  ...  SPEC R , SPEC ACCEL TM , SPEC CPU TM , SPEC MPI R , and SPEC OMP R are registered trademarks of the Standard Performance Evaluation Corporation (SPEC).  ... 
doi:10.1007/978-3-319-17248-4_3 fatcat:wcdquz4gqffsrihtu3olf5nuty

A Lightweight Approach to Performance Portability with targetDP [article]

Alan Gray, Kevin Stratford
2016 arXiv   pre-print
We demonstrate the effectiveness of our pragmatic approach by presenting performance results for a complex fluid application (with which the model was co-designed), plus a separate lattice QCD particle  ...  The concept of performance portability across such architectures, as well as traditional CPUs, is vital for the application programmer.  ...  Department of Energy under Contract No. DE-AC05-00OR22725, and the United Kingdom EPSRC funded ARCHER service. We thank Daniel Holmes and Arno Proeme for providing constructive feedback on this paper.  ... 
arXiv:1609.01479v2 fatcat:n2a5rzfgqfdbdn3fui6srewuva

Accelerated application development: The ORNL Titan experience

Wayne Joubert, Rick Archibald, Mark Berrill, W. Michael Brown, Markus Eisenbach, Ray Grout, Jeff Larkin, John Levesque, Bronson Messer, Matt Norman, Bobby Philip, Ramanan Sankaran (+2 others)
2015 Computers & electrical engineering  
The use of computational accelerators such as NVIDIA GPUs and Intel Xeon Phi processors is now widespread in the high performance computing community, with many applications delivering impressive performance  ...  However, programming these systems for high performance, performance portability and software maintainability has been a challenge.  ...  Acknowledgments The CAAR project was a large-scale effort requiring the participation of many individuals.  ... 
doi:10.1016/j.compeleceng.2015.04.008 fatcat:ir4nljoznvfvrbg3vdcjtggxc4

A lightweight approach to performance portability with targetDP

Alan Gray, Kevin Stratford
2016 The international journal of high performance computing applications  
We demonstrate the effectiveness of our pragmatic approach by presenting performance results for a complex fluid application (with which the model was codesigned), plus a separate lattice QCD particle  ...  The concept of performance portability across such architectures, as well as traditional CPUs, is vital for the application programmer.  ...  Department of Energy under Contract No. DE-AC05-00OR22725, and the United Kingdom EPSRC funded ARCHER service. We thank Daniel Holmes and Arno Proeme for providing constructive feedback on this paper.  ... 
doi:10.1177/1094342016682071 fatcat:7qo4qg4ernhkbczsk2wpozdfae

D7.2.2 Exploitation of HPC Tools and Techniques

Michael Lysaght, Bjorn Lindi, Vit Vondrak, John Donners, Marc Tajchman
2014 Zenodo  
The objective of PRACE-3IP Work Package 7 (WP7) 'Application Enabling and Support' is to provide applications enabling support for HPC applications codes which are important for European researchers to  ...  For a more detailed description of each of the exploitation projects summarised here, we refer the reader to the PRACE-3IP whitepaper associated with each of the 17 projects.  ...  with Accelerators using OpenACC and OpenCL Authors: A.  ... 
doi:10.5281/zenodo.6575525 fatcat:5y3cjsculrdejllndosbjpcgiq
« Previous Showing results 1 — 15 out of 55 results