638 Hits in 8.3 sec

Mapping parallel programs to heterogeneous CPU/GPU architectures using a Monte Carlo Tree Search

Mehdi Goli, John McCall, Christopher Brown, Vladimir Janjic, Kevin Hammond
2013 2013 IEEE Congress on Evolutionary Computation  
In this paper we describe a new technique that derives, automatically, optimal mappings for an application onto a heterogeneous architecture, using a Monte Carlo Tree Search algorithm.  ...  Additionally, with this increasing heterogeneity comes increasing complexity: not only does the programmer have to worry about where and how to express the parallelism, they must also express an efficient  ...  However, this is not the case for heterogeneous (CPU/GPU) architecture. There it is desirable to take advantage of GPU processing for suitable components when possible.  ... 
doi:10.1109/cec.2013.6557926 dblp:conf/cec/GoliMBJH13 fatcat:3srkgbeitfcdzk2i5zm7yu5a7u

Performance Models for CPU-GPU Data Transfers

B. van Werkhoven, J. Maassen, F.J. Seinstra, H.E. Bal
2014 2014 14th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing  
Overlapping GPU computation with CPU-GPU communication can reduce the costs of moving data.  ...  Several different techniques exist for transferring data to and from GPU memory and for overlapping those transfers with GPU computation. It is currently not known when to apply which method.  ...  We apply our performance models to two different kernels from the Parallel Ocean Program [20] .  ... 
doi:10.1109/ccgrid.2014.16 dblp:conf/ccgrid/WerkhovenMSB14 fatcat:mqpnadib35evlkjbfgc6hgisui

Design Patterns for Sparse-Matrix Computations on Hybrid CPU/GPU Platforms

Valeria Cardellini, Salvatore Filippone, Damian W.I. Rouson
2014 Scientific Programming  
We demonstrate how to use design patterns to implement an interface for sparse matrix computations on NVIDIA GPUs starting from PSBLAS, an existing sparse matrix library, and from existing sets of GPU  ...  We apply object-oriented software design patterns to develop code for scientific software involving sparse matrices.  ...  Acknowledgements We gratefully acknowledge the support coming from "HPC Grant 2011 on GPU clusters" provided by the "Consorzio interuniversitario per le applicazioni di supercalcolo per università e ricerca  ... 
doi:10.1155/2014/469753 fatcat:jorjixkn3rf5zoulm6msvk6xlm

Enhanced Energy Efficiency with the Actor Model on Heterogeneous Architectures [chapter]

Yaroslav Hayduk, Anita Sobe, Pascal Felber
2016 Lecture Notes in Computer Science  
For the first three implemen-tations of k-means (CPU/GPU), we chose a default data set from the STAMP benchmark with 65,536 input rows and 16 clusters.  ...  for each input point); (4) per-cluster member count structure (holds the number of points assigned to each cluster).  ... 
doi:10.1007/978-3-319-39577-7_1 fatcat:5gojhzjtgfdp7nne3mdknrqh44

OpenABLext: An automatic code generation framework for agent‐based simulations on CPU‐GPU‐FPGA heterogeneous platforms

Jiajian Xiao, Philipp Andelfinger, Wentong Cai, Paul Richmond, Alois Knoll, David Eckhoff
2020 Concurrency and Computation  
We illustrate how co-execution can be used to further lower execution times. OpenABLext can be seen as an enabler to tap the computing power of heterogeneous hardware platforms for ABS.  ...  However, in heterogeneous hardware environments, it can become increasingly difficult to find viable partitions of the simulation and provide implementations for different hardware devices.  ...  license to publish or reproduce the published form of this manuscript, or allow others to do so, for United States Government purposes.  ... 
doi:10.1002/cpe.5807 fatcat:7hclxt7eozcnhffsgrqbrxxtqy

Database Architectures for Modern Hardware (Dagstuhl Seminar 18251)

Peter A. Boncz, Goetz Graefe, Binsheng He, Kai-Uwe Sattler, Michael Wagner
2019 Dagstuhl Reports  
both the software and hardware sides and to foster cross-cutting architectural discussions.  ...  Based on the broad consensus that this rethinking requires expertise from different research disciplines, the goal of this seminar was to bring together researchers and practitioners from these areas representing  ...  Aim of this work group was to develop a universal system architecture for integrating heterogeneous computing resources such as CPUs, GPUs and FPGAs into a database management system.  ... 
doi:10.4230/dagrep.8.6.63 dblp:journals/dagstuhl-reports/BonczGHS18 fatcat:iepn4bmjavgdvegsns7huv72re

Towards an Energy-Aware Framework for Application Development and Execution in Heterogeneous Parallel Architectures [chapter]

Karim Djemame, Richard Kavanagh, Vasilios Kelefouras, Adrià Aguilà, Jorge Ejarque, Rosa M. Badia, David García Pérez, Clara Pezuela, Jean-Christophe Deprez, Lotfi Guedria, Renaud De Landtsheer, Yiannis Georgiou
2018 Hardware Accelerators in Data Centers  
This leads to a new cross-layer programming approach for heterogeneous parallel hardware architectures featuring software and hardware modelling.  ...  operation for Heterogeneous Parallel Hardware (HPA) environments.  ...  CPU, GPU, heterogeneous CPU+GPU chips, FPGA and heterogeneous multi-processor clusters all of which with various memory hierarchies, size and access performance properties.  ... 
doi:10.1007/978-3-319-92792-3_7 fatcat:hjiue3alirhvxmdwcsz7amlese

GPU-Based Embedded Intelligence Architectures and Applications

Li Minn Ang, Kah Phooi Seng
2021 Electronics  
This paper present contributions to the state-of-the art for graphics processing unit (GPU-based) embedded intelligence (EI) research for architectures and applications.  ...  This paper aims to give useful insights for the research area and motivate researchers towards the development of GPU-based EI for practical deployment and applications.  ...  The authors in [60] proposed an event-based and time-driven SNN simulator for a hybrid CPU-GPU platform.  ... 
doi:10.3390/electronics10080952 fatcat:paubm2sevbhixi2in63ayflmti

Cloud architecture for plant phenotyping research

Olivier Debauche, Sidi Ahmed Mahmoudi, Nicolas De Cock, Saïd Mahmoudi, Pierre Manneback, Frédéric Lebeau
2020 Concurrency and Computation  
Cloud architectures offer means to store a wide range of huge and heterogeneous data. In addition, it hosts a large quantity of specific models and softwares to process these data.  ...  KEYWORDS cloud architecture, digital phenotyping, lambda architecture, plant phenotyping, research application hosting platform INTRODUCTION With the grow of global population, the need for crop production  ...  The authors would also like to thank Mr Adriano Guttadauria for his technical support and for setting up all the electronic and informatic systems necessary for carrying out this cloud architecture.  ... 
doi:10.1002/cpe.5661 fatcat:d56lx4zndndbdmvlnajnoikdpa

Edge Intelligence: Architectures, Challenges, and Applications [article]

Dianlei Xu, Tong Li, Yong Li, Xiang Su, Sasu Tarkoma, Tao Jiang, Jon Crowcroft, Pan Hui
2020 arXiv   pre-print
Although recently emerged, spanning the period from 2011 to now, this field of research has shown explosive growth over the past five years.  ...  For each category, we elaborate, compare and analyse the literature from the perspectives of adopted techniques, objectives, performance, advantages and drawbacks, etc.  ...  ., CPU, GPU, memory, and network, which makes it impossible to be available anytime and anywhere for end users.  ... 
arXiv:2003.12172v2 fatcat:xbrylsvb7bey5idirunacux6pe

Many-Task Computing on Many-Core Architectures

Pedro Valero-Lara, Poornima Nookala, Fernando L. Pelayo, Johan Jansson, Serapheim Dimitropoulos, Ioan Raicu
2016 Scalable Computing : Practice and Experience  
Many-Task Computing (MTC) is a common scenario for multiple parallel systems, such as cluster, grids, cloud and supercomputers, but it is not so popular in shared memory parallel processors.  ...  In this paper, authors present what are those programming mechanisms to take advantages of such massively parallel features for the particular target of MTC.  ...  After that, in [40] , it is proposed a new heterogeneous (CPU-GPU) scheduler in which groups of independent blocks of tasks were efficiently managed to fully use CPU-GPU and reduce the overhead of memory  ... 
doi:10.12694/scpe.v17i1.1148 fatcat:tv2rk556ujdgtbvai4vxggezuy

TANGO: Transparent heterogeneous hardware Architecture deployment for eNergy Gain in Operation [article]

Karim Djemame and Django Armstrong and Richard Kavanagh and Jean-Christophe Deprez and Ana Juan Ferrer and David Garcia Perez and Rosa Badia and Raul Sirvent and Jorge Ejarque and Yiannis Georgiou
2016 arXiv   pre-print
The paper is concerned with the issue of how software systems actually use Heterogeneous Parallel Architectures (HPAs), with the goal of optimizing power consumption on these resources.  ...  It argues the need for novel methods and tools to support software developers aiming to optimise power consumption resulting from designing, developing, deploying and running software on HPAs, while maintaining  ...  Acknowledgments This work is partly supported by the European Commission under H2020-ICT-20152 contract 687584 -Transparent heterogeneous hardware Architecture deployment for eNergy Gain in Operation (  ... 
arXiv:1603.01407v1 fatcat:3yjffrybxfbondmjgq5vy5fjd4

Energy Efficient Computing Systems: Architectures, Abstractions and Modeling to Techniques and Standards [article]

Rajeev Muralidhar and Renata Borovica-Gajic and Rajkumar Buyya
2020 arXiv   pre-print
We have now entered the era of domain-specific architectures for new workloads like AI and ML.  ...  parallelism (ILP) and the end of Dennard's scaling drove the industry towards multi-core chips.  ...  Fig 1, referenced from [96] shows [96] , [57] , [63] The nature of computing systems has thus changed across the spectrum of devices, from being pure compute-based to being a mixture of CPUs, GPUs  ... 
arXiv:2007.09976v2 fatcat:enrfj2qgerhyteapwykxcb5pni

Hardware Architectures for Real-Time Medical Imaging

Eduardo Alcaín, Pedro R. Fernández, Rubén Nieto, Antonio S. Montemayor, Jaime Vilas, Adrian Galiana-Bordera, Pedro Miguel Martinez-Girones, Carmen Prieto-de-la-Lastra, Borja Rodriguez-Vila, Marina Bonet, Cristina Rodriguez-Sanchez, Imene Yahyaoui (+4 others)
2021 Electronics  
This paper focuses on the evolution and the application of different hardware architectures (namely, CPU, GPU, DSP, FPGA, and ASIC) in medical imaging through various specific examples and discussing different  ...  The main purpose is to provide a general introduction to hardware acceleration techniques for medical imaging researchers and developers who need to accelerate their implementations.  ...  The FPGAs have evolved from the homogeneous structure to a more heterogeneous architecture with a wide range of specific blocks.  ... 
doi:10.3390/electronics10243118 fatcat:ek4yfnw23naijkmrzhqwf5olqe

Runtime Resource Management in Heterogeneous System Architectures: The SAVE Approach

Gianluca C. Durelli, Marcello Pogliani, Antonio Miele, Christian Plessl, Heinrich Riebler, Marco D. Santambrogio, Gavin Vaz, Cristiana Bolchini
2014 2014 IEEE International Symposium on Parallel and Distributed Processing with Applications  
The SAVE project will develop a Heterogeneous System Architecture that will decide at runtime to execute task on the appropriate kind of resources, based on the current requirements.  ...  This scenario has propelled an interest towards self-adaptive systems that dynamically reorganize the use of system resources to optimize for a given goal.  ...  It specifically searches for RTCS: main components and interaction with the Orchestrator and with the virtualized CPU, GPU and DFE resources. Fig. 2. Structure of the two-layer Orchestrator.  ... 
doi:10.1109/ispa.2014.27 dblp:conf/ispa/DurelliPMPRSVB14 fatcat:ccsfd3f2tnhijged767ljiww2i
« Previous Showing results 1 — 15 out of 638 results