26 Hits in 9.0 sec

On the characterization of OpenCL dwarfs on fixed and reconfigurable platforms

Konstantinos Krommydas, Wu-chun Feng, Muhsen Owaida, Christos D. Antonopoulos, Nikolaos Bellas
2014 2014 IEEE 25th International Conference on Application-Specific Systems, Architectures and Processors  
Using OpenDwarfs, we characterize a diverse set of fixed and reconfigurable parallel platforms: multicore CPUs, discrete and integrated GPUs, Intel Xeon Phi coprocessor, as well as a FPGA.  ...  interplay between dwarfs' patterns and the underlying hardware architecture of modern parallel platforms.  ...  Greek national funds through the Operational Program Education and Lifelong Learning of the National Strategic Reference Framework (NSRF) -Research Funding Program.  ... 
doi:10.1109/asap.2014.6868650 dblp:conf/asap/KrommydasFOAB14 fatcat:2jiwfdlfgjakroxde2n37y734y

OpenDwarfs: Characterization of Dwarf-Based Benchmarks on Fixed and Reconfigurable Architectures

Konstantinos Krommydas, Wu-chun Feng, Christos D. Antonopoulos, Nikolaos Bellas
2015 Journal of Signal Processing Systems  
Using OpenDwarfs, we characterize a diverse set of modern fixed and reconfigurable parallel platforms: multicore CPUs, discrete and integrated GPUs, Intel Xeon Phi co-processor, as well as a FPGA.  ...  interplay between dwarfs' patterns and the underlying hardware architecture of modern parallel platforms.  ...  Our latest piece of work [12] provided an extensive characterization of OpenDwarfs on fixed and reconfigurable target architectures.  ... 
doi:10.1007/s11265-015-1051-z fatcat:ifnbayv26zdttgeovidgjqtoue

Bridging the Performance-Programmability Gap for FPGAs via OpenCL: A Case Study with OpenDwarfs

Konstantinos Krommydas, Ahmed E. Helal, Anshuman Verma, Wu-Chun Feng
2016 2016 IEEE 24th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM)  
To improve the performance of OpenCL kernels on FPGAs, and thus, bridge the performanceprogrammability gap, we identify general techniques to optimize OpenCL kernels for FPGAs under device-specific hardware  ...  To address this lack of programmability of FPGAs, OpenCL provides an easy-to-use and portable programming model for CPUs, GPUs, APUs, and now, FPGAs.  ...  Figure 2 shows the performance of the OpenDwarfs on both the fixed and reconfigurable architectures.  ... 
doi:10.1109/fccm.2016.56 dblp:conf/fccm/KrommydasHVF16 fatcat:btrd7srlovgt3a2jdujkezbkwq

ASAP 2014 program

2014 2014 IEEE 25th International Conference on Application-Specific Systems, Architectures and Processors  
On the Characterization of OpenCL Dwarfs on Fixed and Reconfigurable Parallel Platforms 86 Edoardo Paone, Gianluca Palermo, Vittorio Zaccaria, Cristina Silvano and Davide Gadioli.  ...  A Reconfigurable Network-on-chip for Heterogeneous Many-core CMPs in the Dark Silicon Era 15 Simon Pontié and Paolo Maistri.  ... 
doi:10.1109/asap.2014.6868622 fatcat:qhye3xi6yzaoxfngx3ta64t4ku

Table of contents

2014 2014 IEEE 25th International Conference on Application-Specific Systems, Architectures and Processors  
(Ruan de Clercq, Frank Piessens, Dries Schellekens, Ingrid Verbauwhede) [Search] Regular Papers 153 On the Characterization of OpenCL Dwarfs on Fixed and Reconfigurable Platforms (Konstantinos  ...  Rakhmatov) Short Papers 57 RNS Modular Multiplication Through Reduced Base Extensions (Karim Bigou, Arnaud Tisserand) 63 On the Computation of the Reciprocal of Floating Point Expansions Using  ... 
doi:10.1109/asap.2014.6868611 fatcat:7bu4iqnpebaslcsnd4lwtcfcei

Exploration of Performance and Energy Trade-offs for Heterogeneous Multicore Architectures [article]

Anastasiia Butko, Florent Bruguier, David Novo, Abdoulaye Gamatié, Gilles Sassatelli
2019 arXiv   pre-print
This study further provides insights on the impact of workload nature on performance/energy trade-off and draws recommendations concerning suitable architecture configurations.  ...  We demonstrate that varying the level of heterogeneity as well as the different core ratio can lead to up to 2.3x gains in energy efficiency and up to 1.5x in performance.  ...  ACKNOWLEDGMENT The research leading to these results has received funding from the European Community's H2020 Program under the Mont-Blanc 3 Project (, grant agreement n o 671697  ... 
arXiv:1902.02343v1 fatcat:a5qfw3mtongqxnk2vnvbtgxpci

MGSim + MGMark: A Framework for Multi-GPU System Research [article]

Yifan Sun, Trinayan Baruah, Saiful A. Mojumder, Shi Dong, Rafael Ubal, Xiang Gong, Shane Treadway, Yuhui Bao, Vincent Zhao, José L. Abellán, John Kim, Ajay Joshi, David Kaeli
2018 arXiv   pre-print
The advent of such systems raises a number of design challenges, including the GPU microarchitecture, multi-GPU interconnect fabrics, runtime libraries and associated programming models.  ...  The rapidly growing popularity and scale of data-parallel workloads demand a corresponding increase in raw computational power of GPUs (Graphics Processing Units).  ...  On the other hand, training a DNN may require a multi-terabyte dataset [19] , dwarfing the GPU memory capacity of a single GPU.  ... 
arXiv:1811.02884v3 fatcat:uqzjyera75dnnnpfeduess7qtq

Analytical Cost Metrics : Days of Future Past [article]

Nirmal Prajapati, Sanjay Rajopadhye, Hristo Djidjev
2018 arXiv   pre-print
With Moore's law driving the evolution of hardware platforms towards exascale, the dominant performance metric (time efficiency) has now expanded to also incorporate power/energy efficiency.  ...  Scientists and researchers are continuously investing in tuning the performance of extreme-scale computational problems.  ...  on these platforms.  ... 
arXiv:1802.01957v1 fatcat:r6lajnt75zb4xahkznt5gb4wx4

Performance modeling of heterogeneous systems

Jan Christian Meyer, Anne Cathrine Elster
2010 2010 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW)  
We describe a design to implement the BSPLib programming interface, combining threads and message-passing parallelism to achieve overlap on commodity cluster platforms, implementing its one-sided communication  ...  We augment and validate the cost model of one adapted synchronization algorithm with the corresponding bandwidth requirement, completing a framework for modeling BSPLib program performance.  ...  I also extend my gratitude to co-advisors Lasse Natvig and Jørn Aslak Amundsen, who have offered very helpful feedback and advice along the way.  ... 
doi:10.1109/ipdpsw.2010.5470682 dblp:conf/ipps/MeyerE10 fatcat:klz3oiau65fxxeshmomyeu47ny

A Manifesto for Future Generation Cloud Computing

Rajkumar Buyya, Marco A. S. Netto, Adel Nadjaran Toosi, Maria Alejandra Rodriguez, Ignacio M. Llorente, Sabrina De Capitani Di Vimercati, Pierangela Samarati, Dejan Milojicic, Carlos Varela, Rami Bahsoon, Marcos Dias De Assuncao, Satish Narayana Srirama (+13 others)
2018 ACM Computing Surveys  
ACKNOWLEDGMENTS We thank anonymous reviewers, Sartaj Sahni (Editor-in-Chief) and Antonio Corradi (Associate Editor) for their constructive suggestions and guidance on improving the content and quality  ...  We also thank Adam Wierman (California Institute of Technology), Shigeru Imai (Rensselaer Polytechnic Institute) and Arash Shaghaghi (University of New South Wales, Sydney) for their comments and suggestions  ...  This quantity is bound to grow many-fold, and may dwarf the size of data present on the public WWW, enterprises and mobile Clouds.  ... 
doi:10.1145/3241737 fatcat:bgb4qjtm5zgcbbtyy6x6anfeju

Eurolab-4-HPC Long-Term Vision on High-Performance Computing [article]

Theo Ungerer, Paul Carpenter
2018 arXiv   pre-print
The proposal on research topics is derived from the report and discussions within the road mapping expert group.  ...  Because of the long-term perspective and its speculative nature, the authors started with an assessment of future computing technologies that could influence HPC hardware and software.  ...  CUDA, OpenCL, OpenACC, and OpenMP 4.0, and consolidation on an open, vendor-neutral and widely used standard is needed [12] .  ... 
arXiv:1807.04521v1 fatcat:5neetrgubjhnvcajcktpkohrzq

InterNoC: Unified Deterministic Communication For Distributed NoC-based Man y-Core

Eleftherios Kyriakakis, Jens Sparso, Martin Schoeberl
2019 Zenodo  
Acknowledgments This work is part of a project that has received funding from the European Union's Horizon 2020 research and innovation programme under grant agreement No 779882.  ...  We present an Integer Linear Programming (ILP) based scheduler on a real life drone application, that minimizes energy consumption, guarantees timing and offers security.  ...  The task may be scheduled using fixed priority, therefore it may be characterized by parameter priority Pi.  ... 
doi:10.5281/zenodo.5851207 fatcat:rpvcg7lr6vfvheufpbfxl2kpv4

Interlanguages and synchronic models of computation [article]

Alexander Victor Berka
2010 arXiv   pre-print
The devices of one a-Ram model, called the Synchronic A-Ram, are fully connected and simpler than FPGA LUT's.  ...  An interstring linked with a abstract machine environment, shares sub-expressions, transfers data, and spatially allocates resources for the parallel evaluation of dataflow.  ...  The architectures presented in this chapter however, need fast connectivity on the inter and intra chip scale, and a very large number of fixed and reconfigurable links, with small amounts of data being  ... 
arXiv:1005.5183v1 fatcat:eiayu7sttbfslblhdoj3ggwnwu

GPGPU-based Gaussian Filtering for Surface Metrological Data Processing

Yang Su, Zhijie Xu, Xiangqian Jiang
2008 2008 12th International Conference Information Visualisation  
that the power, ubiquity and low cost of GPUs makes them an ideal alternative platform for high-performance computing.  ...  He was willing to take a chance on my research from the beginning, and has always pushed me to fill in that one last detail to elevate the level of my thinking and my work.  ...  • Syntax and Semantics for the Adaptable Program At the end of 2008, Apple, AMD and Nvidia have jointly released the Open Computing Language (OpenCL) as a future programming model and platform for  ... 
doi:10.1109/iv.2008.14 dblp:conf/iv/SuXJ08 fatcat:lpagxjxstjbj5lcdumztlpgolu

Portuguese SKA White Book [article]

Domingos Barbosa, Sonia Antón, João Paulo Barraca, Miguel Bergano, Alexandre C. M. Correia, Dalmiro Maia, Valério A. R. M. Ribeiro
2020 arXiv   pre-print
This white book stems from the contributions presented at the Portuguese SKA Days, held on the 6th and 7th February 2018 with the presence of the SKA Deputy Director General Alistair McPherson and the  ...  The meeting was very successful in providing a detailed overview of the SKA status, vision and goals and describes most of the Portuguese contributions to science, technology and the related industry aspirations  ...  support about the international interest on the Azores VLBI cluster.  ... 
arXiv:2005.01140v1 fatcat:gijmylrmvfcj3i7dt3wbsr6vhm
« Previous Showing results 1 — 15 out of 26 results