6,012 Hits in 10.2 sec

Analysis and optimization of power consumption in the iterative solution of sparse linear systems on multi-core and many-core platforms

Hartwig Anzt, Vincent Heuveline, Jose I. Aliaga, Maribel Castillo, Juan C. Fernandez, Rafael Mayo, Enrique S. Quintana-Orti
2011 2011 International Green Computing Conference and Workshops  
In this paper we analyze the computational and power performance of iterative linear solvers applied to sparse systems arising in several scientific applications.  ...  We then apply techniques that set the (multi-core processor in the) host system to a low-consuming state for the time that the GPU is executing.  ...  The authors from the Universidad Jaume I were supported by project CICYT TIN2008-06570-C04-01 and FEDER.  ... 
doi:10.1109/igcc.2011.6008594 dblp:conf/green/AnztHACFMQ11 fatcat:j5xnjer6qnegrewxmbo7nqjzba

Optimization of power consumption in the iterative solution of sparse linear systems on graphics processors

Hartwig Anzt, Maribel Castillo, Juan C. Fernández, Vincent Heuveline, Francisco D. Igual, Rafael Mayo, Enrique S. Quintana-Ortí
2011 Computer Science - Research and Development  
In this paper, we analyze the power consumption of different GPU-accelerated iterative solver implementations enhanced with energy-saving techniques.  ...  Specifically, while conducting kernel calls on the graphics accelerator, we manually set the host system to a power-efficient idle-wait status so as to leverage dynamic voltage and frequency control.  ...  The authors from the Karlsruhe Institute of Technology (KIT) thank the Landesstiftung Baden Württemberg for its financial support in the framework of the project "Multiscale Ensemble forecasting on HPC-systems  ... 
doi:10.1007/s00450-011-0195-8 fatcat:po7yyo347nbihagg2hh77beeqi

Power Consumption of Mixed Precision in the Iterative Solution of Sparse Linear Systems

Hartwig Anzt, Vincent Heuveline, Bjorn Rocker, Maribel Castillo, Juan C. Fern´ndez, Rafael Mayo, Enrique S. Quintana-Orti
2011 2011 IEEE International Symposium on Parallel and Distributed Processing Workshops and Phd Forum  
The time and energy consumption of individual parts of the algorithm are monitored as well, enabling a deeper insight and the possibility of optimizing the energy consumption of the code on a general-purpose  ...  This paper presents a detailed analysis of a mixed precision iterative refinement solver applied to a linear system obtained from the 2D discretization of a fluid flow problem.  ...  The authors from the Universidad Jaume I were supported by project CICYT TIN2008-06570-C04-01 and FEDER.  ... 
doi:10.1109/ipdps.2011.226 dblp:conf/ipps/AnztHRCFMQ11 fatcat:z4ny7kpujzffnpgrrv3nmwvbvq

Energy-Aware High Performance Computing [chapter]

Martin Wlotzka, Vincent Heuveline, Manuel F. Dolz, M. Reza Heidari, Thomas Ludwig, A. Cristiano I. Malossi, Enrique S. Quintana-Orti
2017 ICT - Energy Concepts for Energy Efficiency and Sustainability  
First, we present energy-aware runtimes on shared memory multi-core platforms for the Conjugate Gradient method.  ...  Recently, considerable performance gains resulted predominantly from developments in multi-core, many-core and accelerator technology.  ...  General-purpose multi-core servers: energy-aware runtimes for (sparse) linear algebra Linear algebra operations and, in particular, sparse linear systems are a fundamental building block in many scientific  ... 
doi:10.5772/66404 fatcat:bpzz2exlibczfe4jjkaia7gn7q

Evaluating fault tolerance on asymmetric multicore systems-on-chip using iso-metrics

Charalampos Chalios, Enrique S. Quintana-Ortí, Dimitrios S. Nikolopoulos, Sandra Catalán
2016 IET Computers & Digital Techniques  
platform, and state-of-the-art implementations of the CG solver.  ...  The end of Dennard scaling has promoted low power consumption into a firstorder concern for computing systems.  ...  Experimental setup The CG method The CG method is a key algorithm for the numerical solution of linear systems of the form Ax = b, where A є R nxn is symmetric positive definite (SPD) and sparse, b є  ... 
doi:10.1049/iet-cdt.2015.0056 fatcat:6y37fnozqrgbvkzgn53t5r5rgi

Improving the energy efficiency of sparse linear system solvers on multicore and manycore systems

H. Anzt, E. S. Quintana-Orti
2014 Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences  
improve the energy performance of sparse linear system solvers, without negatively impacting their performance.  ...  For the particular domain of sparse linear algebra, we analyse the energy efficiency of a broad collection of hardware architectures and investigate how algorithmic and implementation modifications can  ...  Quintana-Ortí was supported by project CICYT TIN2011-23283 of the Ministerio de Ciencia e Innovación and FEDER and the EU Project FP7 318793 'EXA2Green'.  ... 
doi:10.1098/rsta.2013.0279 pmid:24842036 fatcat:kw7cnmvzrff6pmihhqenl53uwm

Characterizing the efficiency of multicore and manycore processors for the solution of sparse linear systems

José I. Aliaga, María Barreda, Ernesto Dufrechou, Pablo Ezzatti, Enrique S. Quintana-Ortí
2015 Computer Science - Research and Development  
We analyze the efficiency of servers equipped with state-of-the-art general-purpose multicore processors as well as platforms based on accelerators such as graphics processing units (GPUs) and the Intel  ...  linear systems that comprises numerical kernels and data access patterns analogous to those of HPCG.  ...  Acknowledgements This work was supported by the CI-CYT project TIN2011-23283 of MINECO and FEDER, and the EU Project FP7 318793 "EXA2GREEN".  ... 
doi:10.1007/s00450-015-0299-7 fatcat:grbuldgrjvdlplsj2vp4iibp6i

Multi Objective Optimization of HPC Kernels for Performance, Power, and Energy [chapter]

Prasanna Balaprakash, Ananta Tiwari, Stefan M. Wild
2014 Lecture Notes in Computer Science  
The Intel Xeon Phi’s Many Integrated Core (MIC) architec- ture serves as a platform that allows us to explore the tradeoffs among concur- rency, power, and performance on nodes with many simple cores, a  ...  It assembles a sparse linear system from a steady-state heat conduction problem on a brick-shaped domain of linear, 8-node hex elements.  ... 
doi:10.1007/978-3-319-10214-6_12 fatcat:y2vttotb25g27k53r5ft4jiwgy

Performance Evaluation and Optimization of HPCG benchmark on CPU + MIC platform

Qingyi Pan, Xiaoying Wang
2016 International Journal of Hybrid Information Technology  
Through the analysis of HPCG source code and optimization for CPU + MIC platforms, practical significance and the value of further research is put forward.  ...  In the paper, we used the CPU + MIC heterogeneous computing platforms, and successfully ported HPCG to the platform.  ...  Finally, the power consumption of the HPC system is 2839.52W in total.  ... 
doi:10.14257/ijhit.2016.9.11.21 fatcat:cxpkuzahj5grhhlrajt6t2hm3y

An Embedded Programmable Processor for Compressive Sensing Applications

Mehdi Safarpour, Ilkka Hautala, Olli Silven
2018 2018 IEEE Nordic Circuits and Systems Conference (NORCAS): NORCHIP and International Symposium of System-on-Chip (SoC)  
The flexible hardware design implemented on an FPGA achieves up to 7.80Ksample/s recovery at a power dissipation of 42μJ/sample and beats both ARM and NIOS in total power consumption.  ...  An application specific programmable processor is designed based on the analysis of a set of greedy recovery Compressive Sensing (CS) algorithms.  ...  ACKNOWLEDGMENTS The support of the Academy of Finland for the ICONICAL project is gratefully acknowledged.  ... 
doi:10.1109/norchip.2018.8573494 dblp:conf/norchip/SafarpourHS18 fatcat:eer6qc4sf5hlnojvzt5f3nomyq

Sparse matrix factorization in the implicit finite element method on petascale architecture

Seid Koric, Anshul Gupta
2016 Computer Methods in Applied Mechanics and Engineering  
The performance of the massively parallel direct multifrontal solver Watson Sparse Matrix Package (WSMP) for solving large sparse systems of linear equations arising in implicit finite element method on  ...  unstructured (free) meshes in solid mechanics was evaluated on one of the most powerful supercomputers currently available to the open science community-the sustained petascale high performance computing  ...  Blue Waters is supported by the National Science Foundation (award numbers OCI 07-25070 and ACI-1238993) and the state of Illinois.  ... 
doi:10.1016/j.cma.2016.01.011 fatcat:7fvypjoinrcmnigv3hldkqfcxe

Energy-efficient algebra kernels in FPGA for High Performance Computing

Federico Favaro, Ernesto Dufrechou, Pablo Ezzatti, Juan Pablo Oliver
2021 Journal of Computer Science and Technology  
Specifically, we compare the behavior of fine-tuned kernels in a multi-core CPU processor and HLS implementations on FPGAs.  ...  The dissemination of multi-core architectures and the later irruption of massively parallel devices, led to a revolution in High-Performance Computing (HPC) platforms in the last decades.  ...  Acknowledgements We acknowledge the ANII -MPG Independent Research Groups: "Efficient Hetergenous Computing" with the CSC group.  ... 
doi:10.24215/16666038.21.e09 fatcat:mdmtw5cxrbhktn22mtl42xsp3a

Accelerating advanced preconditioning methods on hybrid architectures

Ernesto Dufrechou
2021 CLEI Electronic Journal  
Many problems, in diverse areas of science and engineering, involve the solution of largescale sparse systems of linear equations.  ...  This dissertation targets the use of GPUs to enhance the performance of the solution of sparse linear systems using iterative methods complemented with state-of-the-art preconditioned techniques.  ...  Acknowledgments Special thanks to the advisors of the thesis, Dr. Pablo Ezzatti (Universidad de la República, Uruguay) and Dr. Enrique Quintana-Ortí (Universitat Politècnica de València, Spain).  ... 
doi:10.19153/cleiej.24.1.6 doaj:cf900516b6334e27afbe4102fa203079 fatcat:ohhmcgyrl5hgfaib7hb2rdyhom

Sunway supercomputer architecture towards exascale computing: analysis and practice

Jiangang Gao, Fang Zheng, Fengbin Qi, Yajun Ding, Hongliang Li, Hongsheng Lu, Wangquan He, Hongmei Wei, Lifeng Jin, Xin Liu, Daoyong Gong, Fei Wang (+5 others)
2021 Science China Information Sciences  
Moreover, this paper proposes the Sunway computer architecture towards exascale computing in which the many-core processor, network chipset and software system are all domestically-designed.  ...  Then, the major challenges of exascale supercomputer, such as scalability, power consumption, data movement, programming and availability, are thoroughly analyzed, and the corresponding appropriate solutions  ...  Secondly, L0 Cache is employed to reduce the power consumption of instruction fetching and decoding through multi-mode iterative optimization and benefits evaluation.  ... 
doi:10.1007/s11432-020-3104-7 fatcat:ocmhnpa2dng2lhqhldgbcdfw2a

GPU-based N-1 Static Security Analysis Algorithm with Preconditioned Conjugate Gradient Method

Meng Fu, Gan Zhou, Jiahao Zhao, Yanjun Feng, Huan He, Kai Liang
2020 IEEE Access  
First, a shared preconditioner is selected to accelerate preprocessing of the iterative method for fast decoupled power flow (FDPF) in N-1 SSA.  ...  Case studies on a practical 10828-bus system show that the GPU-based N-1 SSA algorithm with the batch-PCG solver is 4.90 times faster than a sequential algorithm on an 8-core CPU.  ...  Power flow calculation imposes a computational burden that is identical to the one involved in solving sparse linear system of equations (SLSE).  ... 
doi:10.1109/access.2020.3004713 fatcat:k6wbjj6elngvfi2r376fudpibq
« Previous Showing results 1 — 15 out of 6,012 results