Filters








2,906 Hits in 7.2 sec

Optimizing bandwidth and power of graphics memory with hybrid memory technologies and adaptive data migration

Jishen Zhao, Yuan Xie
2012 Proceedings of the International Conference on Computer-Aided Design - ICCAD '12  
In this paper, we propose a hybrid graphics memory architecture with different memory technologies (DRAM, STT-RAM, and RRAM), to improve the memory bandwidth and reduce the power consumption.  ...  In additionation, we present an adaptive data migration mechanism that exploits various memory access patterns of GPGPU applications for further memory power reduction.  ...  Both memory bandwidth and power consumption are improved with our hybrid graphics memory design. Emerging memory technologies STT-RAM is the latest generation of magnetic RAM (MRAM) [17] .  ... 
doi:10.1145/2429384.2429400 dblp:conf/iccad/ZhaoX12 fatcat:rfw4i3a4qrdc3oeqa75q2g22gi

A survey of power management techniques for phase change memory

Sparsh Mittal
2016 International Journal of Computer Aided Engineering and Technology  
Since PCM (phase change memory) provides high-density, good scalability and non-volatile data storage, it has received significant amount of attention in recent years.  ...  The aim of this paper is to provide insights to researchers into working of PCM power-management techniques and also motivate them to propose even better techniques for designing future 'green' PCM-based  ...  They model the problem of data migration as a shortest path problem. Also, they propose an approach to find the optimal data migration path with minimal cost for both dirty data and clean data.  ... 
doi:10.1504/ijcaet.2016.10000092 fatcat:gnowq3m4jvfuhbqx2jhc27y27m

A Survey Of Techniques for Architecting DRAM Caches

Sparsh Mittal, Jeffrey S. Vetter
2016 IEEE Transactions on Parallel and Distributed Systems  
Recent trends of increasing core-count and memory/bandwidth-wall have led to major overhauls in chip architecture.  ...  In face of increasing cache capacity demands, researchers have now explored DRAM, which was conventionally considered synonymous to main memory, for designing large last level caches.  ...  [12] optimize bandwidth by using adaptive page allocation in DRAM, which is different from Meza et al. [32] and Gulur et al. [21] who use adaptive fetch granularity to optimize bandwidth.  ... 
doi:10.1109/tpds.2015.2461155 fatcat:tqg5hgv64bfnbf6m5c6v4mh5sa

Memory and Storage System Design with Nonvolatile Memory Technologies

Jishen Zhao, Cong Xu, Ping Chi, Yuan Xie
2015 IPSJ Transactions on System LSI Design Methodology  
and energy efficiency provided by traditional memory technologies.  ...  The memory hierarchy is becoming a fundamental performance and energy bottleneck, due to the widening gap between the increasing bandwidth and energy demands of modern applications and the limited performance  ...  [56] proposed an adaptive block placement and migration policy used in an SRAM/STT-MRAM hybrid last-level cache.  ... 
doi:10.2197/ipsjtsldm.8.2 fatcat:hjmdxg6wgzblpess3ejbdqa2m4

Energy Saving Techniques for Phase Change Memory (PCM) [article]

Sparsh Mittal
2013 arXiv   pre-print
Towards this, researchers have proposed use of non-volatile memory, such as phase change memory (PCM), which has low read latency and power; and nearly zero leakage power.  ...  However, the write latency and power of PCM are very high and this, along with limited write endurance of PCM present significant challenges in enabling wide-spread adoption of PCM.  ...  They model the problem of data migration as a shortest path problem. Also, they propose an approach to find the optimal data migration path with minimal cost for both dirty data and clean data.  ... 
arXiv:1309.3785v1 fatcat:w4ehvlzitjgfnkhzlals3abvoq

GPU Parallel Computation in Bioinspired Algorithms: A Review [chapter]

M. G. Arenas, G. Romero, A. M. Mora, P. A. Castillo, J. J. Merelo
2012 Studies in Computational Intelligence  
Advances in the video gaming industry have led to the production of low-cost, high-performance graphics processing units that possess more memory bandwidth and computational capability than central processing  ...  , particularly in the fields of computational biology and bioinformatics.  ...  P08-TIC-03928 projects, and the Jaén University UJA-08-16-30 project.  ... 
doi:10.1007/978-3-642-30154-4_6 fatcat:hs6jd4uvavcfxl7e374fjcufg4

Conceptual and Technical Challenges for High Performance Computing [article]

Claude Tadonki
2020 arXiv   pre-print
The first step is clearly the method, which is a conjunction of modelling with specific considerations (hypothesis, simplifications, constraints, to name a few) and a corresponding algorithm, which could  ...  The present note will discuss the aforementioned points, interleaved with commented contributions from the literature and our personal views.  ...  To achieve good performance and scalability in a Cloud environment with geographically distributed datacenters, data migration should be prevented at the best (through tasks migration instead, processing-migration  ... 
arXiv:2010.02769v1 fatcat:jdndyb3w6ncqjbriax25u6zofe

Adaptation Strategies in Multiprocessors System on Chip [chapter]

Remi Busseuil, Gabriel Marchesan Almeida, Luciano Ost, Sameer Varyani, Gilles Sassatelli, Michel Robert
2012 IFIP Advances in Information and Communication Technology  
Memory organization is a crucial criterion in MPSoC performance optimization, as memory access latency of remote data increases exponentially with respect to the number of cores.  ...  This chapter explors three different adaptable mechanisms, and shows their benefits : frequency scaling, task migration techniques and memory organization.  ...  two task migration mechanisms into an RTL-modeled NoC-based homogenous MPSoC, leading to accurate results, and (iii) a hybrid memory architecture that can be employed to optimize the performance of task  ... 
doi:10.1007/978-3-642-28566-0_10 fatcat:lqxg2ujikndgjjvnxrli6x5cza

A Review on Virtual Machine Positioning and Consolidation Strategies for Energy Efficiency in Cloud Data Centers

Nahuru Ado Sabongari, Abdulsalam Ya'u, Souley Boukari, Badamasi Ja'afaru, Muhammad Auwal, Haruna Chiroma
2020 International Journal of Advanced Computer Science and Applications  
Strategies for positioning and transformation of VM maintain their usefulness as a roadmap to maximum consolidation. The latest techniques do complex restructuring, thus optimizing VM's positioning.  ...  The paper provides a detailed state-of -the-art strategies for VM positioning and consolidation that help improve energy efficiency in cloud data centers.  ...  The method does not consider network bandwidth Heuristic algorithm [35] Power Optimization Multi-Objective CPU and Memory Ant Colony System (ACS) Total energy consumption The method  ... 
doi:10.14569/ijacsa.2020.0110687 fatcat:zju2e6czezfhhn2kxn3iayk77m

Integration Challenges and Tradeoffs for Terascale Architectures

Mani Azimi
2007 Intel Technology Journal  
However, the realization of tera-scale architecture is challenged by on-die power dissipation, wire delays, off-chip memory bandwidth, process variations, and higher failure rates.  ...  Limited off-chip memory bandwidth requires innovations in the cache hierarchy, memory subsystem, and coherence protocol.  ...  ACKNOWLEDGMENTS We thank Yatin Hoskote, Dennis Brzezinski, David James, Mani Ayyar, Ching-Tsun Chou, Rama Menon, Saikat (Roy) Saharoy, Theodore Tabe, Hari Thantry, and Jianping (Jane) Xu for their contributions  ... 
doi:10.1535/itj.1103.01 fatcat:2foqn3s4nrexxezbqvkjpvxcqy

Cloud-Based Augmentation for Mobile Devices: Motivation, Taxonomies, and Open Challenges

Saeid Abolfazli, Zohreh Sanaei, Ejaz Ahmed, Abdullah Gani, Rajkumar Buyya
2014 IEEE Communications Surveys and Tutorials  
Augmented mobile devices envision to perform extensive computations and to store big data beyond their intrinsic capabilities with least footprint and vulnerability.  ...  We critically analyze the state-of-the-art CMA approaches and classify them into four groups of distant fixed, proximate fixed, proximate mobile, and hybrid to present a taxonomy.  ...  Unlike CloneCloud, VEE aims to reduce latency by migrating the segment of data stack explicitly created and owned by the application to the VM instead of copying the entire memory; cloning the entire memory  ... 
doi:10.1109/surv.2013.070813.00285 fatcat:77v7cyzdwrbonll45npxl4loxa

Cloud-Based Augmentation for Mobile Devices: Motivation, Taxonomies, and Open Challenges

Saeid Abolfazli, Zohreh Sanaei, Ejaz Ahmad, Abdullah Gani, Rajkumar Buyya
2014 Figshare  
Augmented mobile devices envision to perform extensive computations and to store big data beyond their intrinsic capabilities with least footprint and vulnerability.  ...  We critically analyze the state-of-the-art CMA approaches and classify them into four groups of distant fixed, proximate fixed, proximate mobile, and hybrid to present a taxonomy.  ...  Unlike CloneCloud, VEE aims to reduce latency by migrating the segment of data stack explicitly created and owned by the application to the VM instead of copying the entire memory; cloning the entire memory  ... 
doi:10.6084/m9.figshare.1038330 fatcat:y3ybv3eujjbu3o45pbh3tgid4u

High-Performance Energy-Efficient Multicore Embedded Computing

A. Munir, S. Ranka, A. Gordon-Ross
2012 IEEE Transactions on Parallel and Distributed Systems  
This paper outlines typical requirements of embedded applications and discusses state-of-the-art hardware/software high-performance energy-efficient embedded computing (HPEEC) techniques that help meeting  ...  With Moore's law supplying billions of transistors on-chip, embedded systems are undergoing a transition from single-core to multicore to exploit this high-transistor density for high performance.  ...  Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the NSERC and the NSF.  ... 
doi:10.1109/tpds.2011.214 fatcat:vagqmojdsjevvc2u2ewqrcjjpq

Sunway supercomputer architecture towards exascale computing: analysis and practice

Jiangang Gao, Fang Zheng, Fengbin Qi, Yajun Ding, Hongliang Li, Hongsheng Lu, Wangquan He, Hongmei Wei, Lifeng Jin, Xin Liu, Daoyong Gong, Fei Wang (+5 others)
2021 Science China Information Sciences  
Then, the major challenges of exascale supercomputer, such as scalability, power consumption, data movement, programming and availability, are thoroughly analyzed, and the corresponding appropriate solutions  ...  The technology roadmap of Sunway supercomputer will hold the comprehensive design methods for the architecture, including the processor, interconnect network, assembly structure, power supply, cooling  ...  ) of the above topics: (1) the applications with regular computing and data migration; (2) the applications with irregular computing and data migration.  ... 
doi:10.1007/s11432-020-3104-7 fatcat:ocmhnpa2dng2lhqhldgbcdfw2a

Comparative Study of Live Virtual Machine Migration Techniques in Cloud

Gulshan Soni, Mala Kalra
2013 International Journal of Computer Applications  
Live migration of virtual machines has been a powerful tool to facilitate system maintenance, load balancing, fault tolerance, and powersaving, especially in clusters or data centers.  ...  Effective migration of virtual machine requires the movement of storage, memory, process states and network connectivity.  ...  Large total Adaptive Rate Limiting approach[3] working set and showed OS 60ms. migration time migration built on top of the Xen VMM. adaptive Memory compression based 1.  ... 
doi:10.5120/14643-2919 fatcat:6ti5qk2vgjfkljxm2fjybq3za4
« Previous Showing results 1 — 15 out of 2,906 results