A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2018; you can also visit the original URL.
The file type is application/pdf
.
Filters
A cache topology-aware multi-query scheduler for multicore architectures
2014
2014 IEEE International Symposium on Workload Characterization (IISWC)
In Figure 1 , we give the high-level view of our automated approach to cache topology aware query scheduling. architecture. ...
Fig. 1: High level sketch of our cache topology-aware query scheduling approach. ...
doi:10.1109/iiswc.2014.6983047
dblp:conf/iiswc/OrhanDYKO14
fatcat:4koxvgoggrgvth4fxikeo4mr4u
Cache Hierarchy-Aware Query Mapping on Emerging Multicore Architectures
2017
IEEE transactions on computers
Our proposed scheme distributes a given batch of queries across the cores of a target multicore architecture based on the affinity relations among the queries. ...
Each domain affinity in this context corresponds to a cache structure bounded by a particular level of the cache hierarchy. ...
ACKNOWLEDGMENTS A preliminary 2-page version of this paper appears in the Proceedings of 2014 IEEE International Symposium on Workload Characterization (IISWC) [27] . This work has been done when U. ...
doi:10.1109/tc.2016.2605682
fatcat:fdfe4mhddrhyfk4isdwak2tkd4
A User-Level NUMA-Aware Scheduler for Optimizing Virtual Machine Performance
[chapter]
2013
Lecture Notes in Computer Science
Experimental results show that our NUMA-aware virtual machine scheduling algorithm is able to improve VM performance by up to 23.4% compared with the default CFS (Completely Fair Scheduler) scheduler used ...
In this paper, we propose a "Best NUMA Node" based virtual machine scheduling algorithm and implement it in a user-level scheduler that can periodically adjust the placement of VMs running on NUMA systems ...
[14] presented a method that allows the guest OS to query the VMM via para-virtualized hypercalls about the NUMA topology. ...
doi:10.1007/978-3-642-45293-2_3
fatcat:4pg43s4yy5gb7fv5ilsn75eqfi
Efficient Query Processing on Many-core Architectures
2016
Proceedings of the 2016 International Conference on Management of Data - SIGMOD '16
In PhiDB, we apply Xeon Phi aware optimizations on query operators to exploit hardware features of Xeon Phi, and design a heuristic algorithm to schedule the concurrent execution of query operators for ...
With the trend towards many-core architectures, query operator optimizations, and efficient query scheduling on such many-core architectures remain as challenging issues. ...
INTRODUCTION Computer architectures have been evolving from multicore processors to many-core processors with emerging architectural features. ...
doi:10.1145/2882903.2899407
dblp:conf/sigmod/ChengHLLHG16
fatcat:ocxssv5zpjbdfo5golx23gople
Locality Aware Task Scheduling in Parallel Data Stream Processing
[chapter]
2015
Studies in Computational Intelligence
The multiprocessor systems and CPU architecture of the day become quite complex, which makes the task scheduling a challenging problem. ...
In this paper, we propose a novel task scheduling strategy for parallel data stream systems, that reflects many technical issues of the current hardware. ...
The multi-query tests (Figure 8 ) emphasize the benefits of NUMA aware scheduling along with NUMA aware memory allocation. ...
doi:10.1007/978-3-319-10422-5_35
fatcat:56g7pv2m3faybftgshjicfxbrm
TACO: A Scheduling Scheme for Parallel Applications on Multicore Architectures
2014
Scientific Programming
Additionally, TACO is conceptually compatible with contention-aware scheduling strategies. We find that topology-awareness increases performance for all evaluated workloads. ...
We propose TACO, a topology-aware scheduling scheme that combines equipartitioning and coscheduling, which does not suffer from the drawbacks of the individual concepts. ...
Basis In order to create a scheduling scheme for modern multicore architectures, we combine the ideas of equipartitioning and coscheduling. ...
doi:10.1155/2014/423084
fatcat:6cuvj4r4ovhrvml3hy2yvpqf2q
Toward Efficient In-memory Data Analytics on NUMA Systems
[article]
2020
arXiv
pre-print
A key drawback of NUMA architectures is that many existing software solutions are not aware of the underlying NUMA topology and thus do not take full advantage of the hardware. ...
Data analytics systems commonly utilize in-memory query processing techniques to achieve better throughput and lower latency. ...
ACKNOWLEDGEMENTS We would like to thank Kenneth Kent and Aaron Graham from IBM CASA and Serguei Vassiliev and Kaizaad Bilimorya from Compute Canada, for providing access to Machine B and Machine C respectively ...
arXiv:1908.01860v3
fatcat:3ri4vadygzce5ao5dslmakn7zm
Databases and hardware
2015
Proceedings of the VLDB Endowment
Fast query and transaction processing is the goal of 40 years of database research and the reason of existence for many new database system architectures. ...
the interaction between the database software and underlying hardware and show that, as application and microarchitecture roadmaps evolve, the effort of maintaining smooth collaboration blossoms into a ...
Moreover, ADDICT exploits the aggregate L1 instruction cache capacity of the multicore architectures. ...
doi:10.14778/2824032.2824142
fatcat:ugl7ujexjng57fqbebzt3oecre
ForestGOMP: An Efficient OpenMP Environment for NUMA Architectures
2010
International journal of parallel programming
Our runtime, which is based on a multi-level thread scheduler combined with a NUMAaware memory manager, converts this information into scheduling hints related to threadmemory affinity issues. ...
Exploiting the full computational power of current hierarchical multiprocessor machines requires a very careful distribution of threads and data among the underlying non-uniform architecture so as to avoid ...
Therefore, we introduce a multi-level thread scheduler combined with a NUMA-aware memory manager. ...
doi:10.1007/s10766-010-0136-3
fatcat:g2vajlq53ba4xeq2xjkfdriesy
2020 Index IEEE Transactions on Parallel and Distributed Systems Vol. 31
2021
IEEE Transactions on Parallel and Distributed Systems
., +, TPDS Jan. 2019 79-92
Resource-Aware Scheduling for Dependable Multicore Real-Time Systems:
Utilization Bound and Partitioning Algorithm. ...
., +, TPDS July 2019 1449-1463 Dependency-Aware Network Adaptive Scheduling of Data-Intensive Paral-Efficient Data Placement and Replication for QoS-Aware Approximate Query Evaluation of Big Data Analytics ...
doi:10.1109/tpds.2020.3033655
fatcat:cpeatdjlpzhqdersvsk5nmzjkm
Topology-aware equipartitioning with coscheduling on multicore systems
2013
2013 IEEE 6th International Workshop on Multi-/Many-core Computing Systems (MuCoCoS)
We propose a topology-aware scheduling scheme that combines equipartitioning and coscheduling. ...
Combined with the ever increasing complexity of multicore architectures, this results in a scheduling problem that is different from what it has been, because features such as non-uniform memory access ...
Basis In order to create an equipartitioning scheme for modern multicore architectures, we combine the idea of equipartitioning with coscheduling. ...
doi:10.1109/mucocos.2013.6633602
fatcat:7bezaxuktfb7tdmbwtk36vf5ba
2014 Index IEEE Transactions on Parallel and Distributed Systems Vol. 25
2015
IEEE Transactions on Parallel and Distributed Systems
., +, TPDS June 2014 1638-1647 CPU Scheduling for Power/Energy Management on Multicore Processors Using Cache Miss and Context Switch Data. Datta, A. ...
Feliu, Josue, +,
TPDS March 2014 581-590
CPU Scheduling for Power/Energy Management on Multicore Processors
Using Cache Miss and Context Switch Data. Datta, A. ...
., +, TPDS Aug. 2014 2840 -2850 Energy and Network Aware Workload Management for Sustainable Data Centers with Thermal Storage. 2030 -2042 Hyperbolic Utilization Bounds for Rate Monotonic Scheduling ...
doi:10.1109/tpds.2014.2371591
fatcat:qxyljogalrbfficryqjowgv3je
A PetriNet mechanism for OLAP in NUMA
2017
Proceedings of the 13th International Workshop on Data Management on New Hardware - DAMON '17
CCS CONCEPTS •Computer systems organization →Multicore architectures; •Information systems →Data management systems; KEYWORDS Multi-core CPUs; OLAP; Abstract Model; NUMA ...
In this paper we present a PetriNet mechanism that represents the load of the database workers for dynamically computing and allocating the local optimum number of CPU cores to tackle such load. ...
MCC-DB classi es queries in cache-sensitive and cacheinsensitive to feed the query execution scheduler. ...
doi:10.1145/3076113.3076121
dblp:conf/damon/DominicoAM17
fatcat:6ovugfeibjbbfl5g6mhgchvfti
Abstracting Multi-Core Topologies with MCTOP
2017
Proceedings of the Twelfth European Conference on Computer Systems - EuroSys '17
These libraries offer a topology representation of multi-cores, as well as a companion interface for placing threads (and data). ...
We illustrate several such policies through four examples: (i-ii) thread placement in OpenMP and in a MapReduce library, (iii) a topology-aware mergesort algorithm, as well as (iv) automatic backoff schemes ...
Acknowledgments We wish to thank our shepherd, Jean-Pierre Lozi, and the anonymous reviewers for their fruitful comments on improving the paper. ...
doi:10.1145/3064176.3064194
dblp:conf/eurosys/ChatzopoulosG0T17
fatcat:xoxdudvpx5dcxph5lnkokoe2yy
NUMA-Aware DGEMM Based on 64-Bit ARMv8 Multicore Processors Architecture
2021
Electronics
This poses a challenge to develop high-performance DGEMM on multi-NUMA architecture. We present a NUMA-aware method to reduce the number of cross-die and cross-chip memory access events. ...
We have implemented NUMA-aware DGEMM in the OpenBLAS and evaluated it on a dual-socket server with 48-core processors based on the Kunpeng920 architecture. ...
Acknowledgments: The authors thank ZhiGuang Chen and Nong Xiao for their guidance and the server provided by Pengcheng Labs.
Conflicts of Interest: The authors declare no conflict of interest. ...
doi:10.3390/electronics10161984
fatcat:mkevjicswjfpzcdrtsfarp2dnq
« Previous
Showing results 1 — 15 out of 451 results