Filters








3,101 Hits in 5.4 sec

Simple but effective techniques for NUMA memory management

W. Bolosky, R. Fitzgerald, M. Scott
1989 Proceedings of the twelfth ACM symposium on Operating systems principles - SOSP '89  
It also suggests that the greatest leverage for further performance improvement lies in reducing false sharing, which occurs when the same page contains objects that would best be placed in different memories  ...  Our experience indicates that even very simple automatic strategies can produce nearly optimal page placement.  ...  , to Bob Marinelli for early help in bring up the ACE, to the entire Mach crew at CMU for stimulating and lively discussion regarding the relationship of Mach, the pmap interface and NUMA machines, and  ... 
doi:10.1145/74850.74854 dblp:conf/sosp/BoloskyFS89 fatcat:ditm7jm6djgvzlsyulpcmq6yry

Simple but effective techniques for NUMA memory management

W. Bolosky, R. Fitzgerald, M. Scott
1989 ACM SIGOPS Operating Systems Review  
It also suggests that the greatest leverage for further performance improvement lies in reducing false sharing, which occurs when the same page contains objects that would best be placed in different memories  ...  Our experience indicates that even very simple automatic strategies can produce nearly optimal page placement.  ...  , to Bob Marinelli for early help in bring up the ACE, to the entire Mach crew at CMU for stimulating and lively discussion regarding the relationship of Mach, the pmap interface and NUMA machines, and  ... 
doi:10.1145/74851.74854 fatcat:bhqjesruzvb37hbzvdzamx7ara

Challenges of memory management on modern NUMA systems

Fabien Gaud, Baptiste Lepers, Justin Funston, Mohammad Dashti, Alexandra Fedorova, Vivien Quéma, Renaud Lachaize, Mark Roth
2015 Communications of the ACM  
Acknowledgments We thank Oracle Labs and the British Columbia Innovation Council for funding this work.  ...  All three of these techniques have been analyzed individually in prior studies, but Carrefour combines them into a novel algorithm that is effective for modern NUMA systems.  ...  NUMA Memory Placement Strategies The results in Figure 2 and the table motivate a NUMA memory-management algorithm that places importance on congestion management, rather than focusing solely on reduc-ing  ... 
doi:10.1145/2814328 fatcat:2toixgaz3zg3zb47dwmmy3rufi

Graph partitioning applied to DAG scheduling to reduce NUMA effects

Isaac Sánchez Barrera, Eduard Ayguadé, Marc Casas, Jesús Labarta, Miquel Moretó, Mateo Valero
2018 Proceedings of the 23rd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming - PPoPP '18  
In this context, techniques to manage and mitigate non-uniform memory access (NUMA) effects consist in migrating threads, memory pages or both and are typically applied by the system software.  ...  We propose techniques at the runtime system level to reduce NUMA effects on parallel applications. We leverage runtime system metadata in terms of a task dependency graph.  ...  Techniques at the OS level for mitigating these effects include migrating threads, memory pages or both [2, 3, 8] .  ... 
doi:10.1145/3178487.3178535 dblp:conf/ppopp/BarreraCMALV18 fatcat:ir7l2elumrgmzem2n2eld6qa5m

Message Passing or Shared Memory: Evaluating the Delegation Abstraction for Multicores [chapter]

Irina Calciu, Dave Dice, Tim Harris, Maurice Herlihy, Alex Kogan, Virendra Marathe, Mark Moir
2013 Lecture Notes in Computer Science  
memory access (NUMA).  ...  Even for small multi-core systems, it has become harder and harder to support a simple shared memory abstraction: processors access some memory regions more quickly than others, a phenomenon called non-uniform  ...  Acknowledgments We are grateful to Nir Shavit for useful input and feedback and to Bill Bridge and Garret Swart who suggested this research direction.  ... 
doi:10.1007/978-3-319-03850-6_7 fatcat:3skftjebjng2pi4gyh5ymtkzfq

Locality-Aware Task Scheduling and Data Distribution for OpenMP Programs on NUMA Systems and Manycore Processors

Ananya Muddukrishna, Peter A. Jonsson, Mats Brorsson
2015 Scientific Programming  
Locality-aware scheduling, in conjunction with or as a replacement for existing scheduling, is necessary to minimize NUMA effects and sustain performance.  ...  We present a data distribution and locality-aware scheduling technique for task-based OpenMP programs executing on NUMA systems and manycore processors.  ...  from NUMA effects [1] .  ... 
doi:10.1155/2015/981759 fatcat:r6gun4qwdfbezg4nnx63zmo7nm

Comparing the effectiveness of fine-grain memory caching against page migration/replication in reducing traffic in DSM clusters

An-Chow Lai, Babak Falsafi
2000 Proceedings of the twelfth annual ACM symposium on Parallel algorithms and architectures - SPAA '00  
In this paper, we compare and contrast two techniques to improve capacity/conflict miss traffic in CC-NUMA DSM clusters.  ...  R-NUMA optimizes read-write accesses to any page by allowing a processor to cache that page in its main memory.  ...  Techniques to reduce remote memory accesses can be more effective in DSMs with longer remote miss latencies.  ... 
doi:10.1145/341800.341811 dblp:conf/spaa/LaiF00 fatcat:xpdbdm5wmfcctazvqt2h476h5u

NUMA policies and their relation to memory architecture

William J. Bolosky, Michael L. Scott, Robert P. Fitzgerald, Robert J. Fowler, Alan L. Cox
1991 ACM SIGOPS Operating Systems Review  
Our results indicate that a good NUMA policy must be chosen to match its machine, and confirm that such policies can be both simple and effective.  ...  We have used this information to explore the relationship between kernel-based NUMA management policies and multiprocessor memory architecture.  ...  require different policies for high-quality NUMA management.  ... 
doi:10.1145/106974.106994 fatcat:54vlt7bddzd6nkcoit7qlgd6wm

NUMA policies and their relation to memory architecture

William J. Bolosky, Michael L. Scott, Robert P. Fitzgerald, Robert J. Fowler, Alan L. Cox
1991 Proceedings of the fourth international conference on Architectural support for programming languages and operating systems - ASPLOS-IV  
Our results indicate that a good NUMA policy must be chosen to match its machine, and confirm that such policies can be both simple and effective.  ...  We have used this information to explore the relationship between kernel-based NUMA management policies and multiprocessor memory architecture.  ...  require different policies for high-quality NUMA management.  ... 
doi:10.1145/106972.106994 dblp:conf/asplos/BoloskySFFC91 fatcat:cf3yu7o5cbczvnannrxmuryyya

NUMA policies and their relation to memory architecture

William J. Bolosky, Michael L. Scott, Robert P. Fitzgerald, Robert J. Fowler, Alan L. Cox
1991 SIGARCH Computer Architecture News  
Our results indicate that a good NUMA policy must be chosen to match its machine, and confirm that such policies can be both simple and effective.  ...  We have used this information to explore the relationship between kernel-based NUMA management policies and multiprocessor memory architecture.  ...  require different policies for high-quality NUMA management.  ... 
doi:10.1145/106975.106994 fatcat:h64q7qubqncobofb7zqq75f6eu

NUMA policies and their relation to memory architecture

William J. Bolosky, Michael L. Scott, Robert P. Fitzgerald, Robert J. Fowler, Alan L. Cox
1991 SIGPLAN notices  
Our results indicate that a good NUMA policy must be chosen to match its machine, and confirm that such policies can be both simple and effective.  ...  We have used this information to explore the relationship between kernel-based NUMA management policies and multiprocessor memory architecture.  ...  require different policies for high-quality NUMA management.  ... 
doi:10.1145/106973.106994 fatcat:gycerao7sjdc3o2jkn6pdudg2q

Optimizing Traffic in DSM Clusters: Fine-Grain Memory Caching versus Page Migration/ Replication

An-Chow Lai, Babak Falsafi
2002 Theory of Computing Systems  
In this paper, we compare and contrast two techniques to improve capacity/conflict miss traffic in CC-NUMA DSM clusters.  ...  R-NUMA optimizes read-write accesses to any page by allowing a processor to cache that page in its main memory.  ...  Techniques to reduce remote memory accesses can be more effective in DSMs with longer remote miss latencies.  ... 
doi:10.1007/s00224-002-1054-6 fatcat:dc7x2u6svzh55of3u2s3s6reim

Explicit Management of Memory Hierarchy [chapter]

Jarek Nieplocha, Robert Harrison, Ian Foster
1997 Advances in High Performance Computing  
The hardware in a particular system may support a shared memory or message passing programming model, but these factors effect only the relative costs of local and remote accesses, not the system's fundamental  ...  Non-Uniform Memory Access (NUMA) characteristics.  ...  Conclusions and Future Work We have described techniques and tools that support explicit but high-level management of data movement in memory hierarchies.  ... 
doi:10.1007/978-94-011-5514-4_11 fatcat:6yywywktgfawzlqtiwuxjnuch4

Reducing Data Movement on Large Shared Memory Systems by Exploiting Computation Dependencies

Isaac Sánchez Barrera, Miquel Moretó, Eduard Ayguadé, Jesús Labarta, Mateo Valero, Marc Casas
2018 Proceedings of the 2018 International Conference on Supercomputing - ICS '18  
In this context, techniques to manage and mitigate non-uniform memory access (NUMA) effects consist in migrating threads, memory pages or both and are generally applied by the system software.  ...  We propose techniques at the runtime system level to further mitigate the impact of NUMA effects on parallel applications' performance.  ...  In order to mitigate NUMA effects, techniques for migrating threads, memory pages or both already exist [15, 16, 39] .  ... 
doi:10.1145/3205289.3205310 dblp:conf/ics/BarreraMALVC18 fatcat:shndpuchkzagdo5z7ka7vkee6q

Memory flipping: a threat to NUMA virtual machines in the Cloud

Djob Mvondo, Boris Teabe, Alain Tchana, Daniel Hagimont, Noel De Palma
2019 IEEE INFOCOM 2019 - IEEE Conference on Computer Communications  
By this way, all NUMA optimizations implemented by almost all VM's OS (e.g. Linux) can become effective.  ...  The evaluation results, performed with well known benchmarks, show that our two solutions are able to almost cancel the issue, while keeping memory flipping effective.  ...  The main lessons we learn from these experiments are: • memory flipping can effectively lead to NUMA topology changes for an userVM; • memory copy is a solution to the problem, but causes significant  ... 
doi:10.1109/infocom.2019.8737548 dblp:conf/infocom/MvondoTTHP19 fatcat:wv47spv56rem7dbufpitf4dn7a
« Previous Showing results 1 — 15 out of 3,101 results