A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2001; you can also visit the original URL.
The file type is application/pdf
.
Filters
Application restructuring and performance portability on shared virtual memory and hardware-coherent multiprocessors
1997
SIGPLAN notices
This paper studies this issue of performance portability, with the commodity communication architecture of interest being page-grained shared virtual memory. ...
We begin with applications that perform well on moderate-scale hardware cache-coherent systems, and nd that they do not do so well on SVM systems. ...
Acknowledgment We w ould like to thank Angelos Bilas and Liviu Iftode for their help with the SVM simulators. ...
doi:10.1145/263767.263792
fatcat:223ffyrufneujmxb6r4hgkcdxq
Application restructuring and performance portability on shared virtual memory and hardware-coherent multiprocessors
1997
Proceedings of the sixth ACM SIGPLAN symposium on Principles and practice of parallel programming - PPOPP '97
This paper studies this issue of performance portability, with the commodity communication architecture of interest being page-grained shared virtual memory. ...
We begin with applications that perform well on moderate-scale hardware cache-coherent systems, and nd that they do not do so well on SVM systems. ...
Acknowledgment We w ould like to thank Angelos Bilas and Liviu Iftode for their help with the SVM simulators. ...
doi:10.1145/263764.263792
dblp:conf/ppopp/JiangSS97
fatcat:4tvoxplw65h2rd7viicyxxnxsm
Application scaling under shared virtual memory on a cluster of SMPs
1999
Proceedings of the 13th international conference on Supercomputing - ICS '99
In this paper we examine how application performance scales on a state-of-the-art shared virtual memory (SVM) system on a cluster with 64 processors, comprising 4-way SMPs connected with a fast system ...
We find that while the level of application restructuring needed is quite high compared to applications that perform well on a hardware-coherent system of this scale, and larger problem sizes are needed ...
Conclusions We have investigated the scaling of application performance on a shared virtual memory cluster of SMPs for a wide range of application classes, using a hardware-coherent SGI Ori-gin2000 multiprocessor ...
doi:10.1145/305138.305190
dblp:conf/ics/JiangOYKBS99
fatcat:o5zayofrq5d5zpevxoder56lfa
Retrospective: the Cedar system
1998
25 years of the international symposia on Computer architecture (selected papers) - ISCA '98
We felt that major advances in the state of hardware technology, architecture, compilers, and parallel algorithms made such a demonstration possible. ...
Another goal was to have the prototype "achieve Cray-1 speeds for programs written in high-level languages and automatically restructured" by a compiler. ...
Word-interleaved, UMA shared memory was part of the virtual address space and directly accessible via processor instructions. ...
doi:10.1145/285930.285965
dblp:conf/isca/VeidenbaumYKPPDG98
fatcat:obv6ipwmffbh7e27pzrhkzaza4
Experience Developing the RP3 Operating System
1991
Computing Systems
This paper discusses what we learned about developing operating systems for shared-memory parallel machines such as RP3 and includes recommendations on how we feel such systems should and should not be ...
'We now believe multiprocessor architectures should support both coherent and non-coherent modes of execution. ...
The overhead of hardware cache coherence protocols on highly-parallel machines may be unacceptably high for some applications. ...
dblp:journals/csys/BryantCR91
fatcat:wplj4toye5dt7pib5frk4u725y
Comparative evaluation of fine- and coarse-grain approaches for software distributed shared memory
1999
Proceedings Fifth International Symposium on High-Performance Computer Architecture
We do work in the design, fabrication and packaging of hardware; language processing and scaling issues in system software design; and the exploration of new applications areas that are opening up with ...
Our focus is computer science research relevant to the design and application of high performance scientific computers. We test our ideas by designing, building, and using real systems. ...
This part allows us to evaluate the portability of applications developed for hardware multiprocessors and to measure the performance gap on applications developed for page-based systems. ...
doi:10.1109/hpca.1999.744377
dblp:conf/hpca/DwarkadasGKSSS99
fatcat:3ngvqrvmofh2jmv3twk7ujtvwa
Distributed shared memory: a survey of issues and algorithms
1991
Computer
All DSM systems provide some form of memory coherence. If the shared data is not replicated, then enforcing memory coherence is trivial. ...
A Distributed sharedmemory systems implement the sharedmemory abstraction On multicomputer architectures, combining the scalability of networkbased architectures with the convenience Of shared-memory programming ...
Acknowledgments This work was supported in part by NSF grant CCR-8808532, a Tektronix research fellowship, and the NSF Research Experiences for Undergraduates program. ...
doi:10.1109/2.84877
fatcat:cy664qqsdvetfbl6oill7bpk3u
Disco
1997
ACM SIGOPS Operating Systems Review
In this paper we examine the problem of extending modem operating systems to run efficiently on large-scale shared memory multiprocessors without a large implementation effort. ...
Our approach brings back an idea popular in the 197Os, virtual machine monitors. We use virtual machines to run multiple commodity operating systems on a scalable multiprocessor. ...
Our colleagues Kinshuk Govil, Dan Teodosiu, and Ben Verghese participated in many lively discussions on Disco and carefully read drafts of the paper. ...
doi:10.1145/269005.266672
fatcat:uvcwdv63yjgqbaapjkfxbzs474
A study of performance on SMP and distributed memory architectures using a shared memory programming model
1997
Proceedings of the 1997 ACM/IEEE conference on Supercomputing (CDROM) - Supercomputing '97
In this paper we examine the use of a shared memory programming model to address the problem of portability of application codes between distributed memory and shared memory architectures. ...
The type qualifier declaration supports an abstract shared memory facility on distributed memory machines while making direct use of hardware support on shared memory architectures. ...
W-7405-ENG-48. 1 This work was partially supported by National Center for Supercomputing Applications under grant number CCR970005N and utilized the Silicon Graphics Origin 2000 at the National Center ...
doi:10.1145/509593.509637
dblp:conf/sc/BrooksW97
fatcat:62k6mlfeebgzfnp3zubqe4ticq
Our colleagues Kinshuk Govil, Dan Teodosiu, and Ben Verghese participated in many lively discussions on Disco and carefully read drafts of the paper. ...
Acknowledgments The authors would like to thank John Chapin, John Gerth, Mike Nelson, Rick Rashid, Steve Ofsthun, Volker Strumpen, and our shepherd Rich Draves for their feedback. ...
The machines use a directory to maintain cache coherency, providing to the software the view of a shared-memory multiprocessor with non-uniform memory access times. ...
doi:10.1145/268998.266672
dblp:conf/sosp/BugnionDR97
fatcat:g3pofnftqrabri5qef5zzzwsfm
Disco: running commodity operating systems on scalable multiprocessors
1997
ACM Transactions on Computer Systems
Our colleagues Kinshuk Govil, Dan Teodosiu, and Ben Verghese participated in many lively discussions on Disco and carefully read drafts of the paper. ...
Acknowledgments The authors would like to thank John Chapin, John Gerth, Mike Nelson, Rick Rashid, Steve Ofsthun, Volker Strumpen, and our shepherd Rich Draves for their feedback. ...
The machines use a directory to maintain cache coherency, providing to the software the view of a shared-memory multiprocessor with non-uniform memory access times. ...
doi:10.1145/265924.265930
fatcat:t5pdtgvenrforgy5mtnzbwqth4
The multikernel
2009
Proceedings of the ACM SIGOPS 22nd symposium on Operating systems principles - SOSP '09
An evaluation of our prototype on multicore systems shows that, even on present-day machines, the performance of a multikernel is comparable with a conventional OS, and can scale better to support future ...
Commodity computer systems contain more and more processor cores and exhibit increasingly diverse architectural tradeoffs, including memory hierarchies, interconnects, instruction sets and variants, and ...
Acknowledgments We would like to thank our shepherd, Jeff Dean, the anonymous reviewers, and Tom Anderson, Steven Hand, and Michael Scott for their helpful suggestions for how to improve this paper and ...
doi:10.1145/1629575.1629579
dblp:conf/sosp/BaumannBDHIPRSS09
fatcat:6dha7l4czrd7pjuvcu6xt2v56i
Accelerating shared virtual memory via general-purpose network interface support
2001
ACM Transactions on Computer Systems
However, the performance of software virtual memory (SVM) is still far from that achieved on hardware DSM systems. ...
With the success of hardware cache-coherent distributed shared memory (DSM), a lot of effort has also been made to support the coherent shared address space programming model in software on clusters. ...
We gratefully acknowledge the support of NSF and DARPA. ...
doi:10.1145/367742.367747
fatcat:fxb2cwkep5h3hh6nhjqfbsr5da
Improving server software support for simultaneous multithreaded processors
2003
Proceedings of the ninth ACM SIGPLAN symposium on Principles and practice of parallel programming - PPoPP '03
This paper evaluates how SMT's changes to the underlying hardware affects server software, and in particular, SMT's effects on memory allocation and synchronization. ...
Overall, our results demonstrate how a few simple changes to applications' run-time support libraries can dramatically boost the performance of multi-threaded servers on SMT, without requiring modifications ...
Overall, our goal is to identify the minimal set of changes that enables applications written for a uniprocessor or a shared memory multiprocessor to achieve good performance on an SMT. ...
doi:10.1145/781503.781504
fatcat:3ewwhcryubafbleaaqrq6wf4hq
Improving server software support for simultaneous multithreaded processors
2003
Proceedings of the ninth ACM SIGPLAN symposium on Principles and practice of parallel programming - PPoPP '03
This paper evaluates how SMT's changes to the underlying hardware affects server software, and in particular, SMT's effects on memory allocation and synchronization. ...
Overall, our results demonstrate how a few simple changes to applications' run-time support libraries can dramatically boost the performance of multi-threaded servers on SMT, without requiring modifications ...
Overall, our goal is to identify the minimal set of changes that enables applications written for a uniprocessor or a shared memory multiprocessor to achieve good performance on an SMT. ...
doi:10.1145/781498.781504
dblp:conf/ppopp/McDowellEG03
fatcat:he5oz7dmwrbobhpigptv2c43l4
« Previous
Showing results 1 — 15 out of 177 results