Filters








177 Hits in 7.2 sec

Application restructuring and performance portability on shared virtual memory and hardware-coherent multiprocessors

Dongming Jiang, Hongzhang Shan, Jaswinder Pal Singh
1997 SIGPLAN notices  
This paper studies this issue of performance portability, with the commodity communication architecture of interest being page-grained shared virtual memory.  ...  We begin with applications that perform well on moderate-scale hardware cache-coherent systems, and nd that they do not do so well on SVM systems.  ...  Acknowledgment We w ould like to thank Angelos Bilas and Liviu Iftode for their help with the SVM simulators.  ... 
doi:10.1145/263767.263792 fatcat:223ffyrufneujmxb6r4hgkcdxq

Application restructuring and performance portability on shared virtual memory and hardware-coherent multiprocessors

Dongming Jiang, Hongzhang Shan, Jaswinder Pal Singh
1997 Proceedings of the sixth ACM SIGPLAN symposium on Principles and practice of parallel programming - PPOPP '97  
This paper studies this issue of performance portability, with the commodity communication architecture of interest being page-grained shared virtual memory.  ...  We begin with applications that perform well on moderate-scale hardware cache-coherent systems, and nd that they do not do so well on SVM systems.  ...  Acknowledgment We w ould like to thank Angelos Bilas and Liviu Iftode for their help with the SVM simulators.  ... 
doi:10.1145/263764.263792 dblp:conf/ppopp/JiangSS97 fatcat:4tvoxplw65h2rd7viicyxxnxsm

Application scaling under shared virtual memory on a cluster of SMPs

Dongming Jiang, Brian O'Kelley, Xiang Yu, Sanjeev Kumar, Angelos Bilas, Jaswinder Pal Singh
1999 Proceedings of the 13th international conference on Supercomputing - ICS '99  
In this paper we examine how application performance scales on a state-of-the-art shared virtual memory (SVM) system on a cluster with 64 processors, comprising 4-way SMPs connected with a fast system  ...  We find that while the level of application restructuring needed is quite high compared to applications that perform well on a hardware-coherent system of this scale, and larger problem sizes are needed  ...  Conclusions We have investigated the scaling of application performance on a shared virtual memory cluster of SMPs for a wide range of application classes, using a hardware-coherent SGI Ori-gin2000 multiprocessor  ... 
doi:10.1145/305138.305190 dblp:conf/ics/JiangOYKBS99 fatcat:o5zayofrq5d5zpevxoder56lfa

Retrospective: the Cedar system

A. Veidenbaum, P.-C. Yew, D. J. Kuck, C. D. Polychronopoulos, D. H. Padua, E. S. Davidson, K. Gallivan
1998 25 years of the international symposia on Computer architecture (selected papers) - ISCA '98  
We felt that major advances in the state of hardware technology, architecture, compilers, and parallel algorithms made such a demonstration possible.  ...  Another goal was to have the prototype "achieve Cray-1 speeds for programs written in high-level languages and automatically restructured" by a compiler.  ...  Word-interleaved, UMA shared memory was part of the virtual address space and directly accessible via processor instructions.  ... 
doi:10.1145/285930.285965 dblp:conf/isca/VeidenbaumYKPPDG98 fatcat:obv6ipwmffbh7e27pzrhkzaza4

Experience Developing the RP3 Operating System

Ray Bryant, Hung-Yang Chang, Bryan S. Rosenburg
1991 Computing Systems  
This paper discusses what we learned about developing operating systems for shared-memory parallel machines such as RP3 and includes recommendations on how we feel such systems should and should not be  ...  'We now believe multiprocessor architectures should support both coherent and non-coherent modes of execution.  ...  The overhead of hardware cache coherence protocols on highly-parallel machines may be unacceptably high for some applications.  ... 
dblp:journals/csys/BryantCR91 fatcat:wplj4toye5dt7pib5frk4u725y

Comparative evaluation of fine- and coarse-grain approaches for software distributed shared memory

S. Dwarkadas, K. Gharachorloo, L. Kontothanassis, D.J. Scales, M.L. Scott, R. Stets
1999 Proceedings Fifth International Symposium on High-Performance Computer Architecture  
We do work in the design, fabrication and packaging of hardware; language processing and scaling issues in system software design; and the exploration of new applications areas that are opening up with  ...  Our focus is computer science research relevant to the design and application of high performance scientific computers. We test our ideas by designing, building, and using real systems.  ...  This part allows us to evaluate the portability of applications developed for hardware multiprocessors and to measure the performance gap on applications developed for page-based systems.  ... 
doi:10.1109/hpca.1999.744377 dblp:conf/hpca/DwarkadasGKSSS99 fatcat:3ngvqrvmofh2jmv3twk7ujtvwa

Distributed shared memory: a survey of issues and algorithms

B. Nitzberg, V. Lo
1991 Computer  
All DSM systems provide some form of memory coherence. If the shared data is not replicated, then enforcing memory coherence is trivial.  ...  A Distributed sharedmemory systems implement the sharedmemory abstraction On multicomputer architectures, combining the scalability of networkbased architectures with the convenience Of shared-memory programming  ...  Acknowledgments This work was supported in part by NSF grant CCR-8808532, a Tektronix research fellowship, and the NSF Research Experiences for Undergraduates program.  ... 
doi:10.1109/2.84877 fatcat:cy664qqsdvetfbl6oill7bpk3u

Disco

Edouard Bugnion, Scott Devine, Mendel Rosenblum
1997 ACM SIGOPS Operating Systems Review  
In this paper we examine the problem of extending modem operating systems to run efficiently on large-scale shared memory multiprocessors without a large implementation effort.  ...  Our approach brings back an idea popular in the 197Os, virtual machine monitors. We use virtual machines to run multiple commodity operating systems on a scalable multiprocessor.  ...  Our colleagues Kinshuk Govil, Dan Teodosiu, and Ben Verghese participated in many lively discussions on Disco and carefully read drafts of the paper.  ... 
doi:10.1145/269005.266672 fatcat:uvcwdv63yjgqbaapjkfxbzs474

A study of performance on SMP and distributed memory architectures using a shared memory programming model

Eugene D. Brooks, Karen H. Warren
1997 Proceedings of the 1997 ACM/IEEE conference on Supercomputing (CDROM) - Supercomputing '97  
In this paper we examine the use of a shared memory programming model to address the problem of portability of application codes between distributed memory and shared memory architectures.  ...  The type qualifier declaration supports an abstract shared memory facility on distributed memory machines while making direct use of hardware support on shared memory architectures.  ...  W-7405-ENG-48. 1 This work was partially supported by National Center for Supercomputing Applications under grant number CCR970005N and utilized the Silicon Graphics Origin 2000 at the National Center  ... 
doi:10.1145/509593.509637 dblp:conf/sc/BrooksW97 fatcat:62k6mlfeebgzfnp3zubqe4ticq

Disco

Edouard Bugnion, Scott Devine, Mendel Rosenblum
1997 Proceedings of the sixteenth ACM symposium on Operating systems principles - SOSP '97  
Our colleagues Kinshuk Govil, Dan Teodosiu, and Ben Verghese participated in many lively discussions on Disco and carefully read drafts of the paper.  ...  Acknowledgments The authors would like to thank John Chapin, John Gerth, Mike Nelson, Rick Rashid, Steve Ofsthun, Volker Strumpen, and our shepherd Rich Draves for their feedback.  ...  The machines use a directory to maintain cache coherency, providing to the software the view of a shared-memory multiprocessor with non-uniform memory access times.  ... 
doi:10.1145/268998.266672 dblp:conf/sosp/BugnionDR97 fatcat:g3pofnftqrabri5qef5zzzwsfm

Disco: running commodity operating systems on scalable multiprocessors

Edouard Bugnion, Scott Devine, Kinshuk Govil, Mendel Rosenblum
1997 ACM Transactions on Computer Systems  
Our colleagues Kinshuk Govil, Dan Teodosiu, and Ben Verghese participated in many lively discussions on Disco and carefully read drafts of the paper.  ...  Acknowledgments The authors would like to thank John Chapin, John Gerth, Mike Nelson, Rick Rashid, Steve Ofsthun, Volker Strumpen, and our shepherd Rich Draves for their feedback.  ...  The machines use a directory to maintain cache coherency, providing to the software the view of a shared-memory multiprocessor with non-uniform memory access times.  ... 
doi:10.1145/265924.265930 fatcat:t5pdtgvenrforgy5mtnzbwqth4

The multikernel

Andrew Baumann, Paul Barham, Pierre-Evariste Dagand, Tim Harris, Rebecca Isaacs, Simon Peter, Timothy Roscoe, Adrian Schüpbach, Akhilesh Singhania
2009 Proceedings of the ACM SIGOPS 22nd symposium on Operating systems principles - SOSP '09  
An evaluation of our prototype on multicore systems shows that, even on present-day machines, the performance of a multikernel is comparable with a conventional OS, and can scale better to support future  ...  Commodity computer systems contain more and more processor cores and exhibit increasingly diverse architectural tradeoffs, including memory hierarchies, interconnects, instruction sets and variants, and  ...  Acknowledgments We would like to thank our shepherd, Jeff Dean, the anonymous reviewers, and Tom Anderson, Steven Hand, and Michael Scott for their helpful suggestions for how to improve this paper and  ... 
doi:10.1145/1629575.1629579 dblp:conf/sosp/BaumannBDHIPRSS09 fatcat:6dha7l4czrd7pjuvcu6xt2v56i

Accelerating shared virtual memory via general-purpose network interface support

Angelos Bilas, Dongming Jiang, Jaswinder Pal Singh
2001 ACM Transactions on Computer Systems  
However, the performance of software virtual memory (SVM) is still far from that achieved on hardware DSM systems.  ...  With the success of hardware cache-coherent distributed shared memory (DSM), a lot of effort has also been made to support the coherent shared address space programming model in software on clusters.  ...  We gratefully acknowledge the support of NSF and DARPA.  ... 
doi:10.1145/367742.367747 fatcat:fxb2cwkep5h3hh6nhjqfbsr5da

Improving server software support for simultaneous multithreaded processors

Luke K. McDowell, Susan J. Eggers, Steven D. Gribble
2003 Proceedings of the ninth ACM SIGPLAN symposium on Principles and practice of parallel programming - PPoPP '03  
This paper evaluates how SMT's changes to the underlying hardware affects server software, and in particular, SMT's effects on memory allocation and synchronization.  ...  Overall, our results demonstrate how a few simple changes to applications' run-time support libraries can dramatically boost the performance of multi-threaded servers on SMT, without requiring modifications  ...  Overall, our goal is to identify the minimal set of changes that enables applications written for a uniprocessor or a shared memory multiprocessor to achieve good performance on an SMT.  ... 
doi:10.1145/781503.781504 fatcat:3ewwhcryubafbleaaqrq6wf4hq

Improving server software support for simultaneous multithreaded processors

Luke K. McDowell, Susan J. Eggers, Steven D. Gribble
2003 Proceedings of the ninth ACM SIGPLAN symposium on Principles and practice of parallel programming - PPoPP '03  
This paper evaluates how SMT's changes to the underlying hardware affects server software, and in particular, SMT's effects on memory allocation and synchronization.  ...  Overall, our results demonstrate how a few simple changes to applications' run-time support libraries can dramatically boost the performance of multi-threaded servers on SMT, without requiring modifications  ...  Overall, our goal is to identify the minimal set of changes that enables applications written for a uniprocessor or a shared memory multiprocessor to achieve good performance on an SMT.  ... 
doi:10.1145/781498.781504 dblp:conf/ppopp/McDowellEG03 fatcat:he5oz7dmwrbobhpigptv2c43l4
« Previous Showing results 1 — 15 out of 177 results