Filters








41 Hits in 6.0 sec

Experience distributing objects in an SMMP OS

Jonathan Appavoo, Michael Stumm, Livio Soares, Dilma Da Silva, Orran Krieger, Marc Auslander, Michal Ostrowski, Bryan Rosenburg, Amos Waterland, Robert W. Wisniewski, Jimi Xenidis
2007 ACM Transactions on Computer Systems  
Designing and implementing system software so that it scales well on shared-memory multiprocessors (SMMPs) has proven to be surprisingly challenging.  ...  In this paper, we describe a methodology and a framework for constructing system software structured for locality, exploiting techniques similar to those used in distributed systems.  ...  However, Clustered Objects in K42 are specifically designed for shared-memory multiprocessors as opposed to loosely-coupled distributed systems, and focus primarily on maximizing SMMP locality.  ... 
doi:10.1145/1275517.1275518 fatcat:ciavjiuem5bdxgueezrqyj7fpy

Hierarchical clustering: A structure for scalable multiprocessor operating system design

Ronald C. Unrau, Orran Krieger, Benjamin Gamsa, Michael Stumm
1995 Journal of Supercomputing  
We introduce the concept of Hierarchical Clustering as a way to structure shared memory multiprocessor operating systems for scalability.  ...  The resulting structure is scalable because it i) maximizes locality, which is key to good performance in NUMA systems, and ii) provides for concurrency that increases linearly with the number of processors  ...  Acknowledgments We wish to thank David Blythe, Yonatan Hanna, and Songnian Zhou for their significant contribution to the design and implementation of HURRICANE.  ... 
doi:10.1007/bf01245400 fatcat:scrxaexlcrcsznvlavuexibwpq

Everything you always wanted to know about synchronization but were afraid to ask

Tudor David, Rachid Guerraoui, Vasileios Trigonakis
2013 Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles - SOSP '13  
We span multiple layers, from hardware cache-coherence protocols up to high-level concurrent software.  ...  We draw a set of observations that, roughly speaking, imply that scalability of synchronization is mainly a property of the hardware. *  ...  Reusable Concurrent Data Types.  ... 
doi:10.1145/2517349.2522714 dblp:conf/sosp/DavidGT13 fatcat:us6lcqct45fmblrpavjie4o7we

Comparing Operating Systems Scalability on Multicore Processors by Microbenchmarking

Yan CUI, Yu CHEN, Yuanchun SHI
2012 IEICE transactions on information and systems  
As open-source operating systems are of particular interests in the research and industry communities, in this paper we choose three operating systems (Linux, Solaris and FreeBSD) to systematically evaluate  ...  With the help of performance tools and source code instrumentation and analysis, we find that synchronization primitives protecting shared data structures in the kernels are the major bottleneck limiting  ...  For example, K42 [2] and Tornado [4] , which are designed in the object-oriented manner, can achieve both locality and scalability on multiprocessors.  ... 
doi:10.1587/transinf.e95.d.2810 fatcat:ketv335yvrdivc4eeerza3xb24

TS-Router: On maximizing the Quality-of-Allocation in the On-Chip Network

Yuan-Ying Chang, Yoshi Shih-Chieh Huang, M. Poremba, V. Narayanan, Yuan Xie, C. King
2013 2013 IEEE 19th International Symposium on High Performance Computer Architecture (HPCA)  
In this paper, we demonstrate that the matching decisions made in a router along time actually form a time series, and the Quality-of-Allocation (QoA) can be maximized if the matching decision is made  ...  This allocation is in essence a matching between the input requests and output port resources. Efficient router designs strive to maximize the matching.  ...  Acknowledgements This work was supported in part by NSF grants 1205618, 1213052, 1147388, 0916887, 0905365 and 0903432 as well as Industrial Technology Research Institute and National Science Council grant  ... 
doi:10.1109/hpca.2013.6522335 dblp:conf/hpca/ChangHPNXK13 fatcat:pobkx2dqz5gzrhl2t6ytyjdkna

Experience on Comparison of Operating Systems Scalability on the Multi-core Architecture

Yan Cui, Yingxin Wang, Yu Chen, Yuanchun Shi
2011 2011 IEEE International Conference on Cluster Computing  
As open-source operating systems are of particular interests in the research and industry communities, in this paper we choose three operating systems (Linux, Solaris and FreeBSD) to systematically evaluate  ...  running on a system of a larger number of cores.  ...  ACKNOWLEDGMENT We wish to thank Frans Kaashoek and Song Jiang for their insightful advice and comments in this paper.  ... 
doi:10.1109/cluster.2011.31 dblp:conf/cluster/CuiWCS11 fatcat:qexxp6uu2rcjvfggrx6o2p2cgi

Single system image: A survey

Philip Healy, Theo Lynn, Enda Barrett, John P. Morrison
2016 Journal of Parallel and Distributed Computing  
This approach encompasses decades of research using a broad variety of techniques at varying levels of abstraction, from custom hardware and distributed hypervisors to specialized operating system kernels  ...  h i g h l i g h t s • We provide a retrospective survey of single system image. • There has been novel recent work in the area of distributed hypervisors. • Despite a peak in interest in the 2000s, kernel-level  ...  The research work described in this paper was supported by the Irish Centre for Cloud Computing and Commerce, an Irish national technology centre funded by Enterprise Ireland and the Irish Industrial Development  ... 
doi:10.1016/j.jpdc.2016.01.004 fatcat:6eqzcmrmu5eptmktzn52i7yy7e

Realtime forest animation in wind

Nimish J. Oliapuram, Subodh Kumar
2010 Proceedings of the Seventh Indian Conference on Computer Vision, Graphics and Image Processing - ICVGIP '10  
We present a system for interactive animation of trees in a wind-field. Wind forces are simulated by Navier-Stokes equations, solved in real-time using multi-processor CUDA architecture.  ...  We demonstrate animation of a forest of trees in a variety of wind conditions under interactive control of a user.  ...  Local memory resides in global memory and therefore has the same performance overhead. However, like registers, it has the scope of a thread. It may be used for local variables, especially arrays.  ... 
doi:10.1145/1924559.1924586 dblp:conf/icvgip/OliapuramK10 fatcat:omc46qx3pjdahh7mdh4cno3iki

Globally Synchronized Frames for guaranteed quality-of-service in on-chip networks

Jae W. Lee, Man Cheuk Ng, Krste Asanović
2012 Journal of Parallel and Distributed Computing  
Future chip multiprocessors (CMPs) may have hundreds to thousands of threads competing to access shared resources, and will require quality-of-service (QoS) support to improve system utilization.  ...  According to our simulations, all concurrent flows receive their guaranteed minimum share of bandwidth in compliance with a given bandwidth allocation.  ...  Introduction Advances in fabrication technology allow the integration of many processors on a chip to form a chip multiprocessor (CMP), possibly in the form of a complex system-on-a-chip (SoC) with custom  ... 
doi:10.1016/j.jpdc.2012.01.013 fatcat:k7rn5pj6w5gm5e6rzfn5cxinfm

Helios

Edmund B. Nightingale, Orion Hodson, Ross McIlroy, Chris Hawblitzel, Galen Hunt
2009 Proceedings of the ACM SIGOPS 22nd symposium on Operating systems principles - SOSP '09  
We offloaded several applications and operating system components, often by changing only a single line of metadata.  ...  Helios is an operating system designed to simplify the task of writing, deploying, and tuning applications for heterogeneous platforms.  ...  We also thank Jeremy Condit and Derrick Coetzee for their work in writing applications for Helios.  ... 
doi:10.1145/1629575.1629597 dblp:conf/sosp/NightingaleHMHH09 fatcat:pcbyzqrw2bdhbf67h6ryqasofu

Cluster Computing White Paper [article]

Mark Baker, et. al
2001 arXiv   pre-print
It is, however, evident that there is a growing interest in its usage in all areas where applications have traditionally used parallel or distributed computing platforms.  ...  The status of this paper is draft and we are at the stage of publicizing its presence and making a Request For Comments (RFC).  ...  References There are many different HA products available to the cluster community, each with several HA applications enabled for them in a pre-packaged form, and others being supported by various vendors  ... 
arXiv:cs/0004014v2 fatcat:cx3lbz6fcbcmrgicye7ure7u3u

Region Templates: Data Representation and Management for Large-Scale Image Analysis [article]

George Teodoro, Tony Pan, Tahsin Kurc, Jun Kong, Lee Cooper, Scott Klasky, Joel Saltz
2014 arXiv   pre-print
A number of optimizations for hybrid machines are available in our runtime system, including performance-aware scheduling for maximizing utilization of computing devices and techniques to reduce impact  ...  The execution of region templates applications is coordinated by a runtime system that supports efficient execution in hybrid machines.  ...  This work was supported in part by HHSN261200800001E from the NCI, R24HL085343 from the NHLBI, R01LM011119-01 and R01LM009239 from the NLM, RC4MD005964 from the NIH, PHS UL1RR025008 from the NIH CTSA,  ... 
arXiv:1405.7958v1 fatcat:doemcxj4djhmnplhvhkimy2x3q

Region templates: Data representation and management for high-throughput image analysis

George Teodoro, Tony Pan, Tahsin Kurc, Jun Kong, Lee Cooper, Scott Klasky, Joel Saltz
2014 Parallel Computing  
The execution of the application is coordinated by a runtime system that implements optimizations for hybrid machines, including performance-aware scheduling for maximizing the utilization of computing  ...  Optimizations in a high speed disk based storage implementation of the abstraction to support asynchronous data transfers and computation result in an application performance gain of about 1.13×.  ...  Acknowledgments This work was supported in part by HHSN261200800001E and 1U24CA180924-01A1 from the NCI, R24HL085343 from the NHLBI, R01LM011119-01 and R01LM009239 from the NLM, RC4MD005964 from the NIH  ... 
doi:10.1016/j.parco.2014.09.003 pmid:26139953 pmcid:PMC4484879 fatcat:4miblqmyyzad5bdcvmzxngv2oy

OceanStore

John Kubiatowicz, Chris Wells, Ben Zhao, David Bindel, Yan Chen, Steven Czerwinski, Patrick Eaton, Dennis Geels, Ramakrishna Gummadi, Sean Rhea, Hakim Weatherspoon
2000 ACM SIGOPS Operating Systems Review  
OceanStore is a utility infrastructure designed to span the globe and provide continuous access to persistent information.  ...  A prototype implementation is currently under development.  ...  support of our DARPA program manager, Jean Scholtz, and industrial funding from EMC and IBM.  ... 
doi:10.1145/384264.379239 fatcat:fuvjbfixt5ge7kmin6iuu4suhi

OceanStore

John Kubiatowicz, Chris Wells, Ben Zhao, David Bindel, Yan Chen, Steven Czerwinski, Patrick Eaton, Dennis Geels, Ramakrishna Gummadi, Sean Rhea, Hakim Weatherspoon
2000 SIGARCH Computer Architecture News  
OceanStore is a utility infrastructure designed to span the globe and provide continuous access to persistent information.  ...  A prototype implementation is currently under development.  ...  support of our DARPA program manager, Jean Scholtz, and industrial funding from EMC and IBM.  ... 
doi:10.1145/378995.379239 fatcat:wcsnxbrcojhqpiwnppspqa7tym
« Previous Showing results 1 — 15 out of 41 results