Filters








10 Hits in 2.3 sec

DMTCP: Transparent checkpointing for cluster computations and the desktop

Jason Ansel, Kapil Arya, Gene Cooperman
2009 2009 IEEE International Symposium on Parallel & Distributed Processing  
DMTCP (Distributed MultiThreaded CheckPointing) is a transparent user-level checkpointing package for distributed applications.  ...  On 128 distributed cores (32 nodes), checkpoint and restart times are typically 2 seconds, with negligible run-time overhead.  ...  Acknowledgements We thank our colleagues at CERN who have discussed, helped debug, and tested the use of DMTCP on runCMS and on ParGeant4.  ... 
doi:10.1109/ipdps.2009.5161063 dblp:conf/ipps/AnselAC09 fatcat:vh4wyp3a4fey5aumolruqpbthy

DMTCP: Transparent Checkpointing for Cluster Computations and the Desktop [article]

Jason Ansel, Kapil Arya, Gene Cooperman
2009 arXiv   pre-print
DMTCP (Distributed MultiThreaded CheckPointing) is a transparent user-level checkpointing package for distributed applications.  ...  On 128 distributed cores (32 nodes), checkpoint and restart times are typically 2 seconds, with negligible run-time overhead.  ...  Acknowledgements We thank our colleagues at CERN who have discussed, helped debug, and tested the use of DMTCP on runCMS and on ParGeant4.  ... 
arXiv:cs/0701037v3 fatcat:cdqecaf6nre2fifjd5unro4kpm

PMI: A Scalable Parallel Process-Management Interface for Extreme-Scale Systems [chapter]

Pavan Balaji, Darius Buntinas, David Goodell, William Gropp, Jayesh Krishna, Ewing Lusk, Rajeev Thakur
2010 Lecture Notes in Computer Science  
Parallel programming models on large-scale systems require a scalable system for managing the processes that make up the execution of a parallel program.  ...  them communicate with each other.  ...  Our performance results demonstrate significant advantages of PMI-2 compared with PMI-1.  ... 
doi:10.1007/978-3-642-15646-5_4 fatcat:fnfmqbw6lvfr3pobr6dylv5ikq

Implementing an Application-Specific Instruction-Set Processor for System-Level Dynamic Program Analysis Engines

Ingoo Heo, Minsu Kim, Yongje Lee, Changho Choi, Jinyong Lee, Brent Byunghoon Kang, Yunheung Paek
2015 ACM Transactions on Design Automation of Electronic Systems  
With the specialized architecture and programmability of software, our PAU aims at fast computation and sufficient flexibility.  ...  To address this, in this paper, we propose an implementation of programmable DPA hardware engine, called program analysis unit (PAU).  ...  SWD multi is about 4 times faster than SWD, and MPD multi improves the performance of MPD by 31.9%.  ... 
doi:10.1145/2746238 fatcat:w6vewbvm7jc3jfkj7i6b6pkp4i

MpD: A Multiprocessor C Debugger

Krish Ponamgi, Columbia University. Computer Science
2017
MpD is a multiprocessor C debugger designed for multithreaded applications running under the Mach operating system. MpD is built on top of gdb, an existing sequential debugger.  ...  The MpD layer utilizes the modeling languages Data Path Expressions developed by Hseush and Kaiser to provide a rich set of commands to trace sequential and parallel execution of a program.  ...  The Caswell-Black version of gdb implements the ptracefdebugger extensiolls needed to deal with multithreaded programs.  ... 
doi:10.7916/d8p84m0z fatcat:pdubzb6ljzbkjovaui3qvmnncu

B.Y.O.C (1,342 times and counting)

Poul-Henning Kamp
2011 Communications of the ACM  
aribas-1.64 --Interpreter for big integer/multi-precision floating point arithmetic ario-1.5 --Ario is a GTK2 client for MPD arirang-2.00,1 --Powerful webserver security scanner for network arista-0.9.5  ...  mod_blowchunks-1.3_1 --Apache module for rejecting and logging chunked requests ap13-mod_bunzip2-1 --Apache module for server-side decompression of bzip2 files ap13-mod_cgi_debug-0.7 --Apache module to make debugging  ...  For data mining FOSS (free and open source software) programs, the FreeBSD operating system ships with a handy facility called the Ports Collection, containing strategic metadata for 22.003 pieces of FOSS  ... 
doi:10.1145/1897852.1897870 fatcat:2elva4fgp5cdnn5b5lgmlp5shi

Data-centric parallel debugging technique for petascale computers

Minh Ngoc Dinh
2017
These assertions are modeled on programming language systems that support the verification and validation of program state as a whole rather than focusing on that of only a single process state.  ...  Petascale scientific applications, such as the Weather Research and Forecasting Model (WRF), involve enormous multi-dimensional data structures and operate with hundreds of thousands of concurrent processing  ...  These programs were selected because debugging them with traditional debugging techniques is ineffective and expensive.  ... 
doi:10.4225/03/58a652813d3bc fatcat:dzln6kyv3zexlo7cgmuqr4uixu

Component-based cluster systems software architecture a case study

N. Desai, R. Bradshaw, E. Lusk, R. Butler
2004 IEEE International Conference on Cluster Computing (IEEE Cat. No.04EX935)  
By "cluster system software," we mean the collection of programs used in configuring and maintaining individual nodes, together with the software involved in submission, scheduling, monitoring, and termination  ...  cluster with unusually complex systems software requirements.  ...  W-31-109-ENG-38 with the U.S. Department of Energy. The U.S.  ... 
doi:10.1109/clustr.2004.1392629 dblp:conf/cluster/DesaiBLB04 fatcat:zykwfzguurcivjkr6ayokbi4iq

Android Stack Machine [chapter]

Taolue Chen, Jinlong He, Fu Song, Guozhen Wang, Zhilin Wu, Jun Yan
2018 Lecture Notes in Computer Science  
The model is based on pushdown systems with multiple stacks, and focuses on the evolution of the back stack of the Android system when interacting with activities carrying specific launch modes and task  ...  For Android, technically ASM can be viewed as the counterpart of pushdown systems with multiple stacks, which are the de facto model for (multithreaded) concurrent programs.  ...  A multitude of classes of MPDSs have been considered, mostly as a model for concurrent recursive programs. In general, an ASM can be encoded as an MPDS.  ... 
doi:10.1007/978-3-319-96142-2_29 fatcat:s5ogoekvlbbqjnq45y6qespgb4

Sensor Fusion for Location Estimation Technologies

Matei-Eugen Vasile, John Darlington
2012
drivers and kernel programming.  ...  This is done using a custom-built program that implements the algorithm discussed next. 20 .  ... 
doi:10.25560/9856 fatcat:kfxcuzzs5rfvfhb3osa472mjca