1,031 Hits in 4.0 sec

NT-SwiFT: software implemented fault tolerance on Windows NT

Deron Liang, P. Emerald Chung, Yennun Huang, Chandra Kintala, Woei-Jyh Lee, Timothy K. Tsai, Chung-Yih Wang
2004 Journal of Systems and Software  
In recent years, an increasing number of highly available applications are being implemented on Windows NT.  ...  on Windows NT, 2000.  ...  Acknowledgements The authors would also like to thank Gaurav Suri and Yi-Min Wang who implemented the first prototype of watchd and libft in NT-SwiFT and are grateful to Dave Korn for his help in using  ... 
doi:10.1016/s0164-1212(02)00154-1 fatcat:m7lgq6dnuvdhbdyjvv4erksf4i

Measurement-Based Analysis of System Dependability Using Fault Injection and Field Failure Data [chapter]

Ravishankar K. Iyer, Zbigniew Kalbarczyk
2002 Lecture Notes in Computer Science  
We use example of LAN of Windows NT based computers to present methods for collecting and analyzing failure data to characterize network system dependability.  ...  For example, the study of failures in a network of Windows NT machines reveals that most of the problems that lead to reboots are software related and that though the average availability evaluates to  ...  This manuscript is based on a research supported in part by NASA under grant NAG-1-613, in cooperation with the Illinois Computer Laboratory for Aerospace Systems and Software (ICLASS), by Tandem Computers  ... 
doi:10.1007/3-540-45798-4_13 fatcat:uce7va7k6nerjehped7tlvxab4

Designing SSI clusters with hierarchical checkpointing and single I/O space

Kai Hwang, Hai Jin, E. Chow, Cho-Li Wang, Zhiwei Xu
1999 IEEE Concurrency  
In the industrial track, Wolfpack for Intelbased Windows NT servers, Berkeley NOW, and Solaris MC for Unix workstations are all aimed at high availability, scalability, and manageability.  ...  In an experimental cluster at USC, we modify the device-relevant system calls in Linux to run on Pentium-based PC hosts.  ...  He leads a design group at NCIC building a series of cluster-based superservers and scalable multicomputers.  ... 
doi:10.1109/4434.749136 fatcat:tk7exsomkjfevjuypo7woihsiq

Fault-Tolerant Dynamic Task Scheduling Based on Dataflow Graphs [chapter]

Erik Maehle, Franz-J. Markus
1998 Fault-Tolerant Parallel and Distributed Systems  
Fault tolerance is based on dynamic redundancy comprising checkpointing, self-diagnosis and rollback recovery. The schedule is computed dynamically during the runtime of the process system.  ...  The algorithm is described in detail for the 1-fault case and some performance measurements on a multi-transputer system are given.  ...  For workstationclusters the environment usually runs on one node of the virtual multicomputer. Supported operating systems are Solaris, Linux and Windows (95, NT).  ... 
doi:10.1007/978-1-4615-5449-3_21 fatcat:qrg5p3jearbetlus4xjarudddq

Dynamic scheduling of process groups

1998 Concurrency Practice and Experience  
for checkpointing of process groups.  ...  of isolation for some time.  ...  Acknowledgments The authors express their thanks to Vietor Abell for pointing out the importance of memory constraints for process group context switching.  ... 
doi:10.1002/(sici)1096-9128(19980410)10:4<265::aid-cpe314>;2-u fatcat:mpfvzhetonb45brlmuarfd4xh4

Chameleon: a software infrastructure for adaptive fault tolerance

Z.T. Kalbarczyk, R.K. Iyer, S. Bagchi, K. Whisnant
1999 IEEE Transactions on Parallel and Distributed Systems  
Employing ARMORs, Chameleon makes available different fault-tolerant configurations and maintains run-time adaptation to changes in the availability requirements of an application.  ...  They also make available a host's resources to the Chameleon environment. 3) Common ARMORs implement specific techniques for providing application-required dependability.  ...  Mathur) for many insightful discussions. We would like to thank J. Wang and M. Kalyanakrishnan for their contributions in developing the application and porting the environment to Windows NT.  ... 
doi:10.1109/71.774907 fatcat:y3f7mkvjtbfctmh4kiboeeoucm

The Design and Architecture of the Microsoft Cluster Service -- A Practical Approach to High-Availability and Scalability [article]

Werner Vogels, Dan Dumitriu, Ken Birman, Rod Gamache, Mike Massa, Rob Short, John Vert, Joe Barrera
1998 arXiv   pre-print
Microsoft Cluster Service (MSCS) extends the Win-dows NT operating system to support high-availability services.  ...  Later ver-sions of MSCS will provide scalability via a node and application management system that allows applications to scale to hundreds of nodes.  ...  Node A node is a self-contained Windows NT™ system that can run an instance of the Cluster Service. Groups of nodes implement a cluster.  ... 
arXiv:cs/9809006v1 fatcat:c7zr3ayiknh45nbkqadvn4ko6e

A Framework for Opportunistic Cluster Computing Using JavaSpaces1) [chapter]

Jyoti Batheja, Manish Parashar
2001 Lecture Notes in Computer Science  
The fundamental idea is to provide large amounts of processing capacity over extended periods of time by harnessing the idle and available resources on the network in an "opportunistic" manner.  ...  Heterogeneous networked clusters are being increasingly used as platforms for resourceintensive parallel and distributed applications.  ...  The segment size for each strip is kept to be 20 leading to 25 sub tasks for each iteration. Each page request spawns off a new using a cluster of Windows NT workstations.  ... 
doi:10.1007/3-540-48228-8_74 fatcat:72oua4fshnealddyxfhcqfwtxm

High-Performance Throughput Computing

S. Chaudhry, P. Caprioli, S. Yip, M. Tremblay
2005 IEEE Micro  
This helps ensure that the memory system will have responded with the data in a timely fashion.  ...  Throughput computing Systems designed for throughput computing emphasize the overall work performed over a fixed time period as opposed to focusing on a metric describing how fast a single core or a thread  ... 
doi:10.1109/mm.2005.49 fatcat:wq3ukuhpg5gubkia7bdjwz2nfu

Control Theoretic Approach to Platform Optimization using HMM [chapter]

Rahul Khanna, Huaping Liu, Mariette Aw
2011 Hidden Markov Models, Theory and Applications  
Distributing the available power to system components in a manner that maximizes the overall performance.  ...  This profile is the summary of system resource usage trends over a period of time (or system DNA).  ...  Running Average Power Synthesizer (RAPS) RAPS is a running average power calculator for a monitored quantity over an enforcement window.  ... 
doi:10.5772/15038 fatcat:angt6jtebbh3tjv5fa5eorn7tu

Cluster Computing [chapter]

Mark Baker, John Brooke, Ken Hawick, Rajkumar Buyya
2001 Lecture Notes in Computer Science  
This ownership often brings problems when attempting to form a cluster of workstations.  ...  Generally a workstation will be "owned" by, for example, an individual, a groups, a department, or an organisation. They are dedicated to the exclusive use by the "owners".  ...  The WANE client distribution provides support for MacintoshOS, DOS, Windows, Windows NT, OS/2, and Linux. Chapter6 6.  ... 
doi:10.1007/3-540-44681-8_100 fatcat:cr6rpynstjgufacwbmciciriha

Cluster Computing White Paper [article]

Mark Baker, et. al
2001 arXiv   pre-print
Cluster computing is not a new area of computing.  ...  components available to support high performance and high availability applications.  ...  References There are many different HA products available to the cluster community, each with several HA applications enabled for them in a pre-packaged form, and others being supported by various vendors  ... 
arXiv:cs/0004014v2 fatcat:cx3lbz6fcbcmrgicye7ure7u3u

Efficient network and I/O throttling for fine-grain cycle stealing

Kyung D. Ryu, Jeffrey K. Hollingsworth, Peter J. Keleher
2001 Proceedings of the 2001 ACM/IEEE conference on Supercomputing (CDROM) - Supercomputing '01  
We use a sliding window of recent events to compute the average rate for a target resource.  ...  This paper proposes and evaluates a new mechanism, rate windows, for I/O and network rate policing.  ...  Windows 2000 (nee Windows NT) and the stackable file system [10] provide the required calls.  ... 
doi:10.1145/582034.582037 dblp:conf/sc/RyuHK01 fatcat:xtdakvrttzbydpuqob3dfu5tcm

Apache Flink™: Stream and Batch Processing in a Single Engine

Paris Carbone, Asterios Katsifodimos, Stephan Ewen, Volker Markl, Seif Haridi, Kostas Tzoumas
2015 IEEE Data Engineering Bulletin  
Architectural patterns such as the "lambda architecture" [21] combine batch and stream processing systems to implement multiple paths of computation: a streaming fast path for timely approximate results  ...  Apache Flink 1 is an open-source system for processing streaming and batch data.  ...  In a high-availability setup, the JobManager persists a minimal set of metadata at each checkpoint to a fault-tolerant storage, such that a standby JobManager can reconstruct the checkpoint and recover  ... 
dblp:journals/debu/CarboneKEMHT15 fatcat:xzgvdr6pljctzb75xecvg74m3q

Analyzing Virtual Machine Live Migration in Application Data Context

Mutiullah Shaikh, Asadullah Shaikh, Muhammad Ali, Farah Deeba
2016 International Journal of Advanced Computer Science and Applications  
memory while the allocated memory could be discarded for a while.  ...  The migration of one virtual machine to another requires some time depending on the network bandwidth, guest availability, hardware limitation overcomes, resource allocation, server reallocation, hypervisor  ...  based system.  ... 
doi:10.14569/ijacsa.2016.070554 fatcat:rjywhgmyrzdzblryan72nos3u4
« Previous Showing results 1 — 15 out of 1,031 results