A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
Filters
NT-SwiFT: software implemented fault tolerance on Windows NT
2004
Journal of Systems and Software
In recent years, an increasing number of highly available applications are being implemented on Windows NT. ...
on Windows NT, 2000. ...
Acknowledgements The authors would also like to thank Gaurav Suri and Yi-Min Wang who implemented the first prototype of watchd and libft in NT-SwiFT and are grateful to Dave Korn for his help in using ...
doi:10.1016/s0164-1212(02)00154-1
fatcat:m7lgq6dnuvdhbdyjvv4erksf4i
Measurement-Based Analysis of System Dependability Using Fault Injection and Field Failure Data
[chapter]
2002
Lecture Notes in Computer Science
We use example of LAN of Windows NT based computers to present methods for collecting and analyzing failure data to characterize network system dependability. ...
For example, the study of failures in a network of Windows NT machines reveals that most of the problems that lead to reboots are software related and that though the average availability evaluates to ...
This manuscript is based on a research supported in part by NASA under grant NAG-1-613, in cooperation with the Illinois Computer Laboratory for Aerospace Systems and Software (ICLASS), by Tandem Computers ...
doi:10.1007/3-540-45798-4_13
fatcat:uce7va7k6nerjehped7tlvxab4
Designing SSI clusters with hierarchical checkpointing and single I/O space
1999
IEEE Concurrency
In the industrial track, Wolfpack for Intelbased Windows NT servers, Berkeley NOW, and Solaris MC for Unix workstations are all aimed at high availability, scalability, and manageability. ...
In an experimental cluster at USC, we modify the device-relevant system calls in Linux to run on Pentium-based PC hosts. ...
He leads a design group at NCIC building a series of cluster-based superservers and scalable multicomputers. ...
doi:10.1109/4434.749136
fatcat:tk7exsomkjfevjuypo7woihsiq
Fault-Tolerant Dynamic Task Scheduling Based on Dataflow Graphs
[chapter]
1998
Fault-Tolerant Parallel and Distributed Systems
Fault tolerance is based on dynamic redundancy comprising checkpointing, self-diagnosis and rollback recovery. The schedule is computed dynamically during the runtime of the process system. ...
The algorithm is described in detail for the 1-fault case and some performance measurements on a multi-transputer system are given. ...
For workstationclusters the environment usually runs on one node of the virtual multicomputer. Supported operating systems are Solaris, Linux and Windows (95, NT). ...
doi:10.1007/978-1-4615-5449-3_21
fatcat:qrg5p3jearbetlus4xjarudddq
Dynamic scheduling of process groups
1998
Concurrency Practice and Experience
for checkpointing of process groups. ...
of isolation for some time. ...
Acknowledgments The authors express their thanks to Vietor Abell for pointing out the importance of memory constraints for process group context switching. ...
doi:10.1002/(sici)1096-9128(19980410)10:4<265::aid-cpe314>3.0.co;2-u
fatcat:mpfvzhetonb45brlmuarfd4xh4
Chameleon: a software infrastructure for adaptive fault tolerance
1999
IEEE Transactions on Parallel and Distributed Systems
Employing ARMORs, Chameleon makes available different fault-tolerant configurations and maintains run-time adaptation to changes in the availability requirements of an application. ...
They also make available a host's resources to the Chameleon environment. 3) Common ARMORs implement specific techniques for providing application-required dependability. ...
Mathur) for many insightful discussions. We would like to thank J. Wang and M. Kalyanakrishnan for their contributions in developing the application and porting the environment to Windows NT. ...
doi:10.1109/71.774907
fatcat:y3f7mkvjtbfctmh4kiboeeoucm
The Design and Architecture of the Microsoft Cluster Service -- A Practical Approach to High-Availability and Scalability
[article]
1998
arXiv
pre-print
Microsoft Cluster Service (MSCS) extends the Win-dows NT operating system to support high-availability services. ...
Later ver-sions of MSCS will provide scalability via a node and application management system that allows applications to scale to hundreds of nodes. ...
Node A node is a self-contained Windows NT™ system that can run an instance of the Cluster Service. Groups of nodes implement a cluster. ...
arXiv:cs/9809006v1
fatcat:c7zr3ayiknh45nbkqadvn4ko6e
A Framework for Opportunistic Cluster Computing Using JavaSpaces1)
[chapter]
2001
Lecture Notes in Computer Science
The fundamental idea is to provide large amounts of processing capacity over extended periods of time by harnessing the idle and available resources on the network in an "opportunistic" manner. ...
Heterogeneous networked clusters are being increasingly used as platforms for resourceintensive parallel and distributed applications. ...
The segment size for each strip is kept to be 20 leading to 25 sub tasks for each iteration. Each page request spawns off a new
using a cluster of Windows NT workstations. ...
doi:10.1007/3-540-48228-8_74
fatcat:72oua4fshnealddyxfhcqfwtxm
High-Performance Throughput Computing
2005
IEEE Micro
This helps ensure that the memory system will have responded with the data in a timely fashion. ...
Throughput computing Systems designed for throughput computing emphasize the overall work performed over a fixed time period as opposed to focusing on a metric describing how fast a single core or a thread ...
doi:10.1109/mm.2005.49
fatcat:wq3ukuhpg5gubkia7bdjwz2nfu
Control Theoretic Approach to Platform Optimization using HMM
[chapter]
2011
Hidden Markov Models, Theory and Applications
Distributing the available power to system components in a manner that maximizes the overall performance. ...
This profile is the summary of system resource usage trends over a period of time (or system DNA). ...
Running Average Power Synthesizer (RAPS) RAPS is a running average power calculator for a monitored quantity over an enforcement window. ...
doi:10.5772/15038
fatcat:angt6jtebbh3tjv5fa5eorn7tu
Cluster Computing
[chapter]
2001
Lecture Notes in Computer Science
This ownership often brings problems when attempting to form a cluster of workstations. ...
Generally a workstation will be "owned" by, for example, an individual, a groups, a department, or an organisation. They are dedicated to the exclusive use by the "owners". ...
The WANE client distribution provides support for MacintoshOS, DOS, Windows, Windows NT, OS/2, and Linux.
Chapter6 6. ...
doi:10.1007/3-540-44681-8_100
fatcat:cr6rpynstjgufacwbmciciriha
Cluster Computing White Paper
[article]
2001
arXiv
pre-print
Cluster computing is not a new area of computing. ...
components available to support high performance and high availability applications. ...
References There are many different HA products available to the cluster community, each with several HA applications enabled for them in a pre-packaged form, and others being supported by various vendors ...
arXiv:cs/0004014v2
fatcat:cx3lbz6fcbcmrgicye7ure7u3u
Efficient network and I/O throttling for fine-grain cycle stealing
2001
Proceedings of the 2001 ACM/IEEE conference on Supercomputing (CDROM) - Supercomputing '01
We use a sliding window of recent events to compute the average rate for a target resource. ...
This paper proposes and evaluates a new mechanism, rate windows, for I/O and network rate policing. ...
Windows 2000 (nee Windows NT) and the stackable file system [10] provide the required calls. ...
doi:10.1145/582034.582037
dblp:conf/sc/RyuHK01
fatcat:xtdakvrttzbydpuqob3dfu5tcm
Apache Flink™: Stream and Batch Processing in a Single Engine
2015
IEEE Data Engineering Bulletin
Architectural patterns such as the "lambda architecture" [21] combine batch and stream processing systems to implement multiple paths of computation: a streaming fast path for timely approximate results ...
Apache Flink 1 is an open-source system for processing streaming and batch data. ...
In a high-availability setup, the JobManager persists a minimal set of metadata at each checkpoint to a fault-tolerant storage, such that a standby JobManager can reconstruct the checkpoint and recover ...
dblp:journals/debu/CarboneKEMHT15
fatcat:xzgvdr6pljctzb75xecvg74m3q
Analyzing Virtual Machine Live Migration in Application Data Context
2016
International Journal of Advanced Computer Science and Applications
memory while the allocated memory could be discarded for a while. ...
The migration of one virtual machine to another requires some time depending on the network bandwidth, guest availability, hardware limitation overcomes, resource allocation, server reallocation, hypervisor ...
based system. ...
doi:10.14569/ijacsa.2016.070554
fatcat:rjywhgmyrzdzblryan72nos3u4
« Previous
Showing results 1 — 15 out of 1,031 results