This book presents the most important fault-tolerant distributed programming abstractions and their associated distributed algorithms, in particular in terms of reliable communication and agreement, which lie at the heart of nearly all ...
Fault tolerant Agreement in Synchronous Message passing Systems
Communication and Agreement Abstractions for Fault tolerant Asynchronous Distributed Systems
Fault Tolerant Parallel and Distributed Systems
International e Conference of Computer Science 2006
Programming Distributed Systems
Distributed Systems for System Architects
Wiley Encyclopedia of Electrical and Electronics Engineering Volume 17
Space Reclamation for Uncoordinated Checkpointing in Message passing Systems
Fault tolerance Implemented by Voting Protocols in Distributed Systems
Recent Advances in Parallel Virtual Machine and Message Passing Interface
To overcome the high overhead drawbacks of current fault tolerant MPI systems ,
this paper presents TH - MPI for parallel cluster systems . ... 1 Introduction The
clusters of PCs have become popular platforms for computationally intensive
distributed applications . ... open source operation system , such as Linux , and
the availability of standard message passing systems , such as Message Passing
...
Parallel Architectures
Proceedings
An Algorithm for Supporting Fault Tolerant Objects in Distributed Object Oriented
Operating Systems Ganesha Beedubail, Anish ... A simple message logging
scheme that pairs the logging of response message and the next request
message reduces the message ... tolerance for distributed system is primarily
addressed for process based systems with asynchronous message passing[5, 10
, 11, 7, 12, 13].
PARBASE 90 International Conference on Databases Parallel Architectures and Their Applications
On the Fault Tolerance of Process Allocation and Communication on Message Passing Multiprocessor Systems
High Performance Computing and Communications
Formal Models and Semantics
In the term distributed computing , the word distributed means spread out across
space . ... The process models that are most obviously distributed are ones in
which processes communicate by message passing - a process sends a
message by adding it to a ... of most real distributed systems , but one often
studies distributed algorithms that are not fault tolerant , leaving other
mechanisms ( such as ...