Skip to search
Skip to main content
Search in
Keyword
Title (keyword)
Author (keyword)
Subject (keyword)
Title starts with
Subject (browse)
Author (browse)
Author (sorted by title)
Call number (browse)
search for
Search
Advanced Search
Bookmarks
(
0
)
Princeton University Library Catalog
Start over
Cite
Send
to
SMS
Email
EndNote
RefWorks
RIS
Printer
Bookmark
Reliable Distributed Systems, 13th Symposium On
Author
IEEE Computer Society Press
[Browse]
Format
Book
Language
English
Published/Created
[Place of publication not identified] IEEE Computer Society Press 1994
Description
1 online resource (240 pages)
Details
Subject(s)
Electronic data processing
—
Distributed processing
—
Congresses
[Browse]
Database management
—
Congresses
[Browse]
Summary note
Parallel computing systems provide hardware redundancy that helps to achieve low cost fault-tolerance, by duplicating the task into more than a single processor, and comparing the states of the processors at checkpoints. This paper suggests a novel technique, based on a Markov Reward Model (MRM), for analyzing the performance of checkpointing schemes with task duplication. We show how this technique can be used to derive the average execution time of a task and other important parameters related to the performance of checkpointing schemes. Our analytical results match well the values we obtained using a simulation program. We compare the average task execution time and total work of four checkpointing schemes, and show that generally increasing the number of processors reduces the average execution time, but increases the total work done by the processors. However, in cases where there is a big difference between the time it takes to perform different operations, those results can change.
Notes
Bibliographic Level Mode of Issuance: Monograph
Language note
English
Doi
10.1109/RELDIS.1994
Statement on language in description
Princeton University Library aims to describe library materials in a manner that is respectful to the individuals and communities who create, use, and are represented in the collections we manage.
Read more...
Other views
Staff view
Ask a Question
Suggest a Correction
Report Harmful Language
Supplementary Information
Other versions
Proceedings, 13th Symposium on Reliable Distributed Systems : Dana Point, California, October 25-27, 1994 / sponsored by IEEE Computer Society Technical Committee on Distributed Processing, IEEE Computer Society Technical Committee on Fault-Tolerant Computing, IFIP WG 10.4 on Dependable Computing.
id
999511093506421