Repository logo
Log In(current)
  1. Home
  2. Colleges & Schools
  3. Graduate School
  4. Masters Theses
  5. Diskless checkpointing
Details

Diskless checkpointing

Date Issued
May 1, 1997
Author(s)
Puening, Michael A.
Advisor(s)
James S. Plank
Additional Advisor(s)
Brad Vader Zanden
Permanent URI
https://trace.tennessee.edu/handle/20.500.14382/31909
Abstract

As the choice of parallel platforms shifts from dedicated parallel machines to networks of workstations, the need for program fault-tolerance has never been greater. Checkpointing is the only means to provide programs with fault-tolerance in general-purpose computing environments. Checkpointing usually involves saving program states to disk. However, in parallel environments, stable storage becomes a bottleneck that prevents efficient checkpointing. Presented in this thesis are algorithms to provide parallel programs with fault-tolerance without relying on stable storage. An implementation of these algorithms was created and compared with the traditional disk-based algorithms. Results show that diskless checkpointing is a viable option to provide efficient fault-tolerance with low overhead.

Degree
Master of Science
Major
Computer Science
File(s)
Thumbnail Image
Name

Thesis97.P84.pdf

Size

5.83 MB

Format

Unknown

Checksum (MD5)

a16511a1db9e79eca201b567e290ffc3

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science

  • Privacy policy
  • End User Agreement
  • Send Feedback
  • Contact
  • Libraries at University of Tennessee, Knoxville
Repository logo COAR Notify