Repository logo
Log In(current)
  1. Home
  2. Colleges & Schools
  3. Graduate School
  4. Doctoral Dissertations
  5. Improving Scalability and Usability of Parallel Runtime Environments for High Availability and High Performance Systems
Details

Improving Scalability and Usability of Parallel Runtime Environments for High Availability and High Performance Systems

Date Issued
December 1, 2007
Author(s)
Angskun, Thara
Advisor(s)
Jack Dongarra
Additional Advisor(s)
George Bosilca
Hairong Qi
Bradley Vander Zanden
Link to full text
http://etd.utk.edu/2007/AngskunThara.pdf
Permanent URI
https://trace.tennessee.edu/handle/20.500.14382/19722
Abstract

The number of processors embedded in high performance computing platforms is growing daily to solve larger and more complex problems. Hence, parallel runtime environments have to support and adapt to the underlying platforms that require scalability and fault management in more and more dynamic environments. This dissertation aims to analyze, understand and improve the state of the art mechanisms for managing highly dynamic, large scale applications.


This dissertation demonstrates that the use of new scalable and fault-tolerant topologies, combined with rerouting techniques, builds parallel runtime environments, which are able to efficiently and reliably deliver sets of information to a large number of processes. Several important graph properties are provided to illustrate the theoretical capability of these topologies in terms of both scalability and fault-tolerance, such as reasonable degree, regular graph, low diameter, symmetric graph, low cost factor, low message traffic density, optimal connectivity, low fault-diameter and strongly resilient.

The dissertation builds a communication framework based on these topologies to support parallel runtime environments. Such a framework can handle multiple types of messages, e.g., unicast, multicast, broadcast and all-gather. Additionally, the communication framework has been formally verified to work in both normal and failure circumstances without creating any of the common problems such as broadcast storm, deadlock and non-progress cycle.

Disciplines
Computer Sciences
Degree
Doctor of Philosophy
Major
Computer Science
Embargo Date
December 1, 2011
File(s)
Thumbnail Image
Name

AngskunThara.pdf

Size

3.05 MB

Format

Adobe PDF

Checksum (MD5)

3b11894aeeb7ac027542f4b7401793ca

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science

  • Privacy policy
  • End User Agreement
  • Send Feedback
  • Contact
  • Libraries at University of Tennessee, Knoxville
Repository logo COAR Notify