Repository logo
Log In(current)
  1. Home
  2. Colleges & Schools
  3. Graduate School
  4. Doctoral Dissertations
  5. Protein Threading for Genome-Scale Structural Analysis
Details

Protein Threading for Genome-Scale Structural Analysis

Date Issued
December 1, 2007
Author(s)
Ellrott, Kyle P
Advisor(s)
Ying Xu
Additional Advisor(s)
Robert L. Hettich
Victor Olman
Hong Guo
Elizabeth Howell
Link to full text
http://etd.utk.edu/2007/EllrottKyle.pdf
Permanent URI
https://trace.tennessee.edu/handle/20.500.14382/22547
Abstract

Protein structure prediction is a necessary tool in the field of bioinformatic analysis. It is a non-trivial process that can add a great deal of information to a genome annotation. This dissertation deals with protein structure prediction through the technique of protein fold recognition and outlines several strategies for the improvement of protein threading techniques. In order to improve protein threading performance, this dissertation begins with an outline of sequence/structure alignment energy functions. A technique called Violated Inequality Minimization is used to quickly adapt to the changing energy landscape as new energy functions are added. To continue the improvement of alignment accuracy and fold recognition, new formulations of energy functions are used for the creation of the sequence/structure alignment. These energies include a formulation of a gap penalty which is dependent on sequence characteristics different from the traditional constant penalty. Another proposed energy is dependent on conserved structural patterns found during threading. These structural patterns have been employed to refine the sequence/structure alignment in my research. The section on Linear Programming Algorithm for protein structure alignment deals with the optimization of an alignment using additional residue-pair energy functions. In the original version of the model, all cores had to be aligned to the target sequence. Our research outlines an expansion of the original threading model which allows for a more flexible alignment by allowing core deletions. Aside from improvements in fold recognition and alignment accuracy, there is also a need to ensure that these techniques can scale for the computational demands of genome level structure prediction. A heuristic decision making processes has been designed to automate the classification and preparation of proteins for prediction. A graph analysis has been applied to the integration of different tools involved in the pipeline. Analysis of the data dependency graph allows for automatic parallelization of genome structure prediction. These different contributions help to improve the overall performance of protein threading and help distribute computations across a large set of computers to help make genome scale protein structure prediction practically feasible.

Disciplines
Biochemistry, Biophysics, and Structural Biology
Life Sciences
Degree
Doctor of Philosophy
Major
Life Sciences
Embargo Date
December 1, 2011
File(s)
Thumbnail Image
Name

EllrottKyle.pdf

Size

1.08 MB

Format

Adobe PDF

Checksum (MD5)

661469be2276db510ee93fd67b8c155b

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science

  • Privacy policy
  • End User Agreement
  • Send Feedback
  • Contact
  • Libraries at University of Tennessee, Knoxville
Repository logo COAR Notify