Date of Award

8-2008

Degree Type

Dissertation

Degree Name

Doctor of Philosophy

Major

Computer Science

Major Professor

Michael A. Langston

Committee Members

Jian Huang, Robert C. Ward, Arnold M. Saxton

Abstract

Graph-based methods used in the analysis of DNA microarray technology can be powerful tools in the elucidation of biological relationships. As these methods are developed and applied to various types of data, challenges arise that test the limits of current algorithms. These challenges arise in all phases of data analysis: data normalization, modeling biological networks, and interpreting results.

Spectral graph theory methods are investigated as means of threshold selection, a key step in constructing graphical models of biological data. Also important in constructing graphs is the selection of an appropriate gene-gene similarity metric, and an overview of similarity pro les for some biological data sets is present, along with a similarity thresholding method based upon structural properties of random graphs.

The identi cation of altered relationships between two or more conditions is a goal of many microarray gene expression studies. Clique-based methods can identify sets of coexpressed genes within each group, but additional computational methods are required to uncover the di erential relationships and sets of genes changing together between groups. Di erential lters are reviewed to highlight those changing interactions and sets of changing genes. The e ect of various normalization methods on these di erential results is also studied.

Finally, how methods commonly used in the analysis of gene expression data can be used to investigate relationships in noisy and incomplete historical ecosystem data is explored.

Files over 3MB may be slow to open. For best results, right-click and select "save as..."

Share

COinS