Masters Theses

Date of Award

5-2009

Degree Type

Thesis

Degree Name

Master of Science

Major

Computer Science

Major Professor

Michael A. Langston

Abstract

During our collaborations with scientists interested in high-throughput analysis of biological data, we have made much progress and facilitated some interesting findings using our clique-finding tools. However, we have also uncovered ways in which we can make our tools more efficient but have yet to write the programs to perform these tasks. Part of the problem is time constraints: in order to be useful to us, an application must be quite flexible and run efficiently. Programming such a tool is no small task, so we have resorted to scripting solutions that are geared to the specific task at hand. The first aim of this work is to produce a tool that is usable by both us and our collaborators to meet our common data processing needs. Also during our collaborations, we have been tasked with finding new ways to help find potentially interesting data among a large amount of information that would be prohibitive to analyze by hand. One of our current tools is one that can take a graph and return all the maximal cliques, which, using real data, can be done in a reasonable amount of time. However, the list of maximal cliques itself is usually long and impractical to analyze by hand. Thus, we have needed to come up with new ways to sleuth out those genes and cliques that may be of most interest from a list of millions of cliques. The second aim of this work is to describe new methods that we have been using to achieve this.

Files over 3MB may be slow to open. For best results, right-click and select "save as..."

Share

COinS