Masters Theses

Date of Award

8-2003

Degree Type

Thesis

Degree Name

Master of Science

Major

Life Sciences

Major Professor

Dong Xu

Committee Members

Jeff Becker, Loren Hauser

Abstract

We have developed an integrated probabilistic prediction method, which combines the information from protein-protein interactions, protein complexes, microarray gene-expression profiles and functional annotations for known proteins. Our approach differs from the other approaches to use high-throughput data in a variety of ways. First, we utilize the GO biological process functional annotation in comparison to the MIPS classification followed by others. Second, we incorporate information from multiple sources of high-throughput data, including genetic interactions, to develop a better model for function prediction. By incorporating information from the multiple sources of high-throughput data, we identify the parameters important for protein function prediction. Third, we estimate the probability for the proteins to have a function of interest by designing a new statistical method for function prediction. Fourth, our approach assigns multiple functions to the hypothetical proteins and allows confidence assessment, based on the supportive evidences from the high-throughput data. Our work demonstrates the power of integrating multiple sources of high-throughput data with biological functional annotations, in the function prediction for unknown proteins. In addition to this, we have also developed a Web server for function prediction in yeast as well as other organisms. We have applied our method to the Saccharomyces cerevisiae proteome and are able to assign function to 1548 out of the 2472 unannotated proteins in yeast with our approach.

Files over 3MB may be slow to open. For best results, right-click and select "save as..."

Included in

Life Sciences Commons

Share

COinS