Repository logo
Log In(current)
  1. Home
  2. Colleges & Schools
  3. Graduate School
  4. Masters Theses
  5. Comparative Analysis of Thresholding Algorithms for Microarray-derived Gene Correlation Matrices
Details

Comparative Analysis of Thresholding Algorithms for Microarray-derived Gene Correlation Matrices

Date Issued
August 1, 2008
Author(s)
Borate, Bhavesh Ram
Advisor(s)
Michael Langston
Additional Advisor(s)
Arnold Saxton, Elissa Chesler, Brynn Voy
Link to full text
http://etd.utk.edu/2008/December2008MastersTheses/BorateBhaveshRam.pdf
Abstract

The thresholding problem is important in today’s data-rich research scenario. A threshold is a well-defined point in the data distribution beyond which the data is highly likely to have scientific meaning. The selection of threshold is crucial since it heavily influences any downstream analysis and inferences made there from. A legitimate threshold is one that is not arbitrary but scientifically well grounded, data-dependent and best segregates the information-rich and noisy sections of data. Although the thresholding problem is not restricted to any particular field of study, little research has been done. This study investigates the problem in context of network-based analysis of transcriptomic data. Six conceptually diverse algorithms – based on number of maximal cliques, correlations of control spots with genes, top 1% of correlations, spectral graph clustering, Bonferroni correction of p-values and statistical power – are used to threshold the gene correlation matrices of three time-series microarray datasets and tested for stability and validity. Stability or reliability of the first four algorithms towards thresholding is tested upon block bootstrapping of arrays in the datasets and comparing the estimated thresholds against the bootstrap threshold distributions. Validity of thresholding algorithms is tested by comparison of the estimated thresholds against threshold based on biological information. Thresholds based on the modular basis of gene networks are concluded to perform better both in terms of stability as well as validity. Future challenges to research the problem have been identified. Although the study utilizes transcriptomic data for analysis, we assert its applicability to thresholding across various fields.

Disciplines
Life Sciences
Degree
Master of Science
Major
Life Sciences
Embargo Date
December 1, 2011
File(s)
Thumbnail Image
Name

BorateBhaveshRam.pdf

Size

475.08 KB

Format

Adobe PDF

Checksum (MD5)

effb90d4eeca7b452b9aa0c0854f8bd3

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science

  • Privacy policy
  • End User Agreement
  • Send Feedback
  • Contact
  • Libraries at University of Tennessee, Knoxville
Repository logo COAR Notify