Repository logo
Log In(current)
  1. Home
  2. Colleges & Schools
  3. Graduate School
  4. Masters Theses
  5. Level search schemes for scalable information retrieval
Details

Level search schemes for scalable information retrieval

Date Issued
December 1, 1999
Author(s)
Zhang, Xiaoyan
Advisor(s)
Michael W. Berry
Additional Advisor(s)
Padma Raghavan
Peiling Wang
Permanent URI
https://trace.tennessee.edu/handle/20.500.14382/31251
Abstract

Latent Semantic Indexing (LSI) has been demonstrated to outperform lexical matching in information retrieval. However, the enormous cost associated with the Singular Value Decomposition (SVD) of the large term-by-document matrix becomes a barrier for its application to scalable information retrieval. This thesis shows that information filtering using level search techniques can reduce the SVD computation cost for LSI. For each query, level search extracts a much smaller subset of the original term-by-document matrix with an average of 25% of the original non-zero entries. When LSI is applied to such subsets, the average precision only degrades by 5% due to level search filtering; however, for some document collections an increase in precision has been observed. Level search techniques are enhanced by a pruning scheme that deletes terms connected to only one document from the query-specific submatrix. An average 65% reduction in the number of non-zeros has been observed with a precision loss of 5% for most collections.

Degree
Master of Science
Major
Computer Science
File(s)
Thumbnail Image
Name

Thesis99Z4.pdf

Size

1.13 MB

Format

Unknown

Checksum (MD5)

31209cfb1b09ef9e0ab3b6afe43ffe8d

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science

  • Privacy policy
  • End User Agreement
  • Send Feedback
  • Contact
  • Libraries at University of Tennessee, Knoxville
Repository logo COAR Notify