Repository logo
Log In(current)
  1. Home
  2. Colleges & Schools
  3. Graduate School
  4. Doctoral Dissertations
  5. Information retrieval and filtering using the Riemannian SVD
Details

Information retrieval and filtering using the Riemannian SVD

Date Issued
August 1, 1998
Author(s)
Jiang, Eric Peiqing
Advisor(s)
Michael W. Berry
Additional Advisor(s)
Charles Collines, Jack Dongarra, Padma Raghavan
Abstract

Latent Semantic Indexing (LSI) is an SVD-based conceptual retrieval technique which employs a rank-reduced model of the original (sparse) term-by-document matrix. This approach has achieved significant performance improvements over traditional lexical searching methods. With current LSI implementations, however, the ability to overcome polysemy problems (multiple meanings for a word or words) has been lacking. The Riemannian SVD (R-SVD) is a recent nonlinear generalization of the SVD which has been used for applications in systems and control. Updating LSI models based on user feedback can be accomplished using constraints modeled by the R-SVD of a low-rank approximation to the original term-by-document matrix. This dissertation presents the formula tion, implementation and performance analysis of a new LSI model (RSVD-LSI) which is equipped with an effective information filtering mechanism. Two iterative algorithms for computing the related R-SVD are proposed. Experiments have shown that the RSVD-LSI model provides an efficient and robust information retrieval/filtering technique and demonstrates a new approach of updating LSI and similar vector-space models to circumvent polysemy problems and improve retrieval performance. The nonlinear filtering mechanism in RSVD-LSI may also have potential applications in designing certain control and security systems for information retrieval from large collections.

Degree
Doctor of Philosophy
Major
Computer Science
File(s)
Thumbnail Image
Name

Thesis98b.J53.pdf_AWSAccessKeyId_AKIAYVUS7KB2IXSYB4XB_Signature_iKO_2B55zp8ysDmgQ4AGPcS87wyTA_3D_Expires_1709387937

Size

2.52 MB

Format

Unknown

Checksum (MD5)

33187ea9074e0cea24d13a1eaab7175d

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science

  • Privacy policy
  • End User Agreement
  • Send Feedback
  • Contact
  • Libraries at University of Tennessee, Knoxville
Repository logo COAR Notify