Repository logo
Log In(current)
  1. Home
  2. Colleges & Schools
  3. Graduate School
  4. Masters Theses
  5. Cross-language information retrieval using latent semantic indexing
Details

Cross-language information retrieval using latent semantic indexing

Date Issued
December 1, 1994
Author(s)
Young, Paul Geoffrey
Advisor(s)
Michael W. Berry
Additional Advisor(s)
Brad Vander Zanden
David Straight
Permanent URI
https://trace.tennessee.edu/handle/20.500.14382/33081
Abstract

In this thesis, a method for indexing cross-language databases for conceptual query- matching is presented. Two languages (Greek and English) are combined by append- ing a small portion of documents from one language to the identical documents in the other language. The proposed merging strategy duplicates less than 7% of the entire database (made up of different translations of the Gospels). Previous strategies duplicated up to 34% of the initial database in order to perform the merger. The proposed method retrieves a larger number of relevant documents for both languages with higher cosine rankings when Latent Semantic Indexing (LSI) is employed.


Using the proposed merge strategies, LSI is shown to be effective in retrieving documents from either language (Greek or English) without requiring any translation of a user's query. An effective Bible search product needs to allow the use of natural language for searching (queries). LSI enables the user to form queries with using natural expressions in the user's own native language. The merging strategy proposed in this study enables LSI to retrieve relevant documents effectively while duplicating a minimum of the entire database.

Degree
Master of Science
Major
Computer Science
File(s)
Thumbnail Image
Name

Thesis94.Y68.pdf

Size

4.08 MB

Format

Unknown

Checksum (MD5)

fb8c69973e82530fe89d2d347df84fc0

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science

  • Privacy policy
  • End User Agreement
  • Send Feedback
  • Contact
  • Libraries at University of Tennessee, Knoxville
Repository logo COAR Notify