Applications of Approximate Word Matching in Information Retrieval

Report
Authors:French, James, Department of Computer ScienceUniversity of Virginia Powell, Allison, Department of Computer ScienceUniversity of Virginia Schulman, Eric, Department of Computer ScienceUniversity of Virginia
Abstract:

As more online databases are integrated into digital libraries, the issue of quality control of the data becomes increasingly important, especially as it relates to the effective retrieval of information. The need to discover and reconcile variant forms of strings in bibliographic entries, i.e., authority work, will become more difficult. Spelling variants, misspellings, and transliteration differences will all increase the difficulty of retrieving information. Approximate string matching has traditionally been used to help with this problem. In this paper we introduce the notion of approximate word matching and show how it can be used to improve detection and categorization of variant forms.

Rights:
All rights reserved (no additional license for public reuse)
Language:
English
Source Citation:

French, James, Allison Powell, and Eric Schulman. "Applications of Approximate Word Matching in Information Retrieval." University of Virginia Dept. of Computer Science Tech Report (1997).

Publisher:
University of Virginia, Department of Computer Science
Published Date:
1997