A hybrid approach to fuzzy name search incorporating language-based and text-based principles

Paul Wu Horng-Jyh*, Na Jin-Cheon, Christopher Khoo Soo-Guan

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

6 Citations (Scopus)

Abstract

Name Search is an important search function in various types of information retrieval systems, such as online library catalogs and electronic yellow pages. It is also difficult, because of the high degree of fuzziness required in matching name variants. Previous approaches to name search systems use ad hoc combinations of search heuristics. This paper first discusses two approaches to name modeling - the natural language processing (NLP) and information retrieval (IR) models - and proposes a hybrid approach. The approach demonstrates a critical combination of complementary NLP and IR features that produces more effective fuzzy name matching. Two principles, position-as-attribute and position-transition-likelihood, are introduced as the principles for integrating the advantageous aspects of both approaches. They have been implemented in an NLP- and IR-hybrid model system called Friendly Name Search (FNS) for real world applications in multilingual directory searches on the Singapore Yellow pages website.

Original languageEnglish
Pages (from-to)3-19
Number of pages17
JournalJournal of Information Science
Volume33
Issue number1
DOIs
Publication statusPublished - Feb 2007
Externally publishedYes

ASJC Scopus Subject Areas

  • Information Systems
  • Library and Information Sciences

Keywords

  • Fuzzy name search
  • Hybrid system
  • Information retrieval
  • Language and text
  • Natural language processing

Fingerprint

Dive into the research topics of 'A hybrid approach to fuzzy name search incorporating language-based and text-based principles'. Together they form a unique fingerprint.

Cite this