Effectiveness of web search results for genre and sentiment classification

Jin Cheon Na*, Tun Thura Thet

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

12 Citations (Scopus)

Abstract

The motivation of this study is to enhance general topical search with a sentiment-based one where the search results (snippets) returned by the web search engine are clustered by sentiment categories. Firstly we developed an automatic method to identify product review documents using the snippets (summary information that includes the URL, title, and summary text), which is genre classification. Then the identified snippets were automatically classified into positive (recommended) and negative (non-recommended) documents, which is sentiment classification. Thereafter the user may directly decide to access the positive or negative review documents. In this study we used only the snippets rather than their original full-text documents, and applied a common machine learning technique, SVM (support vector machine), and heuristic approaches to investigate how effectively the snippets can be used for genre and sentiment classification. The results show that the web search engine should improve the quality of the snippets especially for opinionated documents (i.e. review documents).

Original languageEnglish
Pages (from-to)709-726
Number of pages18
JournalJournal of Information Science
Volume35
Issue number6
DOIs
Publication statusPublished - Dec 2009
Externally publishedYes

ASJC Scopus Subject Areas

  • Information Systems
  • Library and Information Sciences

Keywords

  • Genre classification
  • Product review documents
  • Sentiment classification
  • Snippets
  • Web search results

Fingerprint

Dive into the research topics of 'Effectiveness of web search results for genre and sentiment classification'. Together they form a unique fingerprint.

Cite this