Filtering product reviews from web search results

Tun Thura Thet*, Jin Cheon Na, Christopher S.G. Khoo

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

5 Citations (Scopus)

Abstract

This study seeks to develop an automatic method to identify product reviews on the Web using the snippets (summary information) returned by search engines. Determining whether a snippet is a review or non-review is a challenging task, since the snippet usually does not contain many useful features for identifying review documents. Firstly we applied a common machine learning technique, SVM (Support Vector Machine), to investigate which features of snippets are useful for the classification. Then we employed a heuristic approach utilizing domain knowledge and found that the heuristic approach performs equally well as the machine learning approach. A hybrid approach which combines the machine learning technique and domain knowledge performs slightly better than the machine learning approach alone.

Original languageEnglish
Title of host publicationDocEng'07
Subtitle of host publicationProceedings of the 2007 ACM Symposium on Document Engineering
Pages196-198
Number of pages3
DOIs
Publication statusPublished - 2007
Externally publishedYes
EventDocEng'07: 2007 ACM Symposium on Document Engineering - Winnipeg, MB, Canada
Duration: Aug 28 2007Aug 31 2007

Publication series

NameDocEng'07: Proceedings of the 2007 ACM Symposium on Document Engineering

Conference

ConferenceDocEng'07: 2007 ACM Symposium on Document Engineering
Country/TerritoryCanada
CityWinnipeg, MB
Period8/28/078/31/07

ASJC Scopus Subject Areas

  • Information Systems
  • Software

Keywords

  • Genre classification
  • Product review documents
  • Snippets

Fingerprint

Dive into the research topics of 'Filtering product reviews from web search results'. Together they form a unique fingerprint.

Cite this