Effectiveness of web search results for genre and sentiment classification

Jin Cheon Na; Tun Thura Thet

doi:10.1177/0165551509104233

Effectiveness of web search results for genre and sentiment classification

Jin Cheon Na^*, Tun Thura Thet

^*Corresponding author for this work

Nanyang Technological University

Research output: Contribution to journal › Article › peer-review

12 Citations (Scopus)

Abstract

The motivation of this study is to enhance general topical search with a sentiment-based one where the search results (snippets) returned by the web search engine are clustered by sentiment categories. Firstly we developed an automatic method to identify product review documents using the snippets (summary information that includes the URL, title, and summary text), which is genre classification. Then the identified snippets were automatically classified into positive (recommended) and negative (non-recommended) documents, which is sentiment classification. Thereafter the user may directly decide to access the positive or negative review documents. In this study we used only the snippets rather than their original full-text documents, and applied a common machine learning technique, SVM (support vector machine), and heuristic approaches to investigate how effectively the snippets can be used for genre and sentiment classification. The results show that the web search engine should improve the quality of the snippets especially for opinionated documents (i.e. review documents).

Original language	English
Pages (from-to)	709-726
Number of pages	18
Journal	Journal of Information Science
Volume	35
Issue number	6
DOIs	https://doi.org/10.1177/0165551509104233
Publication status	Published - Dec 2009
Externally published	Yes

ASJC Scopus Subject Areas

Information Systems
Library and Information Sciences

Keywords

Genre classification
Product review documents
Sentiment classification
Snippets
Web search results

Access to Document

10.1177/0165551509104233

Cite this

@article{9c472d47d2f649abb63ca00de3524ae0,

title = "Effectiveness of web search results for genre and sentiment classification",

abstract = "The motivation of this study is to enhance general topical search with a sentiment-based one where the search results (snippets) returned by the web search engine are clustered by sentiment categories. Firstly we developed an automatic method to identify product review documents using the snippets (summary information that includes the URL, title, and summary text), which is genre classification. Then the identified snippets were automatically classified into positive (recommended) and negative (non-recommended) documents, which is sentiment classification. Thereafter the user may directly decide to access the positive or negative review documents. In this study we used only the snippets rather than their original full-text documents, and applied a common machine learning technique, SVM (support vector machine), and heuristic approaches to investigate how effectively the snippets can be used for genre and sentiment classification. The results show that the web search engine should improve the quality of the snippets especially for opinionated documents (i.e. review documents).",

keywords = "Genre classification, Product review documents, Sentiment classification, Snippets, Web search results",

author = "Na, \{Jin Cheon\} and Thet, \{Tun Thura\}",

year = "2009",

month = dec,

doi = "10.1177/0165551509104233",

language = "English",

volume = "35",

pages = "709--726",

journal = "Journal of Information Science",

issn = "0165-5515",

publisher = "SAGE Publications Ltd",

number = "6",

}

TY - JOUR

T1 - Effectiveness of web search results for genre and sentiment classification

AU - Na, Jin Cheon

AU - Thet, Tun Thura

PY - 2009/12

Y1 - 2009/12

N2 - The motivation of this study is to enhance general topical search with a sentiment-based one where the search results (snippets) returned by the web search engine are clustered by sentiment categories. Firstly we developed an automatic method to identify product review documents using the snippets (summary information that includes the URL, title, and summary text), which is genre classification. Then the identified snippets were automatically classified into positive (recommended) and negative (non-recommended) documents, which is sentiment classification. Thereafter the user may directly decide to access the positive or negative review documents. In this study we used only the snippets rather than their original full-text documents, and applied a common machine learning technique, SVM (support vector machine), and heuristic approaches to investigate how effectively the snippets can be used for genre and sentiment classification. The results show that the web search engine should improve the quality of the snippets especially for opinionated documents (i.e. review documents).

AB - The motivation of this study is to enhance general topical search with a sentiment-based one where the search results (snippets) returned by the web search engine are clustered by sentiment categories. Firstly we developed an automatic method to identify product review documents using the snippets (summary information that includes the URL, title, and summary text), which is genre classification. Then the identified snippets were automatically classified into positive (recommended) and negative (non-recommended) documents, which is sentiment classification. Thereafter the user may directly decide to access the positive or negative review documents. In this study we used only the snippets rather than their original full-text documents, and applied a common machine learning technique, SVM (support vector machine), and heuristic approaches to investigate how effectively the snippets can be used for genre and sentiment classification. The results show that the web search engine should improve the quality of the snippets especially for opinionated documents (i.e. review documents).

KW - Genre classification

KW - Product review documents

KW - Sentiment classification

KW - Snippets

KW - Web search results

UR - http://www.scopus.com/inward/record.url?scp=70849127072&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=70849127072&partnerID=8YFLogxK

U2 - 10.1177/0165551509104233

DO - 10.1177/0165551509104233

M3 - Article

AN - SCOPUS:70849127072

SN - 0165-5515

VL - 35

SP - 709

EP - 726

JO - Journal of Information Science

JF - Journal of Information Science

IS - 6

ER -

Effectiveness of web search results for genre and sentiment classification

Abstract

ASJC Scopus Subject Areas

Keywords

Access to Document

Other files and links

Fingerprint

Cite this