Evaluation of a general-purpose sentiment lexicon on a product review corpus

Christopher S.G. Khoo*, Sathik Basha Johnkhan, Jin Cheon Na

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Citations (Scopus)

Abstract

This paper introduces a new general-purpose sentiment lexicon called the WKWSCI Sentiment Lexicon and compares it with three existing lexicons. The WKWSCI Sentiment Lexicon is based on the 6of12dict lexicon, and currently covers adjectives, adverbs and verbs. The words were manually coded with a value on a 7-point sentiment strength scale. The effectiveness of the four sentiment lexicons for sentiment categorization at the document-level and sentence-level was evaluated using an Amazon product review dataset. The WKWSCI lexicon obtained the best results for document-level sentiment categorization, with an accuracy of 75%. The Hu & Liu lexicon obtained the best results for sentence-level sentiment categorization, with an accuracy of 77%. The best bag-of-words machine learning model obtained an accuracy of 82% for document-level sentiment categorization model. The strength of the lexiconbased method is in sentence-level and aspect-based sentiment analysis, where it is difficult to apply machine-learning because of the small number of features.

Original languageEnglish
Title of host publicationDigital Libraries
Subtitle of host publicationProviding Quality Information - 17th International Conference on Asia-Pacific Digital Libraries, ICADL 2015, Proceedings
EditorsRobert B. Allen, Jane Hunter, Marcia L. Zeng
PublisherSpringer Verlag
Pages82-93
Number of pages12
ISBN (Print)9783319279732
DOIs
Publication statusPublished - 2015
Externally publishedYes
Event17th International Conference on Digital Libraries: Providing Quality Information, ICADL 2015 - Seoul, Korea, Republic of
Duration: Dec 9 2015Dec 12 2015

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume9469
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference17th International Conference on Digital Libraries: Providing Quality Information, ICADL 2015
Country/TerritoryKorea, Republic of
CitySeoul
Period12/9/1512/12/15

Bibliographical note

Publisher Copyright:
© Springer International Publishing Switzerland 2015.

ASJC Scopus Subject Areas

  • Theoretical Computer Science
  • General Computer Science

Keywords

  • Sentiment analysis
  • Sentiment categorization
  • Sentiment lexicon

Fingerprint

Dive into the research topics of 'Evaluation of a general-purpose sentiment lexicon on a product review corpus'. Together they form a unique fingerprint.

Cite this