Abstract
A central challenge in building sentiment classifiers using machine learning approach is the generation of discriminative features that allow sentiment to be implied. Researchers have made significant progress with various features such as n-grams, sentiment shifters, and lexicon features. However, the potential of semantics-based features in sentiment classification has not been fully explored. By integrating PropBank-based semantic parsing and class association rule (CAR) mining, this study aims to mine patterns of semantic labels from domain corpus for sentence-level sentiment analysis of product reviews. With the features generated from the semantic patterns, the F-score of the sentiment classifier was boosted to 82.31% at minimum confidence level of 0.75, which not only indicated a statistically significant improvement over the baseline classifier with unigram and negation features (F-score = 73.93%) but also surpassed the best performance obtained with other classifiers trained on generic lexicon features (F-score = 76.25%) and domain-specific lexicon features (F-score = 78.91%).
Original language | English |
---|---|
Title of host publication | Research and Advanced Technology for Digital Libraries - 21st International Conference on Theory and Practice of Digital Libraries, TPDL 2017, Proceedings |
Editors | Yannis Manolopoulos, Jaap Kamps, Giannis Tsakonas, Lazaros Iliadis, Ioannis Karydis |
Publisher | Springer Verlag |
Pages | 382-393 |
Number of pages | 12 |
ISBN (Print) | 9783319670072 |
DOIs | |
Publication status | Published - 2017 |
Externally published | Yes |
Event | 21st International Conference on Theory and Practice of Digital Libraries, TPDL 2017 - Thessaloniki, Greece Duration: Sept 18 2017 → Sept 21 2017 |
Publication series
Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
---|---|
Volume | 10450 LNCS |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference
Conference | 21st International Conference on Theory and Practice of Digital Libraries, TPDL 2017 |
---|---|
Country/Territory | Greece |
City | Thessaloniki |
Period | 9/18/17 → 9/21/17 |
Bibliographical note
Publisher Copyright:© Springer International Publishing AG 2017.
ASJC Scopus Subject Areas
- Theoretical Computer Science
- General Computer Science
Keywords
- Machine learning
- Pattern mining
- Semantic parsing
- Sentiment analysis
- Sentiment classification