Improving aspect extraction by augmenting a frequency-based method with web-based similarity measures |
| |
Authors: | Shi Li Lina Zhou Yijun Li |
| |
Institution: | 1. School of Information and Computer Engineering, Northeast Forestry University, Harbin 150040, PR China;2. Department of Information Systems, UMBC, Baltimore, MD 21250, United States;3. School of Management, Harbin Institute of Technology, Harbin 150001, PR China |
| |
Abstract: | Online review mining has been used to help manufacturers and service providers improve their products and services, and to provide valuable support for consumer decision making. Product aspect extraction is fundamental to online review mining. This research is aimed to improve the performance of aspect extraction from online consumer reviews. To this end, we augment a frequency-based extraction method with PMI-IR, which utilizes web search in measuring the semantic similarity between aspect candidates and target entities. In addition, we extend RCut, an algorithm originally developed for text classification, to learn the threshold for selecting candidate aspects. Experiment results with Chinese online reviews show that our proposed method not only outperforms the state of the art frequency-based method for aspect extraction but also generalizes across different product domains and various data sizes. |
| |
Keywords: | Aspect extraction Online reviews PMI-IR Frequent aspects |
本文献已被 ScienceDirect 等数据库收录! |
|