A New Feature Selection Method for Text Categorization of Customer Reviews |
| |
Authors: | Miao Liu Jie Song |
| |
Institution: | 1. School of Statistics and Mathematics, Central University of Finance and Economics, Beijing, China;2. School of Statistics, Capital University of Economics and Business, Beijing, China |
| |
Abstract: | With the rapid development of e-commerce, online consumer review plays an increasingly important role in consumers’ purchase decisions. Most research papers use the quantitative measures of consumer reviews for statistical analysis. Here we focus on analyzing the texts of customer reviews with text mining tools. We propose a new feature selection method called maximizing the difference. Various classification methods such as boosting, random forest and SVM are used to test the performance of the new method along with different evaluation criteria. Both simulation and empirical results show that it improves the effectiveness of the classifier over the existing methods. |
| |
Keywords: | Customer reviews Feature selection Maximizing the difference Term document matrix |
|
|