Proceedings Article10.1109/SMC.2017.8122782
Adaptive two-stage feature selection for sentiment classification
Xu Chi,Tan Puay Siew,Erik Cambria +2 more
- 01 Oct 2017
- pp 1238-1243
19
TL;DR: This paper proposes an adaptive two-stage feature selection approach, which generates base feature scores from a training dataset and then weights them based on individual test sample so that the feature importance evaluation is adapted to the characteristic of test data as well.
read more
Abstract: Sentiment analysis is able to automatically extract valuable customer information from large amount of unstructured text data to support decision making in manufacturing applications such as product design and demand planning. One of the key issues of sentiment analysis is the high dimensionality of data, which can be effectively solved by feature selection. Existing feature selection techniques compute feature scores solely based on training data statistics or by modifying a specific feature metric formula to include test data information which can not be generalized to other types of feature metrics. In this paper, we propose an adaptive two-stage feature selection approach, which generates base feature scores from a training dataset and then weights them based on individual test sample so that the feature importance evaluation is adapted to the characteristic of test data as well. The proposed method is applicable to arbitrary type of feature metrics and sentiment classifiers. The experiments show that our approach can consistently outperform other methods, especially for the setting of small number of selected features.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Multimodal sentiment analysis using hierarchical fusion with context modeling
TL;DR: This article proposed a hierarchical feature fusion strategy that fuses the modalities two in two and only then fuses all three modalities in a hierarchical fashion to improve the multimodal fusion mechanism.
319
Learning multi-grained aspect target sequence for Chinese sentiment analysis
TL;DR: This paper formalizes the problem of aspect-level sentiment analysis from a different perspective, i.e., that sentiment at aspect target level should be the main focus and proposes to explicitly model the aspect target and conduct sentiment classification directly at the aspect targets level via three granularities.
172
Managing Marketing Decision-Making with Sentiment Analysis: An Evaluation of the Main Product Features Using Text Data Mining
TL;DR: This work is able to improve recommender systems by using positive, neutral, and negative customer opinions and by classifying customers based on their comments, and proves the validity of the approach in a case study using big data extracted from Amazon online reviews, obtaining satisfactory and promising results.
Explicit aspects extraction in sentiment analysis using optimal rules combination
TL;DR: The Whale Optimization Algorithm was improved to address rules selection problem with an improved algorithm called improved WOA (IWOA), and a new pruning algorithm has been developed to remove incorrect aspects and retain correct aspects.
58
Efficient feature selection techniques for sentiment analysis
Avinash Madasu,Sivasankar Elango +1 more
TL;DR: In this paper, the performance of different feature selection techniques for sentiment analysis was evaluated using different machine learning classifiers such as Logistic Regression (LR), Support Vector Machines (SVM), Decision Tree (DT), Naïve Bayes (NB), and Bagging and Random Subspace (BS) techniques.
52
References
An algorithm for suffix stripping
TL;DR: An algorithm for suffix stripping is described, which has been implemented as a short, fast program in BCPL, and performs slightly better than a much more elaborate system with which it has been compared.
9.1K
Thumbs up? Sentiment Classification using Machine Learning Techniques
Bo Pang,Lillian Lee,Shivakumar Vaithyanathan +2 more
- 06 Jul 2002
TL;DR: This work considers the problem of classifying documents not by topic, but by overall sentiment, e.g., determining whether a review is positive or negative, and concludes by examining factors that make the sentiment classification problem more challenging.
Thumbs up? Sentiment Classiflcation using Machine Learning Techniques
Bo Pang,Lillian Lee,Shivakumar Vaithyanathan +2 more
- 01 Jan 2002
TL;DR: In this paper, the problem of classifying documents not by topic, but by overall sentiment, e.g., determining whether a review is positive or negative, was considered and three machine learning methods (Naive Bayes, maximum entropy classiflcation, and support vector machines) were employed.
•Proceedings Article
A Comparative Study on Feature Selection in Text Categorization
Yiming Yang,Jan O. Pedersen +1 more
- 08 Jul 1997
TL;DR: This paper finds strong correlations between the DF IG and CHI values of a term and suggests that DF thresholding the simplest method with the lowest cost in computation can be reliably used instead of IG or CHI when the computation of these measures are too expensive.
5.6K
A systematic analysis of performance measures for classification tasks
Marina Sokolova,Guy Lapalme +1 more
TL;DR: This paper presents a systematic analysis of twenty four performance measures used in the complete spectrum of Machine Learning classification tasks, i.e., binary, multi-class,multi-labelled, and hierarchical, to produce a measure invariance taxonomy with respect to all relevant label distribution changes in a classification problem.