Combine sentiment lexicon and dependency parsing for sentiment classification

There is often the need to perform sentiment classification in a particular domain where no labeled document is available. Although we could make use of a general-purpose off-the-shelf sentiment classifier or a pre-built one for a different domain, the effectiveness would be inferior. In this paper, we explore the possibility of building domain-specific sentiment classifiers with unlabeled documents only. Our investigation indicates that in the word embeddings learned from the unlabeled corpus of a given domain, the distributed word representations (vectors) for opposite sentiments form distinct clusters, though those clusters are not transferable across domains. Exploiting such a clustering structure, we are able to utilize machine learning algorithms to induce a quality domain-specific sentiment lexicon from just a few typical sentiment words (“seeds”). An important finding is that simple linear model based supervised learning algorithms (such as linear SVM) can actually work better than more sophisticated semi-supervised/transductive learning algorithms which represent the state-of-the-art technique for sentiment lexicon induction. The induced lexicon could be applied directly in a lexicon-based method for sentiment classification, but a higher performance could be achieved through a two-phase bootstrapping method which uses the induced lexicon to assign positive/negative sentiment scores to unlabeled documents first, a nd t hen u ses those documents found to have clear sentiment signals as pseudo-labeled examples to train a document sentiment classifier v ia supervised learning algorithms (such as LSTM). On several benchmark datasets for document sentiment classification, our end-to-end pipelined approach which is overall unsupervised (except for a tiny set of seed words) outperforms existing unsupervised approaches and achieves an accuracy comparable to that of fully supervised approaches.

Download Full-text

Sentiment Classification

Advances in Linguistics and Communication Studies - Modern Computational Models of Semantic Discovery in Natural Language ◽

10.4018/978-1-4666-8690-8.ch001 ◽

2015 ◽

pp. 1-26

Author(s):

Jalel Akaichi

Keyword(s):

Support Vector Machine ◽

Text Mining ◽

Sentiment Analysis ◽

Training Model ◽

Sentiment Classification ◽

The Other ◽

Support Vector ◽

Analysis Techniques ◽

Sentiment Lexicon ◽

And Behavior

In this work, we focus on the application of text mining and sentiment analysis techniques for analyzing Tunisian users' statuses updates on Facebook. We aim to extract useful information, about their sentiment and behavior, especially during the “Arabic spring” era. To achieve this task, we describe a method for sentiment analysis using Support Vector Machine and Naïve Bayes algorithms, and applying a combination of more than two features. The output of this work consists, on one hand, on the construction of a sentiment lexicon based on the Emoticons and Acronyms' lexicons that we developed based on the extracted statuses updates; and on the other hand, it consists on the realization of detailed comparative experiments between the above algorithms by creating a training model for sentiment classification.

Download Full-text

Lexicon-Enhanced Attention Network Based on Text Representation for Sentiment Classification

Applied Sciences ◽

10.3390/app9183717 ◽

2019 ◽

Vol 9 (18) ◽

pp. 3717 ◽

Cited By ~ 3

Author(s):

Wenkuan Li ◽

Dongyuan Li ◽

Hongxia Yin ◽

Lindong Zhang ◽

Zhenfang Zhu ◽

...

Keyword(s):

Deep Learning ◽

Large Scale ◽

Attention Mechanism ◽

Sentiment Classification ◽

Classification Model ◽

Linguistic Knowledge ◽

Great Success ◽

Text Representation ◽

Attention Network ◽

Sentiment Lexicon

Text representation learning is an important but challenging issue for various natural language processing tasks. Recently, deep learning-based representation models have achieved great success for sentiment classification. However, these existing models focus on more semantic information rather than sentiment linguistic knowledge, which provides rich sentiment information and plays a key role in sentiment analysis. In this paper, we propose a lexicon-enhanced attention network (LAN) based on text representation to improve the performance of sentiment classification. Specifically, we first propose a lexicon-enhanced attention mechanism by combining the sentiment lexicon with an attention mechanism to incorporate sentiment linguistic knowledge into deep learning methods. Second, we introduce a multi-head attention mechanism in the deep neural network to interactively capture the contextual information from different representation subspaces at different positions. Furthermore, we stack a LAN model to build a hierarchical sentiment classification model for large-scale text. Extensive experiments are conducted to evaluate the effectiveness of the proposed models on four popular real-world sentiment classification datasets at both the sentence level and the document level. The experimental results demonstrate that our proposed models can achieve comparable or better performance than the state-of-the-art methods.

Download Full-text

Dependency Parsing and Attention Network for Aspect-Level Sentiment Classification

Natural Language Processing and Chinese Computing - Lecture Notes in Computer Science ◽

10.1007/978-3-319-99495-6_33 ◽

2018 ◽

pp. 391-403

Author(s):

Zhifan Ouyang ◽

Jindian Su

Keyword(s):

Sentiment Classification ◽

Dependency Parsing ◽

Attention Network

Download Full-text

A Sentiment Classification from Review Corpus using Linked Open Data and Sentiment Lexicon

10.1109/icitee53064.2021.9611898 ◽

2021 ◽

Author(s):

Worapoj Suwanpipob ◽

Ngamnij Arch-int ◽

Monlica Wattana

Keyword(s):

Open Data ◽

Sentiment Classification ◽

Linked Open Data ◽

Sentiment Lexicon

Download Full-text

Graph Attention Network with Dependency Parsing for Aspect-level Sentiment Classification

10.1145/3494885.3494932 ◽

2021 ◽

Author(s):

Ling Gan ◽

Qiao Tang

Keyword(s):

Sentiment Classification ◽

Dependency Parsing ◽

Attention Network

Download Full-text

Sentiment Classification of Internet Commodity Reviews Based on the Extended Chinese Sentiment Lexicon

DEStech Transactions on Computer Science and Engineering ◽

10.12783/dtcse/aita2017/16039 ◽

2017 ◽

Author(s):

Yan ZENG ◽

Ping SHI

Keyword(s):

Sentiment Classification ◽

Sentiment Lexicon

Download Full-text

Automatically Building Financial Sentiment Lexicons While Accounting for Negation

Cognitive Computation ◽

10.1007/s12559-021-09833-w ◽

2021 ◽

Author(s):

Thomas Bos ◽

Flavius Frasincar

Keyword(s):

Mutual Information ◽

Sentiment Classification ◽

The Other ◽

Construction Process ◽

Information Approach ◽

Sentiment Lexicon ◽

Financial Domain ◽

Classification Tasks ◽

Available Information ◽

Pointwise Mutual Information

AbstractFinancial investors make trades based on available information. Previous research has proved that microblogs are a useful source for supporting stock market decisions. However, the financial domain lacks specific sentiment lexicons that could be utilized to extract the sentiment from these microblogs. In this research, we investigate automatic approaches that can be used to build financial sentiment lexicons. We introduce weighted versions of the Pointwise Mutual Information approaches to build sentiment lexicons automatically. Furthermore, existing sentiment lexicons often neglect negation while building the sentiment lexicons. In this research, we also propose two methods (Negated Word and Flip Sentiment) to extend the sentiment building approaches to take into account negation when constructing a sentiment lexicon. We build the financial sentiment lexicons by leveraging 200,000 messages from StockTwits. We evaluate the constructed financial sentiment lexicons in two different sentiment classification tasks (unsupervised and supervised). In addition, the created financial sentiment lexicons are compared with each other and with other existing sentiment lexicons. The best performing financial sentiment lexicon is built by combining our Weighted Normalized Pointwise Mutual Information approach with the Negated Word approach. It outperforms all the other sentiment lexicons in the two sentiment classification tasks. In the unsupervised sentiment classification task, it has, on average, a balanced accuracy of 69.4%, and in the supervised setting, a balanced accuracy of 75.1%. Moreover, the various sentiment classification tasks confirm that the sentiment lexicons could be improved by taking into account negation while building the sentiment lexicons. The improvement could be made by using one of the proposed methods to incorporate negation in the sentiment lexicon construction process.

Download Full-text

Negation Handling in Machine Learning-Based Sentiment Classification for Colloquial Arabic

International Journal of Operations Research and Information Systems ◽

10.4018/ijoris.2020100102 ◽

2020 ◽

Vol 11 (4) ◽

pp. 33-45

Author(s):

Omar Alharbi

Keyword(s):

Machine Learning ◽

Sentiment Analysis ◽

Positive Impact ◽

Machine Learning Algorithms ◽

Sentiment Classification ◽

Linguistic Knowledge ◽

Sentiment Lexicon ◽

Colloquial Arabic ◽

Arabic Sentiment Analysis ◽

The Impact

One crucial aspect of sentiment analysis is negation handling, where the occurrence of negation can flip the sentiment of a review and negatively affects the machine learning-based sentiment classification. The role of negation in Arabic sentiment analysis has been explored only to a limited extent, especially for colloquial Arabic. In this paper, the authors address the negation problem in colloquial Arabic sentiment classification using the machine learning approach. To this end, they propose a simple rule-based algorithm for handling the problem that affects the performance of a machine learning classifier. The rules were crafted based on observing many cases of negation, simple linguistic knowledge, and sentiment lexicon. They also examine the impact of the proposed algorithm on the performance of different machine learning algorithms. Furthermore, they compare the performance of the classifiers when their algorithm is used against three baselines. The experimental results show that there is a positive impact on the classifiers when the proposed algorithm is used compared to the baselines.

Download Full-text

Sentiment Classification of Reviews on Automobile Websites by Combining Word2Vec and Dependency Parsing

Lecture Notes in Computer Science - Smart Computing and Communication ◽

10.1007/978-3-319-73830-7_21 ◽

2018 ◽

pp. 206-221

Author(s):

Feifei Liu ◽

Fang Wei ◽

Ke Yu ◽

Xiaofei Wu

Keyword(s):

Sentiment Classification ◽

Dependency Parsing

Download Full-text

Combine sentiment lexicon and dependency parsing for sentiment classification

Bootstrap Domain-Specific Sentiment Classifiers from Unlabeled Corpora

Sentiment Classification

Lexicon-Enhanced Attention Network Based on Text Representation for Sentiment Classification

Dependency Parsing and Attention Network for Aspect-Level Sentiment Classification

A Sentiment Classification from Review Corpus using Linked Open Data and Sentiment Lexicon

Graph Attention Network with Dependency Parsing for Aspect-level Sentiment Classification

Sentiment Classification of Internet Commodity Reviews Based on the Extended Chinese Sentiment Lexicon

Automatically Building Financial Sentiment Lexicons While Accounting for Negation

Negation Handling in Machine Learning-Based Sentiment Classification for Colloquial Arabic

Sentiment Classification of Reviews on Automobile Websites by Combining Word2Vec and Dependency Parsing

Export Citation Format