Design of adaptive ensemble classifier for online sentiment analysis and opinion mining

PeerJ Computer Science ◽

10.7717/peerj-cs.660 ◽

2021 ◽

Vol 7 ◽

pp. e660

Author(s):

Sanjeev Kumar ◽

Ravendra Singh ◽

Mohammad Zubair Khan ◽

Abdulfattah Noorwali

Keyword(s):

Sentiment Analysis ◽

False Positive ◽

Opinion Mining ◽

Negative Impact ◽

Concept Drift ◽

Data Distribution ◽

Detection Algorithm ◽

Ensemble Classifier ◽

Detection Algorithms ◽

Highly Sensitive

DataStream mining is a challenging task for researchers because of the change in data distribution during classification, known as concept drift. Drift detection algorithms emphasize detecting the drift. The drift detection algorithm needs to be very sensitive to change in data distribution for detecting the maximum number of drifts in the data stream. But highly sensitive drift detectors lead to higher false-positive drift detections. This paper proposed a Drift Detection-based Adaptive Ensemble classifier for sentiment analysis and opinion mining, which uses these false-positive drift detections to benefit and minimize the negative impact of false-positive drift detection signals. The proposed method creates and adds a new classifier to the ensemble whenever a drift happens. A weighting mechanism is implemented, which provides weights to each classifier in the ensemble. The weight of the classifier decides the contribution of each classifier in the final classification results. The experiments are performed using different classification algorithms, and results are evaluated on the accuracy, precision, recall, and F1-measures. The proposed method is also compared with these state-of-the-art methods, OzaBaggingADWINClassifier, Accuracy Weighted Ensemble, Additive Expert Ensemble, Streaming Random Patches, and Adaptive Random Forest Classifier. The results show that the proposed method handles both true positive and false positive drifts efficiently.

Download Full-text

Comparative Analysis of Drift Detection Based Adaptive Ensemble Model with Different Drift Detection Techniques

Journal of University of Shanghai for Science and Technology ◽

10.51201/jusst/21/06492 ◽

2021 ◽

Vol 23 (06) ◽

pp. 49-55

Author(s):

Sanjeev Kumar ◽

◽

Ravendra Singh ◽

Keyword(s):

Data Mining ◽

Comparative Analysis ◽

False Positive ◽

Opinion Mining ◽

Concept Drift ◽

Ensemble Classifier ◽

Stream Data ◽

Detection Techniques ◽

Stream Data Mining ◽

Detection Algorithms

Stream data mining is a popular research area these days. The concept drift detection and drift handling are the biggest challenges of stream data mining. Several drift detection algorithms have been developed which can accurately detect various drifts but have the problem of false-positive drift detection. The false-positive drift detection leads to the performance degradation of the classifier because of unnecessary training in between analyses. Classifier ensemble has shown its efficiency for drift detection, drift handling, and classification. But the ensemble classifiers could not detect the exact position of drift occurrence, so it has to update itself at some fixed interval, which leads to an unnecessary computational burden on the system. Combining the drift detection algorithm with an ensemble classifier can improve the performance and also solve the problems of false-positive drift detection and unnecessary updating of the ensemble classifier. In this paper, a model is proposed that creates a weighted adaptive ensemble classifier by updating it only when a drift detection signal is given by the used drift detection method. The proposed model is evaluated on text-based stream data for sentiment analysis and opinion mining with multiple drift detection algorithms and with multiple classification algorithms as base classifiers for the ensemble. A comparative analysis has been done, and the results have shown the efficiency of the proposed models.

Download Full-text

Concept Drift Identification using Classifier Ensemble Approach

International Journal of Electrical and Computer Engineering (IJECE) ◽

10.11591/ijece.v8i1.pp19-25 ◽

2018 ◽

Vol 8 (1) ◽

pp. 19

Author(s):

Leena Deshpande ◽

M. Narsing Rao

Keyword(s):

Concept Drift ◽

Data Distribution ◽

Ensemble Classifier ◽

Classification Model ◽

Data Sets ◽

Ensemble Approach ◽

Telecommunication Systems ◽

Traditional Classification ◽

Financial Domain ◽

New Feature

<p>Abstract:-In Internetworking system, the huge amount of data is scattered, generated and processed over the network. The data mining techniques are used to discover the unknown pattern from the underlying data. A traditional classification model is used to classify the data based on past labelled data. However in many current applications, data is increasing in size with fluctuating patterns. Due to this new feature may arrive in the data. It is present in many applications like sensornetwork, banking and telecommunication systems, financial domain, Electricity usage and prices based on its demand and supplyetc .Thus change in data distribution reduces the accuracy of classifying the data. It may discover some patterns as frequent while other patterns tend to disappear and wrongly classify. To mine such data distribution, traditionalclassification techniques may not be suitable as the distribution generating the items can change over time so data from the past may become irrelevant or even false for the current prediction. For handlingsuch varying pattern of data, concept drift mining approach is used to improve the accuracy of classification techniques. In this paper we have proposed ensemble approach for improving the accuracy of classifier. The ensemble classifier is applied on 3 different data sets. We investigated different features for the different chunk of data which is further given to ensemble classifier. We observed the proposed approach improves the accuracy of classifier for different chunks of data.</p>

Download Full-text

Bi-directional online transfer learning: a framework

annals of telecommunications - annales des télécommunications ◽

10.1007/s12243-020-00776-1 ◽

2020 ◽

Vol 75 (9-10) ◽

pp. 523-547

Author(s):

Helen McKay ◽

Nathan Griffiths ◽

Phillip Taylor ◽

Theo Damoulas ◽

Zhou Xu

Keyword(s):

Transfer Learning ◽

Concept Drift ◽

Detection Algorithm ◽

Target Domain ◽

Real World Data ◽

Additional Information ◽

Detection Algorithms ◽

Home Heating ◽

Concept Drift Detection ◽

Communication Demands

Abstract Transfer learning uses knowledge learnt in source domains to aid predictions in a target domain. When source and target domains are online, they are susceptible to concept drift, which may alter the mapping of knowledge between them. Drifts in online environments can make additional information available in each domain, necessitating continuing knowledge transfer both from source to target and vice versa. To address this, we introduce the Bi-directional Online Transfer Learning (BOTL) framework, which uses knowledge learnt in each online domain to aid predictions in others. We introduce two variants of BOTL that incorporate model culling to minimise negative transfer in frameworks with high volumes of model transfer. We consider the theoretical loss of BOTL, which indicates that BOTL achieves a loss no worse than the underlying concept drift detection algorithm. We evaluate BOTL using two existing concept drift detection algorithms: RePro and ADWIN. Additionally, we present a concept drift detection algorithm, Adaptive Windowing with Proactive drift detection (AWPro), which reduces the computation and communication demands of BOTL. Empirical results are presented using two data stream generators: the drifting hyperplane emulator and the smart home heating simulator, and real-world data predicting Time To Collision (TTC) from vehicle telemetry. The evaluation shows BOTL and its variants outperform the concept drift detection strategies and the existing state-of-the-art online transfer learning technique.

Download Full-text

Design of Ensemble Classifier Selection Framework Based on Ant Colony Optimization for Sentiment Analysis and Opinion Mining

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.l2755.1081219 ◽

2019 ◽

Vol 8 (12) ◽

pp. 5112-5117

Keyword(s):

Sentiment Analysis ◽

Ant Colony Optimization ◽

Opinion Mining ◽

Ensemble Classifier ◽

Ant Colony ◽

Experimental Result ◽

Ensemble Classifiers ◽

Classifier Selection ◽

Selection For ◽

Selection Framework

Ensemble Classifier provides a promising way to improve the accuracy of classification for sentiment analysis and opinion mining. Ensemble classifier should combine with diverse base classifiers. However, establishing a connection between diversity and accuracy of ensemble classifier is tedious task because of sensitivity between diversity and accuracy. In this paper an Ensemble classifier selection (ECS) framework based on Ant Colony Optimization (ACO) algorithm is presented. The framework provides a subset of base classifiers from a given set of classifiers with maximum possible diversity and accuracy to design an ensemble classifier for sentiment analysis and opinion mining. This framework uses diversity measures and accuracy as selection criteria for classifier selection for ensemble creation. The experimental result shows that the ensemble classifiers provided by this framework presents an efficient way for sentiment analysis and opinion mining.

Download Full-text

Efficient Topic Level Opinion Mining and Sentiment Analysis Algorithm using Latent Dirichlet Allocation Model

International Journal of Advanced Trends in Computer Science and Engineering ◽

10.30534/ijatcse/2019/105852019 ◽

2019 ◽

Vol 8 (5) ◽

pp. 2568-2572

Author(s):

Vamshi Krishna B ◽

Keyword(s):

Sentiment Analysis ◽

Latent Dirichlet Allocation ◽

Opinion Mining ◽

Allocation Model ◽

Analysis Algorithm ◽

Latent Dirichlet Allocation Model ◽

Dirichlet Allocation

Download Full-text

Convolutional Neural Network for Customer’s Opinion on Amazon Products

International Journal of Recent Technology and Engineering - 2 ◽

10.35940/ijrte.c5670.098319 ◽

2019 ◽

Vol 8 (3) ◽

pp. 6634-6643 ◽

Cited By ~ 1

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Sentiment Analysis ◽

Latent Dirichlet Allocation ◽

Opinion Mining ◽

Text Documents ◽

Customer Churn ◽

Learning Classifier ◽

Review Spam

Opinion mining and sentiment analysis are valuable to extract the useful subjective information out of text documents. Predicting the customer’s opinion on amazon products has several benefits like reducing customer churn, agent monitoring, handling multiple customers, tracking overall customer satisfaction, quick escalations, and upselling opportunities. However, performing sentiment analysis is a challenging task for the researchers in order to find the users sentiments from the large datasets, because of its unstructured nature, slangs, misspells and abbreviations. To address this problem, a new proposed system is developed in this research study. Here, the proposed system comprises of four major phases; data collection, pre-processing, key word extraction, and classification. Initially, the input data were collected from the dataset: amazon customer review. After collecting the data, preprocessing was carried-out for enhancing the quality of collected data. The pre-processing phase comprises of three systems; lemmatization, review spam detection, and removal of stop-words and URLs. Then, an effective topic modelling approach Latent Dirichlet Allocation (LDA) along with modified Possibilistic Fuzzy C-Means (PFCM) was applied to extract the keywords and also helps in identifying the concerned topics. The extracted keywords were classified into three forms (positive, negative and neutral) by applying an effective machine learning classifier: Convolutional Neural Network (CNN). The experimental outcome showed that the proposed system enhanced the accuracy in sentiment analysis up to 6-20% related to the existing systems.

Download Full-text

A dual U-Net algorithm for automating feature extraction from satellite imagery

The Journal of Defense Modeling and Simulation Applications Methodology Technology ◽

10.1177/1548512920983549 ◽

2021 ◽

pp. 154851292098354

Author(s):

Samuel Humphries ◽

Trevor Parker ◽

Bryan Jonas ◽

Bryan Adams ◽

Nicholas J Clark

Keyword(s):

Neural Networks ◽

Object Detection ◽

Satellite Images ◽

Detection Algorithm ◽

Military Operations ◽

Detection Algorithms ◽

Us Military ◽

Standard Product ◽

Different Types ◽

Road Intersections

Quick identification of building and roads is critical for execution of tactical US military operations in an urban environment. To this end, a gridded, referenced, satellite images of an objective, often referred to as a gridded reference graphic or GRG, has become a standard product developed during intelligence preparation of the environment. At present, operational units identify key infrastructure by hand through the work of individual intelligence officers. Recent advances in Convolutional Neural Networks, however, allows for this process to be streamlined through the use of object detection algorithms. In this paper, we describe an object detection algorithm designed to quickly identify and label both buildings and road intersections present in an image. Our work leverages both the U-Net architecture as well the SpaceNet data corpus to produce an algorithm that accurately identifies a large breadth of buildings and different types of roads. In addition to predicting buildings and roads, our model numerically labels each building by means of a contour finding algorithm. Most importantly, the dual U-Net model is capable of predicting buildings and roads on a diverse set of test images and using these predictions to produce clean GRGs.

Download Full-text

Validating Sentiment Analysis on Opinion Mining Using Self-reported Attitude Scores

2020 Seventh International Conference on Social Networks Analysis, Management and Security (SNAMS) ◽

10.1109/snams52053.2020.9336540 ◽

2020 ◽

Author(s):

Jieyu Ding Featherstone ◽

George A. Barnett

Keyword(s):

Sentiment Analysis ◽

Opinion Mining

Download Full-text

A novel algorithm to detect non-wear time from raw accelerometer data using deep convolutional neural networks

Scientific Reports ◽

10.1038/s41598-021-87757-z ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Shaheen Syed ◽

Bente Morseth ◽

Laila A. Hopstock ◽

Alexander Horsch

Keyword(s):

Threshold Value ◽

Detection Algorithm ◽

Classification Model ◽

Type I ◽

Accelerometer Data ◽

Deep Convolutional Neural Networks ◽

Major Drawback ◽

Wear Time ◽

Detection Algorithms ◽

Type Ii Errors

AbstractTo date, non-wear detection algorithms commonly employ a 30, 60, or even 90 mins interval or window in which acceleration values need to be below a threshold value. A major drawback of such intervals is that they need to be long enough to prevent false positives (type I errors), while short enough to prevent false negatives (type II errors), which limits detecting both short and longer episodes of non-wear time. In this paper, we propose a novel non-wear detection algorithm that eliminates the need for an interval. Rather than inspecting acceleration within intervals, we explore acceleration right before and right after an episode of non-wear time. We trained a deep convolutional neural network that was able to infer non-wear time by detecting when the accelerometer was removed and when it was placed back on again. We evaluate our algorithm against several baseline and existing non-wear algorithms, and our algorithm achieves a perfect precision, a recall of 0.9962, and an F1 score of 0.9981, outperforming all evaluated algorithms. Although our algorithm was developed using patterns learned from a hip-worn accelerometer, we propose algorithmic steps that can easily be applied to a wrist-worn accelerometer and a retrained classification model.

Download Full-text

Opinion Mining and Sentiment Analysis

Text Data Management and Analysis: A Practical Introduction to Information Retrieval and Text Mining ◽

10.1145/2915031.2915050 ◽

2016 ◽

pp. 389

Keyword(s):

Sentiment Analysis ◽

Opinion Mining

Download Full-text