An Enhanced Technique for Analyzing Sentiments of Public Reviews - I

doi:10.35940/ijies.d0926.095619

An Enhanced Technique for Analyzing Sentiments of Public Reviews - I

International Journal of Inventive Engineering and Sciences - Regular Issue ◽

10.35940/ijies.d0926.095619 ◽

2019 ◽

Vol 5 (6) ◽

pp. 1-6

Keyword(s):

Sentiment Analysis ◽

Semantic Information ◽

Traditional Approach ◽

Classification Performance ◽

Bag Of Words ◽

Text Representation ◽

Shift Problem

Sentiment analysis is the process of extracting the opinion expressed in a piece of text to determine the writer’s attitude towards a topic, product or any service in general and classify it into classes such as positive, negative or neutral. Bag of Words is the traditional approach for text representation in Sentiment Analysis where text is represented as bag of its words. This approach represents the text by breaking the sentence into words disregarding other semantic information. A problem that occurs due to this representation is Polarity Shift problem. To address polarity shift problem a dual sentiment analysis (DSA) system is created. It looks at the reviews from both the sides i.e. positive and negative. The existing work on dual sentiment analysis includes techniques where dual training and dual prediction is performed. The proposed system is to enhance the classification performance of the existing system by applying different classifiers apart from those used in existing system to obtain better results. After classification of reviews into appropriate classes, various graphs are plotted based on different parameters to validate the results and determine the best classifier from the applied classifiers.

Download Full-text

GENERATING SYNTHETIC 3D POINT SEGMENTS FOR IMPROVED CLASSIFICATION OF MOBILE LIDAR POINT CLOUDS

ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-archives-xliii-b2-2021-139-2021 ◽

2021 ◽

Vol XLIII-B2-2021 ◽

pp. 139-144

Author(s):

S. A. Chitnis ◽

Z. Huang ◽

K. Khoshelham

Keyword(s):

Semantic Information ◽

State Of The Art ◽

Point Clouds ◽

Classification Performance ◽

Training Data ◽

Lidar Data ◽

Real Point ◽

The Road ◽

Training Samples

Abstract. Mobile lidar point clouds are commonly used for 3d mapping of road environments as they provide a rich, highly detailed geometric representation of objects on and around the road. However, raw lidar point clouds lack semantic information about the type of objects, which is necessary for various applications. Existing methods for the classification of objects in mobile lidar data, including state of the art deep learning methods, achieve relatively low accuracies, and a primary reason for this under-performance is the inadequacy of available 3d training samples to sufficiently train deep networks. In this paper, we propose a generative model for creating synthetic 3d point segments that can aid in improving the classification performance of mobile lidar point clouds. We train a 3d Adversarial Autoencoder (3dAAE) to generate synthetic point segments that exhibit a high resemblance to and share similar geometric features with real point segments. We evaluate the performance of a PointNet-like classifier trained with and without the synthetic point segments. The evaluation results support our hypothesis that training a classifier with training data augmented with synthetic samples leads to significant improvement in the classification performance. Specifically, our model achieves an F1 score of 0.94 for vehicles and pedestrians and 1.00 for traffic signs.

Download Full-text

An Unsupervised Sentiment Information Identification Approach

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.263-266.3330 ◽

2012 ◽

Vol 263-266 ◽

pp. 3330-3334

Author(s):

Pan Pan Xu ◽

Hui Lan Jin ◽

Han Xiao Shi ◽

Wei Chen

Keyword(s):

Sentiment Analysis ◽

Word Pair ◽

Semantic Information ◽

Contextual Information ◽

Experimental Results ◽

Bag Of Words ◽

Associative Information ◽

Identification Approach ◽

Sentiment Word

Existing research focuses on document-based sentiment analysis and documents are represented by the bag-of-words model. However, due to the loss of contextual information, this representation fails to capture the associative information between an opinion and its corresponding target. Additionally, several researchers focus on sentence-based approaches, which can effectively extract an aspect-sentiment word pair within one sentence. Nevertheless, their approaches can only deal with one aspect within one sentence and miss the identification of sentiment modifier. In order to solve these problems, this paper proposes a novel identification approach of aspect-modifier-sentiment word triple using shallow semantic information. Experimental results show that our approach is feasible and effective.

Download Full-text

A Study on the Auxiliary Diagnosis of Thyroid Disease Images Based on Multiple Dimensional Deep Learning Algorithms

Current Medical Imaging Formerly Current Medical Imaging Reviews ◽

10.2174/1573405615666190115155223 ◽

2020 ◽

Vol 16 (3) ◽

pp. 199-205

Author(s):

Yuejun Liu ◽

Yifei Xu ◽

Xiangzheng Meng ◽

Xuguang Wang ◽

Tianxu Bai

Keyword(s):

Deep Learning ◽

Learning Algorithms ◽

Region Of Interest ◽

Classification Performance ◽

Thyroid Diseases ◽

Great Success ◽

Learning Models ◽

Good Classification Performance ◽

Spect Images

Background: Medical imaging plays an important role in the diagnosis of thyroid diseases. In the field of machine learning, multiple dimensional deep learning algorithms are widely used in image classification and recognition, and have achieved great success. Objective: The method based on multiple dimensional deep learning is employed for the auxiliary diagnosis of thyroid diseases based on SPECT images. The performances of different deep learning models are evaluated and compared. Methods: Thyroid SPECT images are collected with three types, they are hyperthyroidism, normal and hypothyroidism. In the pre-processing, the region of interest of thyroid is segmented and the amount of data sample is expanded. Four CNN models, including CNN, Inception, VGG16 and RNN, are used to evaluate deep learning methods. Results: Deep learning based methods have good classification performance, the accuracy is 92.9%-96.2%, AUC is 97.8%-99.6%. VGG16 model has the best performance, the accuracy is 96.2% and AUC is 99.6%. Especially, the VGG16 model with a changing learning rate works best. Conclusion: The standard CNN, Inception, VGG16, and RNN four deep learning models are efficient for the classification of thyroid diseases with SPECT images. The accuracy of the assisted diagnostic method based on deep learning is higher than that of other methods reported in the literature.

Download Full-text

Document Preprocessing with TF-IDF to Improve the Polarity Classification Performance of Unstructured Sentiment Analysis

Kinetik Game Technology Information System Computer Network Computing Electronics and Control ◽

10.22219/kinetik.v5i3.1066 ◽

2020 ◽

pp. 235-242

Author(s):

Farrikh Alzami ◽

Erika Devi Udayanti ◽

Dwi Puji Prabowo ◽

Rama Aria Megantara

Keyword(s):

Machine Learning ◽

Feature Extraction ◽

Random Forest ◽

Sentiment Analysis ◽

Classification Performance ◽

Document Preparation ◽

Learning Models ◽

Polarity Classification ◽

Negative Sentiment ◽

Machine Learning Models

Sentiment analysis in terms of polarity classification is very important in everyday life, with the existence of polarity, many people can find out whether the respected document has positive or negative sentiment so that it can help in choosing and making decisions. Sentiment analysis usually done manually. Therefore, an automatic sentiment analysis classification process is needed. However, it is rare to find studies that discuss extraction features and which learning models are suitable for unstructured sentiment analysis types with the Amazon food review case. This research explores some extraction features such as Word Bags, TF-IDF, Word2Vector, as well as a combination of TF-IDF and Word2Vector with several machine learning models such as Random Forest, SVM, KNN and Naïve Bayes to find out a combination of feature extraction and learning models that can help add variety to the analysis of polarity sentiments. By assisting with document preparation such as html tags and punctuation and special characters, using snowball stemming, TF-IDF results obtained with SVM are suitable for obtaining a polarity classification in unstructured sentiment analysis for the case of Amazon food review with a performance result of 87,3 percent.

Download Full-text

Sentiment Analysis and Classification of Restaurant Reviews using Machine Learning

2020 21st International Arab Conference on Information Technology (ACIT) ◽

10.1109/acit50332.2020.9300098 ◽

2020 ◽

Author(s):

Kanwal Zahoor ◽

Narmeen Zakaria Bawany ◽

Soomaiya Hamid

Keyword(s):

Machine Learning ◽

Sentiment Analysis ◽

Restaurant Reviews

Download Full-text

Automated classification of clinical trial eligibility criteria text based on ensemble learning and metric learning

BMC Medical Informatics and Decision Making ◽

10.1186/s12911-021-01492-z ◽

2021 ◽

Vol 21 (S2) ◽

Author(s):

Kun Zeng ◽

Yibin Xu ◽

Ge Lin ◽

Likeng Liang ◽

Tianyong Hao

Keyword(s):

Clinical Trial ◽

Ensemble Learning ◽

Metric Learning ◽

Classification Performance ◽

Ensemble Model ◽

Automated Classification ◽

Eligibility Criteria ◽

Data Imbalance ◽

The Impact

Abstract Background Eligibility criteria are the primary strategy for screening the target participants of a clinical trial. Automated classification of clinical trial eligibility criteria text by using machine learning methods improves recruitment efficiency to reduce the cost of clinical research. However, existing methods suffer from poor classification performance due to the complexity and imbalance of eligibility criteria text data. Methods An ensemble learning-based model with metric learning is proposed for eligibility criteria classification. The model integrates a set of pre-trained models including Bidirectional Encoder Representations from Transformers (BERT), A Robustly Optimized BERT Pretraining Approach (RoBERTa), XLNet, Pre-training Text Encoders as Discriminators Rather Than Generators (ELECTRA), and Enhanced Representation through Knowledge Integration (ERNIE). Focal Loss is used as a loss function to address the data imbalance problem. Metric learning is employed to train the embedding of each base model for feature distinguish. Soft Voting is applied to achieve final classification of the ensemble model. The dataset is from the standard evaluation task 3 of 5th China Health Information Processing Conference containing 38,341 eligibility criteria text in 44 categories. Results Our ensemble method had an accuracy of 0.8497, a precision of 0.8229, and a recall of 0.8216 on the dataset. The macro F1-score was 0.8169, outperforming state-of-the-art baseline methods by 0.84% improvement on average. In addition, the performance improvement had a p-value of 2.152e-07 with a standard t-test, indicating that our model achieved a significant improvement. Conclusions A model for classifying eligibility criteria text of clinical trials based on multi-model ensemble learning and metric learning was proposed. The experiments demonstrated that the classification performance was improved by our ensemble model significantly. In addition, metric learning was able to improve word embedding representation and the focal loss reduced the impact of data imbalance to model performance.

Download Full-text

Enhanced concept-level sentiment analysis system with expanded ontological relations for efficient classification of user reviews

Egyptian Informatics Journal ◽

10.1016/j.eij.2021.03.001 ◽

2021 ◽

Author(s):

Asad Khattak ◽

Muhammad Zubair Asghar ◽

Zain Ishaq ◽

Waqas Haider Bangyal ◽

Ibrahim A Hameed

Keyword(s):

Sentiment Analysis ◽

User Reviews ◽

Analysis System

Download Full-text

Space Precession Target Classification Based on Radar High-Resolution Range Profiles

International Journal of Antennas and Propagation ◽

10.1155/2019/8151620 ◽

2019 ◽

Vol 2019 ◽

pp. 1-9

Author(s):

Yizhe Wang ◽

Cunqian Feng ◽

Yongshun Zhang ◽

Sisan He

Keyword(s):

Parameter Extraction ◽

Classification Performance ◽

Support Vector ◽

Electromagnetic Data ◽

Feature Extractor ◽

Different Types ◽

Radar Echo ◽

High Level ◽

Cone Target

Precession is a common micromotion form of space targets, introducing additional micro-Doppler (m-D) modulation into the radar echo. Effective classification of space targets is of great significance for further micromotion parameter extraction and identification. Feature extraction is a key step during the classification process, largely influencing the final classification performance. This paper presents two methods for classifying different types of space precession targets from the HRRPs. We first establish the precession model of space targets and analyze the scattering characteristics and then compute electromagnetic data of the cone target, cone-cylinder target, and cone-cylinder-flare target. Experimental results demonstrate that the support vector machine (SVM) using histograms of oriented gradient (HOG) features achieves a good result, whereas the deep convolutional neural network (DCNN) obtains a higher classification accuracy. DCNN combines the feature extractor and the classifier itself to automatically mine the high-level signatures of HRRPs through a training process. Besides, the efficiency of the two classification processes are compared using the same dataset.

Download Full-text

Polarity Classification of Arabic Sentiments

International Journal of Information Technology and Web Engineering ◽

10.4018/ijitwe.2016070103 ◽

2016 ◽

Vol 11 (3) ◽

pp. 32-49 ◽

Cited By ~ 5

Author(s):

Mohammed N. Al-Kabi ◽

Heider A. Wahsheh ◽

Izzat M. Alsmadi

Keyword(s):

Social Media ◽

Sentiment Analysis ◽

Operating Characteristic ◽

Opinion Mining ◽

Online Social Network ◽

The Social ◽

Polarity Classification ◽

Arabic Sentiment Analysis ◽

Modern Standard

Sentiment Analysis/Opinion Mining is associated with social media and usually aims to automatically identify the polarities of different points of views of the users of the social media about different aspects of life. The polarity of a sentiment reflects the point view of its author about a certain issue. This study aims to present a new method to identify the polarity of Arabic reviews and comments whether they are written in Modern Standard Arabic (MSA), or one of the Arabic Dialects, and/or include Emoticons. The proposed method is called Detection of Arabic Sentiment Analysis Polarity (DASAP). A modest dataset of Arabic comments, posts, and reviews is collected from Online social network websites (i.e. Facebook, Blogs, YouTube, and Twitter). This dataset is used to evaluate the effectiveness of the proposed method (DASAP). Receiver Operating Characteristic (ROC) prediction quality measurements are used to evaluate the effectiveness of DASAP based on the collected dataset.

Download Full-text