Opinion Mining Analysis on Online Product Reviews Using Naïve Bayes and Feature Selection

Author(s):  
Wina Permana Sari ◽  
Hisyam Fahmi

The World Wide Web has boosted its content for the past years, it has a vast amount of multimedia resources that continuously grow specifically in documentary data. One of the major contributors of documentary contents can be evidently found on the social media called Facebook. People or netizens on Facebook are actively sharing their opinion about a certain topic or posts that can be related to them or not. With the huge amount of accessible documentary data that are seen on the so-called social media, there are research trends that can be made by the researchers in the field of opinion mining. A netizen’s comment on a particular post can either be a negative or a positive one. This study will discuss the opinion or comment of a netizen whether it is positive or negative or how she/he feels about a specific topic posted on Facebook; this is can be measured by the use of Sentiment Analysis. The combination of the Natural Language Processing and the analytics in textual form is also known as Sentiment Analysis that is use to the extraction of data in a useful manner. This study will be based on the product reviews of Filipinos in Filipino, English and Taglish (mixed Filipino and English) languages. To categorize a comment effectively, the Naïve Bayes Algorithm was implemented to the developed web system.


Author(s):  
Oman Somantri ◽  
Dyah Apriliani

<p>Conducting an assessment of consumer sentiments taken from social media in assessing a culinary food gives useful information for everyone who wants to get this information especially for migrants and tourists, in th other hand that information is very valuable for food stall and restaurant owners as information in improvinf food quality. Overcoming this problem, a sentiment analysis classification model using naïve bayes algorithm (NB) was applied to get this information. This problem occurs is the level of accuracy of classification of consumer ratings of culinary food is still not optimal because the weight of values in the data preprocessing process are not optimal. In this paper proposed a hybrid feature selection models to overcome the problems in the process of selecting the feature attributes that have not been optimal by using a combination of information gain (IG) and genetic algorithm (GA) algorithms. The result of this research showed that after the experiment and compared to using others algorithms produce the best of the level occuracy is 93%.</p>


2017 ◽  
Vol 7 (6) ◽  
pp. 2296-2302 ◽  
Author(s):  
J. Mir ◽  
A. Mahmood ◽  
S. Khatoon

Aspect based opinion mining investigates deeply, the emotions related to one’s aspects. Aspects and opinion word identification is the core task of aspect based opinion mining. In previous studies aspect based opinion mining have been applied on service or product domain. Moreover, product reviews are short and simple whereas, social reviews are long and complex. However, this study introduces an efficient model for social reviews which classifies aspects and opinion words related to social domain. The main contributions of this paper are auto tagging and data training phase, feature set definition and dictionary usage. Proposed model results are compared with CR model and Naïve Bayes classifier on same dataset having accuracy 98.17% and precision 96.01%, while recall and F1 are 96.00% and 96.01% respectively. The experimental results show that the proposed model performs better than the CR model and Naïve Bayes classifier.


Author(s):  
Mir Habeebullah Shah Quadri ◽  
R. K. Selvakumar

Both sellers and buyers heavily depend on the opinions of customers in purchasing and selling products online. When it comes to text-based data, sentiment analysis of user reviews has become a prominent facet of machine learning. Text data is generally unstructured which makes opinion mining very challenging. A wide array of pre-processing and post-processing techniques need to be applied. But the major challenge is selecting the right classifier for the job. Naïve Bayes algorithm is a commonly used machine learning classifier when it comes to opinion mining and sentiment analysis. The focus of this survey is to observe and analyze the performance of Naïve Bayes algorithm in sentiment analysis of user reviews online. Recent research from a wide array of use-cases such as sentiment analysis of movie reviews, product reviews, book reviews, blog posts, microblogs and other sources of data have been taken into account. The results show that Naïve Bayes algorithm performs exceptionally well with accuracies between 75% to 99% across the board.


2020 ◽  
Vol 4 (3) ◽  
pp. 504-512
Author(s):  
Faried Zamachsari ◽  
Gabriel Vangeran Saragih ◽  
Susafa'ati ◽  
Windu Gata

The decision to move Indonesia's capital city to East Kalimantan received mixed responses on social media. When the poverty rate is still high and the country's finances are difficult to be a factor in disapproval of the relocation of the national capital. Twitter as one of the popular social media, is used by the public to express these opinions. How is the tendency of community responses related to the move of the National Capital and how to do public opinion sentiment analysis related to the move of the National Capital with Feature Selection Naive Bayes Algorithm and Support Vector Machine to get the highest accuracy value is the goal in this study. Sentiment analysis data will take from public opinion using Indonesian from Twitter social media tweets in a crawling manner. Search words used are #IbuKotaBaru and #PindahIbuKota. The stages of the research consisted of collecting data through social media Twitter, polarity, preprocessing consisting of the process of transform case, cleansing, tokenizing, filtering and stemming. The use of feature selection to increase the accuracy value will then enter the ratio that has been determined to be used by data testing and training. The next step is the comparison between the Support Vector Machine and Naive Bayes methods to determine which method is more accurate. In the data period above it was found 24.26% positive sentiment 75.74% negative sentiment related to the move of a new capital city. Accuracy results using Rapid Miner software, the best accuracy value of Naive Bayes with Feature Selection is at a ratio of 9:1 with an accuracy of 88.24% while the best accuracy results Support Vector Machine with Feature Selection is at a ratio of 5:5 with an accuracy of 78.77%.


Healthcare ◽  
2021 ◽  
Vol 9 (7) ◽  
pp. 884
Author(s):  
Antonio García-Domínguez ◽  
Carlos E. Galván-Tejada ◽  
Ramón F. Brena ◽  
Antonio A. Aguileta ◽  
Jorge I. Galván-Tejada ◽  
...  

Children’s healthcare is a relevant issue, especially the prevention of domestic accidents, since it has even been defined as a global health problem. Children’s activity classification generally uses sensors embedded in children’s clothing, which can lead to erroneous measurements for possible damage or mishandling. Having a non-invasive data source for a children’s activity classification model provides reliability to the monitoring system where it is applied. This work proposes the use of environmental sound as a data source for the generation of children’s activity classification models, implementing feature selection methods and classification techniques based on Bayesian networks, focused on the recognition of potentially triggering activities of domestic accidents, applicable in child monitoring systems. Two feature selection techniques were used: the Akaike criterion and genetic algorithms. Likewise, models were generated using three classifiers: naive Bayes, semi-naive Bayes and tree-augmented naive Bayes. The generated models, combining the methods of feature selection and the classifiers used, present accuracy of greater than 97% for most of them, with which we can conclude the efficiency of the proposal of the present work in the recognition of potentially detonating activities of domestic accidents.


Sign in / Sign up

Export Citation Format

Share Document