Reviews’ length and sentiment as correlates of online reviews’ ratings

Purpose The purpose of this paper is twofold: first, to identify and validate reviews’ length and sentiment as correlates of online reviews’ ratings; and second, to understand the emotions embedded in online reviews and how they associate with specific words used in such reviews. Design/methodology/approach A panel data set of customer reviews was collected for auto, life, and home insurance from January 2012 to December 2015 using a web scraping technique. Using a sentiment analysis approach, 1,584 reviews for the auto, home, and life insurance services of 156 insurance companies were analyzed. Findings The results indicate that, since 2013, consumers have generally had more negative emotions than positive ones toward insurance services. The results also show that consumer review sentiment correlates positively and review length correlates negatively with consumer online review ratings. Furthermore, a two-way ANOVA analysis shows that, in general, short reviews with positive sentiment are associated with high review ratings. Practical implications The findings of this study provide service companies, in general, and insurance companies, in particular, with important guidelines that should be considered to increase consumers’ positive attitude toward their services. Originality/value This paper highlights the importance of sentiment analysis in identifying consumer reviews’ emotions and understanding the associations and interactions of reviews’ length and sentiment on online review rating, which can lead to improved marketing strategies.

Download Full-text

Determining banking service attributes from online reviews: text mining and sentiment analysis

International Journal of Bank Marketing ◽

10.1108/ijbm-08-2021-0380 ◽

2022 ◽

Vol ahead-of-print (ahead-of-print) ◽

Author(s):

Divya Mittal ◽

Shiv Ratan Agrawal

Keyword(s):

Text Mining ◽

Customer Satisfaction ◽

Sentiment Analysis ◽

Banking Sector ◽

Online Reviews ◽

Socioeconomic Background ◽

Content Type ◽

Customer Reviews ◽

Banking Service ◽

The Interest Rate

PurposeThe current study employs text mining and sentiment analysis to identify core banking service attributes and customer sentiment in online user-generated reviews. Additionally, the study explains customer satisfaction based on the identified predictors.Design/methodology/approachA total of 32,217 customer reviews were collected across 29 top banks on bankbazaar.com posted from 2014 to 2021. In total three conceptual models were developed and evaluated employing regression analysis.FindingsThe study revealed that all variables were found to be statistically significant and affect customer satisfaction in their respective models except the interest rate.Research limitations/implicationsThe study is confined to the geographical representation of its subjects' i.e. Indian customers. A cross-cultural and socioeconomic background analysis of banking customers in different countries may help to better generalize the findings.Practical implicationsThe study makes essential theoretical and managerial contributions to the existing literature on services, particularly the banking sector.Originality/valueThis paper is unique in nature that focuses on banking customer satisfaction from online reviews and ratings using text mining and sentiment analysis.

Download Full-text

FLORA: a hierarchical fuzzy system for online accommodation review analysis

Journal of Systems and Information Technology ◽

10.1108/jsit-03-2018-0046 ◽

2019 ◽

Vol 21 (3) ◽

pp. 347-367

Author(s):

Thara Angskun ◽

Jitimon Angskun

Keyword(s):

Sentiment Analysis ◽

Semantic Analysis ◽

Online Reviews ◽

Online Review ◽

Probability Weighting ◽

Content Type ◽

Analysis Process ◽

Review Analysis ◽

Accuracy Comparison ◽

Hierarchical Fuzzy System

Purpose This paper aims to introduce a hierarchical fuzzy system for an online review analysis named FLORA. FLORA enables tourists to decide their destination without reading numerous reviews from experienced tourists. It summarizes reviews and visualizes them through a hierarchical structure. The visualization does not only present overall quality of an accommodation, but it also presents the condition of the bed, hospitality of the front desk receptionist and much more in a snap. Design/methodology/approach FLORA is a complete system which acquires online reviews, analyzes sentiments, computes feature scores and summarizes results in a hierarchical view. FLORA is designed to use an overall score, rated by real tourists as a baseline for accuracy comparison. The accuracy of FLORA has achieved by a novel sentiment analysis process (as part of a knowledge acquisition engine) based on semantic analysis and a novel rating technique, called hierarchical fuzzy calculation, in the knowledge inference engine. Findings The performance comparison of FLORA against related work has been assessed in two aspects. The first aspect focuses on review analysis with binary format representation. The results reveal that the hierarchical fuzzy method, with probability weighting of FLORA, is achieved with the highest values in precision, recall and F-measure. The second aspect looks at review analysis with a five-point rating scale rating by comparing with one of the most advanced research methods, called fuzzy domain ontology. The results reveal that the hierarchical fuzzy method, with probability weighting of FLORA, returns the closest results to the tourist-defined rating. Research limitations/implications This research advances knowledge of online review analysis by contributing a novel sentiment analysis process and a novel rating technique. The FLORA system has two limitations. First, the reviews are based on individual expression, which is an arbitrary distinction and not always grammatically correct. Consequently, some opinions may not be extracted because the context free grammar rules are insufficient. Second, natural languages evolve and diversify all the time. Many emerging words or phrases, including idioms, proverbs and slang, are often used in online reviews. Thus, those words or phrases need to be manually updated in the knowledge base. Practical implications This research contributes to the tourism business and assists travelers by introducing comprehensive and easy to understand information about each accommodation to travelers. Although the FLORA system was originally designed and tested with accommodation reviews, it can also be used with reviews of any products or services by updating data in the knowledge base. Thus, businesses, which have online reviews for their products or services, can benefit from the FLORA system. Originality/value This research proposes a FLORA system which analyzes sentiments from online reviews, computes feature scores and summarizes results in a hierarchical view. Moreover, this work is able to use the overall score, rated by real tourists, as a baseline for accuracy comparison. The main theoretical implication is a novel sentiment analysis process based on semantic analysis and a novel rating technique called hierarchical fuzzy calculation.

Download Full-text

Customized reputation generation of entities using sentiment analysis

World Journal of Engineering ◽

10.1108/wje-09-2020-0470 ◽

2020 ◽

Vol ahead-of-print (ahead-of-print) ◽

Author(s):

Arpita Gupta ◽

Saloni Priyani ◽

Ramadoss Balakrishnan

Keyword(s):

Sentiment Analysis ◽

Classification Models ◽

Data Set ◽

Content Type ◽

Customer Reviews ◽

Art Methods ◽

Review Helpfulness ◽

Sentiment Score ◽

State Of Art

Purpose In this study, the authors have used the customer reviews of books and movies in natural language for the purpose of sentiment analysis and reputation generation on the reviews. Most of the existing work has performed sentiment analysis and reputation generation on the reviews by using single classification models and considered other attributes for reputation generation. Design/methodology/approach The authors have taken review, helpfulness and rating into consideration. In this paper, the authors have performed sentiment analysis for extracting the probability of the review belonging to a class, which is further used for generating the sentiment score and reputation of the review. The authors have used pre-trained BERT fine-tuned for sentiment analysis on movie and book reviews separately. Findings In this study, the authors have also combined the three models (BERT, Naïve Bayes and SVM) for more accurate sentiment classification and reputation generation, which has outperformed the best BERT model in this study. They have achieved the best accuracy of 91.2% for the movie review data set and 89.4% for the book review data set which is better than the existing state-of-art methods. They have used the transfer learning concept in deep learning where you take knowledge gained from one problem and apply it to a similar problem. Originality/value The authors have proposed a novel model based on combination of three classification models, which has outperformed the existing state-of-art methods. To the best of the authors’ knowledge, there is no existing model which combines three models for sentiment score calculation and reputation generation for the book review data set.

Download Full-text

Finding eWOM customers from customer reviews

Industrial Management & Data Systems ◽

10.1108/imds-09-2017-0418 ◽

2019 ◽

Vol 119 (1) ◽

pp. 129-147 ◽

Cited By ~ 3

Author(s):

Pengfei Zhao ◽

Ji Wu ◽

Zhongsheng Hua ◽

Shijian Fang

Keyword(s):

Evaluation Method ◽

Online Reviews ◽

Customer Segmentation ◽

Product Reviews ◽

Data Set ◽

Content Type ◽

Customer Reviews ◽

Product Sales ◽

The Relationship ◽

New Framework

PurposeThe purpose of this paper is to identify electronic word-of-mouth (eWOM) customers from customer reviews. Thus, firms can precisely leverage eWOM customers to increase their product sales.Design/methodology/approachThis research proposed a framework to analyze the content of consumer-generated product reviews. Specific algorithms were used to identify potential eWOM reviewers, and then an evaluation method was used to validate the relationship between product sales and the eWOM reviewers identified by the authors’ proposed method.FindingsThe results corroborate that online product reviews that are made by the eWOM customers identified by the authors’ proposed method are more related to product sales than customer reviews that are made by non-eWOM customers and that the predictive power of the reviews generated by eWOM customers are significantly higher than the reviews generated by non-eWOM customers.Research limitations/implicationsThe proposed method is useful in the data set, which is based on one type of products. However, for other products, the validity must be tested. Previous eWOM customers may have no significant influence on product sales in the future. Therefore, the proposed method should be tested in the new market environment.Practical implicationsBy combining the method with the previous customer segmentation method, a new framework of customer segmentation is proposed to help firms understand customers’ value specifically.Originality/valueThis study is the first to identify eWOM customers from online reviews and to evaluate the relationship between reviewers and product sales.

Download Full-text

Assessing technology adoption practices in Chinese theme parks: text mining and sentiment analysis

Journal of Hospitality and Tourism Technology ◽

10.1108/jhtt-05-2020-0126 ◽

2021 ◽

Vol ahead-of-print (ahead-of-print) ◽

Author(s):

Tingting Zhang ◽

Bin Li ◽

Ady Milman ◽

Nan Hua

Keyword(s):

Text Mining ◽

Technology Adoption ◽

Sentiment Analysis ◽

Statistical Tests ◽

Online Reviews ◽

Theme Park ◽

Theme Parks ◽

Content Type ◽

Customer Reviews ◽

Technology Applications

Purpose This study aims to examine technology adoption practices in Chinese theme parks by leveraging text mining and sentiment analysis approaches on actual theme park customers’ online reviews. Design/methodology/approach The study text mined a total of 65,518 reviews of 490 Chinese theme parks with the aid of the Python program. Further, it computed sentiment scores of the customer reviews associated with the ratings of each categorized technology practice applied in the theme parks. Findings The study identified two major categories of technology applications in theme parks: supporting and experiential technologies. Multiple statistical tests confirmed that supporting technologies consisted of three types: intelligent services, ticketing and in-park transportation. Experiential technologies further included five aspects of technologies according to Schmitt’s strategic experiential modules (SEMs): sense, feel, act, think and relate. Originality/value The study findings contribute to the current understanding of theme park visitors’ perceptions of technology adoption practices and provide insightful implications for theme park practitioners who intend to invest in high technology solutions to deliver a better customer experience.

Download Full-text

Does the length of a review matter in perceived helpfulness? The moderating role of product experience

Journal of Research in Interactive Marketing ◽

10.1108/jrim-04-2020-0086 ◽

2021 ◽

Vol ahead-of-print (ahead-of-print) ◽

Author(s):

Han Jia ◽

Sumin Shin ◽

Jinfeng Jiao

Keyword(s):

Online Reviews ◽

Product Type ◽

Online Review ◽

Content Type ◽

Level Of Involvement ◽

Customer Reviews ◽

Product Experience ◽

Online Customer Reviews ◽

Review Helpfulness ◽

Product Ratings

PurposeThis paper aims to offer a framework explaining how product experience (i.e. think vs feel) and product involvement (high vs low) influence the helpfulness of online reviews. It also reexamined how online consumer review dimensions help to build online review helpfulness under different contexts.Design/methodology/approachData were collected using content analysis on 1,200 online customer reviews on 12 products from four categories to measure the relationships between online review dimensions and the helpfulness of reviews. The regression analysis and analysis of variance (ANOVA) were used to test the hypotheses.FindingsThe findings indicate that the effectiveness of length of a review is moderated by product type; for think products, longer reviews yield higher helpfulness. Furthermore, the level of consistency between individual review ratings and overall product ratings is associated with review helpfulness. The length of product descriptions and product ratings is moderated by the level of involvement. For products with high involvement, longer descriptions yield higher helpfulness.Originality/valueA conceptual connection to customer interaction is proposed by online customer reviews that vary by product type. The findings provide implications for online retailers to better manage online customer reviews and increase the value of product ratings.

Download Full-text

Do online consumer reviews help to evaluate the performance of automobile manufacturers?

Journal of Enterprise Information Management ◽

10.1108/jeim-09-2019-0292 ◽

2020 ◽

Vol 33 (5) ◽

pp. 1153-1198

Author(s):

Amit Singh ◽

Mamata Jenamani ◽

Jitesh Thakkar

Keyword(s):

Sentiment Analysis ◽

Data Set ◽

Pareto Chart ◽

Content Type ◽

Customer Reviews ◽

Online Consumer Reviews ◽

Online Customer Reviews ◽

Fishbone Diagram ◽

Online Consumer ◽

Practical Implications

PurposeThis research proposes a text analytics–based framework that examines the utility of online customer reviews in evaluating automobile manufacturers and discovering their consumer-perceived weaknesses.Design/methodology/approachThe proposed framework integrates aspect-level sentiment analysis with the house of quality (HoQ), TOPSIS, Pareto chart and fishbone diagram. While sentiment analysis mines and quantifies review-embedded consumer opinions on various automobile attributes, the integrated HoQ-TOPSIS analyzes the quantified opinions and evaluates the manufacturers. The Pareto charts assist in discovering consumer-perceived weaknesses of the underperforming manufacturers. Finally, the fishbone diagram visually represents the results in the form with which the manufacturing community is acquainted.FindingsThe proposed framework is tested on a review data set collected from CarWale, a well-known car portal in India. Selecting five manufacturers from the mid-size car segment, the authors identified the worst-performing one and discovered its weak attributes.Practical implicationsThe proposed framework can help the manufacturers in evaluating competitor; identifying consumers' contemporary interests; discovering own and their competitors' weak attributes; assessing the suppliers and sending early warnings; detecting the hazardous defects. It can assist the component suppliers in devising process improvement strategies; improving their customer network; comparing them with competitors. It can support the customers in identifying the best available alternative.Originality/valueThe proposed framework is first of its kind to integrate the sentiment analysis with (1) HoQ-TOPSIS to assess the manufacturers; (2) Pareto chart to discover their weaknesses; (3) fishbone diagram to visually represent the results.

Download Full-text

The asymmetric effect of review valence on numerical rating

Online Information Review ◽

10.1108/oir-11-2017-0307 ◽

2019 ◽

Vol 43 (2) ◽

pp. 283-300 ◽

Cited By ~ 6

Author(s):

Hsiu-Yuan Tsao ◽

Ming-Yi Chen ◽

Hao-Chiang Koong Lin ◽

Yu-Chun Ma

Keyword(s):

Text Mining ◽

Sentiment Analysis ◽

Negative Impact ◽

Online Reviews ◽

Data Set ◽

Brand Strength ◽

Content Type ◽

Numerical Rating ◽

The Impact ◽

Moderating Role

PurposeThe basic assumption is that there is a symmetric relationship between review valence and rating, but what if review valence and rating were linked asymmetrically? There are few studies which have investigated the situations in which positive and negative online reviews exert different influences on ratings. This study considers brand strength as having an important moderating role because the average rating of existing reviews for a particular product is a heuristic cue for decision makers. Thus, the purpose of this paper is to argue that an asymmetric relationship between review content valence and numerical rating will depend on brand strength.Design/methodology/approachThe authors have conducted a sentiment analysis via text mining, using self-developed computer programs to retrieve a data set from the TripAdvisor website.FindingsThis study finds there is an asymmetric relationship between review valence (verbal) and numerical rating. The authors further find brand strength to have an important moderating role. For a stronger brand, negative review content will have a greater impact on numerical ratings than positive review content, while for a weaker brand, positive review content will have a greater impact on numerical ratings than negative review content.Practical implicationsMarketers could adopt sentiment analysis via text mining of online reviews as a valid measure or predictor of consumer satisfaction or numerical ratings. Strong brands should direct more attention to negative reviews, because in such reviews the negative impact transcends the positive. In contrast, weak brands should aim to exploit as many positive reviews as possible to minimize the impact of any negative reviews.Originality/valueThis study finds there is an asymmetric relationship between review valence (verbal) and numerical rating and considers brand strength to play an important moderating role. The authors have used real data from the TripAdvisor website, which allow people to express themselves in an unsolicited manner, and linked these with the results from the sentiment analysis.

Download Full-text

Mining numerical measure of consumers’ product evaluation expressed in words based on latent Dirichlet allocation

Journal of Modelling in Management ◽

10.1108/jm2-07-2021-0163 ◽

2021 ◽

Vol ahead-of-print (ahead-of-print) ◽

Author(s):

Ziang Wang ◽

Feng Yang

Keyword(s):

Latent Dirichlet Allocation ◽

Product Evaluation ◽

Online Reviews ◽

Product Evaluations ◽

Product Attributes ◽

Data Set ◽

Content Type ◽

Face To Face ◽

Online Retailers ◽

Dirichlet Allocation

Purpose It has always been a hot topic for online retailers to obtain consumers’ product evaluations from massive online reviews. In the process of online shopping, there is no face-to-face interaction between online retailers and customers. After collecting online reviews left by customers, online retailers are eager to acquire answers to some questions. For example, which product attributes will attract consumers? Or which step brings a better experience to consumers during the process of shopping? This paper aims to associate the latent Dirichlet allocation (LDA) model with the consumers’ attitude and provides a method to calculate the numerical measure of consumers’ product evaluation expressed in each word. Design/methodology/approach First, all possible pairs of reviews are organized as a document to build the corpus. After that, latent topics of the traditional LDA model noted as the standard LDA model, are separated into shared and differential topics. Then, the authors associate the model with consumers’ attitudes toward each review which is distinguished as positive review and non-positive review. The product evaluation reflected in consumers’ binary attitude is expanded to each word that appeared in the corpus. Finally, a variational optimization is introduced to calculate parameters mentioned in the expanded LDA model. Findings The experiment’s result illustrates that the LDA model in the research noted as an expanded LDA model, can successfully assign sufficient probability with words related to products attributes or consumers’ product evaluation. Compared with the standard LDA model, the expanded model intended to assign higher probability with words, which have a higher ranking within each topic. Besides, the expanded model also has higher precision on the prediction set, which shows that breaking down the topics into two categories fits better on the data set than the standard LDA model. The product evaluation of each word is calculated by the expanded model and depicted at the end of the experiment. Originality/value This research provides a new method to calculate consumers’ product evaluation from reviews in the level of words. Words may be used to describe product attributes or consumers’ experiences in reviews. Assigning words with numerical measures can analyze consumers’ products evaluation quantitatively. Besides, words are labeled themselves, they can also be ranked if a numerical measure is given. Online retailers can benefit from the result for label choosing, advertising or product recommendation.

Download Full-text

Which cues influence the perceived usefulness and credibility of an online review? A conjoint analysis

Online Information Review ◽

10.1108/oir-09-2019-0287 ◽

2020 ◽

Vol ahead-of-print (ahead-of-print) ◽

Author(s):

Ana Isabel Lopes ◽

Nathalie Dens ◽

Patrick De Pelsmacker ◽

Freya De Keyzer

Keyword(s):

Conjoint Analysis ◽

Product Category ◽

Online Reviews ◽

Perceived Usefulness ◽

Online Review ◽

Large Set ◽

Relative Importance ◽

Writing Quality ◽

Content Type ◽

Argument Strength

PurposeThis study aims to assess the relative importance of the argument strength, argument sidedness, writing quality, number of arguments, rated review usefulness, summary review rating and number of reviews in determining the perceived usefulness and credibility of an online review. Additionally, the authors use insights from the elaboration likelihood model (ELM) to explore the effect of consumers' product category involvement on the cues' relative importance.Design/methodology/approachA conjoint analysis (N = 287) is used to study the relative importance of the seven previously mentioned attributes. A balanced orthogonal design generated eight cards that correspond to individual reviews. Respondents scored all eight cards in a random order for perceived usefulness and credibility.FindingsOverall, argument strength is the most important cue, while summary review rating and the number of reviews are the least important for perceived review usefulness and credibility. The number of arguments is more important for people who are more highly involved with the product, while writing quality and rated review usefulness are relatively more important for the low-involvement group.Originality/valueThis study provides a comprehensive test of how consumers perceive online reviews, as it the first to the authors’ knowledge to simultaneously investigate a large set of cues using conjoint analysis. This method allows for the implicit valuation (utility) of the individual cues, revealing the cues' relative importance, in a setting that comes close to a real-life context. Besides, insights of the ELM are used to understand how the relative importance of cues differs depending on the level of review readers' product category involvement.

Download Full-text