JUMRv1: A Sentiment Analysis Dataset for Movie Recommendation

Nowadays, we can observe the applications of machine learning in every field, ranging from the quality testing of materials to the building of powerful computer vision tools. One such recent application is the recommendation system, which is a method that suggests products to users based on their preferences. In this paper, our focus is on a specific recommendation system called movie recommendation. Here, we make use of user reviews of movies in order to establish a general outlook about the movie and then use that outlook to recommend that movie to other users. However, a huge number of available reviews has baffled sophisticated review systems. Consequently, there is a need to find a method of extracting meaningful information from the available reviews and use that in classifying a movie review and predicting the sentiment in each one. In a typical scenario, a review can either be positive, negative, or indifferent about a movie. However, the available research articles in the field mainly consider this as a two-class classification problem—positive and negative. The most popular work in this field was performed on Stanford and Rotten Tomatoes datasets, which are somewhat outdated. Our work is based on self-scraped reviews from the IMDB website, and we have annotated the reviews into one of the three classes—positive, negative, and neutral. Our dataset is called JUMRv1—Jadavpur University Movie Recommendation dataset version 1. For the evaluation of JUMRv1, we took an exhaustive approach by testing various combinations of word embeddings, feature selection methods, and classifiers. We also analysed the performance trends, if there were any, and attempted to explain them. Our work sets a benchmark for movie recommendation systems that is based on the newly developed dataset using a three-class sentiment classification.

Download Full-text

User Centric and Collaborative Movie Recommendation System Under Customized Platforms

2021 3rd International Conference on Signal Processing and Communication (ICPSC) ◽

10.1109/icspc51351.2021.9451672 ◽

2021 ◽

Author(s):

Souptik Saha ◽

S. Ramamoorthy ◽

Eisha Raghav

Keyword(s):

Recommendation System ◽

Movie Recommendation ◽

User Centric

Download Full-text

Systematic analysis of Movie Recommendation System through Sentiment Analysis

2021 International Conference on Artificial Intelligence and Smart Systems (ICAIS) ◽

10.1109/icais50930.2021.9395854 ◽

2021 ◽

Author(s):

R Lavanya ◽

B. Bharathi

Keyword(s):

Sentiment Analysis ◽

Recommendation System ◽

Systematic Analysis ◽

Movie Recommendation

Download Full-text

A Personalized Movie Recommendation System based on LSTM-CNN

2020 2nd International Conference on Machine Learning, Big Data and Business Intelligence (MLBDBI) ◽

10.1109/mlbdbi51377.2020.00102 ◽

2020 ◽

Author(s):

Haili Wang ◽

Nana Lou ◽

Zhenlin Chao

Keyword(s):

Recommendation System ◽

Movie Recommendation

Download Full-text

User profile correlation-based similarity (UPCSim) algorithm in movie recommendation system

Journal Of Big Data ◽

10.1186/s40537-021-00425-x ◽

2021 ◽

Vol 8 (1) ◽

Author(s):

Triyanna Widiyaningtyas ◽

Indriana Hidayah ◽

Teguh B. Adji

Keyword(s):

Collaborative Filtering ◽

Recommendation System ◽

User Behavior ◽

Correlation Coefficients ◽

User Profile ◽

Profile Data ◽

Similarity Algorithm ◽

Previous Algorithm ◽

Movie Recommendation ◽

Recommendation Accuracy

AbstractCollaborative filtering is one of the most widely used recommendation system approaches. One issue in collaborative filtering is how to use a similarity algorithm to increase the accuracy of the recommendation system. Most recently, a similarity algorithm that combines the user rating value and the user behavior value has been proposed. The user behavior value is obtained from the user score probability in assessing the genre data. The problem with the algorithm is it only considers genre data for capturing user behavior value. Therefore, this study proposes a new similarity algorithm – so-called User Profile Correlation-based Similarity (UPCSim) – that examines the genre data and the user profile data, namely age, gender, occupation, and location. All the user profile data are used to find the weights of the similarities of user rating value and user behavior value. The weights of both similarities are obtained by calculating the correlation coefficients between the user profile data and the user rating or behavior values. An experiment shows that the UPCSim algorithm outperforms the previous algorithm on recommendation accuracy, reducing MAE by 1.64% and RMSE by 1.4%.

Download Full-text