scholarly journals Wine Ontology Influence in a Recommendation System

2021 ◽  
Vol 5 (2) ◽  
pp. 16
Author(s):  
Luís Oliveira ◽  
Rodrigo Rocha Silva ◽  
Jorge Bernardino

Wine is the second most popular alcoholic drink in the world behind beer. With the rise of e-commerce, recommendation systems have become a very important factor in the success of business. Recommendation systems analyze metadata to predict if, for example, a user will recommend a product. The metadata consist mostly of former reviews or web traffic from the same user. For this reason, we investigate what would happen if the information analyzed by a recommendation system was insufficient. In this paper, we explore the effects of a new wine ontology in a recommendation system. We created our own wine ontology and then made two sets of tests for each dataset. In both sets of tests, we applied four machine learning clustering algorithms that had the objective of predicting if a user recommends a wine product. The only difference between each set of tests is the attributes contained in the dataset. In the first set of tests, the datasets were influenced by the ontology, and in the second set, the only information about a wine product is its name. We compared the two test sets’ results and observed that there was a significant increase in classification accuracy when using a dataset with the proposed ontology. We demonstrate the general applicability of the methodology to other cases, applying our proposal to an Amazon product review dataset.

Author(s):  
Gandhali Malve ◽  
Lajree Lohar ◽  
Tanay Malviya ◽  
Shirish Sabnis

Today the amount of information in the internet growth very rapidly and people need some instruments to find and access appropriate information. One of such tools is called recommendation system. Recommendation systems help to navigate quickly and receive necessary information. Many of us find it difficult to decide which movie to watch and so we decided to make a recommender system for us to better judge which movie we are more likely to love. In this project we are going to use Machine Learning Algorithms to recommend movies to users based on genres and user ratings. Recommendation system attempt to predict the preference or rating that a user would give to an item.


2019 ◽  
Vol 13 ◽  
pp. 267-271
Author(s):  
Jacek Bielecki ◽  
Oskar Ceglarski ◽  
Maria Skublewska-Paszkowska

Recommendation systems are class of information filter applications whose main goal is to provide personalized recommendations. The main goal of the research was to compare two ways of creating personalized recommendations. The recommendation system was built on the basis of a content-based cognitive filtering method and on the basis of a collaborative filtering method based on user ratings. The conclusions of the research show the advantages and disadvantages of both methods.


Symmetry ◽  
2020 ◽  
Vol 12 (9) ◽  
pp. 1566 ◽  
Author(s):  
Zeinab Shahbazi ◽  
Debapriya Hazra ◽  
Sejoon Park ◽  
Yung Cheol Byun

With the spread of COVID-19, the “untact” culture in South Korea is expanding and customers are increasingly seeking for online services. A recommendation system serves as a decision-making indicator that helps users by suggesting items to be purchased in the future by exploring the symmetry between multiple user activity characteristics. A plethora of approaches are employed by the scientific community to design recommendation systems, including collaborative filtering, stereotyping, and content-based filtering, etc. The current paradigm of recommendation systems favors collaborative filtering due to its significant potential to closely capture the interest of a user as compared to other approaches. The collaborative filtering harnesses features like user-profile details, visited pages, and click information to determine the interest of a user, thereby recommending the items that are related to the user’s interest. The existing collaborative filtering approaches exploit implicit and explicit features and report either good classification or prediction outcome. These systems fail to exhibit good results for both measures at the same time. We believe that avoiding the recommendation of those items that have already been purchased could contribute to overcoming the said issue. In this study, we present a collaborative filtering-based algorithm to tackle big data of user with symmetric purchasing order and repetitive purchased products. The proposed algorithm relies on combining extreme gradient boosting machine learning architecture with word2vec mechanism to explore the purchased products based on the click patterns of users. Our algorithm improves the accuracy of predicting the relevant products to be recommended to the customers that are likely to be bought. The results are evaluated on the dataset that contains click-based features of users from an online shopping mall in Jeju Island, South Korea. We have evaluated Mean Absolute Error, Mean Square Error, and Root Mean Square Error for our proposed methodology and also other machine learning algorithms. Our proposed model generated the least error rate and enhanced the prediction accuracy of the recommendation system compared to other traditional approaches.


2016 ◽  
Vol 2016 ◽  
pp. 1-6 ◽  
Author(s):  
Malik Yousef ◽  
Müşerref Duygu Saçar Demirci ◽  
Waleed Khalifa ◽  
Jens Allmer

MicroRNAs (miRNAs) are short RNA sequences involved in posttranscriptional gene regulation. Their experimental analysis is complicated and, therefore, needs to be supplemented with computational miRNA detection. Currently computational miRNA detection is mainly performed using machine learning and in particular two-class classification. For machine learning, the miRNAs need to be parametrized and more than 700 features have been described. Positive training examples for machine learning are readily available, but negative data is hard to come by. Therefore, it seems prerogative to use one-class classification instead of two-class classification. Previously, we were able to almost reach two-class classification accuracy using one-class classifiers. In this work, we employ feature selection procedures in conjunction with one-class classification and show that there is up to 36% difference in accuracy among these feature selection methods. The best feature set allowed the training of a one-class classifier which achieved an average accuracy of ~95.6% thereby outperforming previous two-class-based plant miRNA detection approaches by about 0.5%. We believe that this can be improved upon in the future by rigorous filtering of the positive training examples and by improving current feature clustering algorithms to better target pre-miRNA feature selection.


Author(s):  
A.Y. Zhubatkhan ◽  
Z.A. Buribayev ◽  
S.S. Aubakirov ◽  
M.D. Dilmagambetova ◽  
S.A. Ryskulbek

The trend of the Internet makes the presentation of the right content for the right user inevitable. To this end, recommendation systems are used in areas such as music, books, movies, travel planning, e-commerce, education, and more. One of the most popular recommendation systems in the world is Netflix, which generated record profits during quarantine in the first quartile of 2020. The systematic approach of recommendations is based on the history of user selections, likes and reviews, each of which is interpreted to predict future user selections. This article provides a meaningful analysis of various recommendation systems, such as content-based, collaborative filtering and popularity. We reviewed 7 articles published from 2005 to 2019 to discuss issues related to existing models. The purpose of this article is to compare machine learning algorithms in the Surprise library for a recommendation system. Recommendation system has been implemented and quality has been evaluated using the MAE and RMSE metrics.


2020 ◽  
Author(s):  
Daniel B Hier ◽  
Jonathan Kopel ◽  
Steven U Brint ◽  
Donald C Wunsch II ◽  
Gayla R Olbricht ◽  
...  

Abstract Background: When patient distances are calculated based on phenotype, signs and symptoms are often converted to concepts from an ontological hierarchy. There is controversy as to whether patient distance metrics that consider the semantic similarity between concepts can outperform standard patient distance metrics that are agnostic to concept similarity. The choice of distance metric often dominates the performance of classification or clustering algorithms. Our objective was to determine if semantically augmented distance metrics would outperform standard metrics on machine learning tasks. Methods: We converted the neurological signs and symptoms from 382 published neurology cases into sets of concepts with corresponding machine-readable codes. We calculated inter-patient distances by four different metrics (cosine distance, a semantically augmented cosine distance, Jaccard distance, and a semantically augmented bipartite distance). Semantic augmentation for two of the metrics depended on concept similarities from a hierarchical neuro-ontology. For machine learning algorithms, we used the patient diagnosis as the ground truth label and patient signs and symptoms as the machine learning features . We assessed classification accuracy for four classifiers and cluster quality for two clustering algorithms for each of the distance metrics. Results: Inter-patient distances were smaller when the distance metric was semantically augmented. Classification accuracy and cluster quality were not significantly different by distance metric. Conclusion: Using patient diagnoses as labels and patient signs and symptoms as features, we did not find improved classification accuracy or improved cluster quality with semantically augmented distance metrics. Semantic augmentation reduced inter-patient distances but did not improve machine learning performance.


2020 ◽  
Author(s):  
Daniel B Hier ◽  
Jonathan Kopel ◽  
Steven U Brint ◽  
Donald C Wunsch II ◽  
Gayla R Olbricht ◽  
...  

Abstract Background: Patient distances can be calculated based on signs and symptoms derived from an ontological hierarchy. There is controversy as to whether patient distance metrics that consider the semantic similarity between concepts can outperform standard patient distance metrics that are agnostic to concept similarity. The choice of distance metric can dominate the performance of classification or clustering algorithms. Our objective was to determine if semantically augmented distance metrics would outperform standard metrics on machine learning tasks. Methods: We converted the neurological findings from 382 published neurology cases into sets of concepts with corresponding machine-readable codes. We calculated patient distances by four different metrics (cosine distance, a semantically augmented cosine distance, Jaccard distance, and a semantically augmented bipartite distance). Semantic augmentation for two of the metrics depended on concept similarities from a hierarchical neuro-ontology. For machine learning algorithms, we used the patient diagnosis as the ground truth label and patient findings as machine learning features. We assessed classification accuracy for four classifiers and cluster quality for two clustering algorithms for each of the distance metrics.Results: Inter-patient distances were smaller when the distance metric was semantically augmented. Classification accuracy and cluster quality were not significantly different by distance metric.Conclusion: Although semantic augmentation reduced inter-patient distances, we did not find improved classification accuracy or improved cluster quality with semantically augmented patient distance metrics.


2020 ◽  
Author(s):  
Daniel B Hier ◽  
Jonathan Kopel ◽  
Steven U Brint ◽  
Donald C Wunsch II ◽  
Gayla R Olbricht ◽  
...  

Abstract Background: Patient distances can be calculated based on signs and symptoms derived from an ontological hierarchy. There is controversy as to whether patient distance metrics that consider the semantic similarity between concepts can outperform standard patient distance metrics that are agnostic to concept similarity. The choice of distance metric can dominate the performance of classification or clustering algorithms. Our objective was to determine if semantically augmented distance metrics would outperform standard metrics on machine learning tasks.Methods: We converted the neurological findings from 382 published neurology cases into sets of concepts with corresponding machine-readable codes. We calculated patient distances by four different metrics (cosine distance, a semantically augmented cosine distance, Jaccard distance, and a semantically augmented bipartite distance). Semantic augmentation for two of the metrics depended on concept similarities from a hierarchical neuro-ontology. For machine learning algorithms, we used the patient diagnosis as the ground truth label and patient findings as machine learning features . We assessed classification accuracy for four classifiers and cluster quality for two clustering algorithms for each of the distance metrics.Results: Inter-patient distances were smaller when the distance metric was semantically augmented. Classification accuracy and cluster quality were not significantly different by distance metric.Conclusion: Although semantic augmentation reduced inter-patient distances, we did not find improved classification accuracy or improved cluster quality with semantically augmented patient distance metrics when applied to a dataset of neurology patients. Further work is needed to assess the utility of semantically augmented patient distances.


2018 ◽  
Vol 7 (2.24) ◽  
pp. 417
Author(s):  
Ratna Sathappan ◽  
Tholu Sai Indira ◽  
A Meenapriyadarsini

Internet usage has been at an all-time high from 2000’s vintage years. The people who have access to the internet use it for numerous reasons such as social networking, marketing, promoting, enhancing businesses, consultancy, research, gaming and the list goes on. In the recent years, Review websites have flourished, where people share their opinion about a product, with an increase in response rate and reliability. Recommendations are made by mining data from review websites. Traditional Recommendation systems are limited as they only consider certain metrics, such as product purchase details, product category. Recommendation systems are yet to gain popularity in the medical field. These days most patients are unable to figure out the medication that works in healing them in the best way possible, hence they turn to review websites in order to obtain a second opinion on the prescribed medication. In this work, we have developed a smart recommendation system for off-the Shelf Medical Drugs using machine learning and data analytics based on patient feedback. The patient feedback is unstructured data which is processed using data analytic tools. After which machine learning is used to recommend the best fit and compare the drugs. In this work, we predict the impact of a drug/ medicine on the patient to whom the medication was prescribed, using data mining techniques. Firstly, we detect the user’s polarity (positive/ negative/neutral) based on the patient feedback for a certain drug using sentiment analysis and opinion mining following which we use machine learning algorithms to track sentiment variation and to make a recommendation based on user polarity


Connectivity ◽  
2020 ◽  
Vol 148 (6) ◽  
Author(s):  
D. D. Demydov ◽  
◽  
I. S. Shcherbyna ◽  
N. A. Trintina ◽  
A. M. Shtimmerman ◽  
...  

The article analyzes the method of singular value decomposition (SVD) as an effective way to build a recommender system. With the development of information technologies and their introduction into public life, there is a need to search for accentuated information in conditions of uncertainty. To solve such problems, recently created intelligent recommendation systems [1]. The popularity of recommendation systems is growing in every segment of goods and services, in particular music. From a socio-economic point of view, such systems are the main tool for the dissemination of new compositions in the field of music promotes the promotion of these compositions in accordance with the preferences of the target audience and encourages users to purchase new music tracks. In addition, such systems significantly reduce the time and facilitate the search for appropriate musical compositions under conditions of uncertainty. The main problem in developing machine learning algorithms is the lack of an individual approach to each user. All recommendations are based on the statistical behavior of the majority, resulting in a percentage of people who do not receive recommendations that match their personal preferences. In the case of a separate analysis of each of the users and the implementation of recommendations in accordance with their personal use of Internet resources, the number of quality and more accurate proposals in the list of recommendations would increase significantly. Machine learning methods are effectively used to build recommendation systems, namely: the k-nearest neighbors method, the Bayesian algorithm and the singular matrix decomposition method. Among these methods, the SVD method is the most widely used in practice. This method is used to reduce the number of non-significant data set factors. Factors in recommendation systems are properties that describe the user or subject. In music recommendation systems, this can be a genre. SVD reduces the dimension of the matrix by removing its hidden factors.


Sign in / Sign up

Export Citation Format

Share Document