Recommendation and visualization of similar movies using minimum spanning dendrograms

Exploration of graph structures is an important topic in data mining and data visualization. This work presents a novel technique for visualizing neighbourhood and cluster relationships in graphs; we also show how this methodology can be used within the setting of a recommendation system. Our technique works by projecting the original object distances onto two dimensions while carefully retaining the ‘backbone’ of important distances. Cluster information is also overlayed on the same projected space. A significant advantage of our approach is that it can accommodate both metric and non-metric distance functions. Our methodology is applied to a visual recommender system for movies to allow easy exploration of the actor–movie bipartite graph. The work offers intuitive movie recommendations based on a selected pivot movie and allows the interactive discovery of related movies based on both textual and semantic features.

Download Full-text

Towards a Knowledge-Based Recommendation System for Genomics Visualization

10.31219/osf.io/t74da ◽

2020 ◽

Author(s):

Aditeya Pandey ◽

Sehi L’Yi ◽

Nils Gehlenborg

Keyword(s):

Data Visualization ◽

Biomedical Research ◽

Recommender System ◽

Recommendation System ◽

Formal Training ◽

Preliminary Design ◽

Knowledge Based ◽

Visualization Tools ◽

Tools And Techniques ◽

Selection Of

Analysis and interpretation of genomics data are the backbones of breakthroughs and discoveries in biomedical research. Visualization tools and techniques play a significant role in the workflow of genomics researchers, and they are regularly employed in the interpretation of genomics data. However, the vast majority of genomics researchers have little or no formal training in data visualization design. Therefore, they require guidance on designing effective visualizations for a given set of data and analysis tasks. In this poster, we present the methodology behind a recommender system for genomics data and our preliminary design of a knowledge-based recommendation system. The system allows genomics researchers to navigate through a selection of visualization options and identify the techniques that meet their preferences and requirements.

Download Full-text

Data Visualization Indicator Disease (Malaria, Dengue Fever, and Measles) in The Year 2012-2015

International Journal of New Media Technology ◽

10.31937/ijnmt.v4i2.785 ◽

2017 ◽

Vol 4 (2) ◽

pp. 87-93

Author(s):

Immanuel Luigi Da Gusta ◽

Johan Setiawan

Keyword(s):

Data Mining ◽

Human Resources ◽

Data Visualization ◽

Health Sector ◽

Medical Personnel ◽

Database Systems ◽

Process Time ◽

Acceptance Test ◽

Visual Data Mining ◽

The Government

The aim of this paper are: to create a data visualization that can assist the Government in evaluating the return on the development of health facilities in the region and province area in term of human resources for medical personnel, to help community knowing the amount of distribution of hospitals with medical personnel in the regional area and to map disease indicator in Indonesia. The issue of tackling health is still a major problem that is not resolved by the Government of Indonesia. There are three big things that become problems in the health sector in Indonesia: infrastructure has not been evenly distributed and less adequate, the lack of human resources professional health workforce, there is still a high number of deaths in the outbreak of infectious diseases. Data for the research are taken from BPS, in total 10,600 records after the Extract, Transform and Loading process. Time needed to convert several publications from PDF, to convert to CSV and then to MS Excel 3 weeks. The method used is Eight-step Data Visualization and Data Mining methodology. Tableau is chosen as a tool to create the data visualization because it can combine each dasboard inside a story interactive, easier for the user to analyze the data. The result is a story with 3 dashboards that can fulfill the requirement from BPS staff and has been tested with a satisfied result in the UAT (User Acceptance Test). Index Terms—Dashboard, data visualization, disease, malaria, Tableau REFERENCES [1] S. Arianto, Understanding of learning and others, 2008. [2] Rainer; Turban, Introduction to Information Systems, Danvers: John Wiley & Sons, Inc, 2007. [3] V. Friedman, Data Visualization Infographics, Monday Inspirition, 2008. [4] D. A. Keim, "Information Visualization and Visual Data Mining," IEEE Transactions on Visualization and Computer Graphics 8.1, pp. 1-8, 2002. [5] Connolly and Begg, Database Systems, Boston: Pearson Education, Inc, 2010. [6] E. Hariyanti, "Pengembangan Metodologi Pembangunan Information Dashboard Untuk Monitoring kinerja Organisasi," Konferensi dan Temu Nasional Teknologi Informasi dan Komunikasi untuk Indonesia, p. 1, 2008. [7] S. Darudiato, "Perancangan Data Warehouse Penjualan Untuk Mendukung Kebutuhan Informasi Eksekutif Cemerlang Skin Care," Seminar Nasional Informatika 2010, pp. E-353, 2010.

Download Full-text

Identifying Attack Models for Securing Cluster-Based Recommendation System

Recent Patents on Engineering ◽

10.2174/1872212114666200403091053 ◽

2020 ◽

Vol 14 ◽

Author(s):

Amreen Ahmad ◽

Tanvir Ahmad ◽

Ishita Tripathi

Keyword(s):

Collaborative Filtering ◽

Recommender System ◽

Recommendation System ◽

Relevant Information ◽

System Function ◽

Recommendation Algorithm ◽

Underlying Network ◽

Entire Dataset ◽

Attack Models ◽

Wide Usage

: The immense growth of information has led to the wide usage of recommender systems for retrieving relevant information. One of the widely used methods for recommendation is collaborative filtering. However, such methods suffer from two problems, scalability and sparsity. In the proposed research, the two issues of collaborative filtering are addressed and a cluster-based recommender system is proposed. For the identification of potential clusters from the underlying network, Shapley value concept is used, which divides users into different clusters. After that, the recommendation algorithm is performed in every respective cluster. The proposed system recommends an item to a specific user based on the ratings of the item’s different attributes. Thus, it reduces the running time of the overall algorithm, since it avoids the overhead of computation involved when the algorithm is executed over the entire dataset. Besides, the security of the recommender system is one of the major concerns nowadays. Attackers can come in the form of ordinary users and introduce bias in the system to force the system function that is advantageous for them. In this paper, we identify different attack models that could hamper the security of the proposed cluster-based recommender system. The efficiency of the proposed research is validated by conducting experiments on student dataset.

Download Full-text

The Application of Data Mining Algorithms in the Construction of Travel Recommendation System

The 2nd International Conference on Computing and Data Science ◽

10.1145/3448734.3450477 ◽

2021 ◽

Author(s):

Chao Lou ◽

Zhaonan Mu ◽

Mengzhu Liu

Keyword(s):

Data Mining ◽

Recommendation System ◽

Data Mining Algorithms ◽

Mining Algorithms ◽

Travel Recommendation

Download Full-text

Research on Personalized Recommendation System Based on Big Data Mining Technology

2020 5th International Conference on Mechanical, Control and Computer Engineering (ICMCCE) ◽

10.1109/icmcce51767.2020.00052 ◽

2020 ◽

Author(s):

Hongwei Li

Keyword(s):

Data Mining ◽

Big Data ◽

Recommendation System ◽

Personalized Recommendation ◽

Mining Technology ◽

Big Data Mining

Download Full-text

Application of Improved Recommendation System Based on Spark Platform in Big Data Analysis

Cybernetics and Information Technologies ◽

10.1515/cait-2016-0092 ◽

2016 ◽

Vol 16 (6) ◽

pp. 245-255 ◽

Cited By ~ 1

Author(s):

Li Xie ◽

Wenbo Zhou ◽

Yaosen Li

Keyword(s):

Big Data ◽

Recommender System ◽

Recommendation System ◽

Parallel Implementation ◽

Implementation Process ◽

Recommendation Algorithm ◽

Filtration Problem ◽

Information Filtration ◽

Similarity Information ◽

Better Than

Abstract In the era of big data, people have to face information filtration problem. For those cases when users do not or cannot express their demands clearly, recommender system can analyse user’s information more proactive and intelligent to filter out something users want. This property makes recommender system play a very important role in the field of e-commerce, social network and so on. The collaborative filtering recommendation algorithm based on Alternating Least Squares (ALS) is one of common algorithms using matrix factorization technique of recommendation system. In this paper, we design the parallel implementation process of the recommendation algorithm based on Spark platform and the related technology research of recommendation systems. Because of the shortcomings of the recommendation algorithm based on ALS model, a new loss function is designed. Before the model is trained, the similarity information of users and items is fused. The experimental results show that the performance of the proposed algorithm is better than that of algorithm based on ALS.

Download Full-text

Data Mining Techniques to Build A Recommender System

10.1109/iscsic54682.2021.00047 ◽

2021 ◽

Author(s):

Alicia Huidobro Espejel ◽

Francisco J. Cantu-Ortiz

Keyword(s):

Data Mining ◽

Recommender System ◽

Data Mining Techniques

Download Full-text

Application of Data Mining and Data Visualization in Strategic Management Data at Israel Technological University of Ecuador

Advances in Intelligent Systems and Computing - Advances in Emerging Trends and Technologies ◽

10.1007/978-3-030-32022-5_39 ◽

2019 ◽

pp. 419-431

Author(s):

Paul Francisco Baldeon Egas ◽

Miguel Alfredo Gaibor Saltos ◽

Renato Toasa

Keyword(s):

Data Mining ◽

Strategic Management ◽

Data Visualization ◽

Technological University

Download Full-text

A Scalable Approach for Data Mining – AHUIM

Webology ◽

10.14704/web/v18i1/web18029 ◽

2021 ◽

Vol 18 (1) ◽

pp. 92-103

Author(s):

Vandna Dahiya ◽

Sandeep Dalal

Keyword(s):

Data Mining ◽

Research Paper ◽

Large Datasets ◽

Novel Technique ◽

Itemset Mining ◽

Essential Form ◽

High Utility

Utility itemset mining, which finds the item sets based on utility factors, has established itself as an essential form of data mining. The utility is defined in terms of quantity and some interest factor. Various methods have been developed so far by the researchers to mine these itemsets but most of them are not scalable. In the present times, a scalable approach is required that can fulfill the budding needs of data mining. A Spark based novel technique has been recommended in this research paper for mining the data in a distributed way, called as Absolute High Utility Itemset Mining (AHUIM). The technique is suitable for small as well as large datasets. The performance of the technique is being measured for various parameters such as speed, scalability, and accuracy etc.

Download Full-text

A review of recommendation system research based on bipartite graph

MATEC Web of Conferences ◽

10.1051/matecconf/202133605010 ◽

2021 ◽

Vol 336 ◽

pp. 05010

Author(s):

Ziteng Wu ◽

Chengyun Song ◽

Yunqing Chen ◽

Lingxuan Li

Keyword(s):

Neural Network ◽

Bipartite Graph ◽

Recommendation System ◽

Research Priorities ◽

Research Progress ◽

Full Paper ◽

Data Sparseness ◽

The Past ◽

Interaction History ◽

Network Processing

The interaction history between users and items is usually stored and displayed in the form of bipartite graphs. Neural network recommendation based on the user-item bipartite graph has a significant effect on alleviating the long-standing data sparseness and cold start of the recommendation system. The whole paper is based on the bipartite graph. An review of the recommendation system of graphs summarizes the three characteristics of graph neural network processing bipartite graph data in the recommendation field: interchangeability, Multi-hop transportability, and strong interpretability. The biggest contribution of the full paper is that it summarizes the general framework of graph neural network processing bipartite graph recommendation from the models with the best recommendation effect in the past three years: embedding layer, propagation update layer, and prediction layer. Although there are subtle differences between different models, they are all this framework can be applied, and different models can be regarded as variants of this general model, that is, other models are fine-tuned on the basis of this framework. At the end of the paper, the latest research progress is introduced, and the main challenges and research priorities that will be faced in the future are pointed out.

Download Full-text