Recommendation and visualization of similar movies using minimum spanning dendrograms

2012 ◽  
Vol 12 (1) ◽  
pp. 85-101 ◽  
Author(s):  
Michail Vlachos ◽  
Daniel Svonava

Exploration of graph structures is an important topic in data mining and data visualization. This work presents a novel technique for visualizing neighbourhood and cluster relationships in graphs; we also show how this methodology can be used within the setting of a recommendation system. Our technique works by projecting the original object distances onto two dimensions while carefully retaining the ‘backbone’ of important distances. Cluster information is also overlayed on the same projected space. A significant advantage of our approach is that it can accommodate both metric and non-metric distance functions. Our methodology is applied to a visual recommender system for movies to allow easy exploration of the actor–movie bipartite graph. The work offers intuitive movie recommendations based on a selected pivot movie and allows the interactive discovery of related movies based on both textual and semantic features.

2020 ◽  
Author(s):  
Aditeya Pandey ◽  
Sehi L’Yi ◽  
Nils Gehlenborg

Analysis and interpretation of genomics data are the backbones of breakthroughs and discoveries in biomedical research. Visualization tools and techniques play a significant role in the workflow of genomics researchers, and they are regularly employed in the interpretation of genomics data. However, the vast majority of genomics researchers have little or no formal training in data visualization design. Therefore, they require guidance on designing effective visualizations for a given set of data and analysis tasks. In this poster, we present the methodology behind a recommender system for genomics data and our preliminary design of a knowledge-based recommendation system. The system allows genomics researchers to navigate through a selection of visualization options and identify the techniques that meet their preferences and requirements.


2017 ◽  
Vol 4 (2) ◽  
pp. 87-93
Author(s):  
Immanuel Luigi Da Gusta ◽  
Johan Setiawan

The aim of this paper are: to create a data visualization that can assist the Government in evaluating the return on the development of health facilities in the region and province area in term of human resources for medical personnel, to help community knowing the amount of distribution of hospitals with medical personnel in the regional area and to map disease indicator in Indonesia. The issue of tackling health is still a major problem that is not resolved by the Government of Indonesia. There are three big things that become problems in the health sector in Indonesia: infrastructure has not been evenly distributed and less adequate, the lack of human resources professional health workforce, there is still a high number of deaths in the outbreak of infectious diseases. Data for the research are taken from BPS, in total 10,600 records after the Extract, Transform and Loading process. Time needed to convert several publications from PDF, to convert to CSV and then to MS Excel 3 weeks. The method used is Eight-step Data Visualization and Data Mining methodology. Tableau is chosen as a tool to create the data visualization because it can combine each dasboard inside a story interactive, easier for the user to analyze the data. The result is a story with 3 dashboards that can fulfill the requirement from BPS staff and has been tested with a satisfied result in the UAT (User Acceptance Test). Index Terms—Dashboard, data visualization, disease, malaria, Tableau REFERENCES [1] S. Arianto, Understanding of learning and others, 2008. [2] Rainer; Turban, Introduction to Information Systems, Danvers: John Wiley & Sons, Inc, 2007. [3] V. Friedman, Data Visualization Infographics, Monday Inspirition, 2008. [4] D. A. Keim, "Information Visualization and Visual Data Mining," IEEE Transactions on Visualization and Computer Graphics 8.1, pp. 1-8, 2002. [5] Connolly and Begg, Database Systems, Boston: Pearson Education, Inc, 2010. [6] E. Hariyanti, "Pengembangan Metodologi Pembangunan Information Dashboard Untuk Monitoring kinerja Organisasi," Konferensi dan Temu Nasional Teknologi Informasi dan Komunikasi untuk Indonesia, p. 1, 2008. [7] S. Darudiato, "Perancangan Data Warehouse Penjualan Untuk Mendukung Kebutuhan Informasi Eksekutif Cemerlang Skin Care," Seminar Nasional Informatika 2010, pp. E-353, 2010.


2020 ◽  
Vol 14 ◽  
Author(s):  
Amreen Ahmad ◽  
Tanvir Ahmad ◽  
Ishita Tripathi

: The immense growth of information has led to the wide usage of recommender systems for retrieving relevant information. One of the widely used methods for recommendation is collaborative filtering. However, such methods suffer from two problems, scalability and sparsity. In the proposed research, the two issues of collaborative filtering are addressed and a cluster-based recommender system is proposed. For the identification of potential clusters from the underlying network, Shapley value concept is used, which divides users into different clusters. After that, the recommendation algorithm is performed in every respective cluster. The proposed system recommends an item to a specific user based on the ratings of the item’s different attributes. Thus, it reduces the running time of the overall algorithm, since it avoids the overhead of computation involved when the algorithm is executed over the entire dataset. Besides, the security of the recommender system is one of the major concerns nowadays. Attackers can come in the form of ordinary users and introduce bias in the system to force the system function that is advantageous for them. In this paper, we identify different attack models that could hamper the security of the proposed cluster-based recommender system. The efficiency of the proposed research is validated by conducting experiments on student dataset.


2016 ◽  
Vol 16 (6) ◽  
pp. 245-255 ◽  
Author(s):  
Li Xie ◽  
Wenbo Zhou ◽  
Yaosen Li

Abstract In the era of big data, people have to face information filtration problem. For those cases when users do not or cannot express their demands clearly, recommender system can analyse user’s information more proactive and intelligent to filter out something users want. This property makes recommender system play a very important role in the field of e-commerce, social network and so on. The collaborative filtering recommendation algorithm based on Alternating Least Squares (ALS) is one of common algorithms using matrix factorization technique of recommendation system. In this paper, we design the parallel implementation process of the recommendation algorithm based on Spark platform and the related technology research of recommendation systems. Because of the shortcomings of the recommendation algorithm based on ALS model, a new loss function is designed. Before the model is trained, the similarity information of users and items is fused. The experimental results show that the performance of the proposed algorithm is better than that of algorithm based on ALS.


2021 ◽  
Author(s):  
Alicia Huidobro Espejel ◽  
Francisco J. Cantu-Ortiz

Webology ◽  
2021 ◽  
Vol 18 (1) ◽  
pp. 92-103
Author(s):  
Vandna Dahiya ◽  
Sandeep Dalal

Utility itemset mining, which finds the item sets based on utility factors, has established itself as an essential form of data mining. The utility is defined in terms of quantity and some interest factor. Various methods have been developed so far by the researchers to mine these itemsets but most of them are not scalable. In the present times, a scalable approach is required that can fulfill the budding needs of data mining. A Spark based novel technique has been recommended in this research paper for mining the data in a distributed way, called as Absolute High Utility Itemset Mining (AHUIM). The technique is suitable for small as well as large datasets. The performance of the technique is being measured for various parameters such as speed, scalability, and accuracy etc.


2021 ◽  
Vol 336 ◽  
pp. 05010
Author(s):  
Ziteng Wu ◽  
Chengyun Song ◽  
Yunqing Chen ◽  
Lingxuan Li

The interaction history between users and items is usually stored and displayed in the form of bipartite graphs. Neural network recommendation based on the user-item bipartite graph has a significant effect on alleviating the long-standing data sparseness and cold start of the recommendation system. The whole paper is based on the bipartite graph. An review of the recommendation system of graphs summarizes the three characteristics of graph neural network processing bipartite graph data in the recommendation field: interchangeability, Multi-hop transportability, and strong interpretability. The biggest contribution of the full paper is that it summarizes the general framework of graph neural network processing bipartite graph recommendation from the models with the best recommendation effect in the past three years: embedding layer, propagation update layer, and prediction layer. Although there are subtle differences between different models, they are all this framework can be applied, and different models can be regarded as variants of this general model, that is, other models are fine-tuned on the basis of this framework. At the end of the paper, the latest research progress is introduced, and the main challenges and research priorities that will be faced in the future are pointed out.


Sign in / Sign up

Export Citation Format

Share Document