Collaborative filtering recommendation algorithm based on class correlation distance

Author(s):  
Hanfei Zhang ◽  
Yumei Jian ◽  
Ping Zhou

: A class correlation distance collaborative filtering recommendation algorithm is proposed to solve the problems of category judgment and distance metric in the traditional collaborative filtering recommendation algorithm, which is using the advantage of the distance between the same samples and the class related distance. First, the class correlation distance between the training samples is calculated and stored. Second, the K nearest neighbor samples are selected, the class correlation distance of training samples and the difference ratio between the test samples and training samples are calculated respectively. Finally, according to the difference ratio, we classify the different types of samples. The experimental result shows that the algorithm combined with user rating preference can get lower MAE value, and the recommendation effect is better. With the change of K value, CCDKNN algorithm is obviously better than KNN algorithm and DWKNN algorithm, and the accuracy performance is more stable. The algorithm improves the accuracy of similarity and predictability, which has better performance than the traditional algorithm.

2019 ◽  
Vol 2019 ◽  
pp. 1-7 ◽  
Author(s):  
Sheng-wei Fei

Fault diagnosis of bearing based on variational mode decomposition (VMD)-phase space reconstruction (PSR)-singular value decomposition (SVD) and improved binary particle swarm optimization (IBPSO)-K-nearest neighbor (KNN) which is abbreviated as VPS-IBPSOKNN is presented in this study, among which VMD-PSR-SVD (VPS) is presented to obtain the features of the bearing vibration signal (BVS), and IBPSO is presented to select the parameter K of KNN. In IBPSO, the calculation of the next position of each particle is improved to fit the evolution of the particles. The traditional KNN with different parameter K and trained by the training samples with the features based on VMD-SVD (VS-KNN) can be used to compare with the proposed VPS-IBPSOKNN method. The experimental result demonstrates that fault diagnosis ability of bearing of VPS-IBPSOKNN is better than that of VS-KNN, and it can be concluded that fault diagnosis of bearing based on VPS-IBPSOKNN is effective.


Author(s):  
Tsehay Admassu Assegie*

Phishing causes many problems in business industry. The electronic commerce and electronic banking such as mobile banking involves a number of online transaction. In such online transactions, we have to discriminate features related to legitimate and phishing websites in order to ensure security of the online transaction. In this study, we have collected data form phish tank public data repository and proposed K-Nearest Neighbors (KNN) based model for phishing attack detection. The proposed model detects phishing attack through URL classification. The performance of the proposed model is tested empirically and result is analyzed. Experimental result on test set reveals that the model is efficient on phishing attack detection. Furthermore, the K value that gives better accuracy is determined to achieve better performance on phishing attack detection. Overall, the average accuracy of the proposed model is 85.08%.


Author(s):  
Tsehay Admassu Assegie ◽  

Phishing causes many problems in business industry. The electronic commerce and electronic banking such as mobile banking involves a number of online transaction. In such online transactions, we have to discriminate features related to legitimate and phishing websites in order to ensure security of the online transaction. In this study, we have collected data form phish tank public data repository and proposed K-Nearest Neighbors (KNN) based model for phishing attack detection. The proposed model detects phishing attack through URL classification. The performance of the proposed model is tested empirically and result is analyzed. Experimental result on test set reveals that the model is efficient on phishing attack detection. Furthermore, the K value that gives better accuracy is determined to achieve better performance on phishing attack detection. Overall, the average accuracy of the proposed model is 85.08%.


2018 ◽  
Vol 5 (2) ◽  
pp. 328-348
Author(s):  
Muh Subhan ◽  
Amang Sudarsono ◽  
Ali Ridho Barakbah

Radical content in procedural meaning is content which have provoke the violence, spread the hatred and anti nationalism. Radical definition for each country is different, especially in Indonesia. Radical content is more identical with provocation issue, ethnic and religious hatred that is called SARA in Indonesian languange. SARA content is very difficult to detect due to the large number, unstructure system and many noise can be caused multiple interpretations. This problem can threat the unity and harmony of the religion. According to this condition, it is required a system that can distinguish the radical content or not. In this system, we propose text mining approach using DF threshold and Human Brain as the feature extraction. The system is divided into several steps, those are collecting data which is including at preprocessing part, text mining, selection features, classification for grouping the data with class label, simillarity calculation of data training, and visualization to the radical content or non radical content. The experimental result show that using combination from 10-cross validation and k-Nearest Neighbor (kNN) as the classification methods achieve 66.37% accuracy performance with 7 k value of kNN method[1].


Electronics ◽  
2021 ◽  
Vol 10 (2) ◽  
pp. 214
Author(s):  
Lei Chen ◽  
Yuyu Yuan ◽  
Jincui Yang ◽  
Ahmed Zahir

Despite years of evolution of recommender systems, improving prediction accuracy remains one of the core problems among researchers and industry. It is common to use side information to bolster the accuracy of recommender systems. In this work, we focus on using item categories, specifically movie genres, to improve the prediction accuracy as well as coverage, precision, and recall. We derive the user’s taste for an item using the ratings expressed. Similarly, using the collective ratings given to an item, we identify how much each item belongs to a certain genre. These two vectors are then combined to get a user-item-weight matrix. In contrast to the similarity-based weight matrix in memory-based collaborative filtering, we use user-item-weight to make predictions. The user-item-weights can be used to explain to users why certain items have been recommended. We evaluate our proposed method using three real-world datasets. The proposed model performs significantly better than the baseline methods. In addition, we use the user-item-weight matrix to alleviate the sparsity problem associated with correlation-based similarity. In addition to that, the proposed model has a better computational complexity for making predictions than the k-nearest neighbor (kNN) method.


2014 ◽  
Vol 1044-1045 ◽  
pp. 1484-1488
Author(s):  
Yue Kun Fan ◽  
Xin Ye Li ◽  
Meng Meng Cao

Currently collaborative filtering is widely used in e-commerce, digital libraries and other areas of personalized recommendation service system. Nearest-neighbor algorithm is the earliest proposed and the main collaborative filtering recommendation algorithm, but the data sparsity and cold-start problems seriously affect the recommendation quality. To solve these problems, A collaborative filtering recommendation algorithm based on users' social relationships is proposed. 0n the basis of traditional filtering recommendation technology, it combines with the interested objects of user's social relationship and takes the advantage of the tags to projects marked by users and their interested objects to improve the methods of recommendation. The experimental results of MAE ((Mean Absolute Error)) verify that this method can get better quality of recommendation.


2020 ◽  
Vol 8 (4) ◽  
pp. 367
Author(s):  
Muhammad Arief Budiman ◽  
Gst. Ayu Vida Mastrika Giri

The development of the music industry is currently growing rapidly, millions of music works continue to be issued by various music artists. As for the technologies also follows these developments, examples are mobile phones applications that have music subscription services, namely Spotify, Joox, GrooveShark, and others. Application-based services are increasingly in demand by users for streaming music, free or paid. In this paper, a music recommendation system is proposed, which the system itself can recommend songs based on the similarity of the artist that the user likes or has heard. This research uses Collaborative Filtering method with Cosine Similarity and K-Nearest Neighbor algorithm. From this research, a system that can recommend songs based on artists who are related to one another is generated.


2019 ◽  
Vol 1 (3) ◽  
pp. 1-12
Author(s):  
Agus Wahyu Widodo ◽  
Deo Hernando ◽  
Wayan Firdaus Mahmudy

Due to the problems with uncontrolled changes in mangrove forests, a forest function management and supervision is required. The form of mangrove forest management carried out in this study is to measure the area of mangrove forests by observing the forests using drones or crewless aircraft. Drones are used to take photos because they can capture vast mangrove forests with high resolution. The drone was flown over above the mangrove forest and took several photos. The method used in this study is extracting color features using mean values, standard deviations, and skewness in the HSV color space and texture feature extraction with Haralick features. The classification method used is the k-nearest neighbor method. This study conducted three tests, namely testing the accuracy of the system, testing the distance method used in the k-nearest neighbor classification method, and testing the k value. Based on the results of the three tests above, three conclusions obtained. The first conclusion is that the classification system produces an accuracy of 84%. The second conclusion is that the distance method used in the k-nearest neighbor classification method influences the accuracy of the system. The distance method that produces the highest accuracy is the Euclidean distance method with an accuracy of 84%. The third conclusion is that the k value used in the k-nearest neighbor classification method influences the accuracy of the system. The k-value that produces the highest accuracy is k = 3, with an accuracy of 84%.


Author(s):  
Made Sudarma ◽  
I Gede Harsemadi

Each of music which has been created, has its own mood which is emitted, therefore, there has been many researches in Music Information Retrieval (MIR) field that has been done for recognition of mood to music.  This research produced software to classify music to the mood by using K-Nearest Neighbor and ID3 algorithm.  In this research accuracy performance comparison and measurement of average classification time is carried out which is obtained based on the value produced from music feature extraction process.  For music feature extraction process it uses 9 types of spectral analysis, consists of 400 practicing data and 400 testing data.  The system produced outcome as classification label of mood type those are contentment, exuberance, depression and anxious.  Classification by using algorithm of KNN is good enough that is 86.55% at k value = 3 and average processing time is 0.01021.  Whereas by using ID3 it results accuracy of 59.33% and average of processing time is 0.05091 second.


2017 ◽  
Vol 9 (1) ◽  
pp. 1-9
Author(s):  
Fandiansyah Fandiansyah ◽  
Jayanti Yusmah Sari ◽  
Ika Putri Ningrum

Face recognition is one of the biometric system that mostly used for individual recognition in the absent machine or access control. This is because the face is the most visible part of human anatomy and serves as the first distinguishing factor of a human being. Feature extraction and classification are the key to face recognition, as they are to any pattern classification task. In this paper, we describe a face recognition method based on Linear Discriminant Analysis (LDA) and k-Nearest Neighbor classifier. LDA used for feature extraction, which directly extracts the proper features from image matrices with the objective of maximizing between-class variations and minimizing within-class variations. The features of a testing image will be compared to the features of database image using K-Nearest Neighbor classifier. The experiments in this paper are performed by using using 66 face images of 22 different people. The experimental result shows that the recognition accuracy is up to 98.33%. Index Terms—face recognition, k nearest neighbor, linear discriminant analysis.


Sign in / Sign up

Export Citation Format

Share Document