scholarly journals Further Improvement on Two-Way Cooperative Collaborative Filtering Approaches for the Binary Market Basket Data

2021 ◽  
Vol 11 (19) ◽  
pp. 8977
Author(s):  
Wook-Yeon Hwang ◽  
Jong-Seok Lee

Two-way cooperative collaborative filtering (CF) has been known to be crucial for binary market basket data. We propose an improved two-way logistic regression approach, a Pearson correlation-based score, a random forests (RF) R-square-based score, an RF Pearson correlation-based score, and a CF scheme based on the RF R-square-based score. The main idea is to utilize as much predictive information as possible within the two-way prediction in order to cope with the cold-start problem. All of the proposed methods work better than the existing two-way cooperative CF approach in terms of the experimental results.

2008 ◽  
Vol 17 (01) ◽  
pp. 71-85 ◽  
Author(s):  
XIAOYUAN SU ◽  
TAGHI M. KHOSHGOFTAAR

As one of the most successful recommender systems, collaborative filtering (CF) algorithms are required to deal with high sparsity and high requirement of scalability amongst other challenges. Bayesian networks (BNs), one of the most frequently used classifiers, can be used for CF tasks. Previous works on applying BNs to CF tasks were mainly focused on binary-class data, and used simple or basic Bayesian classifiers.1,2 In this work, we apply advanced BNs models to CF tasks instead of simple ones, and work on real-world multi-class CF data instead of synthetic binary-class data. Empirical results show that with their ability to deal with incomplete data, the extended logistic regression on tree augmented naïve Bayes (TAN-ELR)3 CF model consistently performs better than the traditional Pearson correlation-based CF algorithm for the rating data that have few items or high missing rates. In addition, the ELR-optimized BNs CF models are robust in terms of the ability to make predictions, while the robustness of the Pearson correlation-based CF algorithm degrades as the sparseness of the data increases.


2013 ◽  
Vol 765-767 ◽  
pp. 1218-1222
Author(s):  
Xiang Yun Xiong ◽  
Yu Chen Fu ◽  
Zhao Qing Liu

Personalized recommendation based on bipartite network has attracted more and more attention. Its obviously better than CF (Collaborative Filtering). In this paper, we propose a multi-dimensional recommendation algorithm called BNPM. It combines item-based, user-based and category-based recommendation model to improve recommendation quality. The experimental results show that the algorithm can improve the diversity and reduce the popularity on the base of holding the accuracy of the recommendation


2012 ◽  
Vol 3 (2) ◽  
pp. 14-28 ◽  
Author(s):  
Zainab Khanzadeh ◽  
Mehregan Mahdavi

Internet technology has rapidly grown during the last decades. Presently, users are faced with a great amount of information and they need help to find appropriate items in the shortest possible time. Recommender systems were introduced to overcome this problem of overloaded information. They recommend items of interest to users based on their expressed preferences. Major e-commerce companies try to use this technology to increase their sales. Collaborative Filtering is the most promising technique in recommender systems. It provides personalized recommendations according to user preferences. But one of the problems of Collaborative Filtering is cold-start. The authors provide a novel approach for solving this problem through using the attributes of items in order to recommend items to more people for improving e-business activities. The experimental results show that the proposed method performs better than existing methods in terms of the number of generated recommendations and their quality.


2021 ◽  
Vol 6 (4) ◽  
Author(s):  
Victor T. Odumuyiwa ◽  
Olalekan P. Oloba

Collaborative filtering based recommender systems (RS) are faced with cold start problem. This problem arises when the RS does not have enough information or opinion about a person or about a product and therefore cannot make recommendation for such person. In this paper, the demographic data of the user such as age, gender, and occupation are utilized as additional sources together with existing users’ rating to tackle the cold start problem by employing the entropy-based methodology to determine the degree of predictability.  Experimental results on MovieLens dataset showed that the proposed method gives higher accuracy than other existing demographic based methods. Keywords— Cold Start, Collaborative Filtering, Entropy, Demographic Approach, Recommender Systems


2016 ◽  
Vol 1 (1) ◽  
pp. 45-52
Author(s):  
Palupi Puspitorini

The aim of this study was to select the best sources of auxin of which it can stimulate the growth of shoots Pineapple plant cuttings. This research is compiled in a completely randomized design (CRD) with 4 treatments and 6 replications. The Data were statistically Analyzed by the DMRT. Level of treatment given proves that no treatment 0%, cow urine concentration of 25%, young coconut water concentration of 25% and Rootone F 100 mg / cuttings. The results showed that cow urine concentrations of 25% and Rootone F 100 mg give the best results in stimulating the growth of shoots pineapple stem cuttings. Experimental results concluded that the effect of this natural hormone were better than the shoots without given hormone.           


2019 ◽  
Author(s):  
Oskar Flygare ◽  
Jesper Enander ◽  
Erik Andersson ◽  
Brjánn Ljótsson ◽  
Volen Z Ivanov ◽  
...  

**Background:** Previous attempts to identify predictors of treatment outcomes in body dysmorphic disorder (BDD) have yielded inconsistent findings. One way to increase precision and clinical utility could be to use machine learning methods, which can incorporate multiple non-linear associations in prediction models. **Methods:** This study used a random forests machine learning approach to test if it is possible to reliably predict remission from BDD in a sample of 88 individuals that had received internet-delivered cognitive behavioral therapy for BDD. The random forest models were compared to traditional logistic regression analyses. **Results:** Random forests correctly identified 78% of participants as remitters or non-remitters at post-treatment. The accuracy of prediction was lower in subsequent follow-ups (68%, 66% and 61% correctly classified at 3-, 12- and 24-month follow-ups, respectively). Depressive symptoms, treatment credibility, working alliance, and initial severity of BDD were among the most important predictors at the beginning of treatment. By contrast, the logistic regression models did not identify consistent and strong predictors of remission from BDD. **Conclusions:** The results provide initial support for the clinical utility of machine learning approaches in the prediction of outcomes of patients with BDD. **Trial registration:** ClinicalTrials.gov ID: NCT02010619.


2020 ◽  
Vol 27 (4) ◽  
pp. 329-336 ◽  
Author(s):  
Lei Xu ◽  
Guangmin Liang ◽  
Baowen Chen ◽  
Xu Tan ◽  
Huaikun Xiang ◽  
...  

Background: Cell lytic enzyme is a kind of highly evolved protein, which can destroy the cell structure and kill the bacteria. Compared with antibiotics, cell lytic enzyme will not cause serious problem of drug resistance of pathogenic bacteria. Thus, the study of cell wall lytic enzymes aims at finding an efficient way for curing bacteria infectious. Compared with using antibiotics, the problem of drug resistance becomes more serious. Therefore, it is a good choice for curing bacterial infections by using cell lytic enzymes. Cell lytic enzyme includes endolysin and autolysin and the difference between them is the purpose of the break of cell wall. The identification of the type of cell lytic enzymes is meaningful for the study of cell wall enzymes. Objective: In this article, our motivation is to predict the type of cell lytic enzyme. Cell lytic enzyme is helpful for killing bacteria, so it is meaningful for study the type of cell lytic enzyme. However, it is time consuming to detect the type of cell lytic enzyme by experimental methods. Thus, an efficient computational method for the type of cell lytic enzyme prediction is proposed in our work. Method: We propose a computational method for the prediction of endolysin and autolysin. First, a data set containing 27 endolysins and 41 autolysins is built. Then the protein is represented by tripeptides composition. The features are selected with larger confidence degree. At last, the classifier is trained by the labeled vectors based on support vector machine. The learned classifier is used to predict the type of cell lytic enzyme. Results: Following the proposed method, the experimental results show that the overall accuracy can attain 97.06%, when 44 features are selected. Compared with Ding's method, our method improves the overall accuracy by nearly 4.5% ((97.06-92.9)/92.9%). The performance of our proposed method is stable, when the selected feature number is from 40 to 70. The overall accuracy of tripeptides optimal feature set is 94.12%, and the overall accuracy of Chou's amphiphilic PseAAC method is 76.2%. The experimental results also demonstrate that the overall accuracy is improved by nearly 18% when using the tripeptides optimal feature set. Conclusion: The paper proposed an efficient method for identifying endolysin and autolysin. In this paper, support vector machine is used to predict the type of cell lytic enzyme. The experimental results show that the overall accuracy of the proposed method is 94.12%, which is better than some existing methods. In conclusion, the selected 44 features can improve the overall accuracy for identification of the type of cell lytic enzyme. Support vector machine performs better than other classifiers when using the selected feature set on the benchmark data set.


2013 ◽  
Vol 1 (3) ◽  
pp. 48-65
Author(s):  
Yuting Chen

A concurrent program is intuitively associated with probability: the executions of the program can produce nondeterministic execution program paths due to the interleavings of threads, whereas some paths can always be executed more frequently than the others. An exploration of the probabilities on the execution paths is expected to provide engineers or compilers with support in helping, either at coding phase or at compile time, to optimize some hottest paths. However, it is not easy to take a static analysis of the probabilities on a concurrent program in that the scheduling of threads of a concurrent program usually depends on the operating system and hardware (e.g., processor) on which the program is executed, which may be vary from machine to machine. In this paper the authors propose a platform independent approach, called ProbPP, to analyzing probabilities on the execution paths of the multithreaded programs. The main idea of ProbPP is to calculate the probabilities on the basis of two kinds of probabilities: Primitive Dependent Probabilities (PDPs) representing the control dependent probabilities among the program statements and Thread Execution Probabilities (TEPs) representing the probabilities of threads being scheduled to execute. The authors have also conducted two preliminary experiments to evaluate the effectiveness and performance of ProbPP, and the experimental results show that ProbPP can provide engineers with acceptable accuracy.


Sign in / Sign up

Export Citation Format

Share Document