Further Improvement on Two-Way Cooperative Collaborative Filtering Approaches for the Binary Market Basket Data

Two-way cooperative collaborative filtering (CF) has been known to be crucial for binary market basket data. We propose an improved two-way logistic regression approach, a Pearson correlation-based score, a random forests (RF) R-square-based score, an RF Pearson correlation-based score, and a CF scheme based on the RF R-square-based score. The main idea is to utilize as much predictive information as possible within the two-way prediction in order to cope with the cold-start problem. All of the proposed methods work better than the existing two-way cooperative CF approach in terms of the experimental results.

Download Full-text

COLLABORATIVE FILTERING FOR MULTI-CLASS DATA USING BAYESIAN NETWORKS

International Journal of Artificial Intelligence Tools ◽

10.1142/s0218213008003789 ◽

2008 ◽

Vol 17 (01) ◽

pp. 71-85 ◽

Cited By ~ 16

Author(s):

XIAOYUAN SU ◽

TAGHI M. KHOSHGOFTAAR

Keyword(s):

Logistic Regression ◽

Bayesian Networks ◽

Collaborative Filtering ◽

Recommender Systems ◽

Real World ◽

Incomplete Data ◽

Pearson Correlation ◽

Bayesian Classifiers ◽

Rating Data ◽

Better Than

As one of the most successful recommender systems, collaborative filtering (CF) algorithms are required to deal with high sparsity and high requirement of scalability amongst other challenges. Bayesian networks (BNs), one of the most frequently used classifiers, can be used for CF tasks. Previous works on applying BNs to CF tasks were mainly focused on binary-class data, and used simple or basic Bayesian classifiers.1,2 In this work, we apply advanced BNs models to CF tasks instead of simple ones, and work on real-world multi-class CF data instead of synthetic binary-class data. Empirical results show that with their ability to deal with incomplete data, the extended logistic regression on tree augmented naïve Bayes (TAN-ELR)3 CF model consistently performs better than the traditional Pearson correlation-based CF algorithm for the rating data that have few items or high missing rates. In addition, the ELR-optimized BNs CF models are robust in terms of the ability to make predictions, while the robustness of the Pearson correlation-based CF algorithm degrades as the sparseness of the data increases.

Download Full-text

A Network-Based Multi-Dimensional Recommendation Algorithm

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.765-767.1218 ◽

2013 ◽

Vol 765-767 ◽

pp. 1218-1222

Author(s):

Xiang Yun Xiong ◽

Yu Chen Fu ◽

Zhao Qing Liu

Keyword(s):

Collaborative Filtering ◽

Experimental Results ◽

Personalized Recommendation ◽

Bipartite Network ◽

Recommendation Algorithm ◽

Better Than

Personalized recommendation based on bipartite network has attracted more and more attention. Its obviously better than CF (Collaborative Filtering). In this paper, we propose a multi-dimensional recommendation algorithm called BNPM. It combines item-based, user-based and category-based recommendation model to improve recommendation quality. The experimental results show that the algorithm can improve the diversity and reduce the popularity on the base of holding the accuracy of the recommendation

Download Full-text

Utilizing Association Rules for Improving the Performance of Collaborative Filtering

International Journal of E-Entrepreneurship and Innovation ◽

10.4018/jeei.2012040102 ◽

2012 ◽

Vol 3 (2) ◽

pp. 14-28 ◽

Cited By ~ 1

Author(s):

Zainab Khanzadeh ◽

Mehregan Mahdavi

Keyword(s):

Collaborative Filtering ◽

Recommender Systems ◽

Association Rules ◽

Cold Start ◽

User Preferences ◽

Experimental Results ◽

Internet Technology ◽

Promising Technique ◽

Novel Approach ◽

Better Than

Internet technology has rapidly grown during the last decades. Presently, users are faced with a great amount of information and they need help to find appropriate items in the shortest possible time. Recommender systems were introduced to overcome this problem of overloaded information. They recommend items of interest to users based on their expressed preferences. Major e-commerce companies try to use this technology to increase their sales. Collaborative Filtering is the most promising technique in recommender systems. It provides personalized recommendations according to user preferences. But one of the problems of Collaborative Filtering is cold-start. The authors provide a novel approach for solving this problem through using the attributes of items in order to recommend items to more people for improving e-business activities. The experimental results show that the proposed method performs better than existing methods in terms of the number of generated recommendations and their quality.

Download Full-text

Supervised Learning-Based Collaborative Filtering Using Market Basket Data for the Cold-Start Problem

Industrial Engineering & Management Systems ◽

10.7232/iems.2014.13.4.421 ◽

2014 ◽

Vol 13 (4) ◽

pp. 421-431 ◽

Cited By ~ 3

Author(s):

Wook-Yeon Hwang ◽

Chi-Hyuck Jun

Keyword(s):

Collaborative Filtering ◽

Supervised Learning ◽

Cold Start ◽

Market Basket ◽

Cold Start Problem

Download Full-text

Addressing Cold Start Problem in Collaborative Filtering using Demographic Data with Entropy-based Methodology

FUOYE Journal of Engineering and Technology ◽

10.46792/fuoyejet.v6i4.704 ◽

2021 ◽

Vol 6 (4) ◽

Author(s):

Victor T. Odumuyiwa ◽

Olalekan P. Oloba

Keyword(s):

Collaborative Filtering ◽

Recommender Systems ◽

Demographic Data ◽

Cold Start ◽

Experimental Results ◽

Demographic Approach ◽

Cold Start Problem

Collaborative filtering based recommender systems (RS) are faced with cold start problem. This problem arises when the RS does not have enough information or opinion about a person or about a product and therefore cannot make recommendation for such person. In this paper, the demographic data of the user such as age, gender, and occupation are utilized as additional sources together with existing users’ rating to tackle the cold start problem by employing the entropy-based methodology to determine the degree of predictability. Experimental results on MovieLens dataset showed that the proposed method gives higher accuracy than other existing demographic based methods. Keywords— Cold Start, Collaborative Filtering, Entropy, Demographic Approach, Recommender Systems

Download Full-text

A multilevel logistic regression approach to studying faking

PsycEXTRA Dataset ◽

10.1037/e518532013-120 ◽

2007 ◽

Author(s):

David M. LaHuis ◽

Derek A. Copeland

Keyword(s):

Logistic Regression ◽

Multilevel Logistic Regression ◽

Regression Approach

Download Full-text

THE SOURCES OF AUXIN HORMONE TO GROWTH OF SHOOT PINEAPPLE STEM CUTTING (Ananascomosus.L. Merr.)

Journal of Academic Research and Sciences (JARES) ◽

10.30957/jares.v1i1.41 ◽

2016 ◽

Vol 1 (1) ◽

pp. 45-52

Author(s):

Palupi Puspitorini

Keyword(s):

Water Concentration ◽

Urine Concentration ◽

Experimental Results ◽

Stem Cuttings ◽

Randomized Design ◽

Natural Hormone ◽

Cow Urine ◽

Urine Concentrations ◽

Completely Randomized Design ◽

Better Than

The aim of this study was to select the best sources of auxin of which it can stimulate the growth of shoots Pineapple plant cuttings. This research is compiled in a completely randomized design (CRD) with 4 treatments and 6 replications. The Data were statistically Analyzed by the DMRT. Level of treatment given proves that no treatment 0%, cow urine concentration of 25%, young coconut water concentration of 25% and Rootone F 100 mg / cuttings. The results showed that cow urine concentrations of 25% and Rootone F 100 mg give the best results in stimulating the growth of shoots pineapple stem cuttings. Experimental results concluded that the effect of this natural hormone were better than the shoots without given hormone.

Download Full-text

Predictors of remission from body dysmorphic disorder after internet-delivered cognitive behavior therapy: a machine learning approach

10.31234/osf.io/eqcdx ◽

2019 ◽

Author(s):

Oskar Flygare ◽

Jesper Enander ◽

Erik Andersson ◽

Brjánn Ljótsson ◽

Volen Z Ivanov ◽

...

Keyword(s):

Machine Learning ◽

Logistic Regression ◽

Random Forests ◽

Clinical Utility ◽

Body Dysmorphic Disorder ◽

Prediction Models ◽

Behavioral Therapy ◽

Learning Approach ◽

Learning Approaches ◽

Machine Learning Approach

**Background:** Previous attempts to identify predictors of treatment outcomes in body dysmorphic disorder (BDD) have yielded inconsistent findings. One way to increase precision and clinical utility could be to use machine learning methods, which can incorporate multiple non-linear associations in prediction models. **Methods:** This study used a random forests machine learning approach to test if it is possible to reliably predict remission from BDD in a sample of 88 individuals that had received internet-delivered cognitive behavioral therapy for BDD. The random forest models were compared to traditional logistic regression analyses. **Results:** Random forests correctly identified 78% of participants as remitters or non-remitters at post-treatment. The accuracy of prediction was lower in subsequent follow-ups (68%, 66% and 61% correctly classified at 3-, 12- and 24-month follow-ups, respectively). Depressive symptoms, treatment credibility, working alliance, and initial severity of BDD were among the most important predictors at the beginning of treatment. By contrast, the logistic regression models did not identify consistent and strong predictors of remission from BDD. **Conclusions:** The results provide initial support for the clinical utility of machine learning approaches in the prediction of outcomes of patients with BDD. **Trial registration:** ClinicalTrials.gov ID: NCT02010619.

Download Full-text

A Computational Method for the Identification of Endolysins and Autolysins

Protein and Peptide Letters ◽

10.2174/0929866526666191002104735 ◽

2020 ◽

Vol 27 (4) ◽

pp. 329-336 ◽

Cited By ~ 1

Author(s):

Lei Xu ◽

Guangmin Liang ◽

Baowen Chen ◽

Xu Tan ◽

Huaikun Xiang ◽

...

Keyword(s):

Support Vector Machine ◽

Cell Wall ◽

Experimental Results ◽

Computational Method ◽

Lytic Enzyme ◽

Support Vector ◽

Lytic Enzymes ◽

Data Set ◽

Optimal Feature ◽

Better Than

Background: Cell lytic enzyme is a kind of highly evolved protein, which can destroy the cell structure and kill the bacteria. Compared with antibiotics, cell lytic enzyme will not cause serious problem of drug resistance of pathogenic bacteria. Thus, the study of cell wall lytic enzymes aims at finding an efficient way for curing bacteria infectious. Compared with using antibiotics, the problem of drug resistance becomes more serious. Therefore, it is a good choice for curing bacterial infections by using cell lytic enzymes. Cell lytic enzyme includes endolysin and autolysin and the difference between them is the purpose of the break of cell wall. The identification of the type of cell lytic enzymes is meaningful for the study of cell wall enzymes. Objective: In this article, our motivation is to predict the type of cell lytic enzyme. Cell lytic enzyme is helpful for killing bacteria, so it is meaningful for study the type of cell lytic enzyme. However, it is time consuming to detect the type of cell lytic enzyme by experimental methods. Thus, an efficient computational method for the type of cell lytic enzyme prediction is proposed in our work. Method: We propose a computational method for the prediction of endolysin and autolysin. First, a data set containing 27 endolysins and 41 autolysins is built. Then the protein is represented by tripeptides composition. The features are selected with larger confidence degree. At last, the classifier is trained by the labeled vectors based on support vector machine. The learned classifier is used to predict the type of cell lytic enzyme. Results: Following the proposed method, the experimental results show that the overall accuracy can attain 97.06%, when 44 features are selected. Compared with Ding's method, our method improves the overall accuracy by nearly 4.5% ((97.06-92.9)/92.9%). The performance of our proposed method is stable, when the selected feature number is from 40 to 70. The overall accuracy of tripeptides optimal feature set is 94.12%, and the overall accuracy of Chou's amphiphilic PseAAC method is 76.2%. The experimental results also demonstrate that the overall accuracy is improved by nearly 18% when using the tripeptides optimal feature set. Conclusion: The paper proposed an efficient method for identifying endolysin and autolysin. In this paper, support vector machine is used to predict the type of cell lytic enzyme. The experimental results show that the overall accuracy of the proposed method is 94.12%, which is better than some existing methods. In conclusion, the selected 44 features can improve the overall accuracy for identification of the type of cell lytic enzyme. Support vector machine performs better than other classifiers when using the selected feature set on the benchmark data set.

Download Full-text

Platform Independent Analysis of Probabilities on Multithreaded Programs

International Journal of Software Innovation ◽

10.4018/ijsi.2013070104 ◽

2013 ◽

Vol 1 (3) ◽

pp. 48-65

Author(s):

Yuting Chen

Keyword(s):

Operating System ◽

Static Analysis ◽

Main Idea ◽

Experimental Results ◽

Machine To Machine ◽

Concurrent Program ◽

Acceptable Accuracy ◽

Multithreaded Programs ◽

Independent Analysis ◽

And Performance

A concurrent program is intuitively associated with probability: the executions of the program can produce nondeterministic execution program paths due to the interleavings of threads, whereas some paths can always be executed more frequently than the others. An exploration of the probabilities on the execution paths is expected to provide engineers or compilers with support in helping, either at coding phase or at compile time, to optimize some hottest paths. However, it is not easy to take a static analysis of the probabilities on a concurrent program in that the scheduling of threads of a concurrent program usually depends on the operating system and hardware (e.g., processor) on which the program is executed, which may be vary from machine to machine. In this paper the authors propose a platform independent approach, called ProbPP, to analyzing probabilities on the execution paths of the multithreaded programs. The main idea of ProbPP is to calculate the probabilities on the basis of two kinds of probabilities: Primitive Dependent Probabilities (PDPs) representing the control dependent probabilities among the program statements and Thread Execution Probabilities (TEPs) representing the probabilities of threads being scheduled to execute. The authors have also conducted two preliminary experiments to evaluate the effectiveness and performance of ProbPP, and the experimental results show that ProbPP can provide engineers with acceptable accuracy.

Download Full-text