Learning Heterogeneous Knowledge Base Embeddings for Explainable Recommendation

Providing model-generated explanations in recommender systems is important to user experience. State-of-the-art recommendation algorithms—especially the collaborative filtering (CF)- based approaches with shallow or deep models—usually work with various unstructured information sources for recommendation, such as textual reviews, visual images, and various implicit or explicit feedbacks. Though structured knowledge bases were considered in content-based approaches, they have been largely ignored recently due to the availability of vast amounts of data and the learning power of many complex models. However, structured knowledge bases exhibit unique advantages in personalized recommendation systems. When the explicit knowledge about users and items is considered for recommendation, the system could provide highly customized recommendations based on users’ historical behaviors and the knowledge is helpful for providing informed explanations regarding the recommended items. A great challenge for using knowledge bases for recommendation is how to integrate large-scale structured and unstructured data, while taking advantage of collaborative filtering for highly accurate performance. Recent achievements in knowledge-base embedding (KBE) sheds light on this problem, which makes it possible to learn user and item representations while preserving the structure of their relationship with external knowledge for explanation. In this work, we propose to explain knowledge-base embeddings for explainable recommendation. Specifically, we propose a knowledge-base representation learning framework to embed heterogeneous entities for recommendation, and based on the embedded knowledge base, a soft matching algorithm is proposed to generate personalized explanations for the recommended items. Experimental results on real-world e-commerce datasets verified the superior recommendation performance and the explainability power of our approach compared with state-of-the-art baselines.

Download Full-text

Improving Natural Language Inference Using External Knowledge in the Science Questions Domain

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33017208 ◽

2019 ◽

Vol 33 ◽

pp. 7208-7215 ◽

Cited By ~ 3

Author(s):

Xiaoyan Wang ◽

Pavan Kapanipathi ◽

Ryan Musa ◽

Mo Yu ◽

Kartik Talamadupula ◽

...

Keyword(s):

Natural Language ◽

Language Processing ◽

Large Scale ◽

Question Answering ◽

State Of The Art ◽

Knowledge Bases ◽

Improve Performance ◽

External Knowledge ◽

Structured Knowledge ◽

Significant Attention

Natural Language Inference (NLI) is fundamental to many Natural Language Processing (NLP) applications including semantic search and question answering. The NLI problem has gained significant attention due to the release of large scale, challenging datasets. Present approaches to the problem largely focus on learning-based methods that use only textual information in order to classify whether a given premise entails, contradicts, or is neutral with respect to a given hypothesis. Surprisingly, the use of methods based on structured knowledge – a central topic in artificial intelligence – has not received much attention vis-a-vis the NLI problem. While there are many open knowledge bases that contain various types of reasoning information, their use for NLI has not been well explored. To address this, we present a combination of techniques that harness external knowledge to improve performance on the NLI problem in the science questions domain. We present the results of applying our techniques on text, graph, and text-and-graph based models; and discuss the implications of using external knowledge to solve the NLI problem. Our model achieves close to state-of-the-art performance for NLI on the SciTail science questions dataset.

Download Full-text

Efficient Graph Collaborative Filtering via Contrastive Learning

Sensors ◽

10.3390/s21144666 ◽

2021 ◽

Vol 21 (14) ◽

pp. 4666

Author(s):

Zhiqiang Pan ◽

Honghui Chen

Keyword(s):

Collaborative Filtering ◽

State Of The Art ◽

Representation Learning ◽

First Order ◽

Satisfactory Performance ◽

Potential Applications ◽

Benchmark Datasets ◽

Bayesian Personalized Ranking ◽

Graph Neural Networks ◽

Training Efficiency

Collaborative filtering (CF) aims to make recommendations for users by detecting user’s preference from the historical user–item interactions. Existing graph neural networks (GNN) based methods achieve satisfactory performance by exploiting the high-order connectivity between users and items, however they suffer from the poor training efficiency problem and easily introduce bias for information propagation. Moreover, the widely applied Bayesian personalized ranking (BPR) loss is insufficient to provide supervision signals for training due to the extremely sparse observed interactions. To deal with the above issues, we propose the Efficient Graph Collaborative Filtering (EGCF) method. Specifically, EGCF adopts merely one-layer graph convolution to model the collaborative signal for users and items from the first-order neighbors in the user–item interactions. Moreover, we introduce contrastive learning to enhance the representation learning of users and items by deriving the self-supervisions, which is jointly trained with the supervised learning. Extensive experiments are conducted on two benchmark datasets, i.e., Yelp2018 and Amazon-book, and the experimental results demonstrate that EGCF can achieve the state-of-the-art performance in terms of Recall and normalized discounted cumulative gain (NDCG), especially on ranking the target items at right positions. In addition, EGCF shows obvious advantages in the training efficiency compared with the competitive baselines, making it practicable for potential applications.

Download Full-text

Efficient Heterogeneous Collaborative Filtering without Negative Sampling for Recommendation

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i01.5329 ◽

2020 ◽

Vol 34 (01) ◽

pp. 19-26 ◽

Cited By ~ 5

Author(s):

Chong Chen ◽

Min Zhang ◽

Yongfeng Zhang ◽

Weizhi Ma ◽

Yiqun Liu ◽

...

Keyword(s):

Collaborative Filtering ◽

Real World ◽

Large Scale ◽

State Of The Art ◽

Heterogeneous Data ◽

Model Parameters ◽

Online Systems ◽

Practical Applications ◽

Real World Datasets ◽

Primary Type

Recent studies on recommendation have largely focused on exploring state-of-the-art neural networks to improve the expressiveness of models, while typically apply the Negative Sampling (NS) strategy for efficient learning. Despite effectiveness, two important issues have not been well-considered in existing methods: 1) NS suffers from dramatic fluctuation, making sampling-based methods difficult to achieve the optimal ranking performance in practical applications; 2) although heterogeneous feedback (e.g., view, click, and purchase) is widespread in many online systems, most existing methods leverage only one primary type of user feedback such as purchase. In this work, we propose a novel non-sampling transfer learning solution, named Efficient Heterogeneous Collaborative Filtering (EHCF) for Top-N recommendation. It can not only model fine-grained user-item relations, but also efficiently learn model parameters from the whole heterogeneous data (including all unlabeled data) with a rather low time complexity. Extensive experiments on three real-world datasets show that EHCF significantly outperforms state-of-the-art recommendation methods in both traditional (single-behavior) and heterogeneous scenarios. Moreover, EHCF shows significant improvements in training efficiency, making it more applicable to real-world large-scale systems. Our implementation has been released 1 to facilitate further developments on efficient whole-data based neural methods.

Download Full-text

COMBINING DEFAULT LOGIC DATABASES

International Journal of Cooperative Information Systems ◽

10.1142/s0218215794000181 ◽

1994 ◽

Vol 03 (03) ◽

pp. 319-348 ◽

Cited By ~ 4

Author(s):

CHITTA BARAL ◽

SARIT KRAUS ◽

JACK MINKER ◽

V. S. SUBRAHMANIAN

Keyword(s):

Knowledge Representation ◽

Knowledge Base ◽

Large Scale ◽

Future Generation ◽

Knowledge Bases ◽

Default Logic ◽

Integrity Constraints ◽

Specialized Knowledge ◽

The Past ◽

The Future

During the past decade, it has become increasingly clear that the future generation of large-scale knowledge bases will consist, not of one single isolated knowledge base, but a multiplicity of specialized knowledge bases that contain knowledge about different domains of expertise. These knowledge bases will work cooperatively, pooling together their varied bodies of knowledge, so as to be able to solve complex problems that no single knowledge base, by itself, would have been able to address successfully. In any such situation, inconsistencies are bound to arise. In this paper, we address the question: "Suppose we have a set of knowledge bases, KB1, …, KBn, each of which uses default logic as the formalism for knowledge representation, and a set of integrity constraints IC. What knowledge base constitutes an acceptable combination of KB1, …, KBn?"

Download Full-text

Data-Driven Metaphor Recognition and Explanation

Transactions of the Association for Computational Linguistics ◽

10.1162/tacl_a_00235 ◽

2013 ◽

Vol 1 ◽

pp. 379-390 ◽

Cited By ~ 9

Author(s):

Hongsong Li ◽

Kenny Q. Zhu ◽

Haixun Wang

Keyword(s):

Knowledge Base ◽

State Of The Art ◽

Knowledge Bases ◽

Important Task ◽

Data Driven ◽

Web Pages ◽

Inference Mechanism ◽

Art Methods ◽

Data Driven Approach ◽

Machine Reading

Recognizing metaphors and identifying the source-target mappings is an important task as metaphorical text poses a big challenge for machine reading. To address this problem, we automatically acquire a metaphor knowledge base and an isA knowledge base from billions of web pages. Using the knowledge bases, we develop an inference mechanism to recognize and explain the metaphors in the text. To our knowledge, this is the first purely data-driven approach of probabilistic metaphor acquisition, recognition, and explanation. Our results shows that it significantly outperforms other state-of-the-art methods in recognizing and explaining metaphors.

Download Full-text

Neural Binary Representation Learning for Large-Scale Collaborative Filtering

IEEE Access ◽

10.1109/access.2019.2915331 ◽

2019 ◽

Vol 7 ◽

pp. 60752-60763 ◽

Cited By ~ 3

Author(s):

Yujia Zhang ◽

Jun Wu ◽

Haishuai Wang

Keyword(s):

Collaborative Filtering ◽

Large Scale ◽

Representation Learning ◽

Binary Representation

Download Full-text

A Hybrid Two-Phase Recommendation for Group-Buying E-commerce Applications

Applied Sciences ◽

10.3390/app9153141 ◽

2019 ◽

Vol 9 (15) ◽

pp. 3141

Author(s):

Li Bai ◽

Mi Hu ◽

Yunlong Ma ◽

Min Liu

Keyword(s):

Collaborative Filtering ◽

Real World ◽

Large Scale ◽

Recommendation Systems ◽

Personalized Recommendation ◽

Two Phase ◽

Group Buying ◽

Large Scale Data ◽

User Clustering ◽

Scale Data

The last two decades have witnessed an explosive growth of e-commerce applications. Existing online recommendation systems for e-commerce applications, particularly group-buying applications, suffer from scalability and data sparsity problems when confronted with exponentially increasing large-scale data. This leads to a poor recommendation effect of traditional collaborative filtering (CF) methods in group-buying applications. In order to address this challenge, this paper proposes a hybrid two-phase recommendation (HTPR) method which consists of offline preparation and online recommendation, combining clustering and collaborative filtering techniques. The user-item category tendency matrix is constructed after clustering items, and then users are clustered to facilitate personalized recommendation where items are generated by collaborative filtering technology. In addition, a parallelized strategy was developed to optimize the recommendation process. Extensive experiments on a real-world dataset were conducted by comparing HTPR with other three recommendation methods: traditional CF, user-clustering based CF, and item-clustering based CF. The experimental results show that the proposed HTPR method is effective and can improve the accuracy of online recommendation systems for group-buying applications.

Download Full-text

An Analysis and Application of Fast Nonnegative Orthogonal Matching Pursuit for Image Categorization in Deep Networks

Mathematical Problems in Engineering ◽

10.1155/2015/180675 ◽

2015 ◽

Vol 2015 ◽

pp. 1-9

Author(s):

Bo Wang ◽

Jichang Guo ◽

Yan Zhang

Keyword(s):

Large Scale ◽

State Of The Art ◽

Matching Pursuit ◽

Computational Cost ◽

Representation Learning ◽

Orthogonal Matching Pursuit ◽

Image Categorization ◽

Image Patches ◽

Deep Networks ◽

Shape Vector

Nonnegative orthogonal matching pursuit (NOMP) has been proven to be a more stable encoder for unsupervised sparse representation learning. However, previous research has shown that NOMP is suboptimal in terms of computational cost, as the coefficients selection and refinement using nonnegative least squares (NNLS) have been divided into two separate steps. It is found that this problem severely reduces the efficiency of encoding for large-scale image patches. In this work, we study fast nonnegative OMP (FNOMP) as an efficient encoder which can be accelerated by the implementation ofQRfactorization and iterations of coefficients in deep networks for full-size image categorization task. It is analyzed and demonstrated that using relatively simple gain-shape vector quantization for training dictionary, FNOMP not only performs more efficiently than NOMP for encoding but also significantly improves the classification accuracy compared to OMP based algorithm. In addition, FNOMP based algorithm is superior to other state-of-the-art methods on several publicly available benchmarks, that is, Oxford Flowers, UIUC-Sports, and Caltech101.

Download Full-text

Systematic Review and Quantitative Comparison of Cyberattack Scenario Detection and Projection

Electronics ◽

10.3390/electronics9101722 ◽

2020 ◽

Vol 9 (10) ◽

pp. 1722

Author(s):

Ivan Kovačević ◽

Stjepan Groš ◽

Karlo Slovenec

Keyword(s):

Large Scale ◽

State Of The Art ◽

Knowledge Bases ◽

Intrusion Detection Systems ◽

False Negatives ◽

Detection Systems ◽

Malicious Activity ◽

Event Logs ◽

Manual Analysis

Intrusion Detection Systems (IDSs) automatically analyze event logs and network traffic in order to detect malicious activity and policy violations. Because IDSs have a large number of false positives and false negatives and the technical nature of their alerts requires a lot of manual analysis, the researchers proposed approaches that automate the analysis of alerts to detect large-scale attacks and predict the attacker’s next steps. Unfortunately, many such approaches use unique datasets and success metrics, making comparison difficult. This survey provides an overview of the state of the art in detecting and projecting cyberattack scenarios, with a focus on evaluation and the corresponding metrics. Representative papers are collected while using Google Scholar and Scopus searches. Mutually comparable success metrics are calculated and several comparison tables are provided. Our results show that commonly used metrics are saturated on popular datasets and cannot assess the practical usability of the approaches. In addition, approaches with knowledge bases require constant maintenance, while data mining and ML approaches depend on the quality of available datasets, which, at the time of writing, are not representative enough to provide general knowledge regarding attack scenarios, so more emphasis needs to be placed on researching the behavior of attackers.

Download Full-text

Principles and practice in verifying rule-based systems

The Knowledge Engineering Review ◽

10.1017/s026988890000624x ◽

1992 ◽

Vol 7 (2) ◽

pp. 115-141 ◽

Cited By ~ 40

Author(s):

Alun D. Preece ◽

Rajjan Shinghal ◽

Aïda Batarekh

Keyword(s):

Expert System ◽

Knowledge Base ◽

State Of The Art ◽

Knowledge Bases ◽

Order Logic ◽

First Order Logic ◽

Rule Based ◽

First Order ◽

Rule Bases ◽

System Knowledge

AbstractThis paper surveys the verification of expert system knowledge bases by detecting anomalies. Such anomalies are highly indicative of errors in the knowledge base. The paper is in two parts. The first part describes four types of anomaly: redundancy, ambivalence, circularity, and deficiency. We consider rule bases which are based on first-order logic, and explain the anomalies in terms of the syntax and semantics of logic. The second part presents a review of five programs which have been built to detect various subsets of the anomalies. The four anomalies provide a framework for comparing the capabilities of the five tools, and we highlight the strengths and weaknesses of each approach. This paper therefore provides not only a set of underlying principles for performing knowledge base verification through anomaly detection, but also a survey of the state-of-the-art in building practical tools for carrying out such verification. The reader of this paper is expected to be familiar with first-order logic.

Download Full-text