RDFFrames: knowledge graph access for machine learning tools

The VLDB Journal ◽

10.1007/s00778-021-00690-5 ◽

2021 ◽

Author(s):

Aisha Mohamed ◽

Ghadeer Abuoda ◽

Abdurrahman Ghanem ◽

Zoi Kaoudi ◽

Ashraf Aboulnaga

Keyword(s):

Machine Learning ◽

Database Systems ◽

Database System ◽

Sparql Query ◽

Knowledge Graph ◽

Learning Tools ◽

Tabular Format ◽

Knowledge Graphs ◽

Rdf Database ◽

Learning Software

AbstractKnowledge graphs represented as RDF datasets are integral to many machine learning applications. RDF is supported by a rich ecosystem of data management systems and tools, most notably RDF database systems that provide a SPARQL query interface. Surprisingly, machine learning tools for knowledge graphs do not use SPARQL, despite the obvious advantages of using a database system. This is due to the mismatch between SPARQL and machine learning tools in terms of data model and programming style. Machine learning tools work on data in tabular format and process it using an imperative programming style, while SPARQL is declarative and has as its basic operation matching graph patterns to RDF triples. We posit that a good interface to knowledge graphs from a machine learning software stack should use an imperative, navigational programming paradigm based on graph traversal rather than the SPARQL query paradigm based on graph patterns. In this paper, we present RDFFrames, a framework that provides such an interface. RDFFrames provides an imperative Python API that gets internally translated to SPARQL, and it is integrated with the PyData machine learning software stack. RDFFrames enables the user to make a sequence of Python calls to define the data to be extracted from a knowledge graph stored in an RDF database system, and it translates these calls into a compact SPQARL query, executes it on the database system, and returns the results in a standard tabular format. Thus, RDFFrames is a useful tool for data preparation that combines the usability of PyData with the flexibility and performance of RDF database systems.

Download Full-text

Predictive article recommendation using natural language processing and machine learning to support evidence updates in domain-specific knowledge graphs

JAMIA Open ◽

10.1093/jamiaopen/ooaa028 ◽

2020 ◽

Vol 3 (3) ◽

pp. 332-337

Author(s):

Bhuvan Sharma ◽

Van C Willis ◽

Claudia S Huettner ◽

Kirk Beaty ◽

Jane L Snowdon ◽

...

Keyword(s):

Machine Learning ◽

Language Processing ◽

Named Entity Recognition ◽

Entity Recognition ◽

Human Cognition ◽

Current Evidence ◽

Knowledge Graph ◽

Space Modeling ◽

Domain Specific Knowledge ◽

Knowledge Graphs

Abstract Objectives Describe an augmented intelligence approach to facilitate the update of evidence for associations in knowledge graphs. Methods New publications are filtered through multiple machine learning study classifiers, and filtered publications are combined with articles already included as evidence in the knowledge graph. The corpus is then subjected to named entity recognition, semantic dictionary mapping, term vector space modeling, pairwise similarity, and focal entity match to identify highly related publications. Subject matter experts review recommended articles to assess inclusion in the knowledge graph; discrepancies are resolved by consensus. Results Study classifiers achieved F-scores from 0.88 to 0.94, and similarity thresholds for each study type were determined by experimentation. Our approach reduces human literature review load by 99%, and over the past 12 months, 41% of recommendations were accepted to update the knowledge graph. Conclusion Integrated search and recommendation exploiting current evidence in a knowledge graph is useful for reducing human cognition load.

Download Full-text

Swift Logic for Big Data and Knowledge Graphs

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2017/1 ◽

2017 ◽

Cited By ~ 13

Author(s):

Luigi Bellomarini ◽

Georg Gottlob ◽

Andreas Pieris ◽

Emanuel Sallinger

Keyword(s):

Machine Learning ◽

Big Data ◽

Computational Complexity ◽

Management System ◽

Knowledge Graph ◽

Complex Reasoning ◽

Knowledge Graphs ◽

Reasoning Tasks ◽

Graph Management ◽

The Web

Many modern companies wish to maintain knowledge in the form of a corporate knowledge graph and to use and manage this knowledge via a knowledge graph management system (KGMS). We formulate various requirements for a fully fledged KGMS. In particular, such a system must be capable of performing complex reasoning tasks but, at the same time, achieve efficient and scalable reasoning over Big Data with an acceptable computational complexity. Moreover, a KGMS needs interfaces to corporate databases, the web, and machine-learning and analytics packages. We present KRR formalisms and a system achieving these goals.

Download Full-text

Predicting the relationships between gut microbiota and mental disorders with knowledge graphs

Health Information Science and Systems ◽

10.1007/s13755-020-00128-2 ◽

2020 ◽

Vol 9 (1) ◽

Author(s):

Ting Liu ◽

Xueli Pan ◽

Xu Wang ◽

K. Anton Feenstra ◽

Jaap Heringa ◽

...

Keyword(s):

Mental Disorders ◽

Gut Microbiota ◽

Research Effort ◽

Sparql Query ◽

Knowledge Graph ◽

Test Cases ◽

Relevant Research ◽

Structured Knowledge ◽

Knowledge Graphs ◽

The Relationship

AbstractGut microbiota produce and modulate the production of neurotransmitters which have been implicated in mental disorders. Neurotransmitters may act as ‘matchmaker’ between gut microbiota imbalance and mental disorders. Most of the relevant research effort goes into the relationship between gut microbiota and neurotransmitters and the other between neurotransmitters and mental disorders, while few studies collect and analyze the dispersed research results in systematic ways. We therefore gather the dispersed results that in the existing studies into a structured knowledge base for identifying and predicting the potential relationships between gut microbiota and mental disorders. In this study, we propose to construct a gut microbiota knowledge graph for mental disorder, which named as MiKG4MD. It is extendable by linking to future ontologies by just adding new relationships between existing information and new entities. This extendibility is emphasized for the integration with existing popular ontologies/terminologies, e.g. UMLS, MeSH, and KEGG. We demonstrate the performance of MiKG4MD with three SPARQL query test cases. Results show that the MiKG4MD knowledge graph is an effective method to predict the relationships between gut microbiota and mental disorders.

Download Full-text

Biological Insights Knowledge Graph: an integrated knowledge graph to support drug development

10.1101/2021.10.28.466262 ◽

2021 ◽

Author(s):

David Geleta ◽

Andriy Nikolov ◽

Gavin Edwards ◽

Anna Gogleva ◽

Richard Jackson ◽

...

Keyword(s):

Machine Learning ◽

Drug Development ◽

Use Cases ◽

Knowledge Graph ◽

Multiple Use ◽

Organisational Knowledge ◽

Data Source ◽

The Common ◽

Knowledge Graphs ◽

Use Of Knowledge

The use of knowledge graphs as a data source for machine learning methods to solve complex problems in life sciences has rapidly become popular in recent years. Our Biological Insights Knowledge Graph (BIKG) combines relevant data for drug development from public as well as internal data sources to provide insights for a range of tasks: from identifying new targets to repurposing existing drugs. Besides the common requirements to organisational knowledge graphs such as being able to capture the domain precisely and give the users the ability to search and query the data, the focus on handling multiple use cases and supporting use case-specific machine learning models presents additional challenges: the data models must also be streamlined for the performance of downstream tasks; graph content must be easily customisable for different use cases; different projections of the graph content are required to support a wider range of different consumption modes. In this paper we describe our main design choices in implementation of the BIKG graph and discuss different aspects of its life cycle: from graph construction to exploitation.

Download Full-text

Quantum Machine Learning Algorithm for Knowledge Graphs

ACM Transactions on Quantum Computing ◽

10.1145/3467982 ◽

2021 ◽

Vol 2 (3) ◽

pp. 1-28

Author(s):

Yunpu Ma ◽

Volker Tresp

Keyword(s):

Machine Learning ◽

Learning Algorithm ◽

Semantic Knowledge ◽

Low Rank ◽

Knowledge Representation And Reasoning ◽

Knowledge Graph ◽

Machine Learning Algorithm ◽

Plausible Assumption ◽

Quantum Machine Learning ◽

Knowledge Graphs

Semantic knowledge graphs are large-scale triple-oriented databases for knowledge representation and reasoning. Implicit knowledge can be inferred by modeling the tensor representations generated from knowledge graphs. However, as the sizes of knowledge graphs continue to grow, classical modeling becomes increasingly computationally resource intensive. This article investigates how to capitalize on quantum resources to accelerate the modeling of knowledge graphs. In particular, we propose the first quantum machine learning algorithm for inference on tensorized data, i.e., on knowledge graphs. Since most tensor problems are NP-hard [18], it is challenging to devise quantum algorithms to support the inference task. We simplify the modeling task by making the plausible assumption that the tensor representation of a knowledge graph can be approximated by its low-rank tensor singular value decomposition, which is verified by our experiments. The proposed sampling-based quantum algorithm achieves speedup with a polylogarithmic runtime in the dimension of knowledge graph tensor.

Download Full-text

A Comparative Study of Different Machine Learning Tools

International Journal of Computer Sciences and Engineering ◽

10.26438/ijcse/v7i4.184190 ◽

2019 ◽

Vol 7 (4) ◽

pp. 184-190

Author(s):

Himani Maheshwari ◽

Pooja Goswami ◽

Isha Rana

Keyword(s):

Machine Learning ◽

Comparative Study ◽

Learning Tools

Download Full-text

Mobile Software Assurance Informed through Knowledge Graph Construction: The OWASP Threat of Insecure Data Storage

Journal of Computer Science Research ◽

10.30564/jcsr.v2i2.1765 ◽

2020 ◽

Vol 2 (2) ◽

Author(s):

Suzanna Schmeelk ◽

Lixin Tao

Keyword(s):

Data Storage ◽

Program Analysis ◽

Web Application ◽

Security Analysis ◽

Knowledge Graph ◽

Healthcare Applications ◽

Sensitive Data ◽

Knowledge Graphs ◽

Mobile Malware Detection ◽

Software Assurance

Many organizations, to save costs, are movinheg to t Bring Your Own Mobile Device (BYOD) model and adopting applications built by third-parties at an unprecedented rate. Our research examines software assurance methodologies specifically focusing on security analysis coverage of the program analysis for mobile malware detection, mitigation, and prevention. This research focuses on secure software development of Android applications by developing knowledge graphs for threats reported by the Open Web Application Security Project (OWASP). OWASP maintains lists of the top ten security threats to web and mobile applications. We develop knowledge graphs based on the two most recent top ten threat years and show how the knowledge graph relationships can be discovered in mobile application source code. We analyze 200+ healthcare applications from GitHub to gain an understanding of their software assurance of their developed software for one of the OWASP top ten moble threats, the threat of “Insecure Data Storage.” We find that many of the applications are storing personally identifying information (PII) in potentially vulnerable places leaving users exposed to higher risks for the loss of their sensitive data.

Download Full-text

Improved nutrient management in cereals using Nutrient Expert and machine learning tools: Productivity, profitability and nutrient use efficiency

Agricultural Systems ◽

10.1016/j.agsy.2021.103181 ◽

2021 ◽

Vol 192 ◽

pp. 103181

Author(s):

Jagadish Timsina ◽

Sudarshan Dutta ◽

Krishna Prasad Devkota ◽

Somsubhra Chakraborty ◽

Ram Krishna Neupane ◽

...

Keyword(s):

Machine Learning ◽

Nutrient Management ◽

Nutrient Use Efficiency ◽

Learning Tools ◽

Nutrient Use ◽

Use Efficiency

Download Full-text

Paper2Wire – A Case Study of User-Centred Development of Machine Learning Tools for UX Designers

i-com ◽

10.1515/icom-2021-0002 ◽

2021 ◽

Vol 20 (1) ◽

pp. 19-32

Author(s):

Daniel Buschek ◽

Charlotte Anlauff ◽

Florian Lachner

Keyword(s):

Machine Learning ◽

Development Process ◽

User Study ◽

Concept Development ◽

Lessons Learned ◽

Design Tool ◽

Learning Tools ◽

Interface Elements ◽

Industry Partner

Abstract This paper reflects on a case study of a user-centred concept development process for a Machine Learning (ML) based design tool, conducted at an industry partner. The resulting concept uses ML to match graphical user interface elements in sketches on paper to their digital counterparts to create consistent wireframes. A user study (N=20) with a working prototype shows that this concept is preferred by designers, compared to the previous manual procedure. Reflecting on our process and findings we discuss lessons learned for developing ML tools that respect practitioners’ needs and practices.

Download Full-text

TransET: Knowledge Graph Embedding with Entity Types

Electronics ◽

10.3390/electronics10121407 ◽

2021 ◽

Vol 10 (12) ◽

pp. 1407

Author(s):

Peng Wang ◽

Jing Zhou ◽

Yuzhang Liu ◽

Xingchen Zhou

Keyword(s):

Link Prediction ◽

State Of The Art ◽

Score Function ◽

Graph Embedding ◽

Vector Spaces ◽

Knowledge Graph ◽

Semantic Features ◽

Knowledge Graphs ◽

Real World Datasets ◽

Low Dimensional

Knowledge graph embedding aims to embed entities and relations into low-dimensional vector spaces. Most existing methods only focus on triple facts in knowledge graphs. In addition, models based on translation or distance measurement cannot fully represent complex relations. As well-constructed prior knowledge, entity types can be employed to learn the representations of entities and relations. In this paper, we propose a novel knowledge graph embedding model named TransET, which takes advantage of entity types to learn more semantic features. More specifically, circle convolution based on the embeddings of entity and entity types is utilized to map head entity and tail entity to type-specific representations, then translation-based score function is used to learn the presentation triples. We evaluated our model on real-world datasets with two benchmark tasks of link prediction and triple classification. Experimental results demonstrate that it outperforms state-of-the-art models in most cases.

Download Full-text