Design of Knowledge Graph in Manufacturing Services Discovery

Abstract Knowledge graph networks powering web search and chatbot agents have shown immense popularity. This paper discusses the first steps towards building a knowledge graph for manufacturing services discoverability. Due to the lack of a unified widely adopted schema for structured data in the manufacturing services domain as well as the limitations of existing relational database schemas to represent manufacturing service definitions, there does not exist a unified schema that connects manufacturing resources service descriptions and actual manufacturing service business entities. This gap severely limits the automated discoverability of manufacturing service business organizations. This paper designs a knowledge graph covering over 8,000+ manufacturers, the manufacturing services they provide and corresponding linkage with manufacturing service definitions available from Wikidata. In addition, this work also proposes extensions to Schema.org to assist small business manufacturers to contain embedded search engine optimization (SEO) tags for search and discovery through web search engines. Such vocabulary extensions are critical to the rapid identification and real-time capability assessment particularly when the service providers themselves are responsible for updating tags. A wider scale enhancement of manufacturing specific vocabulary extensions to schema.org can tremendously benefit small and medium scale manufacturers. This paper concludes with the additional work that must be done for a comprehensive addition to manufacturing service graph that spans the entire manufacturing knowledge base.

Download Full-text

A Generic Model for Identifying QoS Parameters Interrelations in Cloud Services Selection Ontology during Runtime

Symmetry ◽

10.3390/sym13040563 ◽

2021 ◽

Vol 13 (4) ◽

pp. 563

Author(s):

Babu Rajendiran ◽

Jayashree Kanniappan

Keyword(s):

Service Providers ◽

Cloud Service ◽

Cloud Services ◽

Operating Costs ◽

Generic Model ◽

Business Organizations ◽

Cloud Environment ◽

Cloud Service Provider ◽

Qos Parameters ◽

Computer Based

Nowadays, many business organizations are operating on the cloud environment in order to diminish their operating costs and to select the best service from many cloud providers. The increasing number of Cloud Services available on the market encourages the cloud consumer to be conscious in selecting the most apt Cloud Service Provider that satisfies functionality, as well as QoS parameters. Many disciplines of computer-based applications use standardized ontology to represent information in their fields that indicate the necessity of an ontology-based representation. The proposed generic model can help service consumers to identify QoS parameters interrelations in the cloud services selection ontology during run-time, and for service providers to enhance their business by interpreting the various relations. The ontology has been developed using the intended attributes of QoS from various service providers. A generic model has been developed and it is tested with the developed ontology.

Download Full-text

The Pensoft Data Publishing Workflow: The FAIRway from articles to Linked Open Data

Biodiversity Information Science and Standards ◽

10.3897/biss.3.35902 ◽

2019 ◽

Vol 3 ◽

Author(s):

Lyubomir Penev ◽

Teodor Georgiev ◽

Viktor Senderov ◽

Mariya Dimitrova ◽

Pavel Stoev

Keyword(s):

Open Data ◽

Structured Data ◽

Linked Open Data ◽

Data Publishing ◽

Knowledge Graph ◽

Supplementary File ◽

Biodiversity Data ◽

Text Format ◽

Biodiversity Knowledge ◽

Data Elements

As one of the first advocates of open access and open data in the field of biodiversity publishiing, Pensoft has adopted a multiple data publishing model, resulting in the ARPHA-BioDiv toolbox (Penev et al. 2017). ARPHA-BioDiv consists of several data publishing workflows and tools described in the Strategies and Guidelines for Publishing of Biodiversity Data and elsewhere: Data underlying research results are deposited in an external repository and/or published as supplementary file(s) to the article and then linked/cited in the article text; supplementary files are published under their own DOIs and bear their own citation details. Data deposited in trusted repositories and/or supplementary files and described in data papers; data papers may be submitted in text format or converted into manuscripts from Ecological Metadata Language (EML) metadata. Integrated narrative and data publishing realised by the Biodiversity Data Journal, where structured data are imported into the article text from tables or via web services and downloaded/distributed from the published article. Data published in structured, semanticaly enriched, full-text XMLs, so that several data elements can thereafter easily be harvested by machines. Linked Open Data (LOD) extracted from literature, converted into interoperable RDF triples in accordance with the OpenBiodiv-O ontology (Senderov et al. 2018) and stored in the OpenBiodiv Biodiversity Knowledge Graph. Data underlying research results are deposited in an external repository and/or published as supplementary file(s) to the article and then linked/cited in the article text; supplementary files are published under their own DOIs and bear their own citation details. Data deposited in trusted repositories and/or supplementary files and described in data papers; data papers may be submitted in text format or converted into manuscripts from Ecological Metadata Language (EML) metadata. Integrated narrative and data publishing realised by the Biodiversity Data Journal, where structured data are imported into the article text from tables or via web services and downloaded/distributed from the published article. Data published in structured, semanticaly enriched, full-text XMLs, so that several data elements can thereafter easily be harvested by machines. Linked Open Data (LOD) extracted from literature, converted into interoperable RDF triples in accordance with the OpenBiodiv-O ontology (Senderov et al. 2018) and stored in the OpenBiodiv Biodiversity Knowledge Graph. The above mentioned approaches are supported by a whole ecosystem of additional workflows and tools, for example: (1) pre-publication data auditing, involving both human and machine data quality checks (workflow 2); (2) web-service integration with data repositories and data centres, such as Global Biodiversity Information Facility (GBIF), Barcode of Life Data Systems (BOLD), Integrated Digitized Biocollections (iDigBio), Data Observation Network for Earth (DataONE), Long Term Ecological Research (LTER), PlutoF, Dryad, and others (workflows 1,2); (3) semantic markup of the article texts in the TaxPub format facilitating further extraction, distribution and re-use of sub-article elements and data (workflows 3,4); (4) server-to-server import of specimen data from GBIF, BOLD, iDigBio and PlutoR into manuscript text (workflow 3); (5) automated conversion of EML metadata into data paper manuscripts (workflow 2); (6) export of Darwin Core Archive and automated deposition in GBIF (workflow 3); (7) submission of individual images and supplementary data under own DOIs to the Biodiversity Literature Repository, BLR (workflows 1-3); (8) conversion of key data elements from TaxPub articles and taxonomic treatments extracted by Plazi into RDF handled by OpenBiodiv (workflow 5). These approaches represent different aspects of the prospective scholarly publishing of biodiversity data, which in a combination with text and data mining (TDM) technologies for legacy literature (PDF) developed by Plazi, lay the ground of an entire data publishing ecosystem for biodiversity, supplying FAIR (Findable, Accessible, Interoperable and Reusable data to several interoperable overarching infrastructures, such as GBIF, BLR, Plazi TreatmentBank, OpenBiodiv and various end users.

Download Full-text

MOVIE RECOMMENDATION USING KNOWLEDGE GRAPH

EPRA International Journal of Research & Development (IJRD) ◽

10.36713/epra6692 ◽

2020 ◽

pp. 99-103

Author(s):

Navin Tatyaba Gopal ◽

Anish Raj Khobragade

Keyword(s):

Neural Network ◽

Recommender Systems ◽

Weighted Graph ◽

Cold Start ◽

Structured Data ◽

Label Propagation ◽

Knowledge Graph ◽

Movie Recommendation ◽

Knowledge Graphs ◽

Prior Application

The Knowledge graphs (KGs) catches structured data and relationships among a bunch of entities and items. Generally, constitute an attractive origin of information that can advance the recommender systems. But, present methodologies of this area depend on manual element thus don’t permit for start to end training. This article proposes, Knowledge Graph along with Label Smoothness (KG-LS) to offer better suggestions for the recommender Systems. Our methodology processes user-specific entities by prior application of a function capability that recognizes key KG-relationships for a specific user. In this manner, we change the KG in a specific-user weighted graph followed by application of a graph neural network to process customized entity embedding. To give better preliminary predisposition, label smoothness comes into picture, which places items in the KG which probably going to have identical user significant names/scores. Use of, label smoothness gives regularization above the edge weights thus; we demonstrate that it is comparable to a label propagation plan on the graph. Additionally building-up a productive usage that symbolizes solid adaptability concerning the size of knowledge graph. Experimentation on 4 datasets shows that our strategy beats best in class baselines. This process likewise accomplishes solid execution in cold start situations where user-entity communications remain meager.

Download Full-text

Classification of Spamming Attacks to Blogging Websites and Their Security Techniques

Encyclopedia of Criminal Activities and the Deep Web ◽

10.4018/978-1-5225-9715-5.ch058 ◽

2020 ◽

pp. 864-880 ◽

Cited By ~ 1

Author(s):

Rizwan Ur Rahman ◽

Rishu Verma ◽

Himani Bansal ◽

Deepak Singh Tomar

Keyword(s):

Search Engine ◽

World Wide ◽

Web Search ◽

Service Providers ◽

Web Pages ◽

Internet Service ◽

Important Concern ◽

Attack Scenario ◽

Explosive Expansion

With the explosive expansion of information on the world wide web, search engines are becoming more significant in the day-to-day lives of humans. Even though a search engine generally gives huge number of results for certain query, the majority of the search engine users simply view the first few web pages in result lists. Consequently, the ranking position has become a most important concern of internet service providers. This article addresses the vulnerabilities, spamming attacks, and countermeasures in blogging sites. In the first part, the article explores the spamming types and detailed section on vulnerabilities. In the next part, an attack scenario of form spamming is presented, and defense approach is presented. Consequently, the aim of this article is to provide review of vulnerabilities, threats of spamming associated with blogging websites, and effective measures to counter them.

Download Full-text

E-Frameworks to Optimize Public Administration Services

Strategic and Pragmatic E-Business - Advances in E-Business Research ◽

10.4018/978-1-4666-1619-6.ch012 ◽

2012 ◽

pp. 267-296

Author(s):

Decebal Popescu ◽

Nirvana Popescu ◽

Ciprian Dobre

Keyword(s):

Public Administration ◽

Service Providers ◽

Real Life ◽

The European Union ◽

Business Organizations ◽

Automatic Data ◽

The Public ◽

Doing Business ◽

Reliable Source ◽

The Relationship

Public administration is subject to major changes affecting many countries, such as the need to implement the European Union Services Directive within the entire EU area. This chapter presents theoretical and practical approaches to developing e-Services and e-Government solutions and real experiences in developing two successful projects with great potential to improve complex Government procedures. The Point of Single Contact is an electronic means through which service providers can find information and complete the formalities necessary to doing business there. Each EU member state must have its own PSC, which should be a reliable source of electronic processing of information that should facilitate the interaction of citizens with the public administration. The design and implementation details of an e-Framework for optimizing the relationship between Governments and citizens using eServices will be presented. Evaluation results obtained by integrating a real-life workflow for opening a business in the Romanian environment are shown. Also, in order to optimize automatic data transfers, document workflows, and business reporting of business organizations, an e-Services system is used.

Download Full-text

Debunking Intermediary Censorship Framework in Social Media via a Content Retrieval and Classification Software

Ontologies and Big Data Considerations for Effective Intelligence - Advances in Information Quality and Management ◽

10.4018/978-1-5225-2058-0.ch005 ◽

2017 ◽

pp. 169-195

Author(s):

Baramee Navanopparatskul ◽

Sukree Sinthupinyo ◽

Pirongrong Ramasoota

Keyword(s):

Social Media ◽

Service Providers ◽

Freedom Of Expression ◽

Business Organizations ◽

Governmental Agencies ◽

Online Discussion Forum ◽

Content Retrieval ◽

Intermediary Liability ◽

Online Service Providers

Following the enactment of computer crime law in Thailand, online service providers are compelled to control illegal content including content that is deemed harmful or problematic. This situation leads to self-censorship of intermediaries, often resulting in overblocking to avoid violating the law. Such filtering flaw both infringes users' freedom of expression and impedes the business of OSPs in Thailand. The Innovative Retrieval System (IRS) is thus developed to investigate intermediary censorship in online discussion forum, Pantip.com, as a case study of social media. The result shows that there is no consistency of censorship pattern on the website at all. The censorship criteria depend on type of content in each forum. Overblocking is also high, over 70% of removed content, due to intimidation of governmental agencies, lawsuits from business organizations, and fear of intermediary liability. Website administrator admitted that he would cut off some users to avoid business troubles.

Download Full-text

User-Centric Privacy Management in Future Network Infrastructure

Privacy Protection Measures and Technologies in Business Organizations ◽

10.4018/978-1-61350-501-4.ch002 ◽

2012 ◽

pp. 32-64

Author(s):

Antonio Gomez-Skarmeta ◽

Alejandro Perez Mendez ◽

Elena Torroglosa Garcia ◽

Gabriel Lopez Millán

Keyword(s):

Service Providers ◽

Privacy Management ◽

Business Organizations ◽

Network Infrastructure ◽

Trust Relationships ◽

Identity Federation ◽

Wide Range ◽

Resource Requirements ◽

User Centric

Finally, it is analyzed how the inclusion of IdM in business organizations can provide economical benefits. These benefits range from a reduction in resource requirements to the increment of potential clients thanks to the incorporation of the organization in an identity federation. Special attention is placed on the case where the telecommunications operator is established as the main point of identity providing, as a straightforward result of its already established trust relationships with a wide range of parties (clients and service providers).

Download Full-text

Preliminary Study on the Knowledge Graph Construction of Chinese Ancient History and Culture

Information ◽

10.3390/info11040186 ◽

2020 ◽

Vol 11 (4) ◽

pp. 186 ◽

Cited By ~ 4

Author(s):

Shuang Liu ◽

Hui Yang ◽

Jiayi Li ◽

Simon Kolmanič

Keyword(s):

Short Term Memory ◽

Structured Data ◽

Entity Recognition ◽

Knowledge Graph ◽

Chinese History ◽

Entity Extraction ◽

Language Recognition ◽

Ancient History ◽

Relationship Extraction ◽

The Relationship

The domestic population has paid increasing attention to ancient Chinese history and culture with the continuous improvement of people’s living standards, the rapid economic growth, and the rapid advancement of information science and technology. The use of information technology has been proven to promote the spread and development of historical culture, and it is becoming a necessary means to promote our traditional culture. This paper will build a knowledge graph of ancient Chinese history and culture in order to facilitate the public to more quickly and accurately understand the relevant knowledge of ancient Chinese history and culture. The construction process is as follows: firstly, use crawler technology to obtain text and table data related to ancient history and culture on Baidu Encyclopedia (similar to Wikipedia) and ancient Chinese history and culture related pages. Among them, the crawler technology crawls the semi-structured data in the information box (InfoBox) in the Baidu Encyclopedia to directly construct the triples required for the knowledge graph, crawls the introductory text information of the entries in Baidu Encyclopedia, and specialized historical and cultural websites (history Chunqiu.com, On History.com) to extract unstructured entities and relationships. Secondly, entity recognition and relationship extraction are performed on an unstructured text. The entity recognition part uses the Bidirectional Long Short-Term Memory-Convolutional Neural Networks-Conditions Random Field (BiLSTM-CNN-CRF) model for entity extraction. The relationship extraction between entities is performed by using the open source tool DeepKE (information extraction tool with language recognition ability developed by Zhejiang University) to extract the relationships between entities. After obtaining the entity and the relationship between the entities, supplement it with the triple data that were constructed from the semi-structured data in the existing knowledge base and Baidu Encyclopedia information box. Subsequently, the ontology construction and the quality evaluation of the entire constructed knowledge graph are performed to form the final knowledge graph of ancient Chinese history and culture.

Download Full-text

Knowledge Graph Construction and Applications for Web Search and Beyond

Data Intelligence ◽

10.1162/dint_a_00019 ◽

2019 ◽

Vol 1 (4) ◽

pp. 333-349 ◽

Cited By ~ 1

Author(s):

Peilu Wang ◽

Hao Jiang ◽

Jingfang Xu ◽

Qi Zhang

Keyword(s):

Search Engine ◽

Intelligent Systems ◽

Large Scale ◽

Web Search ◽

Question Answering ◽

Knowledge Graph ◽

Graph Databases ◽

Distributed Search ◽

Knowledge Based ◽

Dialog System

Knowledge graph (KG) has played an important role in enhancing the performance of many intelligent systems. In this paper, we introduce the solution of building a large-scale multi-source knowledge graph from scratch in Sogou Inc., including its architecture, technical implementation and applications. Unlike previous works that build knowledge graph with graph databases, we build the knowledge graph on top of SogouQdb, a distributed search engine developed by Sogou Web Search Department, which can be easily scaled to support petabytes of data. As a supplement to the search engine, we also introduce a series of models to support inference and graph based querying. Currently, the data of Sogou knowledge graph that are collected from 136 different websites and constantly updated consist of 54 million entities and over 600 million entity links. We also introduce three applications of knowledge graph in Sogou Inc.: entity detection and linking, knowledge based question answering and knowledge based dialog system. These applications have been used in Web search products to help user acquire information more efficiently.

Download Full-text

Evaluating search features of Google Knowledge Graph and Bing Satori

Online Information Review ◽

10.1108/oir-10-2014-0257 ◽

2015 ◽

Vol 39 (2) ◽

pp. 197-213 ◽

Cited By ~ 26

Author(s):

Ahmet Uyar ◽

Farouk Musa Aliyu

Keyword(s):

Semantic Web ◽

Natural Language ◽

Search Engines ◽

Web Search ◽

Semantic Search ◽

Knowledge Graph ◽

Content Type ◽

Query Interfaces ◽

Natural Language Query ◽

Web Search Engines

Purpose – The purpose of this paper is to better understand three main aspects of semantic web search engines of Google Knowledge Graph and Bing Satori. The authors investigated: coverage of entity types, the extent of their support for list search services and the capabilities of their natural language query interfaces. Design/methodology/approach – The authors manually submitted selected queries to these two semantic web search engines and evaluated the returned results. To test the coverage of entity types, the authors selected the entity types from Freebase database. To test the capabilities of natural language query interfaces, the authors used a manually developed query data set about US geography. Findings – The results indicate that both semantic search engines cover only the very common entity types. In addition, the list search service is provided for a small percentage of entity types. Moreover, both search engines support queries with very limited complexity and with limited set of recognised terms. Research limitations/implications – Both companies are continually working to improve their semantic web search engines. Therefore, the findings show their capabilities at the time of conducting this research. Practical implications – The results show that in the near future the authors can expect both semantic search engines to expand their entity databases and improve their natural language interfaces. Originality/value – As far as the authors know, this is the first study evaluating any aspect of newly developing semantic web search engines. It shows the current capabilities and limitations of these semantic web search engines. It provides directions to researchers by pointing out the main problems for semantic web search engines.

Download Full-text