scholarly journals GENERATING KNOWLEDGE STRUCTURES FROM OPEN DATASETS' TAGS - AN APPROACH BASED ON FORMAL CONCEPT ANALYSIS

2021 ◽  
Vol 20 (1) ◽  
pp. 021
Author(s):  
Miloš Bogdanović ◽  
Milena Frtunić Gligorijević ◽  
Nataša Veljković ◽  
Leonid Stoimenov

Under influence of data transparency initiatives, a variety of institutions have published a significant number of datasets. In most cases, data publishers take advantage of open data portals (ODPs) for making their datasets publicly available. To improve the datasets' discoverability, open data portals (ODPs) group open datasets into categories using various criteria like publishers, institutions, formats, and descriptions. For these purposes, portals take advantage of metadata accompanying datasets. However, a part of metadata may be missing, or may be incomplete or redundant. Each of these situations makes it difficult for users to find appropriate datasets and obtain the desired information. As the number of available datasets grows, this problem becomes easy to notice. This paper is focused on the first step towards decreasing this problem by implementing knowledge structures to be used in situations where a part of datasets' metadata is missing. In particular, we focus on developing knowledge structures capable of suggesting the best match for the category where an uncategorized dataset should belong to. Our approach relies on dataset descriptions provided by users within dataset tags. We take advantage of a formal concept analysis to reveal the shared conceptualization originating from the tags' usage by developing a concept lattice per each category of open datasets. Since tags represent free text metadata entered by users, in this paper we will present a method of optimizing their usage through means of semantic similarity measures based on natural language processing mechanisms. Finally, we will demonstrate the advantage of our proposal by comparing concept lattices generated using formal the concept analysis before and after the optimization process. The main experimental research results will show that our approach is capable of reducing the number of nodes within a lattice more than 40%.

2013 ◽  
Vol 760-762 ◽  
pp. 1708-1712
Author(s):  
Ying Fang Li ◽  
Ying Jiang Li ◽  
Yan Li ◽  
Yang Bo

At present, as the number of web services resources on the network drastically increased, how to quickly and efficiently find the needed services from publishing services has become a problem to resolve. Aiming at the problems of low efficiency in service discovery of traditional web service, the formal concept analysis ( FCA) is introduced into the semantic Web service matching, and a Matching Algorithm based semantic web service is proposed. With considering the concept of limited inheritance,this method introduces the concept of limited inheritance to the semantic similarity calculation based on the concept lattice. It is significant in enhancing the service function matching in practical applications through adjust the calculation.


Information ◽  
2018 ◽  
Vol 9 (9) ◽  
pp. 228 ◽  
Author(s):  
Zuping Zhang ◽  
Jing Zhao ◽  
Xiping Yan

Web page clustering is an important technology for sorting network resources. By extraction and clustering based on the similarity of the Web page, a large amount of information on a Web page can be organized effectively. In this paper, after describing the extraction of Web feature words, calculation methods for the weighting of feature words are studied deeply. Taking Web pages as objects and Web feature words as attributes, a formal context is constructed for using formal concept analysis. An algorithm for constructing a concept lattice based on cross data links was proposed and was successfully applied. This method can be used to cluster the Web pages using the concept lattice hierarchy. Experimental results indicate that the proposed algorithm is better than previous competitors with regard to time consumption and the clustering effect.


2012 ◽  
Vol 6-7 ◽  
pp. 625-630 ◽  
Author(s):  
Hong Sheng Xu

In the form of background in the form of concept partial relation to the corresponding concept lattice, concept lattice is the core data structure of formal concept analysis. Association rule mining process includes two phases: first find all the frequent itemsets in data collection, Second it is by these frequent itemsets to generate association rules. This paper analyzes the association rule mining algorithms, such as Apriori and FP-Growth. The paper presents the construction search engine based on formal concept analysis and association rule mining. Experimental results show that the proposed algorithm has high efficiency.


Author(s):  
Nida Meddouri ◽  
Mondher Maddouri

Knowledge discovery in databases (KDD) aims to exploit the large amounts of data collected every day in various fields of computing application. The idea is to extract hidden knowledge from a set of data. It gathers several tasks that constitute a process, such as: data selection, pre-processing, transformation, data mining, visualization, etc. Data mining techniques include supervised classification and unsupervised classification. Classification consists of predicting the class of new instances with a classifier built on learning data of labeled instances. Several approaches were proposed such as: the induction of decision trees, Bayes, nearest neighbor search, neural networks, support vector machines, and formal concept analysis. Learning formal concepts always refers to the mathematical structure of concept lattice. This article presents a state of the art on formal concept analysis classifier. The authors present different ways to calculate the closure operators from nominal data and also present new approach to build only a part of the lattice including the best concepts. This approach is based on Dagging (ensemble method) that generates an ensemble of classifiers, each one represents a formal concept, and combines them by a voting rule. Experimental results are given to prove the efficiency of the proposed method.


2020 ◽  
Vol 39 (3) ◽  
pp. 2783-2790
Author(s):  
Qian Hu ◽  
Ke-Yun Qin

The construction of concept lattices is an important research topic in formal concept analysis. Inspired by multi-granularity rough sets, multi-granularity formal concept analysis has become a new hot research issue. This paper mainly studies the construction methods of concept lattices in multi-granularity formal context. The relationships between concept forming operators under different granularity are discussed. The mutual transformation methods of formal concepts under different granularity are presented. In addition, the approaches of obtaining coarse-granularity concept lattice by fine-granularity concept lattice and fine-granularity concept lattice by coarse-granularity concept lattice are examined. The related algorithms for generating concept lattices are proposed. The practicability of the method is illustrated by an example.


2013 ◽  
Vol 427-429 ◽  
pp. 2536-2539
Author(s):  
Xue Song Dai ◽  
Yuan Ma ◽  
Wen Xue Hong

Formal context is one of the research contents of formal concept analysis theory. In concept lattice, the attributes of the object are equivalent and there is no hierarchy. Facing to this problem, the equivalence relation which is on the attributes' set is defined and the corresponding σ operation is proposed. On this basis, the structure method of attribute hierarchical diagram is presented and attributes' sequences of associated objects are obtained. This conclusion enriches and extends the analysis method of the formal context.


2018 ◽  
Vol 2018 ◽  
pp. 1-11 ◽  
Author(s):  
Huilai Zhi ◽  
Hao Chao

Recently, incomplete formal contexts have received more and more attention from the communities of formal concept analysis. Different from a complete context where the binary relations between all the objects and attribute are known, an incomplete formal context has at least a pair of object and attribute with a completely unknown binary relation. Partially known formal concepts use interval sets to indicate the incompleteness. Three-way formal concept analysis is capable of characterizing a target set by combining positive and negative attributes. However, how to describe target set, by pointing out what attributes it has with certainty and what attributes it has with possibility and what attributes it does not has with certainty and what attributes it does not has with possibility, is still an open problem. This paper combines the ideas of three-way formal concept analysis and partially known formal concepts and presents a framework of approximate three-way concept analysis. At first, approximate object-induced and attribute-induced three-way concept lattices are introduced, respectively. And then, the relationship between approximate three-way concept lattice and classical three-way concept lattice are investigated. Finally, examples are presented to demonstrate and verify the obtained results.


Author(s):  
Jamal Atif ◽  
Isabelle Bloch ◽  
Céline Hudelot

In this paper we extend some previously established links between the derivation operators used in formal concept analysis and some mathematical morphology operators to fuzzy concept analysis. We also propose to use mathematical morphology to navigate in a fuzzy concept lattice and perform operations on it. Links with other lattice-based for malisms such as rough sets and F-transforms are also established. This paper proposes a discussion and new results on such links and their potential interest.


Sign in / Sign up

Export Citation Format

Share Document