Basic Concepts in Data Mining

2015 ◽  
pp. 86-98
Keyword(s):  
Author(s):  
Zhongguang Fu ◽  
Tao Jin ◽  
Kun Yang

Rough set theory is a powerful tool in deal with vagueness and uncertainty. It is particularly suitable to discover hidden and potentially useful knowledge in data and can be used to reduce features and extract rules. This paper introduces the basic concepts and fundamental elements of the rough set theory. A reduction algorithm that integrates a priori with significance is proposed to illustrate how the rough set theory could be used to extract fault features of the condenser in a power plant. Two testing examples are then presented to demonstrate the effectiveness of the theory in fault diagnosis.


Filomat ◽  
2017 ◽  
Vol 31 (19) ◽  
pp. 6175-6183
Author(s):  
Yan-Lan Zhang ◽  
Chang-Qing Li

Rough set theory is an important tool for data mining. Lower and upper approximation operators are two important basic concepts in the rough set theory. The classical Pawlak rough approximation operators are based on equivalence relations and have been extended to relation-based generalized rough approximation operators. This paper presents topological properties of a pair of relation-based generalized rough approximation operators. A topology is induced by the pair of generalized rough approximation operators from an inverse serial relation. Then, connectedness, countability, separation property and Lindel?f property of the topological space are discussed. The results are not only beneficial to obtain more properties of the pair of approximation operators, but also have theoretical and actual significance to general topology.


Author(s):  
Reima Suomi ◽  
Olli Sjöblom

This chapter introduces aviation safety data analysis as an important application area for data mining. In the beginning of the chapter, the reader is introduced to the basic concepts of data mining. After that, the field of aviation safety management is discussed, and in that connection data mining is identified as a key technology to study through flight incidents reports. Afterwards the test runs for four data mining products, for possible use in the Finnish civil aviation authority, are described in detail. However, before the testing of tools the preparation of the test data for the tools is described in detail. The chapter ends with conclusions that tell that even sophisticated data mining tools are just tools: they do not provide any automatic tools, but skilled users can use them for searching clues in the data.


2012 ◽  
Vol 546-547 ◽  
pp. 497-502 ◽  
Author(s):  
Yu Lian Gai

From information technology point of view, analyzed the theoretical and practical background of data mining processing massive amounts of data. Based on the definition and basic concepts of data mining, discussed and analyzed 5 aspects of its main task. Described the data mining process comprehensively. Integrating with the problems within the Apriori Algorithm, described the improvements to the Apriori algorithm. And, in the end, summarized the role of data mining technology in promoting the development of computer technology.


Author(s):  
Mihai Gabroveanu

During the last years the amount of data stored in databases has grown very fast. Data mining, also known as knowledge discovery in databases, represents the discovery process of potentially useful hidden knowledge or relations among data from large databases. An important task in the data mining process is the discovery of the association rules. An association rule describes an interesting relationship between different attributes. There are different kinds of association rules: Boolean (crisp) association rules, quantitative association rules, fuzzy association rules, etc. In this chapter, we present the basic concepts of Boolean and the fuzzy association rules, and describe the methods used to discover the association rules by presenting the most important algorithms.


2014 ◽  
Vol 543-547 ◽  
pp. 2988-2991
Author(s):  
Xiang Min Wu ◽  
Cheng Lin Zhao ◽  
Pan Cao

Customer Relationship Management (CRM) is becoming the focus of enterprise and an active research field of computer science. The ariticle introduces some basic concepts about CRM and data mining, and some benefits brought by data mining in CRM. At the end it points out how to apply data mining applications in CRM.


Author(s):  
Huan Liu ◽  
Lei Yu

The amounts of data have become increasingly large in recent years as the capacity of digital data storage worldwide has significantly increased. As the size of data grows, the demand for data reduction increases for effective data mining. Instance selection is one of the effective means to data reduction. This article introduces the basic concepts of instance selection and its context, necessity, and functionality. The article briefly reviews the state-of-the-art methods for instance selection.


Author(s):  
Andrew Kusiak ◽  
Shital C. Shah

Most processes in pharmaceutical industry are data driven. Company’s ability to capture the data and making use of it will grow in significance and may become the main factor differentiating the industry. Basic concepts of data mining, data warehousing, and data modeling are introduced. These new data-driven concepts lead to a paradigm shift in pharmaceutical industry.


2015 ◽  
Vol 713-715 ◽  
pp. 1970-1973
Author(s):  
Chun Liu ◽  
Dong Xing Wang ◽  
Kun Tan

Concept lattice in essence describe the links between objects and attributes,demonstratesthe generalization and specialization of concepts. The corresponding Hasse diagrams realize the visualization of the data. At present, formal concept analysis has been extensively studied and applied to many areas, such asinformation retrieval, machine learning andsoftware engineering. Based on the above reasons, it is necessary to research the methods of latticeconcept of data mining. This paper is divided into three parts; the first part introduces the basic concepts of data mining. The second part introduces the basic theory of concept lattices. The last part focuses on the application of concept in data mining.


Sign in / Sign up

Export Citation Format

Share Document