Data Mining Integrated with Domain Knowledge

Author(s):  
Anqiang Huang ◽  
Lingling Zhang ◽  
Zhengxiang Zhu ◽  
Yong Shi
Keyword(s):  
2014 ◽  
Vol 496-500 ◽  
pp. 2108-2111
Author(s):  
Jian Hu Zhang ◽  
Lei Lei ◽  
Xin You Cui ◽  
Yong Wu ◽  
Lin Tao Li

Through in-depth understanding of the domain knowledge of insurance and the study of the technology of data warehouse, the paper illustrate the application of data mining technology and data warehouse technology in the insurance clients analysis, and from the basic flow of, discusse the application of data warehouse technology in the field of insurance industry. Then, from the concept of data warehouse, describe the design and implementation of data warehouse concept model and logical model.


Author(s):  
Eric Araka ◽  
Robert Oboko ◽  
Elizaphan Maina ◽  
Rhoda K. Gitonga

Self-regulated learning is attracting tremendous researches from various communities such as information communication technology. Recent studies have greatly contributed to the domain knowledge that the use self-regulatory skills enhance academic performance. Despite these developments in SRL, our understanding on the tools and instruments to measure SRL in online learning environments is limited as the use of traditional tools developed for face-to-face classroom settings are still used to measure SRL on e-learning systems. Modern learning management systems (LMS) allow storage of datasets on student activities. Subsequently, it is now possible to use Educational Data Mining to extract learner patterns which can be used to support SRL. This chapter discusses the current tools for measuring and promoting SRL on e-learning platforms and a conceptual model grounded on educational data mining for implementation as a solution to promoting SRL strategies.


Author(s):  
Longbing Cao ◽  
Chengqi Zhang

Quantitative intelligence based traditional data mining is facing grand challenges from real-world enterprise and cross-organization applications. For instance, the usual demonstration of specific algorithms cannot support business users to take actions to their advantage and needs. We think this is due to Quantitative Intelligence focused data-driven philosophy. It either views data mining as an autonomous data-driven, trial-and-error process, or only analyzes business issues in an isolated, case-by-case manner. Based on experience and lessons learnt from real-world data mining and complex systems, this article proposes a practical data mining methodology referred to as Domain-Driven Data Mining. On top of quantitative intelligence and hidden knowledge in data, domain-driven data mining aims to meta-synthesize quantitative intelligence and qualitative intelligence in mining complex applications in which human is in the loop. It targets actionable knowledge discovery in constrained environment for satisfying user preference. Domain-driven methodology consists of key components including understanding constrained environment, business-technical questionnaire, representing and involving domain knowledge, human-mining cooperation and interaction, constructing next-generation mining infrastructure, in-depth pattern mining and postprocessing, business interestingness and actionability enhancement, and loop-closed human-cooperated iterative refinement. Domain-driven data mining complements the data-driven methodology, the metasynthesis of qualitative intelligence and quantitative intelligence has potential to discover knowledge from complex systems, and enhance knowledge actionability for practical use by industry and business.


Author(s):  
Hung Son Nguyen ◽  
Andrzej Jankowski ◽  
James F. Peters ◽  
Andrzej Skowron ◽  
Jaroslaw Stepaniuk ◽  
...  

The rapid expansion of the Internet has resulted not only in the ever-growing amount of data stored therein, but also in the burgeoning complexity of the concepts and phenomena pertaining to that data. This issue has been vividly compared by the renowned statistician J.F. Friedman (Friedman, 1997) of Stanford University to the advances in human mobility from the period of walking afoot to the era of jet travel. These essential changes in data have brought about new challenges in the discovery of new data mining methods, especially the treatment of these data that increasingly involves complex processes that elude classic modeling paradigms. “Hot” datasets like biomedical, financial or net user behavior data are just a few examples. Mining such temporal or stream data is a focal point in the agenda of many research centers and companies worldwide (see, e.g., (Roddick et al., 2001; Aggarwal, 2007)). In the data mining community, there is a rapidly growing interest in developing methods for process mining, e.g., for discovery of structures of temporal processes from observed sample data. Research on process mining (e.g., (Unnikrishnan et al., 2006; de Medeiros et al., 2007; Wu, 2007; Borrett et al., 2007)) have been undertaken by many renowned centers worldwide1. This research is also related to functional data analysis (see, e.g., (Ramsay & Silverman, 2002)), cognitive networks (see, e.g., (Papageorgiou & Stylios, 2008)), and dynamical system modeling, e.g., in biology (see, e.g., (Feng et al., 2007)). We outline an approach to the discovery of processes from data and domain knowledge. The proposed approach to discovery of process models is based on rough-granular computing. In particular, we discuss how changes along trajectories of such processes can be discovered from sample data and domain knowledge.


2013 ◽  
Vol 645 ◽  
pp. 232-238
Author(s):  
Qing Li ◽  
Wei Yang ◽  
Xiao Nan Ye ◽  
Xiao Xiao Ma

The realization of a device test training system requires the use of a lot of domain knowledge, and building knowledge base will play an important role. In view of the uncertainty, inaccuracy and incompleteness of test data in the testing process, this paper makes the data mining algorithms based on rough set as knowledge acquisition algorithm, and proposes an improved algorithm for insufficient of approximate reduction of rough set knowledge based on the tolerance relation of incomplete information system. The paper studies the design and realization of knowledge base system in the developing of device simulation training system on this basis, and validates the method through a design example of knowledge base of a certain device simulation training system.


Author(s):  
Joseph Tan ◽  
Fuchung Wang

This study examines time-sensitive applications of data mining methods to facilitate claims review processing and provide policy information for insurance decision-making vis-à-vis the Taiwan National Health Insurance (NHI) databases. In order to obtain the best payment management, a hybrid mining (HM) approach, which has been grounded on the extant knowledge of data mining projects and health insurance domain knowledge, is proposed. Through the integration of data warehousing, online analytic processing, data mining techniques and traditional data analysis in the healthcare field, an easy-to-use decision support platform, which will assist in directing the health insurance decision-making process, is built. Drawing from lessons learned within a case study setting, results showed that not only is HM approach a reliable, powerful, and user-friendly platform for diversified payment decision support, but that it also has great relevance for the practice and acceptance of evidence-based medicine. Essentially, HM approach can provide a critical boost to health insurance decision support; hence, future researchers should develop and improve the approach combined with their own application systems.


Sign in / Sign up

Export Citation Format

Share Document