A Query Language for Mobility Data Mining

The technologies of mobile communications and ubiquitous computing pervade society. Wireless networks sense the movement of people and vehicles, generating large volumes of mobility data, such as mobile phone call records and GPS tracks. This data can produce useful knowledge, supporting sustainable mobility and intelligent transportation systems, provided that a suitable knowledge discovery process is enacted for mining this mobility data. In this paper, the authors examine a formal framework, and the associated implementation, for a data mining query language for mobility data, created as a result of a European-wide research project called GeoPKDD (Geographic Privacy-Aware Knowledge Discovery and Delivery). The authors discuss how the system provides comprehensive support for the Mobility Knowledge Discovery process and illustrate its analytical power in unveiling the complexity of urban mobility in a large metropolitan area, based on a massive real life GPS dataset.

Download Full-text

Privacy Preserving Data Mining, Concepts, Techniques, and Evaluation Methodologies

Data Warehousing and Mining ◽

10.4018/978-1-59904-951-9.ch143 ◽

2008 ◽

pp. 2379-2401 ◽

Cited By ~ 1

Author(s):

Igor Nai Fovino

Keyword(s):

Data Mining ◽

Knowledge Discovery ◽

Privacy Preserving ◽

Research Field ◽

Sensitive Information ◽

Discovery Process ◽

Privacy Preserving Data Mining ◽

Mining Technology ◽

Evaluation Methodologies ◽

New Research

Intense work in the area of data mining technology and in its applications to several domains has resulted in the development of a large variety of techniques and tools able to automatically and intelligently transform large amounts of data in knowledge relevant to users. However, as with other kinds of useful technologies, the knowledge discovery process can be misused. It can be used, for example, by malicious subjects in order to reconstruct sensitive information for which they do not have an explicit access authorization. This type of “attack” cannot easily be detected, because, usually, the data used to guess the protected information, is freely accessible. For this reason, many research efforts have been recently devoted to addressing the problem of privacy preserving in data mining. The mission of this chapter is therefore to introduce the reader in this new research field and to provide the proper instruments (in term of concepts, techniques and example) in order to allow a critical comprehension of the advantages, the limitations and the open issues of the Privacy Preserving Data Mining Techniques.

Download Full-text

Recursive data mining strategy using close-degree of concept lattice for knowledge discovery process with granularity

2011 International Conference on Emerging Trends in Networks and Computer Communications (ETNCC) ◽

10.1109/etncc.2011.6255885 ◽

2011 ◽

Cited By ~ 3

Author(s):

Hema Dubey ◽

B. N. Roy

Keyword(s):

Data Mining ◽

Knowledge Discovery ◽

Concept Lattice ◽

Discovery Process ◽

Recursive Data ◽

Data Mining Strategy

Download Full-text

Privacy Preserving Data Mining, Concepts, Techniques, and Evaluation Methodologies

Successes and New Directions in Data Mining ◽

10.4018/978-1-59904-645-7.ch012 ◽

2008 ◽

pp. 277-301

Author(s):

Igor Nai Fovino

Keyword(s):

Data Mining ◽

Knowledge Discovery ◽

Privacy Preserving ◽

Research Field ◽

Sensitive Information ◽

Discovery Process ◽

Privacy Preserving Data Mining ◽

Mining Technology ◽

Evaluation Methodologies ◽

New Research

Intense work in the area of data mining technology and in its applications to several domains has resulted in the development of a large variety of techniques and tools able to automatically and intelligently transform large amounts of data in knowledge relevant to users. However, as with other kinds of useful technologies, the knowledge discovery process can be misused. It can be used, for example, by malicious subjects in order to reconstruct sensitive information for which they do not have an explicit access authorization. This type of “attack” cannot easily be detected, because, usually, the data used to guess the protected information, is freely accessible. For this reason, many research efforts have been recently devoted to addressing the problem of privacy preserving in data mining. The mission of this chapter is therefore to introduce the reader in this new research field and to provide the proper instruments (in term of concepts, techniques and example) in order to allow a critical comprehension of the advantages, the limitations and the open issues of the Privacy Preserving Data Mining Techniques.

Download Full-text

VISUALIZATION SUPPORT FOR USER-CENTERED MODEL SELECTION IN KNOWLEDGE DISCOVERY AND DATA MINING

International Journal of Artificial Intelligence Tools ◽

10.1142/s0218213001000726 ◽

2001 ◽

Vol 10 (04) ◽

pp. 691-713 ◽

Cited By ~ 8

Author(s):

TUBAO HO ◽

TRONGDUNG NGUYEN ◽

DUCDUNG NGUYEN ◽

SAORI KAWASAKI

Keyword(s):

Data Mining ◽

Model Selection ◽

Knowledge Discovery ◽

Performance Metrics ◽

Model Performance ◽

Discovery Process ◽

Whole Process ◽

Discovery System ◽

Competing Models ◽

Visualization Techniques

The problem of model selection in knowledge discovery and data mining—the selection of appropriate discovered patterns/models or algorithms to achieve such patterns/models—is generally a difficult task for the user as it requires meta-knowledge on algorithms/models and model performance metrics. Viewing knowledge discovery as a human-centered process that requires an effective collaboration between the user and the discovery system, our work aims to make model selection in knowledge discovery easier and more effective. For such a collaboration, our solution is to give the user the ability to try easily various alternatives and to compare competing models quantitatively and qualitatively. The basic idea of our solution is to integrate data and knowledge visualization with the knowledge discovery process in order to the support the participation of the user. We introduce the knowledge discovery system D2MS in which several visualization techniques of data and knowledge are developed and integrated into the steps of the knowledge discovery process. The visualizers in D2MS greatly help the user gain better insight in each step of the knowledge discovery process as well the relationship between data and discovered knowledge in the whole process.

Download Full-text

Data Mining and Knowledge Discovery in Real Life Applications

10.5772/97 ◽

2009 ◽

Cited By ~ 9

Keyword(s):

Data Mining ◽

Knowledge Discovery ◽

Real Life

Download Full-text

Modelling of e-Governance Framework for Mining Knowledge from Massive Grievance Redressal Data

International Journal of Advances in Applied Sciences ◽

10.11591/ijaas.v6.i1.pp32-41 ◽

2017 ◽

Vol 6 (1) ◽

pp. 32

Author(s):

Sangeetha G ◽

L. Manjunatha Rao

Keyword(s):

Data Mining ◽

Big Data ◽

Knowledge Discovery ◽

Discussion Forums ◽

Discovery Process ◽

Web Based ◽

Governance Framework ◽

Data Mining Algorithms ◽

Massive Number ◽

Mining Algorithms

With the massive proliferation of online applications for the citizens with abundant resources, there is a tremendous hike in usage of e-governance platforms. Right from entrepreneur, players, politicians, students, or anyone who are highly depending on web-based grievance redressal networking sites, which generates loads of massive grievance data that are not only challenging but also highly impossible to understand. The prime reason behind this is grievance data is massive in size and they are highly unstructured. Because of this fact, the proposed system attempts to understand the possibility of performing knowledge discovery process from grievance Data using conventional data mining algorithms. Designed in Java considering massive number of online e-governance framework from civilian’s grievance discussion forums, the proposed system evaluates the effectiveness of performing datamining for Big data.

Download Full-text

Data Mining and Knowledge Discovery in Metabolomics Armin

Data Warehousing and Mining ◽

10.4018/978-1-59904-951-9.ch104 ◽

2008 ◽

pp. 1759-1783

Author(s):

Christian Baumgartner ◽

Armin Graber

Keyword(s):

Data Mining ◽

Feature Selection ◽

Knowledge Discovery ◽

Biomedical Application ◽

Life Sciences ◽

Background Information ◽

Data Generation ◽

Discovery Process ◽

Mining Community ◽

Preclinical And Clinical Studies

This chapter provides an overview of the knowledge discovery process in metabolomics, a young discipline in the life sciences arena. It introduces two emerging bioanalytical concepts for generating biomolecular information, followed by various data mining and information retrieval procedures such as feature selection, classification, clustering and biochemical interpretation of mined data, illustrated by real examples from preclinical and clinical studies. The authors trust that this chapter will provide an acceptable balance between bioanalytics background information, essential to understanding the complexity of data generation, and information on data mining principals, specific methods and processes, and biomedical application. Thus, this chapter is anticipated to appeal to those with a metabolomics background as well as to basic researchers within the data mining community who are interested in novel life science applications.

Download Full-text

Actionable Knowledge Discovery

Encyclopedia of Information Science and Technology, Second Edition ◽

10.4018/978-1-60566-026-4.ch002 ◽

2011 ◽

pp. 8-13

Author(s):

Longbing Cao

Keyword(s):

Data Mining ◽

Knowledge Discovery ◽

Social Intelligence ◽

Pattern Mining ◽

Real Life ◽

Simulated Data ◽

Business Environment ◽

Actionable Knowledge ◽

Real Life Data ◽

Knowledge Discovery In Database

Actionable knowledge discovery is selected as one of the greatest challenges (Ankerst, 2002; Fayyad, Shapiro, & Uthurusamy, 2003) of next-generation knowledge discovery in database (KDD) studies (Han & Kamber, 2006). In the existing data mining, often mined patterns are nonactionable to real user needs. To enhance knowledge actionability, domain-related social intelligence is substantially essential (Cao et al., 2006b). The involvement of domain-related social intelligence into data mining leads to domaindriven data mining (Cao & Zhang, 2006a, 2007a), which complements traditional data-centered mining methodology. Domain-related social intelligence consists of intelligence of human, domain, environment, society and cyberspace, which complements data intelligence. The extension of KDD toward domain-driven data mining involves many challenging but promising research and development issues in KDD. Studies in regard to these issues may promote the paradigm shift of KDD from data-centered interesting pattern mining to domain-driven actionable knowledge discovery, and the deployment shift from simulated data set-based to real-life data and business environment-oriented as widely predicted.

Download Full-text

Ontology-Based Construction of Grid Data Mining Workflows

Quantitative Quality of Service for Grid Computing ◽

10.4018/978-1-60566-370-8.ch021 ◽

2011 ◽

pp. 404-434

Author(s):

Peter Brezany ◽

Ivan Janciak ◽

A Min Tjoa

Keyword(s):

Data Mining ◽

Real Life ◽

Data Exploration ◽

Discovery Process ◽

Grid Data ◽

Industry Standard ◽

Industry Standards ◽

Interactive Data ◽

Workflow Composition ◽

Whole Life Cycle

This chapter introduces an ontology-based framework for automated construction of complex interactive data mining workflows as a means of improving productivity of Grid-enabled data exploration systems. The authors first characterize existing manual and automated workflow composition approaches and then present their solution called GridMiner Assistant (GMA), which addresses the whole life cycle of the knowledge discovery process. GMA is specified in the OWL language and is being developed around a novel data mining ontology, which is based on concepts of industry standards like the predictive model markup language, cross industry standard process for data mining, and Java data mining API. The ontology introduces basic data mining concepts like data mining elements, tasks, services, and so forth. In addition, conceptual and implementation architectures of the framework are presented and its application to an example taken from the medical domain is illustrated. The authors hope that the further research and development of this framework can lead to productivity improvements, which can have significant impact on many real-life spheres. For example, it can be a crucial factor in achievement of scientific discoveries, optimal treatment of patients, productive decision making, cutting costs, and so forth.

Download Full-text