Overview of Predictive Modeling Approaches in Health Care Data Mining

Business Intelligence ◽

10.4018/978-1-4666-9562-7.ch004 ◽

2016 ◽

pp. 73-95 ◽

Cited By ~ 1

Author(s):

Sunita Soni

Keyword(s):

Data Mining ◽

Predictive Modeling ◽

Clinical Information ◽

Ease Of Use ◽

Classification Model ◽

Computational Time ◽

Data Sets ◽

Advantages And Disadvantages ◽

Mining Methods ◽

Predictive Data Mining

Medical data mining has great potential for exploring the hidden pattern in the data sets of the medical domain. A predictive modeling approach of Data Mining has been systematically applied for the prognosis, diagnosis, and planning for treatment of chronic disease. For example, a classification system can assist the physician to predict if the patient is likely to have a certain disease, or by considering the output of the classification model, the physician can make a better decision on the treatment to be applied to the patient. Once the model is evaluated and verified, it may be embedded within clinical information systems. The objective of this chapter is to extensively study the various predictive data mining methods to evaluate their usage in terms of accuracy, computational time, comprehensibility of the results, ease of use of the algorithm, and advantages and disadvantages to relatively naive medical users. The research has shown that there is not a single best prediction tool, but instead, the best performing algorithm will depend on the features of the dataset to be analyzed.

Download Full-text

Stamping Plant 4.0 – Basics for the Application of Data Mining Methods in Manufacturing Car Body Parts

Key Engineering Materials ◽

10.4028/www.scientific.net/kem.639.21 ◽

2015 ◽

Vol 639 ◽

pp. 21-30 ◽

Cited By ~ 7

Author(s):

Stephan Purr ◽

Josef Meinhardt ◽

Arnulf Lipp ◽

Axel Werner ◽

Martin Ostermair ◽

...

Keyword(s):

Data Mining ◽

Data Acquisition ◽

Data Driven ◽

Quality Analysis ◽

Process Conditions ◽

Data Sets ◽

Body Parts ◽

Car Body ◽

Sample Data ◽

Mining Methods

Data-driven quality evaluation in the stamping process of car body parts is quite promising because dependencies in the process have not yet been sufficiently researched. However, the application of data mining methods for the process in stamping plants would require a large number of sample data sets. Today, acquiring these data represents a major challenge, because the necessary data are inadequately measured, recorded or stored. Thus, the preconditions for the sample data acquisition must first be created before being able to investigate any correlations. In addition, the process conditions change over time due to wear mechanisms. Therefore, the results do not remain valid and a constant data acquisition is required. In this publication, the current situation in stamping plants regarding the process robustness will be first discussed and the need for data-driven methods will be shown. Subsequently, the state of technology regarding the possibility of collecting the sample data sets for quality analysis in producing car body parts will be researched. At the end of this work, an overview will be provided concerning how this data collection was implemented at BMW as well as what kind of potential can be expected.

Download Full-text

Exploring the Relationship between Soil Properties and Deterioration of Metallic Pipes Using Predictive Data Mining Methods

Journal of Computing in Civil Engineering ◽

10.1061/(asce)cp.1943-5487.0000032 ◽

2010 ◽

Vol 24 (3) ◽

pp. 289-301 ◽

Cited By ~ 19

Author(s):

Zheng Liu ◽

Rehan Sadiq ◽

Balvant Rajani ◽

Homayoun Najjaran

Keyword(s):

Data Mining ◽

Soil Properties ◽

Mining Methods ◽

Predictive Data Mining ◽

The Relationship

Download Full-text

Analysis of flight delays in aviation system using different classification algorithms and feature selection methods

The Aeronautical Journal ◽

10.1017/aer.2019.72 ◽

2019 ◽

Vol 123 (1267) ◽

pp. 1415-1436 ◽

Cited By ~ 1

Author(s):

A. B. A. Anderson ◽

A. J. Sanjeev Kumar ◽

A. B. Arockia Christopher

Keyword(s):

Data Mining ◽

Feature Selection ◽

Classification Model ◽

System Level ◽

Support Vector ◽

Flight Delays ◽

Data Mining Techniques ◽

Mining Methods ◽

Artificial Neural Network Ann ◽

Aircraft System

ABSTRACTData mining is a process of finding correlations and collecting and analysing a huge amount of data in a database to discover patterns or relationships. Flight delay creates significant problems in the present aviation system. Data mining techniques are desired for analysing the performance in which micro-level causes propagate to make system-level patterns of delay. Analysing flight delays is very difficult – both when looking from a historical view as well as when estimating delays with forecast demand. This paper proposes using Decision Tree (DT), Support Vector Machine (SVM), Naive Bayesian (NB), K-nearest neighbour (KNN) and Artificial Neural Network (ANN) to study and analyse delays among aircrafts. The performance of different data mining methods is found in the different regions of the updated datasets on these classifiers. Finally, the result shows a significant variation in the performance of different data mining methods and feature selection for this problem. This paper aims to deal with how data mining techniques can be used to understand difficult aircraft system delays in aviation. Our aim is to develop a classification model for studying and reducing delay using different data mining methods and, in this manner, to show that DT has a greater classification accuracy. The different feature selectors are used in this study in order to reduce the number of initial attributes. Our results clearly demonstrate the value of DT for analysing and visualising how system-level effects happen from subsystem-level causes.

Download Full-text

Trajectory Pattern Mining: Methods and Applications

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.490-491.1361 ◽

2014 ◽

Vol 490-491 ◽

pp. 1361-1367

Author(s):

Xin Huang ◽

Hui Juan Chen ◽

Mao Gong Zheng ◽

Ping Liu ◽

Jing Qian

Keyword(s):

Data Mining ◽

Social Media ◽

Pattern Mining ◽

Data Sets ◽

Trajectory Data ◽

Data Mining Algorithms ◽

Mining Methods ◽

Trajectory Pattern ◽

Mining Algorithms ◽

Trajectory Pattern Mining

With the advent of location-based social media and locationacquisition technologies, trajectory data are becoming more and more ubiquitous in the real world. A lot of data mining algorithms have been successfully applied to trajectory data sets. Trajectory pattern mining has received a lot of attention in recent years. In this paper, we review the most inuential methods as well as typical applications within the context of trajectory pattern mining.

Download Full-text

Data Mining on Social Interaction Networks

Journal of Data Mining & Digital Humanities ◽

10.46298/jdmdh.7 ◽

2014 ◽

Vol 2014 ◽

Author(s):

Martin Atzmueller

Keyword(s):

Data Mining ◽

Social Interaction ◽

Real World ◽

Social Systems ◽

Interaction Networks ◽

Descriptive Data ◽

Face To Face ◽

The Social ◽

Mining Methods ◽

Predictive Data Mining

Social media and social networks have already woven themselves into the very fabric of everyday life. This results in a dramatic increase of social data capturing various relations between the users and their associated artifacts, both in online networks and the real world using ubiquitous devices. In this work, we consider social interaction networks from a data mining perspective - also with a special focus on real-world face-to-face contact networks: We combine data mining and social network analysis techniques for examining the networks in order to improve our understanding of the data, the modeled behavior, and its underlying emergent processes. Furthermore, we adapt, extend and apply known predictive data mining algorithms on social interaction networks. Additionally, we present novel methods for descriptive data mining for uncovering and extracting relations and patterns for hypothesis generation and exploration, in order to provide characteristic information about the data and networks. The presented approaches and methods aim at extracting valuable knowledge for enhancing the understanding of the respective data, and for supporting the users of the respective systems. We consider data from several social systems, like the social bookmarking system BibSonomy, the social resource sharing system flickr, and ubiquitous social systems: Specifically, we focus on data from the social conference guidance system Conferator and the social group interaction system MyGroup. This work first gives a short introduction into social interaction networks, before we describe several analysis results in the context of online social networks and real-world face-to-face contact networks. Next, we present predictive data mining methods, i.e., for localization, recommendation and link prediction. After that, we present novel descriptive data mining methods for mining communities and patterns.

Download Full-text

Robustness of Predictive Data Mining Methods under the Presence of Measurement Errors in the Context of Production Processes

IOSR Journal of Computer Engineering ◽

10.9790/0661-1901049098 ◽

2017 ◽

Vol 19 (01) ◽

pp. 90-98

Author(s):

Daniela F. Dianda

Keyword(s):

Data Mining ◽

Measurement Errors ◽

Production Processes ◽

Mining Methods ◽

Predictive Data Mining

Download Full-text

Data Mining to Examine the Treatment of Osteomyelitis

Cases on Health Outcomes and Clinical Data Mining ◽

10.4018/978-1-61520-723-7.ch002 ◽

2010 ◽

pp. 15-46

Author(s):

Hamid Zahedi

Keyword(s):

Data Mining ◽

Relevant Information ◽

Background Information ◽

Primary Data ◽

Data Sets ◽

National Inpatient Sample ◽

Mining Methods ◽

Access To Data ◽

The Cost ◽

Online Sources

The purpose of this study is to use data mining methods to investigate the physician decisions specifically in the treatment of osteomyelitis. Two primary data sets have been used in this study; the National Inpatient Sample (NIS) and the Thomson MedStat MarketScan data. We used online sources to obtain background information about the disease and its treatment in the literature. An innovative method was used to capture the information from the web and to cluster or filter that information to find the most relevant information, using SAS Text Miner. Other important innovations in investigating the data include finding the switches of medication and comparing the date of the switch with the date of procedures. We could study these medications switched at deeper levels, but this is not necessary in our study, especially with limited access to data. We also create a model to forecast the cost of hospitalization for patients with osteomyelitis.

Download Full-text

Application of Chosen Data Mining Methods in Predicting Abnormal Blood Pressure in Children and Adolescents

Studies in Logic, Grammar and Rhetoric ◽

10.2478/slgr-2018-0038 ◽

2018 ◽

Vol 56 (1) ◽

pp. 19-28

Author(s):

Anna Sowińska ◽

Izabela Miechowicz

Keyword(s):

Data Mining ◽

Blood Pressure ◽

Heart Rate ◽

Classification Model ◽

Common Disease ◽

Study Group ◽

Age Group ◽

Classification Models ◽

Body Proportions ◽

Mining Methods

Abstract Hypertension is a common disease in highly industrialized societies, more often perceived as a health problem in adults rather than children. However, epidemiologists are currently paying more attention to the possibility of idiopathic hypertension during childhood. This article compares three classification models (logistic regression, classification trees and MARSplines) in order to determine the best classification model and distinguish the parameters that are most important in the detection of abnormal blood pressure in children. The study group consisted of 1,378 children aged between 7 and 18. After making comparisons between the methods, it was determined that MARSplines is the model that best assigns subjects to classes and can be an alternative in cases when traditional statistical methods cannot be used due to a lack of fulfillment of conditions. For prediction of abnormal blood pressure in this age group, the most important parameters were the heart rate and selected indicators of body proportions.

Download Full-text

A COMPARATIVE ANALYSIS OF K-MEANS AND HIERARCHICAL CLUSTERING

EPRA International Journal of Multidisciplinary Research (IJMR) ◽

10.36713/epra8308 ◽

2021 ◽

pp. 412-418

Author(s):

Aastha Gupta ◽

Himanshu Sharma ◽

Anas Akhtar

Keyword(s):

Data Mining ◽

Hierarchical Clustering ◽

Clustering Algorithms ◽

Analytical Techniques ◽

Data Sets ◽

Clustering Methods ◽

Data Set ◽

Advantages And Disadvantages ◽

The Many ◽

Data Elements

Clustering is the process of arranging comparable data elements into groups. One of the most frequent data mining analytical techniques is clustering analysis; the clustering algorithm’s strategy has a direct influence on the clustering results. This study examines the many types of algorithms, such as k-means clustering algorithms, and compares and contrasts their advantages and disadvantages. This paper also highlights concerns with clustering algorithms, such as time complexity and accuracy, in order to give better outcomes in a variety of environments. The outcomes are described in terms of big datasets. The focus of this study is on clustering algorithms with the WEKA data mining tool. Clustering is the process of dividing a big data set into small groups or clusters. Clustering is an unsupervised approach that may be used to analyze big datasets with many characteristics. It’s a data-modeling technique that provides a clear image of your data. Two clustering methods, k-means and hierarchical clustering, are explained in this survey and their analysis using WEKA tool on different data sets. KEYWORDS: data clustering, weka , k-means, hierarchical clustering

Download Full-text