For Better Healthcare Mining Health Data

2017 ◽

pp. 135-158

Author(s):

Güney Gürsel

Keyword(s):

Data Mining ◽

Nearest Neighbor ◽

Healthcare Management ◽

Support Vector ◽

Privacy And Security ◽

Healthcare Data ◽

Management Fraud ◽

Use Of Data ◽

Customer Relation Management ◽

Vector Machines

Data mining has great contributions to the healthcare such as support for effective treatment, healthcare management, customer relation management, fraud and abuse detection and decision making. The common data mining methods used in healthcare are Artificial Neural Network, Decision trees, Genetic Algorithms, Nearest neighbor method, Logistic regression, Fuzzy logic, Fuzzy based Neural Networks, Bayesian Networks and Support Vector Machines. The most used task is classification. Because of the complexity and toughness of medical domain, data mining is not an easy task to accomplish. In addition, privacy and security of patient data is a big issue to deal with because of the sensitivity of healthcare data. There exist additional serious challenges. This chapter is a descriptive study aimed to provide an acquaintance to data mining and its usage and applications in healthcare domain. The use of Data mining in healthcare informatics and challenges will be examined.

Download Full-text

Decision Support System for Diabetes Classification Using Data Mining Techniques

Research Anthology on Decision Support Systems and Decision Management in Healthcare, Business, and Engineering ◽

10.4018/978-1-7998-9023-2.ch053 ◽

2021 ◽

pp. 1091-1113

Author(s):

Ahmad M. Al-Khasawneh

Keyword(s):

Data Mining ◽

Support Vector Machine ◽

Nearest Neighbor ◽

Support Vector ◽

K Nearest Neighbor ◽

Data Mining Algorithms ◽

Use Of Data ◽

Predictive Data Mining ◽

Severity Of The Disease ◽

Using Data

The use of data mining algorithms in health information systems has played a significant role in developing applications that help to diagnose different diseases. The type of the disease determines the selection of the algorithm, parameters to be used, and dataset pre-processing steps, etc. In this chapter, diagnosing diabetes mellitus is the target since it has gained significant attention in the last few decades due to the increased severity of the disease. Four predictive data mining approaches are being used in diagnosing diabetes. Four models were implemented to diagnose diabetes from PIMA dataset: k-nearest neighbor, support vector machine, multilayer perceptron neural network, and naive Bayesian network. Giving the highest classification accuracy, support vector machine technique outperformed the others with a value of 78.83%.

Download Full-text

Decision Support System for Diabetes Classification Using Data Mining Techniques

Advances in Healthcare Information Systems and Administration - Handbook of Research on Emerging Perspectives on Healthcare Information Systems and Informatics ◽

10.4018/978-1-5225-5460-8.ch012 ◽

2018 ◽

pp. 281-303

Author(s):

Ahmad M. Al-Khasawneh

Keyword(s):

Data Mining ◽

Support Vector Machine ◽

Nearest Neighbor ◽

Support Vector ◽

K Nearest Neighbor ◽

Data Mining Algorithms ◽

Use Of Data ◽

Predictive Data Mining ◽

Severity Of The Disease ◽

Using Data

The use of data mining algorithms in health information systems has played a significant role in developing applications that help to diagnose different diseases. The type of the disease determines the selection of the algorithm, parameters to be used, and dataset pre-processing steps, etc. In this chapter, diagnosing diabetes mellitus is the target since it has gained significant attention in the last few decades due to the increased severity of the disease. Four predictive data mining approaches are being used in diagnosing diabetes. Four models were implemented to diagnose diabetes from PIMA dataset: k-nearest neighbor, support vector machine, multilayer perceptron neural network, and naive Bayesian network. Giving the highest classification accuracy, support vector machine technique outperformed the others with a value of 78.83%.

Download Full-text

Data Mining Approach to Analyze COVID-19 Clinical Dataset

10.53350/pjmhs211561812 ◽

2021 ◽

Vol 15 (6) ◽

pp. 1812-1819

Author(s):

Azita Yazdani ◽

Ramin Ravangard ◽

Roxana Sharifian

Keyword(s):

Artificial Intelligence ◽

Data Mining ◽

Support Vector Machine ◽

Nearest Neighbor ◽

Clinical Signs ◽

Study Data ◽

Mining Machine ◽

Support Vector ◽

K Nearest Neighbor ◽

Data Mining Approach

The new coronavirus has been spreading since the beginning of 2020 and many efforts have been made to develop vaccines to help patients recover. It is now clear that the world needs a rapid solution to curb the spread of COVID-19 worldwide with non-clinical approaches such as data mining, enhanced intelligence, and other artificial intelligence techniques. These approaches can be effective in reducing the burden on the health care system to provide the best possible way to diagnose and predict the COVID-19 epidemic. In this study, data mining models for early detection of Covid-19 in patients were developed using the epidemiological dataset of patients and individuals suspected of having Covid-19 in Iran. C4.5, support vector machine, Naive Bayes, logistic regression, Random Forest, and k-nearest neighbor algorithm were used directly on the dataset using Rapid miner to develop the models. By receiving clinical signs, this model diagnosis the risk of contracting the COVID-19 virus. Examination of the models in this study has shown that the support vector machine with 93.41% accuracy is more efficient in the diagnosis of patients with COVID-19 pandemic, which is the best model among other developed models. Keywords: COVID-19, Data mining, Machine Learning, Artificial Intelligence, Classification

Download Full-text

Proposta de metodologia para a criação de etiqueta de classificação – estudo de caso: desempenho escolar

Gestão & Produção ◽

10.1590/0104-530x810-13 ◽

2016 ◽

Vol 23 (1) ◽

pp. 177-191

Author(s):

Anderson Roges Teixeira Góes ◽

Maria Teresinha Arns Steiner

Keyword(s):

Data Mining ◽

Support Vector Machines ◽

Knowledge Discovery ◽

Knowledge Discovery In Databases ◽

Support Vector ◽

Vector Machines

Resumo A qualidade na educação tem sido objeto de muita discussão, seja nas escolas e entre seus gestores, seja na mídia ou na literatura. No entanto, uma análise mais profunda na literatura parece não indicar técnicas que explorem bancos de dados com a finalidade de obter classificações para o desempenho escolar, nem tampouco há um consenso sobre o que seja “qualidade educacional”. Diante deste contexto, neste artigo, é proposta uma metodologia que se enquadra no processo KDD (Knowledge Discovery in Databases, ou seja, Descoberta de Conhecimento em Bases de Dados) para a classificação do desempenho de instituições de ensino, de forma comparativa, com base nas notas obtidas na Prova Brasil, um dos itens integrantes do Índice de Desenvolvimento da Educação Básica (IDEB) no Brasil. Para ilustrar a metodologia, esta foi aplicada às escolas públicas municipais de Araucária, PR, região metropolitana de Curitiba, PR, num total de 17, que, por ocasião da pesquisa, ofertavam Ensino Fundamental, considerando as notas obtidas pela totalidade dos alunos dos anos iniciais (1º. ao 5º. ano do ensino fundamental) e dos anos finais (6º. ao 9º. ano do ensino fundamental). Na etapa de Data Mining, principal etapa do processo KDD, foram utilizadas três técnicas de forma comparativa para o Reconhecimento de Padrões: Redes Neurais Artificiais; Support Vector Machines; e Algoritmos Genéticos. Essas técnicas apresentaram resultados satisfatórios na classificação das escolas, representados por meio de uma “Etiqueta de Classificação do Desempenho”. Por meio desta etiqueta, os gestores educacionais poderão ter melhor base para definir as medidas a serem adotadas junto a cada escola, podendo definir mais claramente as metas a serem cumpridas.

Download Full-text

A Survey on Phishing Detection and The Importance of Feature Selection In Data Mining Classification Algorithms

Issue 4 - Journal of Science and Technology ◽

10.46243/jst.2020.v5.i6.pp11-18 ◽

2020 ◽

pp. 11-18

Keyword(s):

Data Mining ◽

Feature Selection ◽

Support Vector ◽

Classification Algorithms ◽

End User ◽

Preparation Methods ◽

Survey Paper ◽

Vector Machines ◽

Feature Selection Techniques ◽

Phishing Detection

: In this era of Internet, the issue of security of information is at its peak. One of the main threats in this cyber world is phishing attacks which is an email or website fraud method that targets the genuine webpage or an email and hacks it without the consent of the end user. There are various techniques which help to classify whether the website or an email is legitimate or fake. The major contributors in the process of detection of these phishing frauds include the classification algorithms, feature selection techniques or dataset preparation methods and the feature extraction that plays an important role in detection as well as in prevention of these attacks. This Survey Paper studies the effect of all these contributors and the approaches that are applied in the study conducted on the recent papers. Some of the classification algorithms that are implemented includes Decision tree, Random Forest , Support Vector Machines, Logistic Regression , Lazy K Star, Naive Bayes and J48 etc.

Download Full-text

Data Mining with Parallel Support Vector Machines for Classification

Advances in Information Systems - Lecture Notes in Computer Science ◽

10.1007/11890393_21 ◽

2006 ◽

pp. 197-206 ◽

Cited By ~ 4

Author(s):

Tatjana Eitrich ◽

Bruno Lang

Keyword(s):

Data Mining ◽

Support Vector Machines ◽

Support Vector ◽

Vector Machines

Download Full-text

Adaptive Nearest Neighbor Classification using Support Vector Machines

Advances in Neural Information Processing Systems 14 ◽

10.7551/mitpress/1120.003.0090 ◽

2002 ◽

Keyword(s):

Support Vector Machines ◽

Nearest Neighbor ◽

Support Vector ◽

Nearest Neighbor Classification ◽

Vector Machines ◽

Neighbor Classification

Download Full-text

Data Mining for Multicriteria Single Facility Location Problems

Cognitive Analytics ◽

10.4018/978-1-7998-2460-2.ch063 ◽

2020 ◽

pp. 1248-1271

Author(s):

Seda Tolun ◽

Halit Alper Tayalı

Keyword(s):

Data Mining ◽

Decision Analysis ◽

Facility Location ◽

Location Problem ◽

Facility Location Problem ◽

Support Vector ◽

Ranking Svm ◽

Vector Machines ◽

Single Facility

This chapter focuses on available data analysis and data mining techniques to find the optimal location of the Multicriteria Single Facility Location Problem (MSFLP) at diverse business settings. Solving for the optimal of an MSFLP, there exists numerous multicriteria decision analysis techniques. Mainstream models are mentioned in this chapter, while presenting a general classification of the MSFLP and its framework. Besides, topics from machine learning with respect to decision analysis are covered: Unsupervised Principal Components Analysis ranking (PCA-rank) and supervised Support Vector Machines ranking (SVM-rank). This chapter proposes a data mining perspective for the multicriteria single facility location problem and proposes a new approach to the facility location problem with the combination of the PCA-rank and ranking SVMs.

Download Full-text

Nearest Neighbor Classifiers Versus Random Forests and Support Vector Machines

2019 IEEE International Conference on Data Mining (ICDM) ◽

10.1109/icdm.2019.00164 ◽

2019 ◽

Author(s):

Saket Sathe ◽

Charu C. Aggarwal

Keyword(s):

Support Vector Machines ◽

Random Forests ◽

Nearest Neighbor ◽

Support Vector ◽

Vector Machines ◽

Nearest Neighbor Classifiers

Download Full-text