An Ensemble Model for Multiclass Classification and Outlier Detection Method in Data Mining

Both fuzzy c-means (FCM) clustering and outlier detection are useful data mining techniques in real applications. In this paper, we show that the task of outlier detection could be achieved as by-product of fuzzy c-means clustering. The proposed strategy consists of two stages. The first stage consists of purely fuzzy c-means process, while the second stage identifies exceptional objects according to a novel metric based on the entropy of membership values. We provide experimental results to demonstrate the effectiveness of our technique.

Download Full-text

Improving multiclass classification and outlier detection method through ensemble technique

Proceedings of the 4th International Conference on Communication and Information Processing - ICCIP '18 ◽

10.1145/3290420.3290450 ◽

2018 ◽

Author(s):

Dalton Ndirangu ◽

Waweru Mwangi ◽

Lawrence Nderu

Keyword(s):

Outlier Detection ◽

Detection Method ◽

Multiclass Classification ◽

Ensemble Technique

Download Full-text

An Ensemble Filter Feature Selection Method and Outlier Detection Method for Multiclass Classification

Proceedings of the 2019 8th International Conference on Software and Computer Applications - ICSCA '19 ◽

10.1145/3316615.3318223 ◽

2019 ◽

Cited By ~ 1

Author(s):

Dalton Ndirangu ◽

Waweru Mwangi ◽

Lawrence Nderu

Keyword(s):

Feature Selection ◽

Outlier Detection ◽

Detection Method ◽

Feature Selection Method ◽

Multiclass Classification ◽

Selection Method

Download Full-text

Water Quality Data Outlier Detection Method Based on Spatial Series Features

Fuzzy Systems and Data Mining VI - Frontiers in Artificial Intelligence and Applications ◽

10.3233/faia200715 ◽

2020 ◽

Author(s):

Jianzhuo Yan ◽

Ya Gao ◽

Yongchuan Yu

Keyword(s):

Data Mining ◽

Water Quality ◽

Outlier Detection ◽

Detection Method ◽

Quality Data ◽

Nearest Neighbour ◽

Water Quality Data ◽

Spatial Series ◽

Water Field ◽

Major Branch

Outlier detection is one of the major branch in data mining which has been applied in different fields. Researchers have focused on the outlier detection in time series, but rarely spatial series. In this paper, we propose a new outlier detection method based on k-nearest neighbour (KNN) and Mahalanobis distance, which is first applied to the water field. Experimental results verify that the algorithm has good accuracy and effectiveness in outlier detection for water quality spatial series dataset.

Download Full-text

OFCOD: On the Fly Clustering Based Outlier Detection Framework

Data ◽

10.3390/data6010001 ◽

2020 ◽

Vol 6 (1) ◽

pp. 1

Author(s):

Ahmed Elmogy ◽

Hamada Rizk ◽

Amany M. Sarhan

Keyword(s):

Data Mining ◽

Image Processing ◽

Intrusion Detection ◽

Real Time ◽

Outlier Detection ◽

Real World ◽

Medical Data ◽

Experimental Results ◽

Real Time Applications ◽

Real World Datasets

In data mining, outlier detection is a major challenge as it has an important role in many applications such as medical data, image processing, fraud detection, intrusion detection, and so forth. An extensive variety of clustering based approaches have been developed to detect outliers. However they are by nature time consuming which restrict their utilization with real-time applications. Furthermore, outlier detection requests are handled one at a time, which means that each request is initiated individually with a particular set of parameters. In this paper, the first clustering based outlier detection framework, (On the Fly Clustering Based Outlier Detection (OFCOD)) is presented. OFCOD enables analysts to effectively find out outliers on time with request even within huge datasets. The proposed framework has been tested and evaluated using two real world datasets with different features and applications; one with 699 records, and another with five millions records. The experimental results show that the performance of the proposed framework outperforms other existing approaches while considering several evaluation metrics.

Download Full-text