local outlier factor
Recently Published Documents


TOTAL DOCUMENTS

106
(FIVE YEARS 64)

H-INDEX

12
(FIVE YEARS 5)

2021 ◽  
pp. 1-12
Author(s):  
Chunyan She ◽  
Shaohua Zeng

Outlier detection is a hot issue in data mining, which has plenty of real-world applications. LOF (Local Outlier Factor) can capture the abnormal degree of objects in the dataset with different density levels, and many extended algorithms have been proposed in recent years. However, the LOF needs to search the nearest neighborhood of each object on the whole dataset, which greatly increases the time cost. Most of these extended algorithms only consider the distance between an object and its neighborhood, but ignore the local distribution of an object within its neighborhood, resulting in a high false-positive rate. To improve the running speed, a rough clustering based on triple fusion is proposed, which divides a dataset into several subsets and outlier detection is performed only on each subset. Then, considering the local distribution of an object within its neighborhood, a new local outlier factor is constructed to estimate the abnormal degree of each object. Finally, the experimental results indicate that the proposed algorithm has better performance and lower running time than the others.


2021 ◽  
Vol 31 (4) ◽  
pp. 273-278
Author(s):  
Minseok Kim ◽  
Seunghwan Jung ◽  
Jonggeun Kim ◽  
Sungshin Kim

2021 ◽  
Vol 5 (1) ◽  
pp. 56
Author(s):  
Giulia Moschini ◽  
Régis Houssou ◽  
Jérôme Bovay ◽  
Stephan Robert-Nicoud

This paper addresses the problem of the unsupervised approach of credit card fraud detection in unbalanced datasets using the ARIMA model. The ARIMA model is fitted to the regular spending behaviour of the customer and is used to detect fraud if some deviations or discrepancies appear. Our model is applied to credit card datasets and is compared to four anomaly detection approaches, namely, the K-means, box plot, local outlier factor and isolation forest approaches. The results show that the ARIMA model presents better detecting power than that of the benchmark models.


Electronics ◽  
2021 ◽  
Vol 10 (6) ◽  
pp. 704
Author(s):  
Katharina Hofer-Schmitz ◽  
Ulrike Kleb ◽  
Branka Stojanović

This paper investigates the influences of different statistical network traffic feature sets on detecting advanced persistent threats. The selection of suitable features for detecting targeted cyber attacks is crucial to achieving high performance and to address limited computational and storage costs. The evaluation was performed on a semi-synthetic dataset, which combined the CICIDS2017 dataset and the Contagio malware dataset. The CICIDS2017 dataset is a benchmark dataset in the intrusion detection field and the Contagio malware dataset contains real advanced persistent threat (APT) attack traces. Several different combinations of datasets were used to increase variety in background data and contribute to the quality of results. For the feature extraction, the CICflowmeter tool was used. For the selection of suitable features, a correlation analysis including an in-depth feature investigation by boxplots is provided. Based on that, several suitable features were allocated into different feature sets. The influences of these feature sets on the detection capabilities were investigated in detail with the local outlier factor method. The focus was especially on attacks detected with different feature sets and the influences of the background on the detection capabilities with respect to the local outlier factor method. Based on the results, we could determine a superior feature set, which detected most of the malicious flows.


Sign in / Sign up

Export Citation Format

Share Document