local outlier
Recently Published Documents


TOTAL DOCUMENTS

194
(FIVE YEARS 100)

H-INDEX

15
(FIVE YEARS 5)

2021 ◽  
Vol 2138 (1) ◽  
pp. 012013
Author(s):  
Yongzhi Chen ◽  
Ziao Xu ◽  
Chaoqun Niu

Abstract In the research of flash flood disaster monitoring and early warning, the Internet of Things is widely used in real-time information collection. There are abnormal situations such as noise, repetition and errors in a large amount of data collected by sensors, which will lead to false alarm, lower prediction accuracy and other problems. Aiming at the characteristic that outliers flow of sensors will cause obvious fluctuation of information entropy, this paper proposes a local outlier detection method based on information entropy and optimized by sliding window and LOF (Local Outlier Factor). This method can be used to improve the data quality, thus improving the accuracy of disaster prediction. The method is applied to data stream processing of water sensor, and the experimental results show that the method can accurately detect outliers. Compared with the existing detection methods that only use data distance to determine, the test positive rate is improved and the false positive rate is reduced.


2021 ◽  
pp. 1-12
Author(s):  
Chunyan She ◽  
Shaohua Zeng

Outlier detection is a hot issue in data mining, which has plenty of real-world applications. LOF (Local Outlier Factor) can capture the abnormal degree of objects in the dataset with different density levels, and many extended algorithms have been proposed in recent years. However, the LOF needs to search the nearest neighborhood of each object on the whole dataset, which greatly increases the time cost. Most of these extended algorithms only consider the distance between an object and its neighborhood, but ignore the local distribution of an object within its neighborhood, resulting in a high false-positive rate. To improve the running speed, a rough clustering based on triple fusion is proposed, which divides a dataset into several subsets and outlier detection is performed only on each subset. Then, considering the local distribution of an object within its neighborhood, a new local outlier factor is constructed to estimate the abnormal degree of each object. Finally, the experimental results indicate that the proposed algorithm has better performance and lower running time than the others.


2021 ◽  
Vol 31 (4) ◽  
pp. 273-278
Author(s):  
Minseok Kim ◽  
Seunghwan Jung ◽  
Jonggeun Kim ◽  
Sungshin Kim

Author(s):  
Claas Strecker ◽  
Victor Ara

AbstractFood authenticity is becoming increasingly important but challenges existing analytical methods. In this study, we analyze the mango cultivar Alphonso with regard to authenticity using 1H-NMR spectroscopy. This cultivar has been termed “the king of mangoes” due to its unique flavor. Regarding its metabolites however, little is known about unique constellations that allow for differentiation of the Alphonso cultivar. We find that the Alphonso cultivar is distinguished by high levels of niacin, trigonelline, and histidine but features relatively low levels of alanine. Furthermore, we develop a model based on the local outlier factor algorithm that effectively detects admixture of non-Alphonso cultivars to Alphonso purée. This task is highly challenging because we identified no metabolites that are unique or uniquely absent in the Alphonso cultivar compared to other mango cultivars analyzed in this study. Our model shows promising results on a test set: Admixtures consisting of 35% non-Alphonso and 65% Alphonso mango purée were uncovered with a sensitivity of 88%. At the same time, our model verified Alphonso samples with a good specificity of 86%.


Sadhana ◽  
2021 ◽  
Vol 46 (3) ◽  
Author(s):  
P Sharon Femi ◽  
S Ganesh Vaidyanathan ◽  
A Kala

2021 ◽  
Vol 5 (1) ◽  
pp. 56
Author(s):  
Giulia Moschini ◽  
Régis Houssou ◽  
Jérôme Bovay ◽  
Stephan Robert-Nicoud

This paper addresses the problem of the unsupervised approach of credit card fraud detection in unbalanced datasets using the ARIMA model. The ARIMA model is fitted to the regular spending behaviour of the customer and is used to detect fraud if some deviations or discrepancies appear. Our model is applied to credit card datasets and is compared to four anomaly detection approaches, namely, the K-means, box plot, local outlier factor and isolation forest approaches. The results show that the ARIMA model presents better detecting power than that of the benchmark models.


Sign in / Sign up

Export Citation Format

Share Document