Trajectory outlier detection based on DBSCAN clustering algorithm

Based on outlier detection algorithms, a feasible quantification method for supraharmonic emission signals is presented. It is designed to tackle the requirements of high-resolution and low data volume simultaneously in the frequency domain. The proposed method was developed from the skewed distribution data model and the self-tuning parameters of density-based spatial clustering of applications with noise (DBSCAN) algorithm. Specifically, the data distribution of the supraharmonic band was analyzed first by the Jarque–Bera test. The threshold was determined based on the distribution model to filter out noise. Subsequently, the DBSCAN clustering algorithm parameters were adjusted automatically, according to the k-dist curve slope variation and the dichotomy parameter seeking algorithm, followed by the clustering. The supraharmonic emission points were analyzed as outliers. Finally, simulated and experimental data were applied to verify the effectiveness of the proposed method. On the basis of the detection results, a spectrum with the same resolution as the original spectrum was obtained. The amount of data declined by more than three orders of magnitude compared to the original spectrum. The presented method will benefit the analysis of quantification for the amplitude and frequency of supraharmonic emissions.

Download Full-text

Research on Anomaly Detection Method Based on DBSCAN Clustering Algorithm

2020 5th International Conference on Information Science, Computer Technology and Transportation (ISCTT) ◽

10.1109/isctt51595.2020.00083 ◽

2020 ◽

Author(s):

Dingsheng Deng

Keyword(s):

Anomaly Detection ◽

Clustering Algorithm ◽

Detection Method ◽

Dbscan Clustering

Download Full-text

Resolvable Cluster Target Tracking Based on the DBSCAN Clustering Algorithm and Labeled RFS

IEEE Access ◽

10.1109/access.2021.3066629 ◽

2021 ◽

Vol 9 ◽

pp. 43364-43377

Author(s):

Xirui Xue ◽

Shucai Huang ◽

Jiahao Xie ◽

Jiashun Ma ◽

Ning Li

Keyword(s):

Target Tracking ◽

Clustering Algorithm ◽

Dbscan Clustering

Download Full-text

Outlier Detection Method based on Improved K-means Clustering Algorithm

10.1145/3501409.3501648 ◽

2021 ◽

Author(s):

Wenfen Liu ◽

Nan Wang ◽

Yuehua Huang

Keyword(s):

Outlier Detection ◽

Clustering Algorithm ◽

Detection Method

Download Full-text

Ship Trajectory Outlier Detection Service System Based on Collaborative Computing

2018 IEEE World Congress on Services (SERVICES) ◽

10.1109/services.2018.00021 ◽

2018 ◽

Cited By ~ 1

Author(s):

Tao Zhang ◽

Shuai Zhao ◽

Junliang Chen

Keyword(s):

Outlier Detection ◽

Service System ◽

Collaborative Computing ◽

Trajectory Outlier Detection

Download Full-text

Outlier Detection Method based on Improved Two-step Clustering Algorithm and Synthetic Hypothesis Testing

2019 IEEE 8th Joint International Information Technology and Artificial Intelligence Conference (ITAIC) ◽

10.1109/itaic.2019.8785425 ◽

2019 ◽

Cited By ~ 1

Author(s):

Geyu Huang ◽

Zhiming Zhang ◽

Wenxin Yang

Keyword(s):

Hypothesis Testing ◽

Outlier Detection ◽

Clustering Algorithm ◽

Detection Method

Download Full-text

AN EFFICIENT CLUSTERING METHOD FOR DBSCAN GEOGRAPHIC SPATIO-TEMPORAL LARGE DATA WITH IMPROVED PARAMETER OPTIMIZATION

ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-archives-xlii-3-w10-581-2020 ◽

2020 ◽

Vol XLII-3/W10 ◽

pp. 581-584

Author(s):

J. W. Li ◽

X. Q. Han ◽

J. W. Jiang ◽

Y. Hu ◽

L. Liu

Keyword(s):

Parameter Optimization ◽

Clustering Algorithm ◽

Optimal Solution ◽

Large Data ◽

Parameter Selection ◽

Physical Analysis ◽

Clustering Method ◽

K Value ◽

Dbscan Clustering ◽

Spatio Temporal

Abstract. How to establish an effective method of large data analysis of geographic space-time and quickly and accurately find the hidden value behind geographic information has become a current research focus. Researchers have found that clustering analysis methods in data mining field can well mine knowledge and information hidden in complex and massive spatio-temporal data, and density-based clustering is one of the most important clustering methods.However, the traditional DBSCAN clustering algorithm has some drawbacks which are difficult to overcome in parameter selection. For example, the two important parameters of Eps neighborhood and MinPts density need to be set artificially. If the clustering results are reasonable, the more suitable parameters can not be selected according to the guiding principles of parameter setting of traditional DBSCAN clustering algorithm. It can not produce accurate clustering results.To solve the problem of misclassification and density sparsity caused by unreasonable parameter selection in DBSCAN clustering algorithm. In this paper, a DBSCAN-based data efficient density clustering method with improved parameter optimization is proposed. Its evaluation index function (Optimal Distance) is obtained by cycling k-clustering in turn, and the optimal solution is selected. The optimal k-value in k-clustering is used to cluster samples. Through mathematical and physical analysis, we can determine the appropriate parameters of Eps and MinPts. Finally, we can get clustering results by DBSCAN clustering. Experiments show that this method can select parameters reasonably for DBSCAN clustering, which proves the superiority of the method described in this paper.

Download Full-text

Analysis of new nosological models from disease similarities using clustering

10.1101/2020.04.10.035394 ◽

2020 ◽

Author(s):

Lucía Prieto Santamaría ◽

Eduardo P. García del Valle ◽

Gerardo Lagunes García ◽

Massimiliano Zanin ◽

Alejandro Rodríguez González ◽

...

Keyword(s):

Clustering Algorithm ◽

Molecular Data ◽

Modern Medicine ◽

Biological Information ◽

Distributed Data ◽

Feature Vectors ◽

Dbscan Clustering ◽

Disease Similarity ◽

Starting Point ◽

Evaluation Metric

AbstractWhile classical disease nosology is based on phenotypical characteristics, the increasing availability of biological and molecular data is providing new understanding of diseases and their underlying relationships, that could lead to a more comprehensive paradigm for modern medicine. In the present work, similarities between diseases are used to study the generation of new possible disease nosologic models that include both phenotypical and biological information. To this aim, disease similarity is measured in terms of disease feature vectors, that stood for genes, proteins, metabolic pathways and PPIs in the case of biological similarity, and for symptoms in the case of phenotypical similarity. An improvement in similarity computation is proposed, considering weighted instead of Booleans feature vectors. Unsupervised learning methods were applied to these data, specifically, density-based DBSCAN clustering algorithm. As evaluation metric silhouette coefficient was chosen, even though the number of clusters and the number of outliers were also considered. As a results validation, a comparison with randomly distributed data was performed. Results suggest that weighted biological similarities based on proteins, and computed according to cosine index, may provide a good starting point to rearrange disease taxonomy and nosology.

Download Full-text

Lane Formation Beyond Intuition Towards an Automated Characterization of Lanes in Counter-flows

Collective Dynamics ◽

10.17815/cd.2020.29 ◽

2020 ◽

Vol 5 ◽

Author(s):

Luca Crociani ◽

Giuseppe Vizzari ◽

Andrea Gorrini ◽

Stefania Bandini

Keyword(s):

Clustering Algorithm ◽

Variable Density ◽

Automatic Identification ◽

Computing Power ◽

Dbscan Clustering ◽

Significant Difference ◽

Lane Formation ◽

Human Coder ◽

Behavioural Dynamics

Pedestrian behavioural dynamics have been growingly investigated by means of (semi)automated computing techniques for almost two decades, exploiting advancements on computing power, sensor accuracy and availability, computer vision algorithms. This has led to a unique consensus on the existence of significant difference between unidirectional and bidirectional flows of pedestrians, where the phenomenon of lane formation seems to play a major role. The collective behaviour of lane formation emerges in condition of variable density and due to a self-organisation dynamic, for which pedestrians are induced to walk following preceding persons to avoid and minimize conflictual situations. Although the formation of lanes is a well-known phenomenon in this field of study, there is still a lack of methods offering the possibility to provide an (even semi-) automatic identification and a quantitative characterization. In this context, the paper proposes an unsupervised learning approach for an automatic detection of lanes in multi-directional pedestrian flows, based on the DBSCAN clustering algorithm. The reliability of the approach is evaluated through an inter-rater agreement test between the results achieved by a human coder and by the algorithm.

Download Full-text

Based local density trajectory outlier detection with partition-and-detect framework

2017 13th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD) ◽

10.1109/fskd.2017.8393023 ◽

2017 ◽

Cited By ~ 2

Author(s):

Fangjun Luan ◽

Yunting Zhang ◽

Keyan Cao ◽

Qi Li

Keyword(s):

Outlier Detection ◽

Local Density ◽

Trajectory Outlier Detection

Download Full-text