Urban Hotspot Area Detection Using Nearest-Neighborhood-Related Quality Clustering on Taxi Trajectory Data

An efficient partitional clustering technique, called SAKM-clustering, that integrates the power of simulated annealing for obtaining minimum energy configuration, and the searching capability of K-means algorithm is proposed in this article. The clustering methodology is used to search for appropriate clusters in multidimensional feature space such that a similarity metric of the resulting clusters is optimized. Data points are redistributed among the clusters probabilistically, so that points that are farther away from the cluster center have higher probabilities of migrating to other clusters than those which are closer to it. The superiority of the SAKM-clustering algorithm over the widely used K-means algorithm is extensively demonstrated for artificial and real life data sets.

Download Full-text

The Impact of the Transportation Network Companies on the Taxi Industry: Evidence from Beijing’s GPS Taxi Trajectory Data

IEEE Access ◽

10.1109/access.2018.2810140 ◽

2018 ◽

Vol 6 ◽

pp. 12438-12450 ◽

Cited By ~ 11

Author(s):

Weiwei Jiang ◽

Lin Zhang

Keyword(s):

Transportation Network ◽

Trajectory Data ◽

Taxi Industry ◽

The Impact ◽

Taxi Trajectory ◽

Transportation Network Companies

Download Full-text

UNDERSTANDING URBAN TRAFFIC FLOW CHARACTERISTICS FROM THE NETWORK CENTRALITY PERSPECTIVE AT DIFFERENT GRANULARITIES

ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences ◽

10.5194/isprsarchives-xli-b2-263-2016 ◽

2016 ◽

Vol XLI-B2 ◽

pp. 263-268 ◽

Cited By ~ 5

Author(s):

P. X. Zhao ◽

S. M. Zhao

Keyword(s):

Traffic Flow ◽

Flow Characteristics ◽

Research Unit ◽

Urban Traffic ◽

Network Centrality ◽

Trajectory Data ◽

The Road ◽

Analysis Process ◽

Intersection Line ◽

Taxi Trajectory

In this study, we analyze urban traffic flow using taxi trajectory data to understand the characteristics of traffic flow from the network centrality perspective at point (intersection), line (road), and area (community) granularities. The entire analysis process comprises three steps. The first step utilizes the taxi trajectory data to evaluate traffic flow at different granularities. Second, the centrality indices are calculated based on research units at different granularities. Third, correlation analysis between the centrality indices and corresponding urban traffic flow is performed. Experimental results indicate that urbaxperimental results indicate that urbaxperimental results indicate that urban traffic flow is relatively influenced by the road network structure. However, urban traffic flow also depends on the research unit size. Traditional centralities and traffic flow exhibit a low correlation at point granularity but exhibit a high correlation at line and area granularities. Furthermore, the conclusions of this study reflect the universality of the modifiable areal unit problem.

Download Full-text

A WEB-BASED PLATFORM FOR VISUALIZING SPATIOTEMPORAL DYNAMICS OF BIG TAXI DATA

ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-archives-xlii-2-w7-1407-2017 ◽

2017 ◽

Vol XLII-2/W7 ◽

pp. 1407-1412 ◽

Cited By ~ 2

Author(s):

H. Xiong ◽

L. Chen ◽

Z. Gui

Keyword(s):

Large Scale ◽

Clustering Algorithm ◽

Trajectory Data ◽

Visualization System ◽

City Life ◽

Interactive Animation ◽

Dbscan Clustering ◽

Main Work ◽

Visualization Systems ◽

Taxi Trajectory

With more and more vehicles equipped with Global Positioning System (GPS), access to large-scale taxi trajectory data has become increasingly easy. Taxis are valuable sensors and information associated with taxi trajectory can provide unprecedented insight into many aspects of city life. But analysing these data presents many challenges. Visualization of taxi data is an efficient way to represent its distributions and structures and reveal hidden patterns in the data. However, Most of the existing visualization systems have some shortcomings. On the one hand, the passenger loading status and speed information cannot be expressed. On the other hand, mono-visualization form limits the information presentation. In view of these problems, this paper designs and implements a visualization system in which we use colour and shape to indicate passenger loading status and speed information and integrate various forms of taxi visualization. The main work as follows: 1. Pre-processing and storing the taxi data into MongoDB database. 2. Visualization of hotspots for taxi pickup points. Through DBSCAN clustering algorithm, we cluster the extracted taxi passenger’s pickup locations to produce passenger hotspots. 3. Visualizing the dynamic of taxi moving trajectory using interactive animation. We use a thinning algorithm to reduce the amount of data and design a preloading strategyto load the data smoothly. Colour and shape are used to visualize the taxi trajectory data.

Download Full-text

UNDERSTANDING URBAN TRAFFIC FLOW CHARACTERISTICS FROM THE NETWORK CENTRALITY PERSPECTIVE AT DIFFERENT GRANULARITIES

ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-archives-xli-b2-263-2016 ◽

2016 ◽

Vol XLI-B2 ◽

pp. 263-268

Author(s):

P. X. Zhao ◽

S. M. Zhao

Keyword(s):

Traffic Flow ◽

Flow Characteristics ◽

Research Unit ◽

Urban Traffic ◽

Network Centrality ◽

Trajectory Data ◽

The Road ◽

Analysis Process ◽

Intersection Line ◽

Taxi Trajectory

In this study, we analyze urban traffic flow using taxi trajectory data to understand the characteristics of traffic flow from the network centrality perspective at point (intersection), line (road), and area (community) granularities. The entire analysis process comprises three steps. The first step utilizes the taxi trajectory data to evaluate traffic flow at different granularities. Second, the centrality indices are calculated based on research units at different granularities. Third, correlation analysis between the centrality indices and corresponding urban traffic flow is performed. Experimental results indicate that urbaxperimental results indicate that urbaxperimental results indicate that urban traffic flow is relatively influenced by the road network structure. However, urban traffic flow also depends on the research unit size. Traditional centralities and traffic flow exhibit a low correlation at point granularity but exhibit a high correlation at line and area granularities. Furthermore, the conclusions of this study reflect the universality of the modifiable areal unit problem.

Download Full-text

Discovering the Impact of Urban Traffic Interventions Using Contrast Mining on Vehicle Trajectory Data

Advances in Knowledge Discovery and Data Mining - Lecture Notes in Computer Science ◽

10.1007/978-3-319-18038-0_38 ◽

2015 ◽

pp. 486-497 ◽

Cited By ~ 6

Author(s):

Xiaoting Wang ◽

Christopher Leckie ◽

Hairuo Xie ◽

Tharshan Vaithianathan

Keyword(s):

Urban Traffic ◽

Trajectory Data ◽

Vehicle Trajectory ◽

The Impact

Download Full-text

Fuzzy C-Means Clustering Algorithm Based on Coefficient of Variation

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.998-999.873 ◽

2014 ◽

Vol 998-999 ◽

pp. 873-877

Author(s):

Zhen Bo Wang ◽

Bao Zhi Qiu

Keyword(s):

Coefficient Of Variation ◽

Clustering Algorithm ◽

Clustering Algorithms ◽

Real Data ◽

Cluster Center ◽

Data Set ◽

Fuzzy C Means ◽

Initial Cluster ◽

Fuzzy C Means Clustering ◽

The Impact

To reduce the impact of irrelevant attributes on clustering results, and improve the importance of relevant attributes to clustering, this paper proposes fuzzy C-means clustering algorithm based on coefficient of variation (CV-FCM). In the algorithm, coefficient of variation is used to weigh attributes so as to assign different weights to each attribute in the data set, and the magnitude of weight is used to express the importance of different attributes to clusters. In addition, for the characteristic of fuzzy C-means clustering algorithm that it is susceptible to initial cluster center value, the method for the selection of initial cluster center based on maximum distance is introduced on the basis of weighted coefficient of variation. The result of the experiment based on real data sets shows that this algorithm can select cluster center effectively, with the clustering result superior to general fuzzy C-means clustering algorithms.

Download Full-text

Deployment Strategy for Car-Sharing Depots by Clustering Urban Traffic Big Data Based on Affinity Propagation

Scientific Programming ◽

10.1155/2018/3907513 ◽

2018 ◽

Vol 2018 ◽

pp. 1-9 ◽

Cited By ~ 2

Author(s):

Zhihan Liu ◽

Yi Jia ◽

Xiaolu Zhu

Keyword(s):

Big Data ◽

Clustering Algorithm ◽

Optimization Method ◽

Urban Traffic ◽

Affinity Propagation ◽

Superior Performance ◽

Trajectory Data ◽

Car Sharing ◽

Gps Trajectory Data ◽

Ap Clustering

Car sharing is a type of car rental service, by which consumers rent cars for short periods of time, often charged by hours. The analysis of urban traffic big data is full of importance and significance to determine locations of depots for car-sharing system. Taxi OD (Origin-Destination) is a typical dataset of urban traffic. The volume of the data is extremely large so that traditional data processing applications do not work well. In this paper, an optimization method to determine the depot locations by clustering taxi OD points with AP (Affinity Propagation) clustering algorithm has been presented. By analyzing the characteristics of AP clustering algorithm, AP clustering has been optimized hierarchically based on administrative region segmentation. Considering sparse similarity matrix of taxi OD points, the input parameters of AP clustering have been adapted. In the case study, we choose the OD pairs information from Beijing’s taxi GPS trajectory data. The number and locations of depots are determined by clustering the OD points based on the optimization AP clustering. We describe experimental results of our approach and compare it with standard K-means method using quantitative and stationarity index. Experiments on the real datasets show that the proposed method for determining car-sharing depots has a superior performance.

Download Full-text

An Improved K-means Method with Density Distribution Analysis

MATEC Web of Conferences ◽

10.1051/matecconf/201817601019 ◽

2018 ◽

Vol 176 ◽

pp. 01019

Author(s):

Huiwen Xue ◽

Haochen Li ◽

Yanfei Wang

Keyword(s):

Density Distribution ◽

Clustering Algorithm ◽

Local Density ◽

Synthetic Data ◽

Original Data ◽

Cluster Center ◽

Distribution Analysis ◽

Time Cost ◽

Improved Method ◽

Business Data

In this paper, a novel K-means clustering algorithm is proposed. Before running the traditional Kmeans, the cluster centers should be randomly selected, which would influence the time cost and accuracy. To solve this problem, we utilize density distribution analysis in the traditional K-means. For a reasonable cluster, it should have a dense inside structure which means the points in the same cluster should tightly surround the center, while separated away from other cluster canters. Based on this assumption, two quantities are firstly introduced: the local density of cluster center ρi and its desperation degree δi, then some reasonable cluster centers candidates are selected from the original data. We performed our algorithm on three synthetic data and a real bank business data to evaluate its accuracy and efficiency. Comparing with Traditional K-means and K-means++, the results demonstrated that the improved method performs better.

Download Full-text