scholarly journals Tensor Data Imputation by PARAFAC with Updated Chaotic Biases by Adam Optimizer

Author(s):  
Pooja Choudhary ◽  
Kanwal Garg

The big data pattern analysis suffers from incorrect responses due to missing data entries in the real world. Data collected for digital movie platforms like Netflix and intelligent transportation systems is Spatio-temporal data. Extracting the latent and explicit features from this data is a challenge. We present the high dimensional data imputation problem as a higher-order tensor decomposition. The regularized and biased PARAFAC decomposition is proposed to generate the missing data entries. The biases are created and updated by a chaotic exponential factor in Adam's optimization, which reduces the imputation error. This chaotic perturbed exponentially update in the learning rate replaces the fixed learning rate in the bias update by Adam optimization. The idea has experimented with Netflix and traffic datasets from Guangzhou, China.

Traffic data plays a major role in transport related applications. The problem of missing data has greatly impact the performance of Intelligent transportation systems(ITS). In this work impute the missing traffic data with spatio-temporal exploitation for high precision result under various missing rates. Deep learning based stacked denoise autoencoder is proposed with efficient Elu activation function to remove noise and impute the missing value.This imputed value will be used in analyses and prediction of vehicle traffic. Results are discussed that the proposed method outperforms well in state of the art approaches.


Sensors ◽  
2019 ◽  
Vol 19 (12) ◽  
pp. 2813 ◽  
Author(s):  
Kennedy John Offor ◽  
Lubos Vaci ◽  
Lyudmila S. Mihaylova

Intelligent transportation systems require the knowledge of current and forecasted traffic states for effective control of road networks. The actual traffic state has to be estimated as the existing sensors does not capture the needed state. Sensor measurements often contain missing or incomplete data as a result of communication issues, faulty sensors or cost leading to incomplete monitoring of the entire road network. This missing data poses challenges to traffic estimation approaches. In this work, a robust spatio-temporal traffic imputation approach capable of withstanding high missing data rate is presented. A particle based approach with Kriging interpolation is proposed. The performance of the particle based Kriging interpolation for different missing data ratios was investigated for a large road network comprising 1000 segments. Results indicate that the effect of missing data in a large road network can be mitigated by the Kriging interpolation within the particle filter framework.


Author(s):  
Alkiviadis Kyrtsoglou ◽  
Dimara Asimina ◽  
Dimitrios Triantafyllidis ◽  
Stelios Krinidis ◽  
Konstantinos Kitsikoudis ◽  
...  

2020 ◽  
Vol 54 (2) ◽  
pp. 59-73
Author(s):  
Yang Wang ◽  
Yu Xiao ◽  
Jianhui Lai ◽  
Yanyan Chen

Traffic flow is one of the fundamental parameters for traffic analysis and planning. With the rapid development of intelligent transportation systems, a large number of various detectors have been deployed in urban roads and, consequently, huge amount of data relating to the traffic flow are accumulatively available now. However, the traffic flow data detected through various detectors are often degraded due to the presence of a number of missing data, which can even lead to erroneous analysis and decision if no appropriate process is carried out. To remedy this issue, great research efforts have been made and subsequently various imputation techniques have been successively proposed in recent years, among which the k nearest neighbour algorithm (kNN) has received a great popularity as it is easy to implement and impute the missing data effectively. In the work presented in this paper, we firstly analyse the stochastic effect of traffic flow, to which the suffering of the kNN algorithm can be attributed. This motivates us to make an improvement, while eliminating the requirement to predefine parameters. Such a parameter-free algorithm has been realized by introducing a new similarity metric which is combined with the conventional metric so as to avoid the parameter setting, which is often determined with the requirement of adequate domain knowledge. Unlike the conventional version of the kNN algorithm, the proposed algorithm employs the multivariate linear regression model to estimate the weights for the final output, based on a set of data, which is smoothed by a Wavelet technique. A series of experiments have been performed, based on a set of traffic flow data reported from serval different countries, to examine the adaptive determination of parameters and the smoothing effect. Additional experiments have been conducted to evaluate the competent performance for the proposed algorithm by comparing to a number of widely-used imputing algorithms.


Author(s):  
J. Haworth

Traffic congestion and its associated environmental effects pose a significant problem for large cities. Consequently, promoting and investing in green travel modes such as cycling is high on the agenda for many transport authorities. In order to target investment in cycling infrastructure and improve the experience of cyclists on the road, it is important to know where they are. Unfortunately, investment in intelligent transportation systems over the years has mainly focussed on monitoring vehicular traffic, and comparatively little is known about where cyclists are on a day to day basis. In London, for example, there are a limited number of automatic cycle counters installed on the network, which provide only part of the picture. These are supplemented by surveys that are carried out infrequently. Activity tracking apps on smart phones and GPS devices such as Strava have become very popular over recent years. Their intended use is to track physical activity and monitor training. However, many people routinely use such apps to record their daily commutes by bicycle. At the aggregate level, these data provide a potentially rich source of information about the movement and behaviour of cyclists. Before such data can be relied upon, however, it is necessary to examine their representativeness and understand their potential biases. In this study, the flows obtained from Strava Metro (SM) are compared with those obtained during the 2013 London Cycle Census (LCC). A set of linear regression models are constructed to predict LCC flows using SM flows along with a number of dummy variables including road type, hour of day, day of week and presence/absence of cycle lane. Cross-validation is used to test the fitted models on unseen LCC sites. SM flows are found to be a statistically significant (p<0.0001) predictor of total flows as measured by the LCC and the models yield R squared statistics of ~0.7 before considering spatio-temporal variation. The initial results indicate that data collected using fitness tracking apps such as Strava are a promising data source for traffic managers. Future work will incorporate the spatio-temporal structure in the data to better account for the spatial and temporal variation in the ratio of SM flows to LCC flows.


Author(s):  
João Peixoto ◽  
Adriano Moreira

The analysis of urban mobility has been attracting the interest of the research community recently. The research challenges in this domain are diverse and include data acquisition and representation, human movement modeling and the visualization of dynamic geo-referenced data. Some of the direct applications for these studies are urban planning, security, intelligent transportation systems and wireless networks optimization. One of the drivers for recent work in this area is the availability of large datasets representing many aspects of the urban dynamics. Quite often, the proposed approaches are highly dependent on the data type. However, the analysis of urban dynamics could benefit from the combined and simultaneous use of multiple sources of spatio-temporal data. This paper describes the definition of a set of basic concepts for the representation and processing of spatio-temporal data, sufficiently flexible to deal with various types of mobility data and to support multiple forms of processing and visualization of the urban mobility. For this purpose the authors define a set of concepts and describe how real data from heterogeneous sources is mapped into the proposed framework. Available results obtained by the integration of geometric and symbolic data reveal the adequacy of the proposed concepts, and uncover new possibilities for the fusion of heterogeneous datasets.


2018 ◽  
Vol 88 ◽  
pp. 124-139 ◽  
Author(s):  
Bumjoon Bae ◽  
Hyun Kim ◽  
Hyeonsup Lim ◽  
Yuandong Liu ◽  
Lee D. Han ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document