Enhanced Application of Principal Component Analysis in Machine Learning for Imputation of Missing Traffic Data

Missing value imputation approaches have been widely used to support and maintain the quality of traffic data. Although the spatiotemporal dependency-based approaches can improve the imputation performance for large and continuous missing patterns, additionally considering traffic states can lead to more reliable results. In order to improve the imputation performances further, a section-based approach is also needed. This study proposes a novel approach for identifying traffic-states of different spots of road sections that comprise, namely, a section-based traffic state (SBTS), and determining their spatiotemporal dependencies customized for each SBTS, for missing value imputations. A principal component analysis (PCA) was employed, and angles obtained from the first principal component were used to identify the SBTSs. The pre-processing was combined with a support vector machine for developing the imputation model. It was found that the segmentation of the SBTS using the angles and considering the spatiotemporal dependency for each state by the proposed approach outperformed other existing models.

Download Full-text

Quality Reliability Evaluation of the Miner Lamp Power Supply with Principal Component Analysis and Support Vector Machine

Key Engineering Materials ◽

10.4028/www.scientific.net/kem.460-461.716 ◽

2011 ◽

Vol 460-461 ◽

pp. 716-723

Author(s):

Dan Ma ◽

Zhan Qing Chen ◽

Ji Da Huang ◽

Hao Jin Lv

Keyword(s):

Principal Component Analysis ◽

Support Vector Machine ◽

Power Supply ◽

Gaussian Function ◽

Principal Component ◽

Component Analysis ◽

Measurement Model ◽

Support Vector ◽

Safety Production

The quality of the miner lamp power supply (MLPS) affects the performance of the miner lamp, while the safety performance and quality of the miner lamp are closely related to the safety production in coal mines. The factors, which affect the quality of power supply, are screened through the principal component analysis (PCA). After training the principal extracted component by PCA, the measurement model for the MLPS is set up based on support vector machine (SVM), meanwhile, the Gaussian function, which functions as the kernel function of SVM are selected to simulate, the test results indicate that the measurement model based on PCA-SVM could be used as the detection of the MLPS, which can better ensures the quality reliability of the MLPS.

Download Full-text

Longitudinal Crack Detection Approach Based on Principal Component Analysis and Support Vector Machine for Slab Continuous Casting

steel research international ◽

10.1002/srin.202100168 ◽

2021 ◽

Author(s):

Haiyang Duan ◽

Jingjing Wei ◽

Lin Qi ◽

Xudong Wang ◽

Yu Liu ◽

...

Keyword(s):

Principal Component Analysis ◽

Support Vector Machine ◽

Continuous Casting ◽

Crack Detection ◽

Longitudinal Crack ◽

Principal Component ◽

Component Analysis ◽

Support Vector ◽

Slab Continuous Casting ◽

Detection Approach

Download Full-text

Prediction of China’s Energy Consumption Based on Robust Principal Component Analysis and PSO-LSSVM Optimized by the Tabu Search Algorithm

Energies ◽

10.3390/en12010196 ◽

2019 ◽

Vol 12 (1) ◽

pp. 196 ◽

Cited By ~ 3

Author(s):

Lihui Zhang ◽

Riletu Ge ◽

Jianxue Chai

Keyword(s):

Principal Component Analysis ◽

Energy Consumption ◽

Tabu Search ◽

Industrial Structure ◽

Principal Component ◽

Component Analysis ◽

Support Vector ◽

Forecasting Model ◽

Robust Principal Component Analysis ◽

Consumption Structure

China’s energy consumption issues are closely associated with global climate issues, and the scale of energy consumption, peak energy consumption, and consumption investment are all the focus of national attention. In order to forecast the amount of energy consumption of China accurately, this article selected GDP, population, industrial structure and energy consumption structure, energy intensity, total imports and exports, fixed asset investment, energy efficiency, urbanization, the level of consumption, and fixed investment in the energy industry as a preliminary set of factors; Secondly, we corrected the traditional principal component analysis (PCA) algorithm from the perspective of eliminating “bad points” and then judged a “bad spot” sample based on signal reconstruction ideas. Based on the above content, we put forward a robust principal component analysis (RPCA) algorithm and chose the first five principal components as main factors affecting energy consumption, including: GDP, population, industrial structure and energy consumption structure, urbanization; Then, we applied the Tabu search (TS) algorithm to the least square to support vector machine (LSSVM) optimized by the particle swarm optimization (PSO) algorithm to forecast China’s energy consumption. We collected data from 1996 to 2010 as a training set and from 2010 to 2016 as the test set. For easy comparison, the sample data was input into the LSSVM algorithm and the PSO-LSSVM algorithm at the same time. We used statistical indicators including goodness of fit determination coefficient (R2), the root means square error (RMSE), and the mean radial error (MRE) to compare the training results of the three forecasting models, which demonstrated that the proposed TS-PSO-LSSVM forecasting model had higher prediction accuracy, generalization ability, and higher training speed. Finally, the TS-PSO-LSSVM forecasting model was applied to forecast the energy consumption of China from 2017 to 2030. According to predictions, we found that China shows a gradual increase in energy consumption trends from 2017 to 2030 and will breakthrough 6000 million tons in 2030. However, the growth rate is gradually tightening and China’s energy consumption economy will transfer to a state of diminishing returns around 2026, which guides China to put more emphasis on the field of energy investment.

Download Full-text

Multi-View Face Detection Based on Kernel Principal Component Analysis and Kernel Support Vector Techniques

International Journal on Soft Computing ◽

10.5121/ijsc.2011.2201 ◽

2011 ◽

Vol 2 (2) ◽

pp. 1-13 ◽

Cited By ~ 5

Author(s):

Muzhir Shaban Al Ani ◽

Alaa Sulaiman Al Waisy

Keyword(s):

Principal Component Analysis ◽

Face Detection ◽

Principal Component ◽

Component Analysis ◽

Kernel Principal Component Analysis ◽

Support Vector

Download Full-text

Spam Detection Approach Based on C-Support Vector Machine and Kernel Principal-Component Analysis

2014 Tenth International Conference on Intelligent Information Hiding and Multimedia Signal Processing ◽

10.1109/iih-msp.2014.64 ◽

2014 ◽

Author(s):

Shu Geng ◽

Liu Lv ◽

Rongjun Liu

Keyword(s):

Principal Component Analysis ◽

Support Vector Machine ◽

Principal Component ◽

Component Analysis ◽

Kernel Principal Component Analysis ◽

Support Vector ◽

Spam Detection ◽

Detection Approach

Download Full-text

Multimode Monitoring of Oxy-Gas Combustion Through Flame Imaging, Principal Component Analysis, and Kernel Support Vector Machine

Combustion Science and Technology ◽

10.1080/00102202.2016.1250749 ◽

2016 ◽

Vol 189 (5) ◽

pp. 776-792 ◽

Cited By ~ 3

Author(s):

Xiaojing Bai ◽

Gang Lu ◽

Md Moinul Hossain ◽

Yong Yan ◽

Shi Liu

Keyword(s):

Principal Component Analysis ◽

Support Vector Machine ◽

Principal Component ◽

Component Analysis ◽

Support Vector ◽

Gas Combustion ◽

Kernel Support Vector Machine ◽

Flame Imaging

Download Full-text

Multiclass classification of leukemia cancer data using Fuzzy Support Vector Machine (FSVM) with feature selection using Principal Component Analysis (PCA)

Journal of Physics Conference Series ◽

10.1088/1742-6596/1725/1/012012 ◽

2021 ◽

Vol 1725 ◽

pp. 012012

Author(s):

I R Fauzi ◽

Z Rustam ◽

A Wibowo

Keyword(s):

Principal Component Analysis ◽

Support Vector Machine ◽

Feature Selection ◽

Principal Component ◽

Component Analysis ◽

Multiclass Classification ◽

Support Vector ◽

Fuzzy Support Vector Machine ◽

Cancer Data

Download Full-text

Face Recognition Based on Principal Component Analysis and Support Vector Machine Algorithms

10.23919/ccc52363.2021.9550727 ◽

2021 ◽

Author(s):

Yanbang Zhang ◽

Fen Zhang ◽

Lei Guo

Keyword(s):

Principal Component Analysis ◽

Support Vector Machine ◽

Face Recognition ◽

Principal Component ◽

Component Analysis ◽

Support Vector

Download Full-text

Batch process monitoring based on global enhanced multiple neighborhoods preserving embedding

Transactions of the Institute of Measurement and Control ◽

10.1177/01423312211044742 ◽

2021 ◽

pp. 014233122110447

Author(s):

Hongjuan Yao ◽

Xiaoqiang Zhao ◽

Wei Li ◽

Yongyong Hui

Keyword(s):

Principal Component Analysis ◽

Fault Detection ◽

Objective Function ◽

Principal Component ◽

Batch Process ◽

Component Analysis ◽

Support Vector ◽

Support Vector Data Description ◽

Order Information ◽

Multiple Neighborhoods

Batch process generally has varying dynamic characteristic that causes low fault detection rate and high false alarm rate, and it is necessary and urgent to monitor batch process. This paper proposes a global enhanced multiple neighborhoods preserving embedding based fault detection strategy for dynamic batch process. Firstly, the angle neighbor is defined and selected to compensate for the insufficient expression for the spatial similarity of samples only by using the distance neighbor, and the time neighbor is introduced to describe the time correlations between samples. These three types of neighbors can fully characterize the similarity of the samples in time and space. Secondly, considering the minimum reconstruction error and the order information of three types of neighbors, an enhanced objective function is constructed to prevent the loss of order information when neighborhood preserving embedding (NPE) calculates the reconstruction weights. Furthermore, the enhanced objective function and a global objective function are organically combined to extract both global and local features, to describe process dynamics and visualize process data in a low-dimensional space. Finally, a monitoring index based on support vector data description is constructed to eliminate adverse effects of non-Gaussian data for monitoring performance. The advantages of the proposed method over principal component analysis, neighborhood preserving embedding, dynamic principal component analysis and time NPE are demonstrated by a numerical example and the penicillin fermentation process simulation.

Download Full-text