scholarly journals CNN-GRU-AM for Shared Bicycles Demand Forecasting

2021 ◽  
Vol 2021 ◽  
pp. 1-14
Author(s):  
Yali Peng ◽  
Ting Liang ◽  
Xiaojiang Hao ◽  
Yu Chen ◽  
Shicheng Li ◽  
...  

The demand forecast of shared bicycles directly determines the utilization rate of vehicles and projects operation benefits. Accurate prediction based on the existing operating data can reduce unnecessary delivery. Since the use of shared bicycles is susceptible to time dependence and external factors, most of the existing works only consider some of the attributes of shared bicycles, resulting in insufficient modeling and unsatisfactory prediction performance. In order to address the aforementioned limitations, this paper establishes a novelty prediction model based on convolutional recurrent neural network with the attention mechanism named as CNN-GRU-AM. There are four parts in the proposed CNN-GRU-AM model. First, a convolutional neural network (CNN) with two layers is used to extract local features from the multiple sources data. Second, the gated recurrent unit (GRU) is employed to capture the time-series relationships of the output data of CNN. Third, the attention mechanism (AM) is introduced to mining the potential relationships of the series features, in which different weights will be assigned to the corresponding features according to their importance. At last, a fully connected layer with three layers is added to learn features and output the prediction results. To evaluate the performance of the proposed method, we conducted massive experiments on two datasets including a real mobile bicycle data and a public shared bicycle data. The experimental results show that the prediction performance of the proposed model is better than other prediction models, indicating the significance of the social benefits.


Energies ◽  
2020 ◽  
Vol 13 (9) ◽  
pp. 2242 ◽  
Author(s):  
Alejandro J. del Real ◽  
Fernando Dorado ◽  
Jaime Durán

This paper investigates the use of deep learning techniques in order to perform energy demand forecasting. To this end, the authors propose a mixed architecture consisting of a convolutional neural network (CNN) coupled with an artificial neural network (ANN), with the main objective of taking advantage of the virtues of both structures: the regression capabilities of the artificial neural network and the feature extraction capacities of the convolutional neural network. The proposed structure was trained and then used in a real setting to provide a French energy demand forecast using Action de Recherche Petite Echelle Grande Echelle (ARPEGE) forecasting weather data. The results show that this approach outperforms the reference Réseau de Transport d’Electricité (RTE, French transmission system operator) subscription-based service. Additionally, the proposed solution obtains the highest performance score when compared with other alternatives, including Autoregressive Integrated Moving Average (ARIMA) and traditional ANN models. This opens up the possibility of achieving high-accuracy forecasting using widely accessible deep learning techniques through open-source machine learning platforms.



2004 ◽  
Vol 8 (4) ◽  
pp. 219-233
Author(s):  
Tarun K. Sen ◽  
Parviz Ghandforoush ◽  
Charles T. Stivason

Neural networks are excellent mapping tools for complex financial data. Their mapping capabilities however do not always result in good generalizability for financial prediction models. Increasing the number of nodes and hidden layers in a neural network model produces better mapping of the data since the number of parameters available to the model increases. This is determinal to generalizabilitiy of the model since the model memorizes idiosyncratic patterns in the data. A neural network model can be expected to be more generalizable if the model architecture is made less complex by using fewer input nodes. In this study we simplify the neural network by eliminating input nodes that have the least contribution to the prediction of a desired outcome. We also provide a theoretical relationship of the sensitivity of output variables to the input variables under certain conditions. This research initiates an effort in identifying methods that would improve the generalizability of neural networks in financial prediction tasks by using mergers and bankruptcy models. The result indicates that incorporating more variables that appear relevant in a model does not necessarily improve prediction performance.



Author(s):  
Alejandro J. del Real ◽  
Fernando Dorado ◽  
Jaime Durán

This paper investigates the use of deep learning techniques to perform energy demand forecasting. Specifically, the authors have adapted a deep neural network originally thought for image classification and composed of a convolutional neural network (CNN) followed by a multilayered fully connected artificial neural network (ANN). The convolutional part of the network was fed with a grid of temperature forecasting data distributed in the area of interest in order to extract a featured temperature. The subsequent ANN is then fed with this calculated temperature along with other data related to the timing of the forecast. The proposed structure was first trained and then used in a real setting aimed to provide the French energy demand forecast using ARPEGE forecasting weather data. The results show that the performance of this approach is in the line of the performance provided by the reference RTE subscription-based service, which opens the possibility to obtain high accuracy forecasting using widely accessible deep learning techniques through open-source machine learning platforms.



2019 ◽  
Author(s):  
Zhonghao Liu ◽  
Jing Jin ◽  
Yuxin Cui ◽  
Zheng Xiong ◽  
Alireza Nasiri ◽  
...  

AbstractHuman leukocyte antigen (HLA) complex molecules play an essential role in immune interactions by presenting peptides on the cell surface to T cells. With significant progress in deep learning, a series of neural network based models have been proposed and demonstrated with their good performances for peptide-HLA class I binding prediction. However, there still lack effective binding prediction models for HLA class II protein binding with peptides due to its inherent challenges. In this work, we present a novel sequence-based pan-specific neural network structure, DeepSeaPanII, for peptide-HLA class II binding prediction. Compared with existing pan-specific models, our model is an end-to-end neural network model without the need for pre- or post-processing on input samples. Besides state-of-the-art peformance in binding affinity prediction, DeepSeqPanII can also extract biological insight on the binding mechanism over the peptide and HLA sequences by its attention mechanism based binding core prediction capability. The leave-one-allele-out cross validation and benchmark evaluation results show that our proposed network model achieved state-of-the-art performance in HLA-II peptide binding. The source code and trained models are freely available at https://github.com/pcpLiu/DeepSeqPanII.



2021 ◽  
Vol 267 ◽  
pp. 02059
Author(s):  
Deyu Xia ◽  
Ning Li ◽  
Pengju Ren ◽  
Xiaodong Wen

Machine learning has brought great convenience to material property prediction. However, most existing models can only predict properties of molecules or crystals with specific size, and usually only local atomic environment or molecular global descriptor representation be used as the characteristics of the model, resulting in poor model versatility and cannot be applied to multiple systems. We propose a method that combines the description of the local atomic environment and the overall structure of the molecule, a fusion model consisting of a graph convolutional neural network and a fully connected neural network is used to predict the properties of molecules or crystals, and successfully applied to QM9 organic molecules and semiconductor crystal materials. Our method is not limited to a specific size of a molecule or a crystal structure. According to the calculation principle of the properties of the material molecules, the influences of the local atomic environment and the overall structure of the molecules on the properties are respectively considered, an appropriate weighting ratio is selected to predict the properties. As a result, the prediction performance has been greatly improved. In fact, the proposed method is not limited to organic molecules and crystals and is also applicable to other structures, such as clusters.



2018 ◽  
Vol 175 ◽  
pp. 03015
Author(s):  
GOU Guohua

This paper studied the coal demand in the prediction accuracy problems. The traditional gray GM(1,1)model has the theoretical prediction problem of poor accuracy which leaded to less accurate prediction. A modified gray BP Neural Network forecasting model was used to predict the residual correction. The total consumption of coal as a major factor in variables was selected to construct forecast of coal demand The simulation results show that the proposed algorithm has better prediction accuracy and is an effective demand forecasting algorithm



2020 ◽  
Vol 2020 ◽  
pp. 1-15 ◽  
Author(s):  
Pan Wu ◽  
Zilin Huang ◽  
Yuzhuang Pian ◽  
Lunhui Xu ◽  
Jinlong Li ◽  
...  

Short-term traffic speed prediction is a promising research topic in intelligent transportation systems (ITSs), which also plays an important role in the real-time decision-making of traffic control and guidance systems. However, the urban traffic speed has strong temporal, spatial correlation and the characteristic of complex nonlinearity and randomness, which makes it challenging to accurately and efficiently forecast short-term traffic speeds. We investigate the relevant literature and found that although most methods can achieve good prediction performance with the complete sample data, when there is a certain missing rate in the database, it is difficult to maintain accuracy with these methods. Recent studies have shown that deep learning methods, especially long short-term memory (LSTM) models, have good results in short-term traffic flow prediction. Furthermore, the attention mechanism can properly assign weights to distinguish the importance of traffic time sequences, thereby further improving the computational efficiency of the prediction model. Therefore, we propose a framework for short-term traffic speed prediction, including data preprocessing module and short-term traffic prediction module. In the data preprocessing module, the missing traffic data are repaired to provide a complete dataset for subsequent prediction. In the prediction module, a combined deep learning method that is an attention-based LSTM (ATT-LSTM) model for predicting short-term traffic speed on urban roads is proposed. The proposed framework was applied to the urban road network in Nanshan District, Shenzhen, Guangdong Province, China, with a 30-day traffic speed dataset (floating car data) used as the experimental sample. Results show that the proposed method outperforms other deep learning algorithms (such as recurrent neural network (RNN) and convolutional neural network (CNN)) in terms of both calculating efficiency and prediction accuracy. The attention mechanism can significantly reduce the error of the LSTM model (up to 12.4%) and improves the prediction performance.



2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Sangwon Chae ◽  
Joonhyeok Shin ◽  
Sungjun Kwon ◽  
Sangmok Lee ◽  
Sungwon Kang ◽  
...  

AbstractIn this paper, we propose a real-time prediction model that can respond to particulate matters (PM) in the air, which are an indication of poor air quality. The model applies interpolation to air quality and weather data and then uses a Convolutional Neural Network (CNN) to predict PM concentrations. The interpolation transforms the irregular spatial data into an equally spaced grid, which the model requires. This combination creates the interpolated CNN (ICNN) model that we use to predict PM10 and PM2.5 concentrations. The PM10 and PM2.5 evaluation results show an effective prediction performance with an R-squared higher than 0.97 and a root mean square error (RMSE) of approximately 16% of the standard deviation. Furthermore, both PM10 and PM2.5 prediction models forecast high concentrations with high reliability, with a probability of detection higher than 0.90 and a critical success index exceeding 0.85. The proposed ICNN prediction model achieves a high prediction performance using spatio-temporal information and presents a new direction in the prediction field.



2019 ◽  
Vol 24 (3) ◽  
pp. 220-228
Author(s):  
Gusti Alfahmi Anwar ◽  
Desti Riminarsih

Panthera merupakan genus dari keluarga kucing yang memiliki empat spesies popular yaitu, harimau, jaguar, macan tutul, singa. Singa memiliki warna keemasan dan tidak memilki motif, harimau memiliki motif loreng dengan garis-garis panjang, jaguar memiliki tubuh yang lebih besar dari pada macan tutul serta memiliki motif tutul yang lebih lebar, sedangkan macan tutul memiliki tubuh yang sedikit lebih ramping dari pada jaguar dan memiliki tutul yang tidak terlalu lebar. Pada penelitian ini dilakukan klasifikasi genus panther yaitu harimau, jaguar, macan tutul, dan singa menggunakan metode Convolutional Neural Network. Model Convolutional Neural Network yang digunakan memiliki 1 input layer, 5 convolution layer, dan 2 fully connected layer. Dataset yang digunakan berupa citra harimau, jaguar, macan tutul, dan singa. Data training terdiri dari 3840 citra, data validasi sebanyak 960 citra, dan data testing sebanyak 800 citra. Hasil akurasi dari pelatihan model untuk training yaitu 92,31% dan validasi yaitu 81,88%, pengujian model menggunakan dataset testing mendapatan hasil 68%. Hasil akurasi prediksi didapatkan dari nilai F1-Score pada pengujian didapatkan sebesar 78% untuk harimau, 70% untuk jaguar, 37% untuk macan tutul, 74% untuk singa. Macan tutul mendapatkan akurasi terendah dibandingkan 3 hewan lainnya tetapi lebih baik dibandingkan hasil penelitian sebelumnya.



Sign in / Sign up

Export Citation Format

Share Document