Study on real-time prediction model of railway passenger flow based on big data technology

In order to solve the limitation of traditional offline forecasting application scenarios, the author uses a variety of big data open source frameworks and tools to combine with railway real-time data, and proposes a real-time prediction model of railway passenger flow. The model architecture is divided into four levels from bottom to top: data source layer, data transmission layer, prediction calculation layer and application layer. The main components of the model are data flow and prediction flow. Through message queue and ETL, the data process part realizes the synchronization of offline data and real-time data; through the big data technology frameworks such as Spark, Redis and Hive and the GBDT (Gradient Boosting Tree) algorithm, the prediction process partially realizes the real-time passenger flow of the train OD section prediction. The experimental results show that the model proposed by the author has certain practicability and accuracy both in performance and prediction accuracy.

Download Full-text

Research on real-time data centre reconstruction technology based on big data technology

Automotive, Mechanical and Electrical Engineering ◽

10.1201/9781315210445-38 ◽

2017 ◽

pp. 203-206

Author(s):

Li Xianhui ◽

Sheng Zhou ◽

Shengpeng Ji ◽

Gang Zhen ◽

Yang He ◽

...

Keyword(s):

Big Data ◽

Real Time ◽

Time Data ◽

Data Centre ◽

Real Time Data ◽

Big Data Technology

Download Full-text

Design and Implementation of Real Time Data Center Access Interface Based on Big Data Technology

2017 International Conference on Computer Technology, Electronics and Communication (ICCTEC) ◽

10.1109/icctec.2017.00125 ◽

2017 ◽

Cited By ~ 1

Author(s):

Feng You ◽

Junning Qin ◽

Keheng Zhang ◽

Xianhui Li ◽

Haiquan Mao ◽

...

Keyword(s):

Big Data ◽

Real Time ◽

Data Center ◽

Time Data ◽

Design And Implementation ◽

Real Time Data ◽

Big Data Technology

Download Full-text

Design and Implementation of Real Time Data Center Access Interface Based on Big Data Technology

2018 IEEE International Conference of Safety Produce Informatization (IICSPI) ◽

10.1109/iicspi.2018.8690415 ◽

2018 ◽

Author(s):

You Feng ◽

Qin Junning ◽

Zhang Keheng ◽

Li Xianhui ◽

Mao Haiquan ◽

...

Keyword(s):

Big Data ◽

Real Time ◽

Data Center ◽

Time Data ◽

Design And Implementation ◽

Real Time Data ◽

Big Data Technology

Download Full-text

Neural network prediction model for a real-time data transmission

Neural Computing and Applications ◽

10.1007/s00521-006-0042-1 ◽

2006 ◽

Vol 15 (3-4) ◽

pp. 373-382 ◽

Cited By ~ 4

Author(s):

Kil To Chong ◽

Sung Goo Yoo

Keyword(s):

Neural Network ◽

Prediction Model ◽

Real Time ◽

Data Transmission ◽

Time Data ◽

Real Time Data ◽

Neural Network Prediction ◽

Network Prediction

Download Full-text

An LSTM-Based Method Considering History and Real-Time Data for Passenger Flow Prediction

Applied Sciences ◽

10.3390/app10113788 ◽

2020 ◽

Vol 10 (11) ◽

pp. 3788 ◽

Cited By ~ 1

Author(s):

Qi Ouyang ◽

Yongbo Lv ◽

Jihui Ma ◽

Jing Li

Keyword(s):

Feature Extraction ◽

Real Time ◽

Short Term Memory ◽

Historical Data ◽

Time Interval ◽

Information Coding ◽

Time Data ◽

Passenger Flow ◽

Flow Prediction ◽

Real Time Data

With the development of big data and deep learning, bus passenger flow prediction considering real-time data becomes possible. Real-time traffic flow prediction helps to grasp real-time passenger flow dynamics, provide early warning for a sudden passenger flow and data support for real-time bus plan changes, and improve the stability of urban transportation systems. To solve the problem of passenger flow prediction considering real-time data, this paper proposes a novel passenger flow prediction network model based on long short-term memory (LSTM) networks. The model includes four parts: feature extraction based on Xgboost model, information coding based on historical data, information coding based on real-time data, and decoding based on a multi-layer neural network. In the feature extraction part, the data dimension is increased by fusing bus data and points of interest to improve the number of parameters and model accuracy. In the historical information coding part, we use the date as the index in the LSTM structure to encode historical data and provide relevant information for prediction; in the real-time data coding part, the daily half-hour time interval is used as the index to encode real-time data and provide real-time prediction information; in the decoding part, the passenger flow data for the next two 30 min interval outputs by decoding all the information. To our best knowledge, it is the first time to real-time information has been taken into consideration in passenger flow prediction based on LSTM. The proposed model can achieve better accuracy compared to the LSTM and other baseline methods.

Download Full-text

Big Data Management in the Context of Real-Time Data Warehousing

Big Data Management, Technologies, and Applications - Advances in Data Mining and Database Management ◽

10.4018/978-1-4666-4699-5.ch007 ◽

2013 ◽

pp. 150-176

Author(s):

M. Asif Naeem ◽

Gillian Dobbie ◽

Gerald Weber

Keyword(s):

Big Data ◽

Data Integration ◽

Real Time ◽

Real Life ◽

Skewed Distribution ◽

Stream Data ◽

Time Data ◽

Master Data ◽

Real Time Data ◽

Resource Aware

In order to make timely and effective decisions, businesses need the latest information from big data warehouse repositories. To keep these repositories up to date, real-time data integration is required. An important phase in real-time data integration is data transformation where a stream of updates, which is huge in volume and infinite, is joined with large disk-based master data. Stream processing is an important concept in Big Data, since large volumes of data are often best processed immediately. A well-known algorithm called Mesh Join (MESHJOIN) was proposed to process stream data with disk-based master data, which uses limited memory. MESHJOIN is a candidate for a resource-aware system setup. The problem that the authors consider in this chapter is that MESHJOIN is not very selective. In particular, the performance of the algorithm is always inversely proportional to the size of the master data table. As a consequence, the resource consumption is in some scenarios suboptimal. They present an algorithm called Cache Join (CACHEJOIN), which performs asymptotically at least as well as MESHJOIN but performs better in realistic scenarios, particularly if parts of the master data are used with different frequencies. In order to quantify the performance differences, the authors compare both algorithms with a synthetic dataset of a known skewed distribution as well as TPC-H and real-life datasets.

Download Full-text

Performance Improvement IoT Applications Through Multimedia Analytics Using Big Data Stream Computing Platforms

Exploring the Convergence of Big Data and the Internet of Things - Advances in Data Mining and Database Management ◽

10.4018/978-1-5225-2947-7.ch015 ◽

2018 ◽

pp. 200-221

Author(s):

Rizwan Patan ◽

Rajasekhara Babu M ◽

Suresh Kallam

Keyword(s):

Big Data ◽

Real Time ◽

Performance Improvement ◽

Data Stream ◽

Real Data ◽

Stream Computing ◽

Time Data ◽

Real Time Data ◽

Computing Platforms ◽

Time And Energy

A Big Data Stream Computing (BDSC) Platform handles real-time data from various applications such as risk management, marketing management and business intelligence. Now a days Internet of Things (IoT) deployment is increasing massively in all the areas. These IoTs engender real-time data for analysis. Existing BDSC is inefficient to handle Real-data stream from IoTs because the data stream from IoTs is unstructured and has inconstant velocity. So, it is challenging to handle such real-time data stream. This work proposes a framework that handles real-time data stream through device control techniques to improve the performance. The frame work includes three layers. First layer deals with Big Data platforms that handles real data streams based on area of importance. Second layer is performance layer which deals with performance issues such as low response time, and energy efficiency. The third layer is meant for Applying developed method on existing BDSC platform. The experimental results have been shown a performance improvement 20%-30% for real time data stream from IoT application.

Download Full-text

Trends and Technologies in Big Data Processing

Advances in Computational Intelligence and Robotics - Innovations, Algorithms, and Applications in Cognitive Informatics and Natural Intelligence ◽

10.4018/978-1-7998-3038-2.ch002 ◽

2020 ◽

pp. 17-42

Author(s):

Amitava Choudhury ◽

Kalpana Rangra

Keyword(s):

Cloud Computing ◽

Big Data ◽

Internet Of Things ◽

Data Processing ◽

Real Time ◽

Computing Technology ◽

Time Data ◽

Big Data Processing ◽

Real Time Data ◽

Real Time Data Processing

Data type and amount in human society is growing at an amazing speed, which is caused by emerging new services such as cloud computing, internet of things, and location-based services. The era of big data has arrived. As data has been a fundamental resource, how to manage and utilize big data better has attracted much attention. Especially with the development of the internet of things, how to process a large amount of real-time data has become a great challenge in research and applications. Recently, cloud computing technology has attracted much attention to high performance, but how to use cloud computing technology for large-scale real-time data processing has not been studied. In this chapter, various big data processing techniques are discussed.

Download Full-text

A novel performance aware real-time data handling for big data platforms on Lambda architecture

International Journal of Computer Aided Engineering and Technology ◽

10.1504/ijcaet.2018.092840 ◽

2018 ◽

Vol 10 (4) ◽

pp. 418 ◽

Cited By ~ 1

Author(s):

Rizwan Patan ◽

M. Rajasekhara Babu

Keyword(s):

Big Data ◽

Real Time ◽

Data Handling ◽

Time Data ◽

Lambda Architecture ◽

Real Time Data

Download Full-text

Customer behavior analysis using real-time data processing

Asia Pacific Journal of Marketing and Logistics ◽

10.1108/apjml-03-2018-0088 ◽

2019 ◽

Vol 31 (1) ◽

pp. 265-290 ◽

Cited By ~ 8

Author(s):

Ganjar Alfian ◽

Muhammad Fazal Ijaz ◽

Muhammad Syafrudin ◽

M. Alex Syaekhoni ◽

Norma Latif Fitriyani ◽

...

Keyword(s):

Data Processing ◽

Real Time ◽

Association Rule ◽

Customer Behavior ◽

Time Data ◽

Content Type ◽

The Real ◽

Real Time Data ◽

Real Time Data Processing ◽

Big Data Technology

PurposeThe purpose of this paper is to propose customer behavior analysis based on real-time data processing and association rule for digital signage-based online store (DSOS). The real-time data processing based on big data technology (such as NoSQL MongoDB and Apache Kafka) is utilized to handle the vast amount of customer behavior data.Design/methodology/approachIn order to extract customer behavior patterns, customers’ browsing history and transactional data from digital signage (DS) could be used as the input for decision making. First, the authors developed a DSOS and installed it in different locations, so that customers could have the experience of browsing and buying a product. Second, the real-time data processing system gathered customers’ browsing history and transaction data as it occurred. In addition, the authors utilized the association rule to extract useful information from customer behavior, so it may be used by the managers to efficiently enhance the service quality.FindingsFirst, as the number of customers and DS increases, the proposed system was capable of processing a gigantic amount of input data conveniently. Second, the data set showed that as the number of visit and shopping duration increases, the chance of products being purchased also increased. Third, by combining purchasing and browsing data from customers, the association rules from the frequent transaction pattern were achieved. Thus, the products will have a high possibility to be purchased if they are used as recommendations.Research limitations/implicationsThis research empirically supports the theory of association rule that frequent patterns, correlations or causal relationship found in various kinds of databases. The scope of the present study is limited to DSOS, although the findings can be interpreted and generalized in a global business scenario.Practical implicationsThe proposed system is expected to help management in taking decisions such as improving the layout of the DS and providing better product suggestions to the customer.Social implicationsThe proposed system may be utilized to promote green products to the customer, having a positive impact on sustainability.Originality/valueThe key novelty of the present study lies in system development based on big data technology to handle the enormous amounts of data as well as analyzing the customer behavior in real time in the DSOS. The real-time data processing based on big data technology (such as NoSQL MongoDB and Apache Kafka) is used to handle the vast amount of customer behavior data. In addition, the present study proposed association rule to extract useful information from customer behavior. These results can be used for promotion as well as relevant product recommendations to DSOS customers. Besides in today’s changing retail environment, analyzing the customer behavior in real time in DSOS helps to attract and retain customers more efficiently and effectively, and retailers can get a competitive advantage over their competitors.

Download Full-text