Data-driven decision support under concept drift in streamed big data

Jie Lu; Anjin Liu; Yiliao Song; Guangquan Zhang

doi:10.1007/s40747-019-00124-4

Data-driven decision support under concept drift in streamed big data

Complex & Intelligent Systems ◽

10.1007/s40747-019-00124-4 ◽

2019 ◽

Vol 6 (1) ◽

pp. 157-163 ◽

Cited By ~ 2

Author(s):

Jie Lu ◽

Anjin Liu ◽

Yiliao Song ◽

Guangquan Zhang

Keyword(s):

Decision Making ◽

Big Data ◽

Real Time ◽

Concept Drift ◽

High Volume ◽

Streaming Data ◽

Data Driven ◽

Research Directions ◽

Decision Outcomes ◽

Past Data

Abstract Data-driven decision-making ($$\mathrm {D^3}$$D3M) is often confronted by the problem of uncertainty or unknown dynamics in streaming data. To provide real-time accurate decision solutions, the systems have to promptly address changes in data distribution in streaming data—a phenomenon known as concept drift. Past data patterns may not be relevant to new data when a data stream experiences significant drift, thus to continue using models based on past data will lead to poor prediction and poor decision outcomes. This position paper discusses the basic framework and prevailing techniques in streaming type big data and concept drift for $$\mathrm {D^3}$$D3M. The study first establishes a technical framework for real-time $$\mathrm {D^3}$$D3M under concept drift and details the characteristics of high-volume streaming data. The main methodologies and approaches for detecting concept drift and supporting $$\mathrm {D^3}$$D3M are highlighted and presented. Lastly, further research directions, related methods and procedures for using streaming data to support decision-making in concept drift environments are identified. We hope the observations in this paper could support researchers and professionals to better understand the fundamentals and research directions of $$\mathrm {D^3}$$D3M in streamed big data environments.

Get full-text (via PubEx)

Big Data-driven Decision-Making Processes, Real-Time Advanced Analytics, and Cyber-Physical Production Networks in Industry 4.0-based Manufacturing Systems

Economics Management and Financial Markets ◽

10.22381/emfm16420216 ◽

2021 ◽

Vol 16 (4) ◽

pp. 84

Keyword(s):

Decision Making ◽

Big Data ◽

Real Time ◽

Industry 4.0 ◽

Manufacturing Systems ◽

Data Driven ◽

Production Networks ◽

Data Driven Decision Making ◽

Decision Making Processes ◽

Advanced Analytics

Get full-text (via PubEx)

Big Data-driven Smart Cities: Computationally Networked Urbanism, Real-Time Decision-Making, and the Cognitive Internet of Things

Geopolitics History and International Relations ◽

10.22381/ghir11220197 ◽

2019 ◽

Vol 11 (2) ◽

pp. 48

Keyword(s):

Decision Making ◽

Big Data ◽

Internet Of Things ◽

Real Time ◽

Smart Cities ◽

Data Driven ◽

Cognitive Internet Of Things

Get full-text (via PubEx)

Industrial Artificial Intelligence, Smart Connected Sensors, and Big Data-driven Decision-Making Processes in Internet of Things-based Real-Time Production Logistics

Economics Management and Financial Markets ◽

10.22381/emfm15320201 ◽

2020 ◽

Vol 15 (3) ◽

pp. 9

Keyword(s):

Artificial Intelligence ◽

Decision Making ◽

Big Data ◽

Internet Of Things ◽

Real Time ◽

Data Driven ◽

Data Driven Decision Making ◽

Production Logistics ◽

Time Production ◽

Decision Making Processes

Get full-text (via PubEx)

Internet of Things-based Real-Time Production Logistics, Big Data-driven Decision-Making Processes, and Industrial Artificial Intelligence in Sustainable Cyber-Physical Manufacturing Systems

Journal of Self-Governance and Management Economics ◽

10.22381/jsme9320215 ◽

2021 ◽

Vol 9 (3) ◽

pp. 61

Keyword(s):

Artificial Intelligence ◽

Decision Making ◽

Big Data ◽

Internet Of Things ◽

Real Time ◽

Manufacturing Systems ◽

Data Driven ◽

Production Logistics ◽

Time Production ◽

Decision Making Processes

Get full-text (via PubEx)

Big Data-driven Algorithmic Decision-Making in Selecting and Managing Employees: Advanced Predictive Analytics, Workforce Metrics, and Digital Innovations for Enhancing Organizational Human Capital Social Sciences, Sociology, Management and complex organi

Psychosociological Issues in Human Resource Management ◽

10.22381/pihrm7220198 ◽

2019 ◽

Vol 7 (2) ◽

pp. 49 ◽

Cited By ~ 2

Keyword(s):

Social Sciences ◽

Decision Making ◽

Human Capital ◽

Big Data ◽

Predictive Analytics ◽

Data Driven ◽

Capital Social

Get full-text (via PubEx)

Cognitive Automation, Big Data-driven Manufacturing, and Sustainable Industrial Value Creation in Internet of Things-based Real-Time Production Logistics

Economics Management and Financial Markets ◽

10.22381/emfm15420204 ◽

2020 ◽

Vol 15 (4) ◽

pp. 39

Keyword(s):

Big Data ◽

Internet Of Things ◽

Real Time ◽

Value Creation ◽

Data Driven ◽

Production Logistics ◽

Time Production ◽

Cognitive Automation

Get full-text (via PubEx)

Big Data-driven Decision-Making Processes, Industry 4.0 Wireless Networks, and Digitized Mass Production in Cyber-Physical System-based Smart Factories

Economics Management and Financial Markets ◽

10.22381/emfm15420202 ◽

2020 ◽

Vol 15 (4) ◽

pp. 19

Keyword(s):

Decision Making ◽

Wireless Networks ◽

Big Data ◽

Mass Production ◽

Physical System ◽

Industry 4.0 ◽

Data Driven ◽

Cyber Physical System ◽

Decision Making Processes ◽

Smart Factories

Get full-text (via PubEx)

Data-Driven Dispatching Rules Mining and Real-Time Decision-Making Methodology in Intelligent Manufacturing Shop Floor with Uncertainty

Sensors ◽

10.3390/s21144836 ◽

2021 ◽

Vol 21 (14) ◽

pp. 4836

Author(s):

Liping Zhang ◽

Yifan Hu ◽

Qiuhua Tang ◽

Jie Li ◽

Zhixiong Li

Keyword(s):

Decision Making ◽

Data Base ◽

Real Time ◽

Manufacturing Industry ◽

Job Shop ◽

Data Driven ◽

Dispatching Rules ◽

Shop Floor ◽

Production Data ◽

Online Decision Making

In modern manufacturing industry, the methods supporting real-time decision-making are the urgent requirement to response the uncertainty and complexity in intelligent production process. In this paper, a novel closed-loop scheduling framework is proposed to achieve real-time decision making by calling the appropriate data-driven dispatching rules at each rescheduling point. This framework contains four parts: offline training, online decision-making, data base and rules base. In the offline training part, the potential and appropriate dispatching rules with managers’ expectations are explored successfully by an improved gene expression program (IGEP) from the historical production data, not just the available or predictable information of the shop floor. In the online decision-making part, the intelligent shop floor will implement the scheduling scheme which is scheduled by the appropriate dispatching rules from rules base and store the production data into the data base. This approach is evaluated in a scenario of the intelligent job shop with random jobs arrival. Numerical experiments demonstrate that the proposed method outperformed the existing well-known single and combination dispatching rules or the discovered dispatching rules via metaheuristic algorithm in term of makespan, total flow time and tardiness.

Get full-text (via PubEx)

A large multi-group decision-making technique for prioritizing the big data-driven circular economy practices in the automobile component manufacturing industry

Technological Forecasting and Social Change ◽

10.1016/j.techfore.2020.120567 ◽

2021 ◽

Vol 165 ◽

pp. 120567

Author(s):

Sachin S. Kamble ◽

Amine Belhadi ◽

Angappa Gunasekaran ◽

L. Ganapathy ◽

Surabhi Verma

Keyword(s):

Decision Making ◽

Big Data ◽

Group Decision Making ◽

Circular Economy ◽

Manufacturing Industry ◽

Group Decision ◽

Data Driven ◽

Component Manufacturing

Get full-text (via PubEx)

Measuring the Effectiveness of Adaptive Random Forest for Handling Concept Drift in Big Data Streams

Entropy ◽

10.3390/e23070859 ◽

2021 ◽

Vol 23 (7) ◽

pp. 859

Author(s):

Abdulaziz O. AlQabbany ◽

Aqil M. Azmi

Keyword(s):

Big Data ◽

Random Forest ◽

Real Time ◽

Data Streams ◽

Learning Algorithm ◽

Concept Drift ◽

The United States ◽

Careful Consideration ◽

Data Sets ◽

Stream Data

We are living in the age of big data, a majority of which is stream data. The real-time processing of this data requires careful consideration from different perspectives. Concept drift is a change in the data’s underlying distribution, a significant issue, especially when learning from data streams. It requires learners to be adaptive to dynamic changes. Random forest is an ensemble approach that is widely used in classical non-streaming settings of machine learning applications. At the same time, the Adaptive Random Forest (ARF) is a stream learning algorithm that showed promising results in terms of its accuracy and ability to deal with various types of drift. The incoming instances’ continuity allows for their binomial distribution to be approximated to a Poisson(1) distribution. In this study, we propose a mechanism to increase such streaming algorithms’ efficiency by focusing on resampling. Our measure, resampling effectiveness (ρ), fuses the two most essential aspects in online learning; accuracy and execution time. We use six different synthetic data sets, each having a different type of drift, to empirically select the parameter λ of the Poisson distribution that yields the best value for ρ. By comparing the standard ARF with its tuned variations, we show that ARF performance can be enhanced by tackling this important aspect. Finally, we present three case studies from different contexts to test our proposed enhancement method and demonstrate its effectiveness in processing large data sets: (a) Amazon customer reviews (written in English), (b) hotel reviews (in Arabic), and (c) real-time aspect-based sentiment analysis of COVID-19-related tweets in the United States during April 2020. Results indicate that our proposed method of enhancement exhibited considerable improvement in most of the situations.

Get full-text (via PubEx)