Big Data Predictive Modeling and Analytics

Systematic reviews are the method of choice to synthesize research evidence. To identify main topics (so-called hot spots) relevant to large corpora of original publications in need of a synthesis, one must address the “three Vs” of big data (volume, velocity, and variety), especially in loosely defined or fragmented disciplines. For this purpose, text mining and predictive modeling are very helpful. Thus, we applied these methods to a compilation of documents related to digitalization in aesthetic, arts, and cultural education, as a prototypical, loosely defined, fragmented discipline, and particularly to quantitative research within it (QRD-ACE). By broadly querying the abstract and citation database Scopus with terms indicative of QRD-ACE, we identified a corpus of N = 55,553 publications for the years 2013–2017. As the result of an iterative approach of text mining, priority screening, and predictive modeling, we identified n = 8,304 potentially relevant publications of which n = 1,666 were included after priority screening. Analysis of the subject distribution of the included publications revealed video games as a first hot spot of QRD-ACE. Topic modeling resulted in aesthetics and cultural activities on social media as a second hot spot, related to 4 of k = 8 identified topics. This way, we were able to identify current hot spots of QRD-ACE by screening less than 15% of the corpus. We discuss implications for harnessing text mining, predictive modeling, and priority screening in future research syntheses and avenues for future original research on QRD-ACE.

Download Full-text

TEXT MINING TRANSPORTATION RESEARCH GRANT BIG DATA: KNOWLEDGE EXTRACTION AND PREDICTIVE MODELING USING FAST NEURAL NETS

International Journal for Traffic and Transport Engineering ◽

10.7708/ijtte.2017.7(3).06 ◽

2017 ◽

Vol 7 (3) ◽

Keyword(s):

Big Data ◽

Text Mining ◽

Predictive Modeling ◽

Knowledge Extraction ◽

Neural Nets ◽

Transportation Research ◽

Research Grant

Download Full-text

Big data and predictive modeling topics in healthcare

Proceedings of the 6th ACM Conference on Bioinformatics, Computational Biology and Health Informatics - BCB '15 ◽

10.1145/2808719.2816984 ◽

2015 ◽

Author(s):

Xin Deng ◽

Donghui Wu

Keyword(s):

Big Data ◽

Predictive Modeling

Download Full-text

Conceptual Predictive Modeling in a Competitive Framework Using Big Data Technology

2015 8th International Conference on Database Theory and Application (DTA) ◽

10.1109/dta.2015.15 ◽

2015 ◽

Author(s):

Jeong-Sig Kim ◽

Eung-Sung Kim ◽

Jin-Hong Kim

Keyword(s):

Big Data ◽

Predictive Modeling ◽

Big Data Technology

Download Full-text

BAYESIAN ANALYSIS OF BIG DATA IN INSURANCE PREDICTIVE MODELING USING DISTRIBUTED COMPUTING

Astin Bulletin ◽

10.1017/asb.2017.15 ◽

2017 ◽

Vol 47 (3) ◽

pp. 943-961 ◽

Cited By ~ 2

Author(s):

Yanwei Zhang

Keyword(s):

Big Data ◽

Distributed Computing ◽

Predictive Modeling ◽

Bayesian Methods ◽

Distributed Algorithm ◽

Bayesian Computation ◽

Actuarial Science ◽

Parallel Method ◽

Bayesian Hierarchical ◽

The Empirical Analysis

AbstractWhile Bayesian methods have attracted considerable interest in actuarial science, they are yet to be embraced in large-scaled insurance predictive modeling applications, due to inefficiencies of Bayesian estimation procedures. The paper presents an efficient method that parallelizes Bayesian computation using distributed computing on Apache Spark across a cluster of computers. The distributed algorithm dramatically boosts the speed of Bayesian computation and expands the scope of applicability of Bayesian methods in insurance modeling. The empirical analysis applies a Bayesian hierarchical Tweedie model to a big data of 13 million insurance claim records. The distributed algorithm achieves as much as 65 times performance gain over the non-parallel method in this application. The analysis demonstrates that Bayesian methods can be of great value to large-scaled insurance predictive modeling.

Download Full-text

Big Data Analytics-Based Predictive Modeling for Stress Management Using Healthcare System

Advanced Science Letters ◽

10.1166/asl.2017.8627 ◽

2017 ◽

Vol 23 (3) ◽

pp. 1585-1588 ◽

Cited By ~ 1

Author(s):

Jung-Hyok Kwon ◽

Hwi-Ho Lee ◽

Eui-Jik Kim

Keyword(s):

Big Data ◽

Stress Management ◽

Healthcare System ◽

Predictive Modeling ◽

Data Analytics ◽

Big Data Analytics

Download Full-text

Big Data Analytics and Predictive Modeling Approaches for the Energy Sector

2019 IEEE International Congress on Big Data (BigDataCongress) ◽

10.1109/bigdatacongress.2019.00020 ◽

2019 ◽

Cited By ~ 1

Author(s):

Roberto Corizzo ◽

Michelangelo Ceci ◽

Donato Malerba

Keyword(s):

Big Data ◽

Predictive Modeling ◽

Data Analytics ◽

Big Data Analytics ◽

Energy Sector ◽

Modeling Approaches

Download Full-text