scholarly journals Integrated Time Series Summarization and Prediction Algorithm and its Application to COVID-19 Data Mining

Author(s):  
Mogens Graf Plessen
2021 ◽  
Vol 2021 ◽  
pp. 1-8
Author(s):  
Hongxiang Sun ◽  
Zhongkai Yao ◽  
Qingchun Miao

With the rapid development of information technology and globalization of economy, financial data are being generated and collected at an unprecedented rate. Consequently, there has been a dire need of automated methods for effective and proficient utilization of a substantial amount of financial data to help in investment planning and decision-making. Data mining methods have been employed to discover hidden patterns and estimate future tendencies in financial markets. In this article, an improved macroeconomic growth prediction algorithm based on data mining and fuzzy correlation analysis is presented. This study analyzes the sequence of economic characteristics, reorganizes the spatial structure of economic characteristics, and integrates the statistical information of economic data. Using the optimized Apriori algorithm, the association rules between macroeconomic data are generated. Distinct features are extracted according to association rules using the joint distribution characteristic quantity of macroeconomic time series. Moreover, the Doppler parameter of macroeconomic time series growth prediction is calculated, and the residual analysis method of the regression model is used to predict the growth of macroeconomic data. Experimental results show that the proposed algorithm has better adaptability, less computation time, and higher prediction accuracy of economic data mining.


2021 ◽  
Vol 8 (1) ◽  
Author(s):  
J. Nathan Matias ◽  
Kevin Munger ◽  
Marianne Aubin Le Quere ◽  
Charles Ebersole

AbstractThe pursuit of audience attention online has led organizations to conduct thousands of behavioral experiments each year in media, politics, activism, and digital technology. One pioneer of A/B tests was Upworthy.com, a U.S. media publisher that conducted a randomized trial for every article they published. Each experiment tested variations in a headline and image “package,” recording how many randomly-assigned viewers selected each variation. While none of these tests were designed to answer scientific questions, scientists can advance knowledge by meta-analyzing and data-mining the tens of thousands of experiments Upworthy conducted. This archive records the stimuli and outcome for every A/B test fielded by Upworthy between January 24, 2013 and April 30, 2015. In total, the archive includes 32,487 experiments, 150,817 experiment arms, and 538,272,878 participant assignments. The open access dataset is organized to support exploratory and confirmatory research, as well as meta-scientific research on ways that scientists make use of the archive.


Author(s):  
Shadi Aljawarneh ◽  
Aurea Anguera ◽  
John William Atwood ◽  
Juan A. Lara ◽  
David Lizcano

AbstractNowadays, large amounts of data are generated in the medical domain. Various physiological signals generated from different organs can be recorded to extract interesting information about patients’ health. The analysis of physiological signals is a hard task that requires the use of specific approaches such as the Knowledge Discovery in Databases process. The application of such process in the domain of medicine has a series of implications and difficulties, especially regarding the application of data mining techniques to data, mainly time series, gathered from medical examinations of patients. The goal of this paper is to describe the lessons learned and the experience gathered by the authors applying data mining techniques to real medical patient data including time series. In this research, we carried out an exhaustive case study working on data from two medical fields: stabilometry (15 professional basketball players, 18 elite ice skaters) and electroencephalography (100 healthy patients, 100 epileptic patients). We applied a previously proposed knowledge discovery framework for classification purpose obtaining good results in terms of classification accuracy (greater than 99% in both fields). The good results obtained in our research are the groundwork for the lessons learned and recommendations made in this position paper that intends to be a guide for experts who have to face similar medical data mining projects.


Sign in / Sign up

Export Citation Format

Share Document