Unsupervised spatial data mining for the development of future scenarios: a Covid-19 application

2021 ◽  
pp. 173-178
Author(s):  
Yuri Calleo ◽  
Simone Di Zio

In the context of Futures Studies, the scenario development process permits to make assumptions on what the futures can be in order to support better today decisions. In the initial stages of the scenario building (Framing and Scanning phases), the process requires much time and efforts to scanning data and information (reading of documents, literature review and consultation of experts) to understand more about the object of the foresight study. The daily use of social networks causes an exponential increase of data and for this reason here we deal with the problem of speeding up and optimizing the Scanning phase by applying a new combined method based on the analysis of tweets with the use of unsupervised classification models, text-mining and spatial data mining techniques. For the purpose of having a qualitative overview, we applied the bag-of-words model and a Sentiment Analysis with the Afinn and Vader algorithms. Then, in order to extrapolate the influence factors, and the relevant key factors (Kayser and Blind, 2017; 2020) the Latent Dirichlet Allocation (LDA) was used (Tong and Zhang, 2016). Furthermore, to acquire also spatial information we used spatial data mining technique to extract georeferenced data from which it was possible to analyse and obtain a geographic analysis of the data. To showcase our method, we provide an example using Covid-19 tweets (Uhl and Schiebel, 2017), upon which 5 topics and 6 key factors have been extracted. In the last instance, for each influence factor, a cartogram was created through the relative frequencies in order to have a spatial distribution of the users discussing each particular topic. The results fully answer the research objectives and the model used could be a new approach that can offer benefits in the scenario developments process.

Author(s):  
Nguyen Vinh Nam ◽  
Le Hoai Bac

The  unique properties of spatial data provide challenges  and  opportunities  for  researching  new methods  in  spatial  data  mining.  In  this  article,  we propose  an  interoperable  framework  that  integrates Geographic  Information  System  (GIS)  with  the  spatial data  mining  processto  facilitate  spatial  data preparation,  to  extract  spatial  relationships  that  can take  advantage of traditional data  mining toolkits such as Weka, and to reveal significant spatial patterns. With this approach, it’svery straightforward to adopt spatial access methods and spatial query processing algorithms foran  efficient  data  mining  technique.  Moreover,  our framework  visually  supports  the  complete  spatial  data mining process.


Spatial data, also called geospatial data, is term needed to describe data linked to or containing knowledgeable data about a particular location on Earth’s surface. Spatial data mining's primary goal is to uncover hidden complicated information from spatial & non-spatial information in spite of their enormous quantity and find the spatial relations density. Spatial Data Mining techniques, however, continue to be an expansion of individuals utilized in standard data mining. Spatial Data is an extremely challenging area since enormous quantities of spatial data have been obtained from the remote sensed to the GIS (Geographic Information Systems), ecological estimation, computer cartography, planning and many more. In a given paper, we only focus on an essential type of spatial vagueness termed as spatial fuzziness. Spatial fuzziness intakes the property of several spatial objects in certainty which don’t contain boundaries of sharp type and interiors or whose boundaries as well as interiors can't be determined in precise form. This paper provides the method for finding fuzzy spatial data of association rule. Association rules provided valuable data in the assessment of important correlations observed in big databases. Compared to the previous research work, the current approach for there search highlights the superiority over the same dataset in terms of time taken and generated rules. The rules generated tell about the occurrence of attributes. The results show that the current research is more efficient than that of the previous work and also less time-consuming.


2021 ◽  
Vol 10 (2) ◽  
pp. 79
Author(s):  
Ching-Yun Mu ◽  
Tien-Yin Chou ◽  
Thanh Van Hoang ◽  
Pin Kung ◽  
Yao-Min Fang ◽  
...  

Spatial information technology has been widely used for vehicles in general and for fleet management. Many studies have focused on improving vehicle positioning accuracy, although few studies have focused on efficiency improvements for managing large truck fleets in the context of the current complex network of roads. Therefore, this paper proposes a multilayer-based map matching algorithm with different spatial data structures to deal rapidly with large amounts of coordinate data. Using the dimension reduction technique, the geodesic coordinates can be transformed into plane coordinates. This study provides multiple layer grouping combinations to deal with complex road networks. We integrated these techniques and employed a puncture method to process the geometric computation with spatial data-mining approaches. We constructed a spatial division index and combined this with the puncture method, which improves the efficiency of the system and can enhance data retrieval efficiency for large truck fleet dispatching. This paper also used a multilayer-based map matching algorithm with raster data structures. Comparing the results revealed that the look-up table method offers the best outcome. The proposed multilayer-based map matching algorithm using the look-up table method is suited to obtaining competitive performance in identifying efficiency improvements for large truck fleet dispatching.


Sign in / Sign up

Export Citation Format

Share Document