A Trie-based Indexing Scheme for Efficient Retrieval of Massive Spatio-Temporal IoT Sensor Data

2020 ◽  
Vol 47 (12) ◽  
pp. 1199-1207
Author(s):  
Hawon Chu ◽  
Young-Kyoon Suh ◽  
Ryong Lee ◽  
Minwoo Park ◽  
Rae-Young Jang ◽  
...  
2020 ◽  
Vol 12 (22) ◽  
pp. 9727
Author(s):  
Hawon Chu ◽  
Jaeseong Kim ◽  
Seounghyeon Kim ◽  
Young-Kyoon Suh ◽  
Ryong Lee ◽  
...  

Recently, various environmental data, such as microdust pollution, temperature, humidity, etc., have been continuously collected by widely deployed Internet of Things (IoT) sensors. Although these data can provide great insight into developing sustainable application services, it is challenging to rapidly retrieve such data, due to their multidimensional properties and huge growth in volume over time. Existing indexing methods for efficiently locating those data expose several problems, such as high administrative cost, spatial overhead, and slow retrieval performance. To mitigate these problems, we propose a novel indexing scheme termed ST-Trie, for efficient retrieval over spatiotemporal IoT environment data. Given IoT sensor data with latitude, longitude, and time, the proposed scheme first converts the three-dimensional attributes to one-dimensional index keys. The scheme then builds a trie-based index, consisting of internal nodes inserted by the converted keys and leaf nodes containing the keys and pointers to actual IoT data. We leverage this index to process various types of queries. In our experiments with three real-world datasets, we show that the proposed ST-Trie index outperforms existing approaches by a substantial margin regarding response time. Furthermore, we show that the query processing performance via ST-Trie also scales very well with an increasing time interval. Finally, we demonstrate that when compressed, the ST-Trie index can significantly reduce its space overhead by approximately a factor of seven.


Author(s):  
Omar Subhi Aldabbas

Internet of Things (IoT) is a ubiquitous embedded ecosystem known for its capability to perform common application functions through coordinating resources, which are distributed on-object or on-network domains. As new applications evolve, the challenge is in the analysis and usage of multimodal data streamed by diverse kinds of sensors. This paper presents a new service-centric approach for data collection and retrieval. This approach considers objects as highly decentralized, composite and cost effective services. Such services can be constructed from objects located within close geographical proximity to retrieve spatio-temporal events from the gathered sensor data. To achieve this, we advocate Coordination languages and models to fuse multimodal, heterogeneous services through interfacing with every service to achieve the network objective according to the data they gather and analyze. In this paper we give an application scenario that illustrates the implementation of the coordination models to provision successful collaboration among IoT objects to retrieve information. The proposed solution reduced the communication delay before service composition by up to 43% and improved the target detection accuracy by up to 70%, while maintaining energy consumption 20% lower than its best rivals in the literature.


2019 ◽  
Vol 11 (9) ◽  
pp. 1052 ◽  
Author(s):  
Reto Stöckli ◽  
Jędrzej S. Bojanowski ◽  
Viju O. John ◽  
Anke Duguay-Tetzlaff ◽  
Quentin Bourgeois ◽  
...  

Can we build stable Climate Data Records (CDRs) spanning several satellite generations? This study outlines how the ClOud Fractional Cover dataset from METeosat First and Second Generation (COMET) of the EUMETSAT Satellite Application Facility on Climate Monitoring (CM SAF) was created for the 25-year period 1991–2015. Modern multi-spectral cloud detection algorithms cannot be used for historical Geostationary (GEO) sensors due to their limited spectral resolution. We document the innovation needed to create a retrieval algorithm from scratch to provide the required accuracy and stability over several decades. It builds on inter-calibrated radiances now available for historical GEO sensors. It uses spatio-temporal information and a robust clear-sky retrieval. The real strength of GEO observations—the diurnal cycle of reflectance and brightness temperature—is fully exploited instead of just accounting for single “imagery”. The commonly-used naive Bayesian classifier is extended with covariance information of cloud state and variability. The resulting cloud fractional cover CDR has a bias of 1% Mean Bias Error (MBE), a precision of 7% bias-corrected Root-Mean-Squared-Error (bcRMSE) for monthly means, and a decadal stability of 1%. Our experience can serve as motivation for CDR developers to explore novel concepts to exploit historical sensor data.


Author(s):  
J. C. Whittier ◽  
S. Nittel ◽  
I. Subasinghe

With live streaming sensors and sensor networks, increasingly large numbers of individual sensors are deployed in physical space. Sensor data streams are a fundamentally novel mechanism to deliver observations to information systems. They enable us to represent spatio-temporal continuous phenomena such as radiation accidents, toxic plumes, or earthquakes almost as instantaneously as they happen in the real world. Sensor data streams discretely sample an earthquake, while the earthquake is continuous over space and time. Programmers attempting to integrate many streams to analyze earthquake activity and scope need to write code to integrate potentially very large sets of asynchronously sampled, concurrent streams in tedious application code. In previous work, we proposed the field stream data model (Liang et al., 2016) for data stream engines. Abstracting the stream of an individual sensor as a temporal field, the field represents the Earth’s movement at the sensor position as continuous. This simplifies analysis across many sensors significantly. In this paper, we undertake a feasibility study of using the field stream model and the open source Data Stream Engine (DSE) Apache Spark(Apache Spark, 2017) to implement a real-time earthquake event detection with a subset of the 250 GPS sensor data streams of the Southern California Integrated GPS Network (SCIGN). The field-based real-time stream queries compute maximum displacement values over the latest query window of each stream, and related spatially neighboring streams to identify earthquake events and their extent. Further, we correlated the detected events with an USGS earthquake event feed. The query results are visualized in real-time.


Sensors ◽  
2019 ◽  
Vol 19 (3) ◽  
pp. 448 ◽  
Author(s):  
Reza Rawassizadeh ◽  
Chelsea Dobbins ◽  
Mohammad Akbari ◽  
Michael Pazzani

Mobile and wearable devices are capable of quantifying user behaviors based on their contextual sensor data. However, few indexing and annotation mechanisms are available, due to difficulties inherent in raw multivariate data types and the relative sparsity of sensor data. These issues have slowed the development of higher level human-centric searching and querying mechanisms. Here, we propose a pipeline of three algorithms. First, we introduce a spatio-temporal event detection algorithm. Then, we introduce a clustering algorithm based on mobile contextual data. Our spatio-temporal clustering approach can be used as an annotation on raw sensor data. It improves information retrieval by reducing the search space and is based on searching only the related clusters. To further improve behavior quantification, the third algorithm identifies contrasting events withina cluster content. Two large real-world smartphone datasets have been used to evaluate our algorithms and demonstrate the utility and resource efficiency of our approach to search.


2015 ◽  
Vol 764-765 ◽  
pp. 1002-1006
Author(s):  
Hae Jung Baek ◽  
Je Min Kim ◽  
Young Tack Park

This paper describes a method for inferring user destinations and routes based on logs collected by smartphones. A challenging problem is coping with the uncertainty of smartphone sensor data. In this study, we represent a user transportation model with probabilistic models based on temporal smartphone sensor data, including GPS and accelerometer data. In our model, the travel behavior and spatio-temporal information of users are factors that affect route decisions. We propose hierarchical particle filters to enhance the performance and efficiency by sampling the route model based on hierarchical and semantic relationships.


Sign in / Sign up

Export Citation Format

Share Document