scholarly journals A Semantic Inference Based Method for Privacy Measurement

IEEE Access ◽  
2020 ◽  
Vol 8 ◽  
pp. 200112-200128
Author(s):  
Baocun Chen ◽  
Nafei Zhu ◽  
Jingsha He ◽  
Peng He ◽  
Shuting Jin ◽  
...  
2021 ◽  
Vol 10 (7) ◽  
pp. 436
Author(s):  
Amerah Alghanim ◽  
Musfira Jilani ◽  
Michela Bertolotto ◽  
Gavin McArdle

Volunteered Geographic Information (VGI) is often collected by non-expert users. This raises concerns about the quality and veracity of such data. There has been much effort to understand and quantify the quality of VGI. Extrinsic measures which compare VGI to authoritative data sources such as National Mapping Agencies are common but the cost and slow update frequency of such data hinder the task. On the other hand, intrinsic measures which compare the data to heuristics or models built from the VGI data are becoming increasingly popular. Supervised machine learning techniques are particularly suitable for intrinsic measures of quality where they can infer and predict the properties of spatial data. In this article we are interested in assessing the quality of semantic information, such as the road type, associated with data in OpenStreetMap (OSM). We have developed a machine learning approach which utilises new intrinsic input features collected from the VGI dataset. Specifically, using our proposed novel approach we obtained an average classification accuracy of 84.12%. This result outperforms existing techniques on the same semantic inference task. The trustworthiness of the data used for developing and training machine learning models is important. To address this issue we have also developed a new measure for this using direct and indirect characteristics of OSM data such as its edit history along with an assessment of the users who contributed the data. An evaluation of the impact of data determined to be trustworthy within the machine learning model shows that the trusted data collected with the new approach improves the prediction accuracy of our machine learning technique. Specifically, our results demonstrate that the classification accuracy of our developed model is 87.75% when applied to a trusted dataset and 57.98% when applied to an untrusted dataset. Consequently, such results can be used to assess the quality of OSM and suggest improvements to the data set.


Author(s):  
Jiaoyan Chen ◽  
Freddy Lecue ◽  
Jeff Z. Pan ◽  
Huajun Chen

Data stream learning has been largely studied for extracting knowledge structures from continuous and rapid data records. In the semantic Web, data is interpreted in ontologies and its ordered sequence is represented as an ontology stream. Our work exploits the semantics of such streams to tackle the problem of concept drift i.e., unexpected changes in data distribution, causing most of models to be less accurate as time passes. To this end we revisited (i) semantic inference in the context of supervised stream learning, and (ii) models with semantic embeddings. The experiments show accurate prediction with data from Dublin and Beijing.


IEEE Access ◽  
2019 ◽  
Vol 7 ◽  
pp. 167401-167408
Author(s):  
Jianjun Li ◽  
Hui Guo ◽  
Shichao Chen ◽  
Dan Yang ◽  
Wanyong Tian ◽  
...  

2019 ◽  
Vol 68 (8) ◽  
pp. 7890-7903 ◽  
Author(s):  
Wei Zhang ◽  
Siwang Zhou ◽  
Liang Yang ◽  
Lu Ou ◽  
Zhu Xiao

Sign in / Sign up

Export Citation Format

Share Document