SenseLens: An Efficient Social Signal Conditioning System for True Event Detection

2022 ◽  
Vol 18 (2) ◽  
pp. 1-27
Author(s):  
Hang Cui ◽  
Tarek Abdelzaher

This article narrows the gap between physical sensing systems that measure physical signals and social sensing systems that measure information signals by (i) defining a novel algorithm for extracting information signals (building on results from text embedding) and (ii) showing that it increases the accuracy of truth discovery—the separation of true information from false/manipulated one. The work is applied in the context of separating true and false facts on social media, such as Twitter and Reddit, where users post predominantly short microblogs. The new algorithm decides how to aggregate the signal across words in the microblog for purposes of clustering the miscroblogs in the latent information signal space, where it is easier to separate true and false posts. Although previous literature extensively studied the problem of short text embedding/representation, this article improves previous work in three important respects: (1) Our work constitutes unsupervised truth discovery, requiring no labeled input or prior training. (2) We propose a new distance metric for efficient short text similarity estimation, we call Semantic Subset Matching , that improves our ability to meaningfully cluster microblog posts in the latent information signal space. (3) We introduce an iterative framework that jointly improves miscroblog clustering and truth discovery. The evaluation shows that the approach improves the accuracy of truth-discovery by 6.3%, 2.5%, and 3.8% (constituting a 38.9%, 14.2%, and 18.7% reduction in error, respectively) in three real Twitter data traces.

2020 ◽  
Vol 16 (2) ◽  
pp. 1045-1057 ◽  
Author(s):  
Yang Du ◽  
Yu-E Sun ◽  
He Huang ◽  
Liusheng Huang ◽  
Hongli Xu ◽  
...  

2019 ◽  
Vol 15 (1) ◽  
pp. 1-32 ◽  
Author(s):  
Chenglin Miao ◽  
Wenjun Jiang ◽  
Lu Su ◽  
Yaliang Li ◽  
Suxin Guo ◽  
...  

2013 ◽  
Vol 756-759 ◽  
pp. 1309-1313 ◽  
Author(s):  
Bing Jie Sun ◽  
Zhi Chao Liang ◽  
Qing Tian Zeng ◽  
Hua Zhao ◽  
Wei Jian Ni ◽  
...  

Text similarity computing is the core issue that question-answering system needs to solve. It is mainly used to filter out the existed problems which are similar to the users questions from database. Because of the low recall of domain keywords in domain text similarity computing based on traditional semantic dictionary, this paper proposed a short text similarity computing method in the field of agriculture based on the extended version of <<Tongyicicilin>> which referred to as <<CiLin>>. This paper propose to consider both the similarity and correlation when calculate the words final similarity. The experimental results show that the proposed short text similarity computing method resolve the problem of the low recall of domain words in traditional semantic dictionary well, and improve the similarity calculation performance of high relevant keywords greatly.


2021 ◽  
Vol 2021 ◽  
pp. 1-15
Author(s):  
Taochun Wang ◽  
Chengmei Lv ◽  
Chengtian Wang ◽  
Fulong Chen ◽  
Yonglong Luo

With the rapid development of portable mobile devices, mobile crowd sensing systems (MCS) have been widely studied. However, the sensing data provided by participants in MCS applications is always unreliable, which affects the service quality of the system, and the truth discovery technology can effectively obtain true values from the data provided by multiple users. At the same time, privacy leaks also restrict users’ enthusiasm for participating in the MCS. Based on this, our paper proposes a secure truth discovery for data aggregation in crowd sensing systems, STDDA, which iteratively calculates user weights and true values to obtain real object data. In order to protect the privacy of data, STDDA divides users into several clusters, and users in the clusters ensure the privacy of data by adding secret random numbers to the perceived data. At the same time, the cluster head node uses the secure sum protocol to obtain the aggregation result of the sense data and uploads it to the server so that the server cannot obtain the sense data and weight of individual users, further ensuring the privacy of the user’s sense data and weight. In addition, using the truth discovery method, STDDA provides corresponding processing mechanisms for users’ dynamic joining and exiting, which enhances the robustness of the system. Experimental results show that STDDA has the characteristics of high accuracy, low communication, and high security.


Author(s):  
Kaiping Xue ◽  
Bin Zhu ◽  
Qingyou Yang ◽  
Na Gai ◽  
David S.L. Wei ◽  
...  

2019 ◽  
Vol 68 (4) ◽  
pp. 3854-3865 ◽  
Author(s):  
Guowen Xu ◽  
Hongwei Li ◽  
Sen Liu ◽  
Mi Wen ◽  
Rongxing Lu

2019 ◽  
Vol 16 (1) ◽  
pp. 289-311 ◽  
Author(s):  
Adela Ljajic ◽  
Ulfeta Marovac

The importance of determining sentiment for short text increases with the rise in the number of comments on social networks. The presence of negation in these texts affects their sentiment, because it has a greater range of action in proportion to the length of the text. In this paper, we examine how the treatment of negation impacts the sentiment of tweets in the Serbian language. The grammatical rules that influence the change of polarity are processed. We performed an analysis of the effect of the negation treatment on the overall process of sentiment analysis. A statistically significant relative improvement was obtained (up to 31.16% or up to 2.65%) when the negation was processed using our rules with the lexicon-based approach or machine learning methods. By applying machine learning methods, an accuracy of 68.84% was achieved on a set of positive, negative and neutral tweets, and an accuracy of as much as 91.13% when applied to the set of positive and negative tweets.


Sign in / Sign up

Export Citation Format

Share Document