scholarly journals MapReduce and Semantics Enabled Event Detection using Social Media

2017 ◽  
Vol 7 (3) ◽  
pp. 201-213 ◽  
Author(s):  
Peng Yan

Abstract Social media is playing an increasingly important role in reporting major events happening in the world. However, detecting events from social media is challenging due to the huge magnitude of the data and the complex semantics of the language being processed. This paper proposes MASEED (MapReduce and Semantics Enabled Event Detection), a novel event detection framework that effectively addresses the following problems: 1) traditional data mining paradigms cannot work for big data; 2) data preprocessing requires significant human efforts; 3) domain knowledge must be gained before the detection; 4) semantic interpretation of events is overlooked; 5) detection scenarios are limited to specific domains. In this work, we overcome these challenges by embedding semantic analysis into temporal analysis for capturing the salient aspects of social media data, and parallelizing the detection of potential events using the MapReduce methodology. We evaluate the performance of our method using real Twitter data. The results will demonstrate the proposed system outperforms most of the state-of-the-art methods in terms of accuracy and efficiency.

Author(s):  
L. Thapa

Social Medias these days have become the instant communication platform to share anything; from personal feelings to the matter of public concern, these are the easiest and aphoristic way to deliver information among the mass. With the development of Web 2.0 technologies, more and more emphasis has been given to user input in the web; the concept of Geoweb is being visualized and in the recent years, social media like Twitter, Flicker are among the popular Location Based Social Medias with locational functionality enabled in them. Nepal faced devastating earthquake on 25 April, 2015 resulting in the loss of thousands of lives, destruction in the historical-archaeological sites and properties. Instant help was offered by many countries around the globe and even lots of NGOs, INGOs and people started the rescue operations immediately; concerned authorities and people used different communication medium like Frequency Modulation Stations, Television, and Social Medias over the World Wide Web to gather information associated with the Quake and to ease the rescue activities. They also initiated campaign in the Social Media to raise the funds and support the victims. Even the social medias like Facebook, Twitter, themselves announced the helping campaign to rebuild Nepal. In such scenario, this paper features the analysis of Twitter data containing hashtag related to Nepal Earthquake 2015 together with their temporal characteristics, when were the message generated, where were these from and how these spread spatially over the internet?


Author(s):  
L. Thapa

Social Medias these days have become the instant communication platform to share anything; from personal feelings to the matter of public concern, these are the easiest and aphoristic way to deliver information among the mass. With the development of Web 2.0 technologies, more and more emphasis has been given to user input in the web; the concept of Geoweb is being visualized and in the recent years, social media like Twitter, Flicker are among the popular Location Based Social Medias with locational functionality enabled in them. Nepal faced devastating earthquake on 25 April, 2015 resulting in the loss of thousands of lives, destruction in the historical-archaeological sites and properties. Instant help was offered by many countries around the globe and even lots of NGOs, INGOs and people started the rescue operations immediately; concerned authorities and people used different communication medium like Frequency Modulation Stations, Television, and Social Medias over the World Wide Web to gather information associated with the Quake and to ease the rescue activities. They also initiated campaign in the Social Media to raise the funds and support the victims. Even the social medias like Facebook, Twitter, themselves announced the helping campaign to rebuild Nepal. In such scenario, this paper features the analysis of Twitter data containing hashtag related to Nepal Earthquake 2015 together with their temporal characteristics, when were the message generated, where were these from and how these spread spatially over the internet?


2021 ◽  
Vol 8 (1) ◽  
Author(s):  
Yasmeen George ◽  
Shanika Karunasekera ◽  
Aaron Harwood ◽  
Kwan Hui Lim

AbstractA key challenge in mining social media data streams is to identify events which are actively discussed by a group of people in a specific local or global area. Such events are useful for early warning for accident, protest, election or breaking news. However, neither the list of events nor the resolution of both event time and space is fixed or known beforehand. In this work, we propose an online spatio-temporal event detection system using social media that is able to detect events at different time and space resolutions. First, to address the challenge related to the unknown spatial resolution of events, a quad-tree method is exploited in order to split the geographical space into multiscale regions based on the density of social media data. Then, a statistical unsupervised approach is performed that involves Poisson distribution and a smoothing method for highlighting regions with unexpected density of social posts. Further, event duration is precisely estimated by merging events happening in the same region at consecutive time intervals. A post processing stage is introduced to filter out events that are spam, fake or wrong. Finally, we incorporate simple semantics by using social media entities to assess the integrity, and accuracy of detected events. The proposed method is evaluated using different social media datasets: Twitter and Flickr for different cities: Melbourne, London, Paris and New York. To verify the effectiveness of the proposed method, we compare our results with two baseline algorithms based on fixed split of geographical space and clustering method. For performance evaluation, we manually compute recall and precision. We also propose a new quality measure named strength index, which automatically measures how accurate the reported event is.


2021 ◽  
Author(s):  
Hansi Hettiarachchi ◽  
Mariam Adedoyin-Olowe ◽  
Jagdev Bhogal ◽  
Mohamed Medhat Gaber

AbstractSocial media is becoming a primary medium to discuss what is happening around the world. Therefore, the data generated by social media platforms contain rich information which describes the ongoing events. Further, the timeliness associated with these data is capable of facilitating immediate insights. However, considering the dynamic nature and high volume of data production in social media data streams, it is impractical to filter the events manually and therefore, automated event detection mechanisms are invaluable to the community. Apart from a few notable exceptions, most previous research on automated event detection have focused only on statistical and syntactical features in data and lacked the involvement of underlying semantics which are important for effective information retrieval from text since they represent the connections between words and their meanings. In this paper, we propose a novel method termed Embed2Detect for event detection in social media by combining the characteristics in word embeddings and hierarchical agglomerative clustering. The adoption of word embeddings gives Embed2Detect the capability to incorporate powerful semantical features into event detection and overcome a major limitation inherent in previous approaches. We experimented our method on two recent real social media data sets which represent the sports and political domain and also compared the results to several state-of-the-art methods. The obtained results show that Embed2Detect is capable of effective and efficient event detection and it outperforms the recent event detection methods. For the sports data set, Embed2Detect achieved 27% higher F-measure than the best-performed baseline and for the political data set, it was an increase of 29%.


2012 ◽  
Vol 7 (1) ◽  
pp. 174-197 ◽  
Author(s):  
Heather Small ◽  
Kristine Kasianovitz ◽  
Ronald Blanford ◽  
Ina Celaya

Social networking sites and other social media have enabled new forms of collaborative communication and participation for users, and created additional value as rich data sets for research. Research based on accessing, mining, and analyzing social media data has risen steadily over the last several years and is increasingly multidisciplinary; researchers from the social sciences, humanities, computer science and other domains have used social media data as the basis of their studies. The broad use of this form of data has implications for how curators address preservation, access and reuse for an audience with divergent disciplinary norms related to privacy, ownership, authenticity and reliability.In this paper, we explore how the characteristics of the Twitter platform, coupled with an ambiguous and evolving understanding of privacy in networked communication, and divergent disciplinary understandings of the resulting data, combine to create complex issues for curators trying to ensure broad-based and ethical reuse of Twitter data. We provide a case study of a specific data set to illustrate how data curators can engage with the topics and questions raised in the paper. While some initial suggestions are offered to librarians and other information professionals who are beginning to receive social media data from researchers, our larger goal is to stimulate discussion and prompt additional research on the curation and preservation of social media data.


2021 ◽  
pp. 0739456X2110442
Author(s):  
Yunmi Park ◽  
Minju Kim ◽  
Jiyeon Shin ◽  
Megan E. Heim LaFrombois

This research examined social media’s role in understanding perceptions about the spaces in which individuals interact, what planners can learn from social media data, and how to use social media to inform urban regeneration efforts. Using Twitter data from 2010 to 2018 recorded in one U.S. shrinking city, Detroit, Michigan, this paper longitudinally investigated topics that people discuss, their emotions, and neighborhood conditions associated with these topics and sentiments. Findings demonstrate that neighborhood demographics, socioeconomic, and built environment conditions impact people’s sentiments.


2022 ◽  
pp. 188-205
Author(s):  
Erkan Çiçek ◽  
Uğur Gündüz

Social media has been in our lives so much lately that it is an undeniable fact that global pandemics, which constitute an important part of our lives, are also affected by these networks and that they exist in these networks and share the users. The purpose of making this hashtag analysis is to reveal the difference in discourse and language while analyzing Twitter data and to evaluate the effects of a global pandemic crisis on language, message, and crisis management with social media data. This form of analysis is typically completed through amassing textual content data then investigating the “sentiment” conveyed. Within the scope of the study, 11,300 Twitter messages posted with the #stayhome hashtag between 30 May 2020 and 6 June 2020 were examined. The impact and reliability of social media in disaster management could be questioned by carrying out a content analysis based totally on the semantic analysis of the messages given on the Twitter posts with the phrases and frequencies used.


Author(s):  
Harshala Bhoir ◽  
K. Jayamalini

Visual sentiment analysis is the way to automatically recognize positive and negative emotions from images, videos, graphics, stickers etc. To estimate the polarity of the sentiment evoked by images in terms of positive or negative sentiment, most of the state-of-the-art works exploit the text associated to a social post provided by the user. However, such textual data is typically noisy due to the subjectivity of the user which usually includes text useful to maximize the diffusion of the social post. Proposed system will extract and employ an Objective Text description of images automatically extracted from the visual content rather than the classic Subjective Text provided by the user. The proposed System will extract three views visual view, subjective text view and objective text view of social media image and will give sentiment polarity positive, negative or neutral based on hypothesis table.


Author(s):  
Amrita Mishra ◽  

Sentiment Analysis has paved routes for opinion analysis of masses over unrestricted territorial limits. With the advent and growth of social media like Twitter, Facebook, WhatsApp, Snapchat in today’s world, stakeholders and the public often takes to expressing their opinion on them and drawing conclusions. While these social media data are extremely informative and well connected, the major challenge lies in incorporating efficient Text Classification strategies which not only overcomes the unstructured and humongous nature of data but also generates correct polarity of opinions (i.e. positive, negative, and neutral). This paper is a thorough effort to provide a brief study about various approaches to SA including Machine Learning, Lexicon Based, and Automatic Approaches. The paper also highlights the comparison of positive, negative, and neutral tweets of the Sputnik V, Moderna, and Covaxin vaccines used for preventive and emergency use of COVID-19 disease.


2021 ◽  
Vol 10 (1) ◽  
Author(s):  
Tarek Al Baghal ◽  
Alexander Wenz ◽  
Luke Sloan ◽  
Curtis Jessop

AbstractLinked social media and survey data have the potential to be a unique source of information for social research. While the potential usefulness of this methodology is widely acknowledged, very few studies have explored methodological aspects of such linkage. Respondents produce planned amounts of survey data, but highly variant amounts of social media data. This study explores this asymmetry by examining the amount of social media data available to link to surveys. The extent of variation in the amount of data collected from social media could affect the ability to derive meaningful linked indicators and could introduce possible biases. Linked Twitter data from respondents to two longitudinal surveys representative of Great Britain, the Innovation Panel and the NatCen Panel, show that there is indeed substantial variation in the number of tweets posted and the number of followers and friends respondents have. Multivariate analyses of both data sources show that only a few respondent characteristics have a statistically significant effect on the number of tweets posted, with the number of followers being the strongest predictor of posting in both panels, women posting less than men, and some evidence that people with higher education post less, but only in the Innovation Panel. We use sentiment analyses of tweets to provide an example of how the amount of Twitter data collected can impact outcomes using these linked data sources. Results show that more negatively coded tweets are related to general happiness, but not the number of positive tweets. Taken together, the findings suggest that the amount of data collected from social media which can be linked to surveys is an important factor to consider and indicate the potential for such linked data sources in social research.


Sign in / Sign up

Export Citation Format

Share Document