The Problem of Data Cleaning for Knowledge Extraction from Social Media

Author(s):  
Emre Calisir ◽  
Marco Brambilla
Author(s):  
Sunil M. E. ◽  
Vinay S.

Opinion mining, also known as sentimental analysis, is the analysis of sentiment (emotion, affection, experience) towards the target object. In the present era, everyone is interested to know the opinions of others before making a decision or performing a task. Hence, it is necessary to collect the information (features) from relatives, friends, or web. These opinions or feedbacks help them to decide their action. With the advent of social media and use of digital technologies, web is a huge resource for data. However, it is time-consuming to read the data collected from the web and analyze it to arrive at informed decisions. This chapter provides complete overview of tools to simplify the operations of opinion mining like data collection, data cleaning, and visualization of predicted sentiment.


IEEE Access ◽  
2021 ◽  
pp. 1-1
Author(s):  
Marco Mameli ◽  
Marina Paolanti ◽  
Rocco Pietrini ◽  
Giulia Pazzaglia ◽  
Emanuele Frontoni ◽  
...  

Author(s):  
Abhishek Bhattacharya ◽  
Arijit Ghosal ◽  
Ahmed J. Obaid ◽  
Salahddine Krit ◽  
Vinod Kumar Shukla ◽  
...  

Microblogging, where millions of users exchange messages to share their opinions on different trending and non-trending topics, is one of the popular communication media in recent times. Several researchers are concentrating on these data due to a huge source of information exchanges in online social media. In platforms such as Twitter, dataset-generated lacks coherence, and manually extracting meaning or knowledge from them proves to be painstakingly difficult. It opens up the challenges to the researchers for knowledge extraction driven by a summarization approach. Therefore, automated summary generation tools are recommended to get a meaningful summary out of a given topic becomes crucial in the age of big data. In this work, an unsupervised, extractive summarization model has been proposed. For categorization of data, k-means algorithm has been used, and based on scoring of each document in the corpus, summarization model is designed. The proposed methodology achieves an improved outcome over existing methods, such as lexical rank, sum basic, LSA, etc. evaluated by rouge tool.


ASHA Leader ◽  
2015 ◽  
Vol 20 (7) ◽  
Author(s):  
Vicki Clarke
Keyword(s):  

ASHA Leader ◽  
2013 ◽  
Vol 18 (5) ◽  

As professionals who recognize and value the power and important of communications, audiologists and speech-language pathologists are perfectly positioned to leverage social media for public relations.


2013 ◽  
Vol 44 (1) ◽  
pp. 4
Author(s):  
Jane Anderson
Keyword(s):  

Sign in / Sign up

Export Citation Format

Share Document