A Comparative Study on Data Cleaning Approaches in Sentiment Analysis

Author(s):  
H. Mohamed Zakir ◽  
S. Vinila Jinny
2016 ◽  
Vol 49 (1) ◽  
pp. 1-26 ◽  
Author(s):  
Nadia Felix F. Da Silva ◽  
Luiz F. S. Coletta ◽  
Eduardo R. Hruschka

2022 ◽  
Vol 6 ◽  
pp. 781-791
Author(s):  
John Paul Miranda ◽  

Purpose–The dataset was collected to examine and identify possible key topicswithin these texts. Method–Data preparation such as data cleaning, transformation, tokenization, removal of stop wordsfrom both English and Filipino, and word stemmingwas employed in the datasetbefore feeding it to sentiment analysis and the LDA model.Results–The topmost occurring word within the dataset is "development" and there are three (3) likely topics from the speeches of Philippine presidents: economic development, enhancement of public services, and addressing challenges.Conclusion–The datasetwas ableto provide valuable insights contained among official documents. While the study showedthatpresidentshave used their annual address to express their visions for the country. It alsopresentedthat the presidents from 1935 to 2016 faced the same problems during their term.Recommendations–Future researchers may collect other speeches made by presidents during their term;combine them to the dataset used in this studyto furtherinvestigate these important textsby subjecting them to the same methodology used in this study.The dataset may be requested from the authors and it is recommended for further analysis. For example, determine how the speeches of the president reflect the preamble or foundations of the Philippine constitution.


2021 ◽  
Author(s):  
Nabanita Das ◽  
Saloni Gupta ◽  
Srinjoy Das ◽  
Shuvam Yadav ◽  
Trishika Subramanian ◽  
...  

2021 ◽  
pp. 199-211
Author(s):  
Bachchu Paul ◽  
Sanchita Guchhait ◽  
Tanushree Dey ◽  
Debashri Das Adhikary ◽  
Somnath Bera

Sign in / Sign up

Export Citation Format

Share Document