scholarly journals Utilizando Análise de Sentimentos e SVM na Classificação de Tweets Depressivos

Author(s):  
Omar Andres Carmona Cortes ◽  
Wesley Eduardo de Oliveira Melo

The number of depression cases has grown worldwide. The WorldHealth Organization estimates that 5.8% of the Brazilian populationalready present depression symptoms. In the world, 4.8% ofthe entire population has presented some symptoms. These dataare alarming because they represent about 12 million people onlyin Brazil and 368 million worldwide. Therefore, it is essential tobuild applications that adequately identify the population’s feelingsabout depression to drive public health policies. Appropriate policiescan save money on public health and keep people active. Thus,this work investigates how to apply machine learning in classifyingdepression posts on Tweeter. The data were extracted from thesocial media network, reaching a total of 31.177 tweets classified asdepressive and non-depressive. The application was implementedin Python with Pandas and SciKit Learning. Results have shownthat SVM overcomes the Naive Bayes algorithm and can reach anaccuracy of 94%, precision of 91%, a recall of 91%, and an F1 Scoreof 91%.

2020 ◽  
Vol 1 (2) ◽  
pp. 61-66
Author(s):  
Febri Astiko ◽  
Achmad Khodar

This study aims to design a machine learning model of sentiment analysis on Indosat Ooredoo service reviews on social media twitter using the Naive Bayes algorithm as a classifier of positive and negative labels. This sentiment analysis uses machine learning to get patterns an model that can be used again to predict new data.


2021 ◽  
Author(s):  
Carlos Eduardo Beluzo ◽  
Luciana Correia Alves ◽  
Natália Martins Arruda ◽  
Cátia Sepetauskas ◽  
Everton Silva ◽  
...  

ABSTRACTReduction in child mortality is one of the United Nations Sustainable Development Goals for 2030. In Brazil, despite recent reduction in child mortality in the last decades, the neonatal mortality is a persistent problem and it is associated with the quality of prenatal, childbirth care and social-environmental factors. In a proper health system, the effect of some of these factors could be minimized by the appropriate number of newborn intensive care units, number of health care units, number of neonatal incubators and even by the correct level of instruction of mothers, which can lead to a proper care along the prenatal period. With the intent of providing knowledge resources for planning public health policies focused on neonatal mortality reduction, we propose a new data-driven machine leaning method for Neonatal Mortality Rate forecasting called NeMoR, which predicts neonatal mortality rates for 4 months ahead, using NeoDeathForecast, a monthly base time series dataset composed by these factors and by neonatal mortality rates history (2006-2016), having 57,816 samples, for all 438 Brazilian administrative health regions. In order to build the model, Extra-Tree, XGBoost Regressor, Gradient Boosting Regressor and Lasso machine learning regression models were evaluated and a hyperparameters search was also performed as a fine tune step. The method has been validated using São Paulo city data, mainly because of data quality. On the better configuration the method predicted the neonatal mortality rates with a Mean Square Error lower than 0.18. Besides that, the forecast results may be useful as it provides a way for policy makers to anticipate trends on neonatal mortality rates curves, an important resource for planning public health policies.Graphical AbstractHighlightsProposition of a new data-driven approach for neonatal mortality rate forecast, which provides a way for policy-makers to anticipate trends on neonatal mortality rates curves, making a better planning of health policies focused on NMR reduction possible;a method for NMR forecasting with a MSE lower than 0.18;an extensive evaluation of different Machine Learning (ML) regression models, as well as hyperparameters search, which accounts for the last stage in NeMoR;a new time series database for NMR prediction problems;a new features projection space for NMR forecasting problems, which considerably reduces errors in NRM prediction.


In this never-ending social media era it is estimated that over 5 billion people use smartphones. Out of these, there are over 1.5 billion active users in the world. In which we all are a major part and before opening our messages we all are curious about what message we have received. No doubt, we all always hope for a good message to be received. So Sentiment analysis on social media data has been seen by many as an effective tool to monitor user preferences and inclination. Finally, we propose a scalable machine learning model to analyze the polarity of a communicative text using Naive Bayes’ Bernoulli classifier. This paper works on only two polarities that is whether the sentence is positive or negative. Bernoulli classifier is used in this paper because it is best suited for binary inputs which in turn enhances the accuracy of up to 97%.


2021 ◽  
Vol 6 (1) ◽  
Author(s):  
Bustami Yusuf ◽  
Muhammad Zaeki ◽  
Hendri Ahmadian ◽  
Khairan Ar ◽  
Sri Wahyuni

Education is one of the sciences that makes humans much better by learning various scientific disciplines. Al-Quran is one of the sources of knowledge that is believed by Muslims around the world. Because technology has penetrated almost every domain of our lives , including the world of education. Thus, the authors make technology as tool  for researching educational topics in Al-Quran by implementing text exploration .The research was carried out by making some basic words that were related to the subject of education as the keywords in this study. The keywords are “Ajar”, “Bicara”, “Cipta”, “Dengar”, “Ingat” and “Lihat”. Then, the authors implemented the Naïve Bayes Classifier algorithm. To test and evaluate the results, the author used two methods, i.e. recall and precision. The study results are the keyword “cipta” by 3.05 %, “Ingat” 2.25 %, “Ajar” 1.96 %,“Lihat” 0.82 %, finally “Dengar” 0.62% and “Bicara” 0.34% with  total  weight of 3,516 words that  have been filtered. The overall percentage of the results is 9.04% of the total number of words 38,761 in the Al-Quran. For the Naïve Bayes algorithm evaluation method,  the recall and precision scores are 0.605 and 0.366, respectively.


2020 ◽  
Vol 32 (4) ◽  
pp. 163-164
Author(s):  
Jeconiah Louis Dreisbach

The 2019 coronavirus disease (COVID-19) presents a great challenge to developing countries with limited access to public health measures in grassroots communities. The World Health Organization lauded the Vietnamese government for its proactive and steady investment in health facilities that mitigate the risk of the infectious disease in Vietnam. This short communication presents cases that could benchmark public health policies in developing countries.


Author(s):  
Ade Riani ◽  
Yessy Susianto ◽  
Nur Rahman

Heart disease is a disease with a high mortality rate in the world of health. The disease is usually rarely realized the cause. However, there are several parameters that can be used to predict whether a person has a risk of heart disease or not. As for this study, researchers will use several indicators including Age, Sex, Chest pain type, Trestbps, Cholesterol, Fasting blood sugar, Resting ECG, Max heart rate, Exercise-induced angina, Oldpeak, Slope, Number of vessels coloured, and Thal This research will perform calculations using the Data Mining method with the Naive Bayes Algorithm. The results of this study get an accuracy of 86% for the 303 datasets tested. 


Author(s):  
Ahmed T. Shawky ◽  
Ismail M. Hagag

In today’s world using data mining and classification is considered to be one of the most important techniques, as today’s world is full of data that is generated by various sources. However, extracting useful knowledge out of this data is the real challenge, and this paper conquers this challenge by using machine learning algorithms to use data for classifiers to draw meaningful results. The aim of this research paper is to design a model to detect diabetes in patients with high accuracy. Therefore, this research paper using five different algorithms for different machine learning classification includes, Decision Tree, Support Vector Machine (SVM), Random Forest, Naive Bayes, and K- Nearest Neighbor (K-NN), the purpose of this approach is to predict diabetes at an early stage. Finally, we have compared the performance of these algorithms, concluding that K-NN algorithm is a better accuracy (81.16%), followed by the Naive Bayes algorithm (76.06%).


2018 ◽  
Vol 2 (3) ◽  
pp. 153 ◽  
Author(s):  
Muhammad Firman Aji Saputra ◽  
Triyanna Widiyaningtyas ◽  
Aji Prasetya Wibawa

Illiteracy is an inability to recognize characters, both in order to read and write. It is a significant problem for countries all around the world including Indonesia. In Indonesia, illiteracy rate is generally set as an indicator to see whether or not education in Indonesia is successful. If this problem is not going to be overcome, it will affect people’s prosperity. One system that has been used to overcome this problem is prioritizing the treatment from areas with the highest illiteracy rate and followed by areas with lower illiteracy rate. The method is going to be a way easier to be applied if it is supported by classification process. Since the classification process needs a class, and there has not been any fine classification of illiteracy rate, there is needed a clustering process before classification process. This research is aimed to get optimal number of classes through clustering process and know the result of illiteracy classification process. The clustering process is conducted by using k means algorithm, and for the classification process is conducted by using Naïve Bayes algorithm. The testing method used to assess the success of classification process is 10-fold method. Based on the research result, it can be concluded that the optimal illiteracy classes are three classes with the classification accuracy value of 96.4912% and error rate value of 3.5088%. Whereas the classification with two classes get the accuracy value of 93.8596% and error rate value of 6.1404%. And for the classification with five classes get the accuracy value of 90.3509% and error rate value of 9.6491%.


Author(s):  
Edouard Lansiaux ◽  
Noe Tchagaspanian ◽  
Juliette Arnaud ◽  
Pierre Durand ◽  
Mark Changizi ◽  
...  

Let us all take a moment to talk, once again, about this new coronavirus pandemic that the world has been facing since November 2019 and about its global response. After a short period marked by the pandemic underestimation risk by most governments, the Western world went nuts and overreacted, most probably so as not to be accused of inaction. In many cases, the overall benefits of the chosen policies were not sufficiently questioned, which resulted in many side effects on global health .The medical motto “primum non nocere”, a moral principle everyone should at least consider following, was evidently not taken into account. It has been overlooked, and the virus has become an obsession, to the extent that nearly everything else, even the most valuable things in life, is still now under appreciated if not simply ignored. This review highlighted facts against this simplistic, one-dimensional view.


Sign in / Sign up

Export Citation Format

Share Document