scholarly journals Sentiment Analysis of Impact of Technology on Employment from Text on Twitter

Author(s):  
Shahzad Qaiser ◽  
Nooraini Yusoff ◽  
Farzana Kabir Ahmad ◽  
Ramsha Ali

Many different studies are in progress to analyze the content created by the users on social media due to its influence and social ripple effect. Various content created on social media has pieces of information and user’s sentiments about social issues. This study aims to analyze people’s sentiments about the impact of technology on employment and advancements in technologies and build a machine learning classifier to classify the sentiments. People are getting nervous, depressed and even doing suicides due to unemployment; hence, it is essential to explore this relatively new area of research. The study has two main objectives 1) to preprocess text collected from Twitter concerning the impact of technology on employment and analyze its sentiment, 2) to evaluate the performance of machine learning Naïve Bayes (NB) classifier on the text. To achieve this, a methodology is proposed that includes 1) data collection and preprocessing 2) analyze sentiment, 3) building machine learning classifier and 4) compare the performance of NB and support vector machine (SVM). NB and SVM achieved 87.18% and 82.05% accuracy respectively. The study found that 65% of the people hold negative sentiment regarding the impact of technology on employment and technological advancements; hence people must acquire new skills to minimize the effect of structural unemployment.

Author(s):  
Noor Asyikin Sulaiman ◽  
Md Pauzi Abdullah ◽  
Hayati Abdullah ◽  
Muhammad Noorazlan Shah Zainudin ◽  
Azdiana Md Yusop

Air conditioning system is a complex system and consumes the most energy in a building. Any fault in the system operation such as cooling tower fan faulty, compressor failure, damper stuck, etc. could lead to energy wastage and reduction in the system’s coefficient of performance (COP). Due to the complexity of the air conditioning system, detecting those faults is hard as it requires exhaustive inspections. This paper consists of two parts; i) to investigate the impact of different faults related to the air conditioning system on COP and ii) to analyse the performances of machine learning algorithms to classify those faults. Three supervised learning classifier models were developed, which were deep learning, support vector machine (SVM) and multi-layer perceptron (MLP). The performances of each classifier were investigated in terms of six different classes of faults. Results showed that different faults give different negative impacts on the COP. Also, the three supervised learning classifier models able to classify all faults for more than 94%, and MLP produced the highest accuracy and precision among all.


Hoax news on social media has had a dramatic effect on our society in recent years. The impact of hoax news felt by many people, anxiety, financial loss, and loss of the right name. Therefore we need a detection system that can help reduce hoax news on social media. Hoax news classification is one of the stages in the construction of a hoax news detection system, and this unsupervised learning algorithm becomes a method for creating hoax news datasets, machine learning tools for data processing, and text processing for detecting data. The next will produce a classification of a hoax or not a Hoax based on the text inputted. Hoax news classification in this study uses five algorithms, namely Support Vector Machine, Naïve Bayes, Decision Tree, Logistic Regression, Stochastic Gradient Descent, and Neural Network (MLP). These five algorithms to produce the best algorithm that can use to detect hoax news, with the highest parameters, accuracy, F-measure, Precision, and recall. From the results of testing conducted on five classification algorithms produced shows that the NN-MPL algorithm has an average of 93% for the value of accuracy, F-Measure, and Precision, the highest compared to five other algorithms, but for the highest Recall value generated from the algorithm SVM which is 94%. the results of this experiment show that different effects for different classifiers, and that means that the more hoax data used as training data, the more accurate the system calculates accuracy in more detail.


2021 ◽  
Vol 13 (1) ◽  
pp. 19
Author(s):  
Ola Karajeh ◽  
Dirar Darweesh ◽  
Omar Darwish ◽  
Noor Abu-El-Rub ◽  
Belal Alsinglawi ◽  
...  

Social media sites are considered one of the most important sources of data in many fields, such as health, education, and politics. While surveys provide explicit answers to specific questions, posts in social media have the same answers implicitly occurring in the text. This research aims to develop a method for extracting implicit answers from large tweet collections, and to demonstrate this method for an important concern: the problem of heart attacks. The approach is to collect tweets containing “heart attack” and then select from those the ones with useful information. Informational tweets are those which express real heart attack issues, e.g., “Yesterday morning, my grandfather had a heart attack while he was walking around the garden.” On the other hand, there are non-informational tweets such as “Dropped my iPhone for the first time and almost had a heart attack.” The starting point was to manually classify around 7000 tweets as either informational (11%) or non-informational (89%), thus yielding a labeled dataset to use in devising a machine learning classifier that can be applied to our large collection of over 20 million tweets. Tweets were cleaned and converted to a vector representation, suitable to be fed into different machine-learning algorithms: Deep neural networks, support vector machine (SVM), J48 decision tree and naïve Bayes. Our experimentation aimed to find the best algorithm to use to build a high-quality classifier. This involved splitting the labeled dataset, with 2/3 used to train the classifier and 1/3 used for evaluation besides cross-validation methods. The deep neural network (DNN) classifier obtained the highest accuracy (95.2%). In addition, it obtained the highest F1-scores with (73.6%) and (97.4%) for informational and non-informational classes, respectively.


Author(s):  
Noraini Seman ◽  
Nurul Atiqah Razmi

A huge amount of data is generated every minute for social networking and content sharing via Social media sites that can be in a form of structured, unstructured or semi-structured data.  One of the largest used social media sites is Twitter, where each and every day millions of data generated in the form of unstructured tweets. Tweets or opinions of the people can be used to extract sentiments of the people. Sentiment analysis is beneficial for organizations to improve their products and make required changes on demand to increase their profit. In this paper, three machine learning algorithms Support Vector Machine (SVM), Decision Trees (DT), and Naive Bayes (NB) for classifying sentiments of twitters data. The purpose of this research is to compare the outcomes of these algorithms to identify best machine learning method which gives most accurate and efficient results for classifying twitter data. Our experimental result shows that same preprocessing methods on a different dataset affect similarly the classifiers performance. After analyzing the results it is observed that SVM provides 64.96%, 71.26% and 91.25% precision which is better than other two algorithms. Also, overall Recall and F-measure rate of SVM is greater than NB and DT for three datasets. However, it is important to further study current available preprocessing techniques that help us to improve results of various classifiers.


Author(s):  
Yuming Zhang ◽  
Fan Yang

Companies use corporate social responsibility (CSR) disclosures to communicate their social and environmental policies, practices, and performance to stakeholders. Although the determinants and outcomes of CSR activities are well understood, we know little about how companies use CSR communication to manage a crisis. The few relevant CSR studies have focused on the pressure on corporations exerted by governments, customers, the media, or the public. Although investors have a significant influence on firm value, this stakeholder group has been neglected in research on CSR disclosure. Grounded in legitimacy theory and agency theory, this study uses a sample of Chinese public companies listed on the Shanghai Stock Exchange to investigate CSR disclosure in response to social media criticism posted by investors. The empirical findings show that investors’ social media criticism not only motivates companies to disclose their CSR activities but also increases the substantiveness of their CSR reports, demonstrating that companies’ CSR communication in response to a crisis is substantive rather than merely symbolic. We also find that the impact of social media criticism on CSR disclosure is heterogeneous. Non-state-owned enterprises, companies in regions with high levels of environmental regulations, and companies in regions with local government concern about social issues are most likely to disclose CSR information and report substantive CSR activities. We provide an in-depth analysis of corporate CSR strategies for crisis management and show that crises initiated by investors on social media provide opportunities for corporations to improve their CSR engagement.


Electronics ◽  
2020 ◽  
Vol 9 (2) ◽  
pp. 374 ◽  
Author(s):  
Sudhanshu Kumar ◽  
Monika Gahalawat ◽  
Partha Pratim Roy ◽  
Debi Prosad Dogra ◽  
Byung-Gyu Kim

Sentiment analysis is a rapidly growing field of research due to the explosive growth in digital information. In the modern world of artificial intelligence, sentiment analysis is one of the essential tools to extract emotion information from massive data. Sentiment analysis is applied to a variety of user data from customer reviews to social network posts. To the best of our knowledge, there is less work on sentiment analysis based on the categorization of users by demographics. Demographics play an important role in deciding the marketing strategies for different products. In this study, we explore the impact of age and gender in sentiment analysis, as this can help e-commerce retailers to market their products based on specific demographics. The dataset is created by collecting reviews on books from Facebook users by asking them to answer a questionnaire containing questions about their preferences in books, along with their age groups and gender information. Next, the paper analyzes the segmented data for sentiments based on each age group and gender. Finally, sentiment analysis is done using different Machine Learning (ML) approaches including maximum entropy, support vector machine, convolutional neural network, and long short term memory to study the impact of age and gender on user reviews. Experiments have been conducted to identify new insights into the effect of age and gender for sentiment analysis.


2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Arturo Moncada-Torres ◽  
Marissa C. van Maaren ◽  
Mathijs P. Hendriks ◽  
Sabine Siesling ◽  
Gijs Geleijnse

AbstractCox Proportional Hazards (CPH) analysis is the standard for survival analysis in oncology. Recently, several machine learning (ML) techniques have been adapted for this task. Although they have shown to yield results at least as good as classical methods, they are often disregarded because of their lack of transparency and little to no explainability, which are key for their adoption in clinical settings. In this paper, we used data from the Netherlands Cancer Registry of 36,658 non-metastatic breast cancer patients to compare the performance of CPH with ML techniques (Random Survival Forests, Survival Support Vector Machines, and Extreme Gradient Boosting [XGB]) in predicting survival using the $$c$$ c -index. We demonstrated that in our dataset, ML-based models can perform at least as good as the classical CPH regression ($$c$$ c -index $$\sim \,0.63$$ ∼ 0.63 ), and in the case of XGB even better ($$c$$ c -index $$\sim 0.73$$ ∼ 0.73 ). Furthermore, we used Shapley Additive Explanation (SHAP) values to explain the models’ predictions. We concluded that the difference in performance can be attributed to XGB’s ability to model nonlinearities and complex interactions. We also investigated the impact of specific features on the models’ predictions as well as their corresponding insights. Lastly, we showed that explainable ML can generate explicit knowledge of how models make their predictions, which is crucial in increasing the trust and adoption of innovative ML techniques in oncology and healthcare overall.


2021 ◽  
Vol ahead-of-print (ahead-of-print) ◽  
Author(s):  
Aasif Ahmad Mir ◽  
Sevukan Rathinam ◽  
Sumeer Gul

PurposeTwitter is gaining popularity as a microblogging and social networking service to discuss various social issues. Coronavirus disease 2019 (COVID-19) has become a global pandemic and is discussed worldwide. Social media is an instant platform to deliberate various dimensions of COVID-19. The purpose of the study is to explore and analyze the public sentiments related to COVID-19 vaccines across the Twitter messages (positive, neutral, and negative) and the impact tweets make across digital social circles.Design/methodology/approachTo fetch the vaccine-related posts, a manual examination of randomly selected 500 tweets was carried out to identify the popular hashtags relevant to the vaccine conversation. It was found that the hashtags “covid19vaccine” and “coronavirusvaccine” were the two popular hashtags used to discuss the communications related to COVID-19 vaccines. 23,575 global tweets available in public domain were retrieved through “Twitter Application Programming Interface” (API), using “Orange Software”, an open-source machine learning, data visualization and data mining toolkit. The study was confined to the tweets posted in English language only. The default data cleaning and preprocessing techniques available in the “Orange Software” were applied to the dataset, which include “transformation”, “tokenization” and “filtering”. The “Valence Aware Dictionary for sEntiment Reasoning” (VADER) tool was used for classification of tweets to determine the tweet sentiments (positive, neutral and negative) as well as the degree of sentiments (compound score also known as sentiment score). To assess the influence/impact of tweets account wise (verified and unverified) and sentiment wise (positive, neutral, and negative), the retweets and likes, which offer a sort of reward or acknowledgment of tweets, were used.FindingsA gradual decline in the number of tweets over the time is observed. Majority (11,205; 47.52%) of tweets express positive sentiments, followed by neutral (7,948; 33.71%) and negative sentiments (4,422; 18.75%), respectively. The study also signifies a substantial difference between the impact of tweets tweeted by verified and unverified users. The tweets related to verified users have a higher impact both in terms of retweets (65.91%) and likes (84.62%) compared to the tweets tweeted by unverified users. Tweets expressing positive sentiments have the highest impact both in terms of likes (mean = 10.48) and retweets (mean = 3.07) compared to those that express neutral or negative sentiments.Research limitations/implicationsThe main limitation of the study is that the sentiments of the people expressed over one single social platform, that is, Twitter have been studied which cannot generalize the global public perceptions. There can be a variation in the results when the datasets from other social media platforms will be studied.Practical implicationsThe study will help to know the people's sentiments and beliefs toward the COVID-19 vaccines. Sentiments that people hold about the COVID-19 vaccines are studied, which will help health policymakers understand the polarity (positive, negative, and neutral) of the tweets and thus see the public reaction and reflect the types of information people are exposed to about vaccines. The study can aid the health sectors to intensify positive messages and eliminate negative messages for an enhanced vaccination uptake. The research can also help design more operative vaccine-advocating communication by customizing messages using the obtained knowledge from the sentiments and opinions about the vaccines.Originality/valueThe paper focuses on an essential aspect of COVID-19 vaccines and how people express themselves (positively, neutrally and negatively) on Twitter.


2018 ◽  
Vol 34 (3) ◽  
pp. 569-581 ◽  
Author(s):  
Sujata Rani ◽  
Parteek Kumar

Abstract In this article, an innovative approach to perform the sentiment analysis (SA) has been presented. The proposed system handles the issues of Romanized or abbreviated text and spelling variations in the text to perform the sentiment analysis. The training data set of 3,000 movie reviews and tweets has been manually labeled by native speakers of Hindi in three classes, i.e. positive, negative, and neutral. The system uses WEKA (Waikato Environment for Knowledge Analysis) tool to convert these string data into numerical matrices and applies three machine learning techniques, i.e. Naive Bayes (NB), J48, and support vector machine (SVM). The proposed system has been tested on 100 movie reviews and tweets, and it has been observed that SVM has performed best in comparison to other classifiers, and it has an accuracy of 68% for movie reviews and 82% in case of tweets. The results of the proposed system are very promising and can be used in emerging applications like SA of product reviews and social media analysis. Additionally, the proposed system can be used in other cultural/social benefits like predicting/fighting human riots.


Materials ◽  
2021 ◽  
Vol 14 (21) ◽  
pp. 6713
Author(s):  
Omid Khalaj ◽  
Moslem Ghobadi ◽  
Ehsan Saebnoori ◽  
Alireza Zarezadeh ◽  
Mohammadreza Shishesaz ◽  
...  

Oxide Precipitation-Hardened (OPH) alloys are a new generation of Oxide Dispersion-Strengthened (ODS) alloys recently developed by the authors. The mechanical properties of this group of alloys are significantly influenced by the chemical composition and appropriate heat treatment (HT). The main steps in producing OPH alloys consist of mechanical alloying (MA) and consolidation, followed by hot rolling. Toughness was obtained from standard tensile test results for different variants of OPH alloy to understand their mechanical properties. Three machine learning techniques were developed using experimental data to simulate different outcomes. The effectivity of the impact of each parameter on the toughness of OPH alloys is discussed. By using the experimental results performed by the authors, the composition of OPH alloys (Al, Mo, Fe, Cr, Ta, Y, and O), HT conditions, and mechanical alloying (MA) were used to train the models as inputs and toughness was set as the output. The results demonstrated that all three models are suitable for predicting the toughness of OPH alloys, and the models fulfilled all the desired requirements. However, several criteria validated the fact that the adaptive neuro-fuzzy inference systems (ANFIS) model results in better conditions and has a better ability to simulate. The mean square error (MSE) for artificial neural networks (ANN), ANFIS, and support vector regression (SVR) models was 459.22, 0.0418, and 651.68 respectively. After performing the sensitivity analysis (SA) an optimized ANFIS model was achieved with a MSE value of 0.003 and demonstrated that HT temperature is the most significant of these parameters, and this acts as a critical rule in training the data sets.


Sign in / Sign up

Export Citation Format

Share Document