scholarly journals Exploring Impact of Age and Gender on Sentiment Analysis Using Machine Learning

Electronics ◽  
2020 ◽  
Vol 9 (2) ◽  
pp. 374 ◽  
Author(s):  
Sudhanshu Kumar ◽  
Monika Gahalawat ◽  
Partha Pratim Roy ◽  
Debi Prosad Dogra ◽  
Byung-Gyu Kim

Sentiment analysis is a rapidly growing field of research due to the explosive growth in digital information. In the modern world of artificial intelligence, sentiment analysis is one of the essential tools to extract emotion information from massive data. Sentiment analysis is applied to a variety of user data from customer reviews to social network posts. To the best of our knowledge, there is less work on sentiment analysis based on the categorization of users by demographics. Demographics play an important role in deciding the marketing strategies for different products. In this study, we explore the impact of age and gender in sentiment analysis, as this can help e-commerce retailers to market their products based on specific demographics. The dataset is created by collecting reviews on books from Facebook users by asking them to answer a questionnaire containing questions about their preferences in books, along with their age groups and gender information. Next, the paper analyzes the segmented data for sentiments based on each age group and gender. Finally, sentiment analysis is done using different Machine Learning (ML) approaches including maximum entropy, support vector machine, convolutional neural network, and long short term memory to study the impact of age and gender on user reviews. Experiments have been conducted to identify new insights into the effect of age and gender for sentiment analysis.

2021 ◽  
Vol 11 (10) ◽  
pp. 4443
Author(s):  
Rokas Štrimaitis ◽  
Pavel Stefanovič ◽  
Simona Ramanauskaitė ◽  
Asta Slotkienė

Financial area analysis is not limited to enterprise performance analysis. It is worth analyzing as wide an area as possible to obtain the full impression of a specific enterprise. News website content is a datum source that expresses the public’s opinion on enterprise operations, status, etc. Therefore, it is worth analyzing the news portal article text. Sentiment analysis in English texts and financial area texts exist, and are accurate, the complexity of Lithuanian language is mostly concentrated on sentiment analysis of comment texts, and does not provide high accuracy. Therefore in this paper, the supervised machine learning model was implemented to assign sentiment analysis on financial context news, gathered from Lithuanian language websites. The analysis was made using three commonly used classification algorithms in the field of sentiment analysis. The hyperparameters optimization using the grid search was performed to discover the best parameters of each classifier. All experimental investigations were made using the newly collected datasets from four Lithuanian news websites. The results of the applied machine learning algorithms show that the highest accuracy is obtained using a non-balanced dataset, via the multinomial Naive Bayes algorithm (71.1%). The other algorithm accuracies were slightly lower: a long short-term memory (71%), and a support vector machine (70.4%).


2020 ◽  
Vol ahead-of-print (ahead-of-print) ◽  
Author(s):  
Giridhar B. Kamath ◽  
Shirshendu Ganguli ◽  
Simon George

PurposeThis paper tests and validates a conceptual model linking the attachment points, team identification, attitude towards the team sponsors and the behavioural intentions in the context of Indian Premier League (IPL), while testing for the moderating effects of age and gender.Design/methodology/approachData were collected from 1,053 participants through both online and offline survey and then analyzed using exploratory factor analysis (EFA) and structural equation modelling (SEM).FindingsAttachment points influence the formation of team identification, which, in turn, affect the attitude towards the team sponsors. Attitude towards the team sponsors influence the behavioural intentions. Player attachment influences team identification the most. Age and gender have a moderating effect on the constructs of the study. Team identification in females is stronger because of attachment to sports, whereas males have stronger team identification based on player attachment. Males have a stronger intention to spread positive word of mouth (WOM) about sponsor products as compared to the female respondents. The younger age group of less than 21 years has more intention to spread positive WOM compared to the other age groups considered in the study.Practical implicationsThis study contributes towards sports sponsorship research and the paradigms of social identity and attachment theories. Moreover, it will also help the marketers (sponsors) in IPL to strategically market their brands.Originality/valueThis is the first study to investigate the impact of attachment points on sponsorship outcomes in the context of IPL. Further, it is also the first to investigate the purchase intentions and WOM for the team sponsors in IPL. The multi-group analysis results will provide insights into marketers to better understand IPL viewers' segments and their behaviour.


Computers ◽  
2019 ◽  
Vol 8 (1) ◽  
pp. 4 ◽  
Author(s):  
Jurgita Kapočiūtė-Dzikienė ◽  
Robertas Damaševičius ◽  
Marcin Woźniak

We describe the sentiment analysis experiments that were performed on the Lithuanian Internet comment dataset using traditional machine learning (Naïve Bayes Multinomial—NBM and Support Vector Machine—SVM) and deep learning (Long Short-Term Memory—LSTM and Convolutional Neural Network—CNN) approaches. The traditional machine learning techniques were used with the features based on the lexical, morphological, and character information. The deep learning approaches were applied on the top of two types of word embeddings (Vord2Vec continuous bag-of-words with negative sampling and FastText). Both traditional and deep learning approaches had to solve the positive/negative/neutral sentiment classification task on the balanced and full dataset versions. The best deep learning results (reaching 0.706 of accuracy) were achieved on the full dataset with CNN applied on top of the FastText embeddings, replaced emoticons, and eliminated diacritics. The traditional machine learning approaches demonstrated the best performance (0.735 of accuracy) on the full dataset with the NBM method, replaced emoticons, restored diacritics, and lemma unigrams as features. Although traditional machine learning approaches were superior when compared to the deep learning methods; deep learning demonstrated good results when applied on the small datasets.


2008 ◽  
Vol 24 (1) ◽  
pp. 14-21 ◽  
Author(s):  
Sven Brändström ◽  
Sören Sigvardsson ◽  
Per-Olof Nylander ◽  
Jörg Richter

Abstract. In order to establish new norms of the Swedish version of the Temperament and Character Inventory (TCI), data from 2,209 Swedish individuals (age between 13 and 80) was analyzed. The second aim was to evaluate the impact of age and gender on the questionnaire scores. The third aim was to investigate whether the TCI can be meaningfully applied to adolescents in personality assessment as a basis for further research and clinical studies. Age and gender showed independent effects on personality dimensions, which implies that age and gender specific norms have to be established for the TCI. Furthermore, the results in terms of inconsistencies in the correlational and factorial structure, as well as low internal consistency scores in the younger age groups, suggest that the adult version of the TCI should not be applied below the age of 17; for these age groups we recommend the use of the junior TCI (JTCI). The inventory is under further development and several items are in need of revision in order to create less complicated formulations, enabling an improvement in the psychometrics.


Author(s):  
Ergün Yücesoy

In this study, the classification of the speakers according to age and gender was discussed. Age and gender classes were first examined separately, and then by combining these classes a classification with a total of 7 classes was made. Speech signals represented by Mel-Frequency Cepstral Coefficients (MFCC) and delta parameters were converted into Gaussian Mixture Model (GMM) mean supervectors and classified with a Support Vector Machine (SVM). While the GMM mean supervectors were formed according to the Maximum-a-posteriori (MAP) adaptive GMM-Universal Background Model (UBM) configuration, the number of components was changed from 16 to 512, and the optimum number of components was decided. Gender classification accuracy of the system developed using aGender dataset was measured as 99.02% for two classes and 92.58% for three classes and age group classification accuracy was measured as 67.03% for female and 63.79% for male. In the classification of age and gender classes together in one step, an accuracy of 61.46% was obtained. In the study, a two-level approach was proposed for classifying age and gender classes together. According to this approach, the speakers were first divided into three classes as child, male and female, then males and females were classified according to their age groups and thus a 7-class classification was realized. This two-level approach was increased the accuracy of the classification in all other cases except when 32-component GMMs were used. While the highest improvement of 2.45% was achieved with 64 component GMMs, an improvement of 0.79 was achieved with 256 component GMMs.


2022 ◽  
Vol 12 (2) ◽  
pp. 894
Author(s):  
Aušrius Juozapavičius ◽  
Agnė Brilingaitė ◽  
Linas Bukauskas ◽  
Ricardo Gregorio Lugo

Password hygiene plays an essential part in securing systems protected with single-factor authentication. A significant fraction of security incidents happen due to weak or reused passwords. The reasons behind differences in security vulnerable behaviour between various user groups remains an active research topic. The paper aims to identify the impact of age and gender on password strength using a large password dataset. We recovered previously hashed passwords of 102,120 users from a leaked customer database of a car-sharing company. Although the measured effect size was small, males significantly had stronger passwords than females for all age groups. Males aged 26–45 were also significantly different from all other groups, and password complexity decreased with age for both genders equally. Overall, very weak password hygiene was observed, 72% of users based their password on a word or used a simple sequence of digits, and passwords of over 39% of users were found in word lists of previous leaks.


2020 ◽  
Vol 16 ◽  
Author(s):  
Abdullah S. Alghamdi ◽  
Abdulaziz Alqadi ◽  
Richard O. Jenkins ◽  
Parvez I. Haris

Background: Glycated haemoglobin (HbA1c) is the gold standard measurement in the screening, diagnosis and monitoring of diabetes mellitus. Saudi Arabia has a high prevalence of diabetes mellitus that is expected to rise, and the HbA1c test is commonly used in the screening, diagnosis and monitoring of diabetes. Objective: his study aims to assess the impact of age and gender on HbA1c levels, and the influence of menopausal status on HbA1c variation in a large group of Saudis. Method: Age, gender, and HbA1c results of 168,614 Saudi adult individuals were obtained from their medical records. Patients’ records were extracted irrespective of their status regarding presence of diabetes and status of glycaemic control. Linear regression models were used for predicting HbA1c from age and gender, and their interaction term. HbA1c levels were compared between genders in different age groups and different HbA1c categories. Results and Discussion: There was a statistically significant positive correlation between age and HbA1c levels, where for each ten years increase in age HbA1c increased by 0.35%. Although the overall mean HbA1c in women was significantly lower than in men (P < 0.001), women show a significant increase in HbA1c with increased age compared to men (B = 0.014, P < 0.001). Furthermore, the mean HbA1c levels in age group > 50 years was significantly higher than before that age (P < 0.001). Thus, HbA1c increased by 1.118% in age > 50 years group compared to age ≤ 50 years, and this increase in HbA1c was significantly higher in women compared to men (B = 0.495, P < 0.001). Conclusion: HbA1c levels are lower in women before the estimated menopausal age, which should be taken into consideration when using HbA1c for screening, diagnosis, and monitoring of diabetes in Saudi adult women. The short lifespan of red blood cells, due to loss of blood through menstruation, in women before menopause age, is a possible reason for these variations.


2008 ◽  
Vol 16 (1) ◽  
pp. 24-41 ◽  
Author(s):  
Caroline W. Stegink Jansen ◽  
Bruce R. Niebuhr ◽  
Daniel J. Coussirat ◽  
Dana Hawthorne ◽  
Laura Moreno ◽  
...  

This cross-sectional study aimed to assess the impact of age and gender on 4 measures of grip and pinch force of well elderly community dwellers and to provide normative values. The hypotheses were that age and gender affect pinch and grip force and that these 2 factors might interact. Hand strength of 224 seniors 65–92 years old was tested. Grip and pinch force decreased in successively older age groups past 65 years. Men’s grip force exceeded that of women in all age groups. Men’s hand-force decline was steeper than that of women over successive age groups, suggesting that gender differences in force decreased with age. Trends were the same for all 4 types of grip- and pinch-force measurement but were most clearly visible in grip and key-pinch force. Norms were provided for seniors age 65–85+ years in 5-yr increments.


2021 ◽  
Vol 7 ◽  
pp. e813
Author(s):  
Anandan Chinnalagu ◽  
Ashok Kumar Durairaj

Customer satisfaction and their positive sentiments are some of the various goals for successful companies. However, analyzing customer reviews to predict accurate sentiments have been proven to be challenging and time-consuming due to high volumes of collected data from various sources. Several researchers approach this with algorithms, methods, and models. These include machine learning and deep learning (DL) methods, unigram and skip-gram based algorithms, as well as the Artificial Neural Network (ANN) and bag-of-word (BOW) regression model. Studies and research have revealed incoherence in polarity, model overfitting and performance issues, as well as high cost in data processing. This experiment was conducted to solve these revealing issues, by building a high performance yet cost-effective model for predicting accurate sentiments from large datasets containing customer reviews. This model uses the fastText library from Facebook’s AI research (FAIR) Lab, as well as the traditional Linear Support Vector Machine (LSVM) to classify text and word embedding. Comparisons of this model were also done with the author’s a custom multi-layer Sentiment Analysis (SA) Bi-directional Long Short-Term Memory (SA-BLSTM) model. The proposed fastText model, based on results, obtains a higher accuracy of 90.71% as well as 20% in performance compared to LSVM and SA-BLSTM models.


2022 ◽  
Vol 9 (1) ◽  
pp. 33-38
Author(s):  
Hamzullah Khan ◽  
Mohammad Basharat Khan ◽  
Shahtaj Khan ◽  
Saiqa Zahoor ◽  
Anwar Khan Wazir

OBJECTIVES:  To analyze the impact of age and gender on iron stores in a population of the Nowshera region. METHODOLOGY: This cross sectional study was conducted in the Department of Pathology Qazi Hussain Ahmed Medical Complex Nowshera from 1st January 2019 to 31st March 2020. All patients were selected by convenience sampling in the Pathology department irrespective of age and gender.   Both descriptive and inferential statistics were applied to analyze data by the latest SPSS version 25.  RESULTS: Out of the total study population males were 70 (27.1%) and females 188 (77.9%) with median age 30 years.  The median ferritin level was 12.8 ng/ml. Out of total, 142 (55%) of cases were with serum ferritin less than 15ng/ml. A significant (p=0.03) gender based median ferritin level difference was observed with 1.5 times more probability of low iron stores in females as compared to males (OR=1.5). No statistically significant difference in body iron stores exists in different age groups. CONCLUSION:  A significant difference was noted in the iron stores in gender groups and the probability of depleted/low iron stores was higher in female gender as compared to male gender in all age groups in our population.    


Sign in / Sign up

Export Citation Format

Share Document