Identifying top Chinese network buzzwords from social media big data set based on time-distribution features

The term Big Data corresponds to a large dataset which is available in different forms of occurrence. In recent years, most of the organizations generate vast amounts of data in different forms which makes the context of volume, variety, velocity, and veracity. Big Data on the volume aspect is based on data set maintenance. The data volume goes to processing usual a database but cannot be handled by a traditional database. Big Data is stored among structured, unstructured, and semi-structured data. Big Data is used for programming, data warehousing, computational frameworks, quantitative aptitude and statistics, and business knowledge. Upon considering the analytics in the Big Data sector, predictive analytics and social media analytics are widely used for determining the pattern or trend which is about to happen. This chapter mainly deals with the tools and techniques that corresponds to big data analytics of various applications.

Download Full-text

Parteikommunikation im Zeitalter von Social Media

10.5771/9783748905424 ◽

2020 ◽

Author(s):

Mario Datts

Keyword(s):

Social Media ◽

Big Data ◽

Political Parties ◽

Explanatory Model ◽

Election Campaigns ◽

Data Set ◽

Comprehensive Data ◽

Party Members ◽

District Offices

How active are the local branches of political parties on social media? Do such parties use social media on the training ground of democracy? This study answers these questions using a comprehensive data set consisting of big data and data from surveys. It identifies political parties’ key reasons for using social media by developing and examining a complex explanatory model, the results of which reveal that the majority of parties’ district offices are active on social media, for example on WhatsApp, Twitter and YouTube in addition to Facebook. One key reason for them using social media sites is their desire to meet the expectations of their own party members. Furthermore, they appear to use social media in their election campaigns and because of their general distrust of the conventional media’s reporting. What is noticeable is that the local divisions of the AfD in particular seem to be extremely successful in using social media.

Download Full-text

A comparison of sentiment analysis techniques with a social media big data set

Suid-Afrikaanse Tydskrif vir Natuurwetenskap en Tegnologie ◽

10.4102/satnt.v35i1.1404 ◽

2016 ◽

Vol 35 (1) ◽

Author(s):

I. Van der Linde ◽

E. Kotzé ◽

G. Dollman

Keyword(s):

Social Media ◽

Big Data ◽

Sentiment Analysis ◽

Data Set ◽

Analysis Techniques

No abstract available.

Download Full-text

Performance Analysis of Machine Learning Algorithms for Big Data Classification

International Journal of E-Health and Medical Communications ◽

10.4018/ijehmc.20210701.oa4 ◽

2021 ◽

Vol 12 (4) ◽

pp. 60-75

Author(s):

Sanjeev Kumar Punia ◽

Manoj Kumar ◽

Thompson Stephan ◽

Ganesh Gopal Deverajan ◽

Rizwan Patan

Keyword(s):

Machine Learning ◽

Social Media ◽

Big Data ◽

Research Data ◽

Structured Data ◽

Unstructured Data ◽

Support Vector ◽

Classification Algorithms ◽

Data Set ◽

Machine Learning Classification

In broad, three machine learning classification algorithms are used to discover correlations, hidden patterns, and other useful information from different data sets known as big data. Today, Twitter, Facebook, Instagram, and many other social media networks are used to collect the unstructured data. The conversion of unstructured data into structured data or meaningful information is a very tedious task. The different machine learning classification algorithms are used to convert unstructured data into structured data. In this paper, the authors first collect the unstructured research data from a frequently used social media network (i.e., Twitter) by using a Twitter application program interface (API) stream. Secondly, they implement different machine classification algorithms (supervised, unsupervised, and reinforcement) like decision trees (DT), neural networks (NN), support vector machines (SVM), naive Bayes (NB), linear regression (LR), and k-nearest neighbor (K-NN) from the collected research data set. The comparison of different machine learning classification algorithms is concluded.

Download Full-text

DYNAMIC EXTRACTION AND ANALYTICS OF BIG DATA FROM CLOUD AND SOCIAL MEDIA INTEGRATED PLATFORMS

Advances in Mathematics: Scientific Journal ◽

10.37418/amsj.9.6.48 ◽

2020 ◽

Vol 9 (6) ◽

pp. 3703-3711

Author(s):

N. Oberoi ◽

S. Sachdeva ◽

P. Garg ◽

R. Walia

Keyword(s):

Social Media ◽

Big Data ◽

Dynamic Extraction

Download Full-text

A Comprehensive Analysis on the Adoption of Mobile Technology by Using Big Data-Based Social Media Marketing in SME Retailers in Jordan

SSRN Electronic Journal ◽

10.2139/ssrn.2962990 ◽

2016 ◽

Author(s):

Abdallah Al Tawara ◽

Ergun Gide

Keyword(s):

Social Media ◽

Big Data ◽

Mobile Technology ◽

Comprehensive Analysis ◽

Social Media Marketing

Download Full-text

What big data reveal about social media use in the hotel industry

International Journal of Tourism Management and Sciences ◽

10.21719/ijtms.33.5.13 ◽

2018 ◽

Vol 33 (5) ◽

pp. 257-273

Author(s):

Woohyuk Kim

Keyword(s):

Social Media ◽

Big Data ◽

Media Use ◽

Hotel Industry ◽

Social Media Use

Download Full-text

Mining Social Media Data to Study the Consequences of Dementia Diagnosis on Caregivers and Relatives (Preprint)

10.2196/preprints.10506 ◽

2018 ◽

Author(s):

Anika Oellrich ◽

George Gkotsis ◽

Richard James Butler Dobson ◽

Tim JP Hubbard ◽

Rina Dutta

Keyword(s):

Social Media ◽

Family Relationships ◽

Text Processing ◽

Automated Analysis ◽

Health Concern ◽

Dementia Diagnosis ◽

Data Set ◽

Social Media Data ◽

Real Time Processing ◽

Media Data

BACKGROUND Dementia is a growing public health concern with approximately 50 million people affected worldwide in 2017 and this number is expected to reach more than 131 million by 2050. The toll on caregivers and relatives cannot be underestimated as dementia changes family relationships, leaves people socially isolated, and affects the finances of all those involved. OBJECTIVE The aim of this study was to explore using automated analysis (i) the age and gender of people who post to the social media forum Reddit about dementia diagnoses, (ii) the affected person and their diagnosis, (iii) relevant subreddits authors are posting to, (iv) the types of messages posted and (v) the content of these posts. METHODS We analysed Reddit posts concerning dementia diagnoses. We used a previously developed text analysis pipeline to determine attributes of the posts as well as their authors to characterise online communications about dementia diagnoses. The posts were also examined by manual curation for the diagnosis provided and the person affected. Furthermore, we investigated the communities these people engage in and assessed the contents of the posts with an automated topic gathering technique. RESULTS Our results indicate that the majority of posters in our data set are women, and it is mostly close relatives such as parents and grandparents that are mentioned. Both the communities frequented and topics gathered reflect not only the sufferer's diagnosis but also potential outcomes, e.g. hardships experienced by the caregiver. The trends observed from this dataset are consistent with findings based on qualitative review, validating the robustness of social media automated text processing. CONCLUSIONS This work demonstrates the value of social media data sources as a resource for in-depth studies of those affected by a dementia diagnosis and the potential to develop novel support systems based on their real time processing in line with the increasing digitalisation of medical care.

Download Full-text

Citizens, Elites, and Social Media Methodological Challenges and Opportunities in the Study of Persuasion and Mobilization

The Oxford Handbook of Electoral Persuasion ◽

10.1093/oxfordhb/9780190860806.013.27 ◽

2019 ◽

pp. 1036-1058

Author(s):

Philip Habel ◽

Yannis Theocharis

Keyword(s):

Social Media ◽

Big Data ◽

Supply And Demand ◽

Political Process ◽

The Political ◽

The Novel ◽

Complete Understanding ◽

Social Media Data ◽

Challenges And Opportunities ◽

Media Data

In the last decade, big data, and social media in particular, have seen increased popularity among citizens, organizations, politicians, and other elites—which in turn has created new and promising avenues for scholars studying long-standing questions of communication flows and influence. Studies of social media play a prominent role in our evolving understanding of the supply and demand sides of the political process, including the novel strategies adopted by elites to persuade and mobilize publics, as well as the ways in which citizens react, interact with elites and others, and utilize platforms to persuade audiences. While recognizing some challenges, this chapter speaks to the myriad of opportunities that social media data afford for evaluating questions of mobilization and persuasion, ultimately bringing us closer to a more complete understanding Lasswell’s (1948) famous maxim: “who, says what, in which channel, to whom, [and] with what effect.”

Download Full-text

Investigating the impact of pre-processing techniques and pre-trained word embeddings in detecting Arabic health information on social media

Journal Of Big Data ◽

10.1186/s40537-021-00488-w ◽

2021 ◽

Vol 8 (1) ◽

Author(s):

Yahya Albalawi ◽

Jim Buckley ◽

Nikola S. Nikolov

Keyword(s):

Social Media ◽

Deep Learning ◽

Comprehensive Evaluation ◽

Classification Problem ◽

Data Sets ◽

Word Embeddings ◽

Data Set ◽

Lower Accuracy ◽

Health Related ◽

The Impact

AbstractThis paper presents a comprehensive evaluation of data pre-processing and word embedding techniques in the context of Arabic document classification in the domain of health-related communication on social media. We evaluate 26 text pre-processings applied to Arabic tweets within the process of training a classifier to identify health-related tweets. For this task we use the (traditional) machine learning classifiers KNN, SVM, Multinomial NB and Logistic Regression. Furthermore, we report experimental results with the deep learning architectures BLSTM and CNN for the same text classification problem. Since word embeddings are more typically used as the input layer in deep networks, in the deep learning experiments we evaluate several state-of-the-art pre-trained word embeddings with the same text pre-processing applied. To achieve these goals, we use two data sets: one for both training and testing, and another for testing the generality of our models only. Our results point to the conclusion that only four out of the 26 pre-processings improve the classification accuracy significantly. For the first data set of Arabic tweets, we found that Mazajak CBOW pre-trained word embeddings as the input to a BLSTM deep network led to the most accurate classifier with F1 score of 89.7%. For the second data set, Mazajak Skip-Gram pre-trained word embeddings as the input to BLSTM led to the most accurate model with F1 score of 75.2% and accuracy of 90.7% compared to F1 score of 90.8% achieved by Mazajak CBOW for the same architecture but with lower accuracy of 70.89%. Our results also show that the performance of the best of the traditional classifier we trained is comparable to the deep learning methods on the first dataset, but significantly worse on the second dataset.

Download Full-text