Big Data

Author(s):  
Yannick Dufresne ◽  
Brittany I. Davidson

This chapter assesses big data. Within the social sciences, big data could refer to an emerging field of research that brings together academics from a variety of disciplines using and developing tools to widen perspective, to utilize latent data sets, as well as for the generation of new data. Another way to define big data in the social sciences refers to data corresponding to at least one of the three s of big data: volume, variety, or velocity.. These characteristics are widely used by researchers attempting to define and distinguish new types of data from conventional ones. However, there are a number of ethical and consent issues with big data analytics. For example, many studies across the social sciences utilize big data from the web, from social media, online communities, and the darknet, where there is a question as to whether users provided consent to the reuse of their posts, profiles, or other data shared when they signed up, knowing their profiles and information would be public. This has led to a number of issues regarding algorithms making decisions that cannot be explained. The chapter then considers the opportunities and pitfalls that come along with big data.

2021 ◽  
Vol 12 ◽  
Author(s):  
Muhammad Usman Tariq ◽  
Muhammad Babar ◽  
Marc Poulin ◽  
Akmal Saeed Khattak ◽  
Mohammad Dahman Alshehri ◽  
...  

Intelligent big data analysis is an evolving pattern in the age of big data science and artificial intelligence (AI). Analysis of organized data has been very successful, but analyzing human behavior using social media data becomes challenging. The social media data comprises a vast and unstructured format of data sources that can include likes, comments, tweets, shares, and views. Data analytics of social media data became a challenging task for companies, such as Dailymotion, that have billions of daily users and vast numbers of comments, likes, and views. Social media data is created in a significant amount and at a tremendous pace. There is a very high volume to store, sort, process, and carefully study the data for making possible decisions. This article proposes an architecture using a big data analytics mechanism to efficiently and logically process the huge social media datasets. The proposed architecture is composed of three layers. The main objective of the project is to demonstrate Apache Spark parallel processing and distributed framework technologies with other storage and processing mechanisms. The social media data generated from Dailymotion is used in this article to demonstrate the benefits of this architecture. The project utilized the application programming interface (API) of Dailymotion, allowing it to incorporate functions suitable to fetch and view information. The API key is generated to fetch information of public channel data in the form of text files. Hive storage machinist is utilized with Apache Spark for efficient data processing. The effectiveness of the proposed architecture is also highlighted.


2021 ◽  
Vol 8 (1) ◽  
Author(s):  
Kiran Chaudhary ◽  
Mansaf Alam ◽  
Mabrook S. Al-Rakhami ◽  
Abdu Gumaei

AbstractSocial media is popular in our society right now. People are using social media platforms to purchase various products. We collected the data from various social media platforms. We analyzed the data for prediction of the consumer behavior on the social media platform. We considered the consumer data from Facebook, Twitter, Linked In and YouTube, Instagram, and Pinterest, etc. There are diverse and high-speed, high volume data which are coming from social media platform, so we used predictive big data analytics. In this paper, we have used the concept of big data technology to process data and analyze it to predict consumer behavior on social media. We have analyzed consumer behavior on social media platforms based on some parameters and criteria. We analyzed the consumer perception, attitude towards the social media platform. To get good quality of result, we pre-process data using various data pre-processing to detect outlier, noises, error, and duplicate record. We developed mathematical modeling using machine learning to predict consumer behavior on the social media platform. This model is a predictive model for predicting consumer behavior on the social media platform. 80% of data are used for training purposes and 20% for testing.


Sociologija ◽  
2018 ◽  
Vol 60 (3) ◽  
pp. 557-582
Author(s):  
Jelisaveta Petrovic

The paper critically examines the attitude of the mainstream sociology towards the study of big data in social sciences. Content analysis of the scientific papers published in the top-tier sociological journals ranked on the Thomson Reuters Impact Factor ssci list (2000-2017) shows that, in the observed period, the issue of big data was largely neglected. This topic is still rather invisible in the mainstream sociological thought, although it draws a lot of attention outside the academia. The analysis of big data within mainstream sociology is dominated by a critical perspective, while the application of the big data analytics is quite rare. In the concluding section, the importance of the big data study for sociology is emphasised. Moreover, it is pointed out at the risk of auto-marginalization in case of neglecting the ?tectonic? changes induced by the big data analytics in the space once dominated by the social sciences. [Project of the Serbian Ministry of Education, Science and Technological Development, Grant no. 179035: Izazovi nove drustvene integracije u Srbiji: koncepti i akteri]


2021 ◽  
pp. 074391562199967
Author(s):  
Raffaello Rossi ◽  
Agnes Nairn ◽  
Josh Smith ◽  
Christopher Inskip

The internet raises substantial challenges for policy makers in regulating gambling harm. The proliferation of gambling advertising on Twitter is one such challenge. However, the sheer scale renders it extremely hard to investigate using conventional techniques. In this paper the authors present three UK Twitter gambling advertising studies using both Big Data analytics and manual content analysis to explore the volume and content of gambling adverts, the age and engagement of followers, and compliance with UK advertising regulations. They analyse 890k organic adverts from 417 accounts along with data on 620k followers and 457k engagements (replies and retweets). They find that around 41,000 UK children follow Twitter gambling accounts, and that two-thirds of gambling advertising Tweets fail to fully comply with regulations. Adverts for eSports gambling are markedly different from those for traditional gambling (e.g. on soccer, casinos and lotteries) and appear to have strong appeal for children, with 28% of engagements with eSports gambling ads from under 16s. The authors make six policy recommendations: spotlight eSports gambling advertising; create new social-media-specific regulations; revise regulation on content appealing to children; use technology to block under-18s from seeing gambling ads; require ad-labelling of organic gambling Tweets; and deploy better enforcement.


2020 ◽  
Vol 4 (2) ◽  
pp. 5 ◽  
Author(s):  
Ioannis C. Drivas ◽  
Damianos P. Sakas ◽  
Georgios A. Giannakopoulos ◽  
Daphne Kyriaki-Manessi

In the Big Data era, search engine optimization deals with the encapsulation of datasets that are related to website performance in terms of architecture, content curation, and user behavior, with the purpose to convert them into actionable insights and improve visibility and findability on the Web. In this respect, big data analytics expands the opportunities for developing new methodological frameworks that are composed of valid, reliable, and consistent analytics that are practically useful to develop well-informed strategies for organic traffic optimization. In this paper, a novel methodology is implemented in order to increase organic search engine visits based on the impact of multiple SEO factors. In order to achieve this purpose, the authors examined 171 cultural heritage websites and their retrieved data analytics about their performance and user experience inside them. Massive amounts of Web-based collections are included and presented by cultural heritage organizations through their websites. Subsequently, users interact with these collections, producing behavioral analytics in a variety of different data types that come from multiple devices, with high velocity, in large volumes. Nevertheless, prior research efforts indicate that these massive cultural collections are difficult to browse while expressing low visibility and findability in the semantic Web era. Against this backdrop, this paper proposes the computational development of a search engine optimization (SEO) strategy that utilizes the generated big cultural data analytics and improves the visibility of cultural heritage websites. One step further, the statistical results of the study are integrated into a predictive model that is composed of two stages. First, a fuzzy cognitive mapping process is generated as an aggregated macro-level descriptive model. Secondly, a micro-level data-driven agent-based model follows up. The purpose of the model is to predict the most effective combinations of factors that achieve enhanced visibility and organic traffic on cultural heritage organizations’ websites. To this end, the study contributes to the knowledge expansion of researchers and practitioners in the big cultural analytics sector with the purpose to implement potential strategies for greater visibility and findability of cultural collections on the Web.


2018 ◽  
Vol 20 (1) ◽  
Author(s):  
Tiko Iyamu

Background: Over the years, big data analytics has been statically carried out in a programmed way, which does not allow for translation of data sets from a subjective perspective. This approach affects an understanding of why and how data sets manifest themselves into various forms in the way that they do. This has a negative impact on the accuracy, redundancy and usefulness of data sets, which in turn affects the value of operations and the competitive effectiveness of an organisation. Also, the current single approach lacks a detailed examination of data sets, which big data deserve in order to improve purposefulness and usefulness.Objective: The purpose of this study was to propose a multilevel approach to big data analysis. This includes examining how a sociotechnical theory, the actor network theory (ANT), can be complementarily used with analytic tools for big data analysis.Method: In the study, the qualitative methods were employed from the interpretivist approach perspective.Results: From the findings, a framework that offers big data analytics at two levels, micro- (strategic) and macro- (operational) levels, was developed. Based on the framework, a model was developed, which can be used to guide the analysis of heterogeneous data sets that exist within networks.Conclusion: The multilevel approach ensures a fully detailed analysis, which is intended to increase accuracy, reduce redundancy and put the manipulation and manifestation of data sets into perspectives for improved organisations’ competitiveness.


2021 ◽  
Author(s):  
Kristia M. Pavlakos

Big Data1is a phenomenon that has been increasingly studied in the academy in recent years, especially in technological and scientific contexts. However, it is still a relatively new field of academic study; because it has been previously considered in mainly technological contexts, more attention needs to be drawn to the contributions made in Big Data scholarship in the social sciences by scholars like Omar Tene and Jules Polonetsky, Bart Custers, Kate Crawford, Nick Couldry, and Jose van Dijk. The purpose of this Major Research Paper is to gain insight into the issues surrounding privacy and user rights, roles, and commodification in relation to Big Data in a social sciences context. The term “Big Data” describes the collection, aggregation, and analysis of large data sets. While corporations are usually responsible for the analysis and dissemination of the data, most of this data is user generated, and there must be considerations regarding the user’s rights and roles. In this paper, I raise three main issues that shape the discussion: how users can be more active agents in data ownership, how consent measures can be made to actively reflect user interests instead of focusing on benefitting corporations, and how user agency can be preserved. Through an analysis of social sciences scholarly literature on Big Data, privacy, and user commodification, I wish to determine how these concepts are being discussed, where there have been advancements in privacy regulation and the prevention of user commodification, and where there is a need to improve these measures. In doing this, I hope to discover a way to better facilitate the relationship between data collectors and analysts, and user-generators. 1 While there is no definitive resolution as to whether or not to capitalize the term “Big Data”, in capitalizing it I chose to conform with such authors as boyd and Crawford (2012), Couldry and Turow (2014), and Dalton and Thatcher (2015), who do so in the scholarly literature.


Sign in / Sign up

Export Citation Format

Share Document