Efficient Computation of the Well-Founded Semantics over Big Data

User data created in the digital context has increasingly been of interest to analysis and spatial analysis in particular. Large scale computer user management systems such as digital ticketing and social networking are creating vast amount of data. Such data systems can contain information generated by potentially millions of individuals. This kind of data has been termed big data. The analysis of big data can in its spatial but also in a temporal and social nature be of much interest for analysis in the context of cities and urban areas. This chapter discusses this potential along with a selection of sample work and an in-depth case study. Hereby the focus is mainly on the use and employment of insight gained from social media data, especially the Twitter platform, in regards to cities and urban environments. The first part of the chapter discusses a range of examples that make use of big data and the mapping of digital social network data. The second part discusses the way the data is collected and processed. An important section is dedicated to the aspects of ethical considerations. A summary and an outlook are discussed at the end.

Download Full-text

Lifewide learning in the city: novel big data approaches to exploring learning with large-scale surveys, GPS, and social media

Oxford Review of Education ◽

10.1080/03054985.2018.1554531 ◽

2019 ◽

Vol 45 (2) ◽

pp. 279-295 ◽

Cited By ~ 4

Author(s):

Catherine Lido ◽

Kate Reid ◽

Michael Osborne

Keyword(s):

Social Media ◽

Big Data ◽

Large Scale ◽

The City

Download Full-text

Recuperação da Informação em Ambientes Semânticos: uma ferramenta aplicada à publicações científicas

Journal on Advances in Theoretical and Applied Informatics ◽

10.26729/jadi.v1i1.1042 ◽

2015 ◽

Vol 1 (1) ◽

pp. 30 ◽

Cited By ~ 1

Author(s):

Caio Saraiva Coneglian ◽

Elvis Fusco

Keyword(s):

Social Media ◽

Big Data ◽

Information Retrieval ◽

Great Difficulty ◽

Unstructured Data ◽

Added Value ◽

Informational Needs ◽

Retrieval Process ◽

Extraction Agent ◽

The Web

The data available on the Web is growing exponentially, providing information of high added value to organizations. Such information can be arranged in diverse bases and in varied formats, like videos and photos in social media. However, unstructured data present great difficulty for the information retrieval, not efficiently meeting the informational needs of the users, because there are problems in understanding the meaning of documents stored on the Web. In the context of an Information Retrieval architecture, this research aims to The implementation of a semantic extraction agent in the context of the Web that allows the location, treatment and retrieval of information in the context of Big Data in the most varied informational sources that serves as the basis for the implementation of informational environments that aid the Information Retrieval process , Using ontology to add semantics to the process of retrieval and presentation of results obtained to users, thus being able to meet their needs.

Download Full-text

Session details: Modeling social media: mining big data in social media and the web (MSM 2014)

Proceedings of the 23rd International Conference on World Wide Web - WWW '14 Companion ◽

10.1145/3254770 ◽

2014 ◽

Author(s):

Martin Atzmueller ◽

Alvin Chin ◽

Christoph Trattner

Keyword(s):

Social Media ◽

Big Data ◽

Social Media Mining ◽

Media Mining ◽

The Web

Download Full-text

On the Dynamic Shifting of the MapReduce Timeout

Advances in Data Mining and Database Management - Managing and Processing Big Data in Cloud Computing ◽

10.4018/978-1-4666-9767-6.ch001 ◽

2016 ◽

pp. 1-22

Author(s):

Bunjamin Memishi ◽

Shadi Ibrahim ◽

Maria S. Perez ◽

Gabriel Antoniu

Keyword(s):

Big Data ◽

Data Processing ◽

Response Time ◽

Case Studies ◽

Large Scale ◽

Failure Detection ◽

Mapreduce Framework ◽

Big Data Applications ◽

Design Ideas

MapReduce has become a relevant framework for Big Data processing in the cloud. At large-scale clouds, failures do occur and may incur unwanted performance degradation to Big Data applications. As the reliability of MapReduce depends on how well they detect and handle failures, this book chapter investigates the problem of failure detection in the MapReduce framework. The case studies of this contribution reveal that the current static timeout value is not adequate and demonstrate significant variations in the application's response time with different timeout values. While arguing that comparatively little attention has been devoted to the failure detection in the framework, the chapter presents design ideas for a new adaptive timeout.

Download Full-text

The Use of Social Media for Urban Planning

Advances in Civil and Industrial Engineering - Technologies for Urban and Spatial Planning ◽

10.4018/978-1-4666-4349-9.ch006 ◽

2014 ◽

pp. 113-134 ◽

Cited By ~ 3

Author(s):

Fabian Neuhaus

Keyword(s):

Social Media ◽

Big Data ◽

Urban Areas ◽

Large Scale ◽

Urban Environments ◽

Data Systems ◽

User Management ◽

Social Nature ◽

User Data ◽

Use Of Social Media

User data created in the digital context has increasingly been of interest to analysis and spatial analysis in particular. Large scale computer user management systems such as digital ticketing and social networking are creating vast amount of data. Such data systems can contain information generated by potentially millions of individuals. This kind of data has been termed big data. The analysis of big data can in its spatial but also in a temporal and social nature be of much interest for analysis in the context of cities and urban areas. This chapter discusses this potential along with a selection of sample work and an in-depth case study. Hereby the focus is mainly on the use and employment of insight gained from social media data, especially the Twitter platform, in regards to cities and urban environments. The first part of the chapter discusses a range of examples that make use of big data and the mapping of digital social network data. The second part discusses the way the data is collected and processed. An important section is dedicated to the aspects of ethical considerations. A summary and an outlook are discussed at the end.

Download Full-text

Evolution of diversity and dominance of companies in online activity

PLoS ONE ◽

10.1371/journal.pone.0249993 ◽

2021 ◽

Vol 16 (4) ◽

pp. e0249993

Author(s):

Paul X. McCarthy ◽

Xian Gong ◽

Sina Eghbal ◽

Daniel S. Falster ◽

Marian-Andrei Rizoiu

Keyword(s):

Social Media ◽

Large Scale ◽

Preferential Attachment ◽

Online Activity ◽

Enterprise Value ◽

Online Social Media ◽

Long Run ◽

User Attention ◽

Online Attention ◽

The Web

Ever since the web began, the number of websites has been growing exponentially. These websites cover an ever-increasing range of online services that fill a variety of social and economic functions across a growing range of industries. Yet the networked nature of the web, combined with the economics of preferential attachment, increasing returns and global trade, suggest that over the long run a small number of competitive giants are likely to dominate each functional market segment, such as search, retail and social media. Here we perform a large scale longitudinal study to quantify the distribution of attention given in the online environment to competing organisations. In two large online social media datasets, containing more than 10 billion posts and spanning more than a decade, we tally the volume of external links posted towards the organisations’ main domain name as a proxy for the online attention they receive. We also use the Common Crawl dataset—which contains the linkage patterns between more than a billion different websites—to study the patterns of link concentration over the past three years across the entire web. Lastly, we showcase the linking between economic, financial and market data by exploring the relationships between online attention on social media and the growth in enterprise value in the electric carmaker Tesla. Our analysis shows that despite the fact that we observe consistent growth in all the macro indicators—the total amount of online attention, in the number of organisations with an online presence, and in the functions they perform—we also observe that a smaller number of organisations account for an ever-increasing proportion of total user attention, usually with one large player dominating each function. These results highlight how evolution of the online economy involves innovation, diversity, and then competitive dominance.

Download Full-text

A DIVISION OF LABOR: THE ROLE OF BIG DATA ANALYSIS IN THE REPERTOIRE OF INTERNET RESEARCH METHODS

AoIR Selected Papers of Internet Research ◽

10.5210/spir.v2018i0.10467 ◽

2020 ◽

Author(s):

Rasmus Helles ◽

Jacob Ørmen ◽

Klaus Bruhn Jensen ◽

Signe Sophus Lai ◽

Ericka Menchen-Trevino ◽

...

Keyword(s):

Social Media ◽

Big Data ◽

Data Analysis ◽

Research Methods ◽

Real World ◽

Large Scale ◽

Big Data Analysis ◽

Internet Research ◽

Internet Research Methods

In recent years, large-scale analysis of log data from digital devices - often termed ""big data analysis"" (Lazer, Kennedy, King, & Vespignani, 2014) - have taken hold in the field of internet research. Through Application Programming Interfaces (APIs) and commercial measurement, scholars have been able to analyze social media users (Freelon 2014) and web audiences (Taneja, 2016) on an uprecedented scale. And by developing digital research tools, scholars have been able to track individuals across websites (Menchen-Trevino, 2013) and mobile applications (Ørmen & Thorhauge 2015) in greater detail than ever before. Big data analysis holds unique potential for studying communication in depth and across many individuals (see e.g. Boase & Ling, 2013; Prior, 2013). At the same time, this approach introduces new methodological challenges in the transparency of data collection (Webster, 2014), sampling of participants and validity of conclusions (Rieder, Abdulla, Poell, Woltering, & Zack, 2015). Firstly, data aggregation is typically designed for commercial rather than academic purposes. The type of data included as well as how it is presented depend in large part on the business interests of measurement and advertisement companies (Webster, 2014). Secondly, when relying on this kind of secondary data it can be difficult to validate the output or techniques used to generate the data (Rieder, Abdulla, Poell, Woltering, & Zack, 2015). Thirdly, often the unit of analysis is media-centric, taking specific websites or social network pages as the empirical basis instead of individual users (Taneja, 2016). This makes it hard to untangle the behavior of real-world users from the aggregate trends. Lastly, variations in what users do might be so large that it is necessary to move from the aggregate to smaller groups of users to make meaningful inferences (Welles, 2014). Internet research is thus faced with a new research approach in big data analysis with potentials and perils that need to be discussed in combination with traditional approaches. This panel explores the role of big data analysis in relation to the wider repertoire of methods in internet research. The panel comprises four presentations that each sheds light on the complementarity of big data analysis with more traditional qualitative and quantitative methods. The first presentation opens the discussion with an overview of strategies for combining digital traces and commercial audience data with qualitative interviews and quantitative survey methods. The next presentation explores the potential of trace data to improve upon the experimental method. Researcher-collected data enables scholars to operate in a real-world setting, in contrast to a research lab, while obtaining informed consent from participants. The third presentation argues that large-scale audience data provide a unique perspective on internet use. By integrating census-level information about users with detailed traces of their behavior across websites, commercial audience data combines the strength of surveys and digital trace data respectively. Lastly, the fourth presentation shows how multi-institutional collaboration makes it possible do document social media activity (on Twitter) for a whole country (Australia) in a comprehensive manner. A feat not possible through other methods on a similar scale. Through these four presentations, the panel aims to situate big data analysis in the broader repertoire of internet research methods.

Download Full-text

MidSemI

International Journal of Information System Modeling and Design ◽

10.4018/ijismd.2019040101 ◽

2019 ◽

Vol 10 (2) ◽

pp. 1-25 ◽

Cited By ~ 1

Author(s):

Samir Sellami ◽

Taoufiq Dkaki ◽

Nacer Eddine Zarour ◽

Pierre-Jean Charrel

Keyword(s):

Social Media ◽

Linked Data ◽

Large Scale ◽

Keyword Search ◽

Evaluation Study ◽

Added Value ◽

Web Of Data ◽

Integration Techniques ◽

User Friendly ◽

The Web

The web diversification into the Web of Data and social media means that companies need to gather all the necessary data to help make the best-informed market decisions. However, data providers on the web publish data in various data models and may equip it with different search capabilities, thus requiring data integration techniques to access them. This work explores the current challenges in this area, discusses the limitations of some existing integration tools, and addresses them by proposing a semantic mediator-based approach to virtually integrate enterprise data with large-scale social and linked data. The implementation of the proposed approach is a configurable middleware application and a user-friendly keyword search interface that retrieves its input from internal enterprise data combined with various SPARQL endpoints and Web APIs. An evaluation study was conducted to compare its features with recent integration approaches. The results illustrate the added value and usability of the contributed approach.

Download Full-text

A survey of large-scale reasoning on the Web of data

The Knowledge Engineering Review ◽

10.1017/s0269888918000255 ◽

2018 ◽

Vol 33 ◽

Cited By ~ 3

Author(s):

Grigoris Antoniou ◽

Sotiris Batsakis ◽

Raghava Mutharaju ◽

Jeff Z. Pan ◽

Guilin Qi ◽

...

Keyword(s):

Systematic Review ◽

Social Media ◽

Parallel Algorithms ◽

High Performance ◽

Large Scale ◽

Open Problems ◽

Reasoning Systems ◽

Web Of Data ◽

Computational Properties ◽

The Web

AbstractAs more and more data is being generated by sensor networks, social media and organizations, the Web interlinking this wealth of information becomes more complex. This is particularly true for the so-called Web of Data, in which data is semantically enriched and interlinked using ontologies. In this large and uncoordinated environment, reasoning can be used to check the consistency of the data and of associated ontologies, or to infer logical consequences which, in turn, can be used to obtain new insights from the data. However, reasoning approaches need to be scalable in order to enable reasoning over the entire Web of Data. To address this problem, several high-performance reasoning systems, which mainly implement distributed or parallel algorithms, have been proposed in the last few years. These systems differ significantly; for instance in terms of reasoning expressivity, computational properties such as completeness, or reasoning objectives. In order to provide a first complete overview of the field, this paper reports a systematic review of such scalable reasoning approaches over various ontological languages, reporting details about the methods and over the conducted experiments. We highlight the shortcomings of these approaches and discuss some of the open problems related to performing scalable reasoning.

Download Full-text