Solving the Fragment Complexity of Official, Social, and Sensorial Urban Data

Cities in the big data era hold the massive urban data to create valuable information and digitally enhanced services. Sources of urban data are generally categorized as one of the three types: official, social, and sensorial, which are from the government and enterprises, social networks of citizens, and the sensor network. These types typically differ significantly from each other but are consolidated together for the smart urban services. Based on the sophisticated consolidation approaches, we argue that a new challenge, fragment complexity that represents a well-integrated data has appropriate but fragmentary schema and difficult to be queried, is ignored in the state-of-art urban data management. Comparing with predefined and rigid schema, fragmentary schema means a dataset contains millions of attributes but nonorthogonally distributed among tables, and of course, values of these attributes are even massive. As far as a query is concerned, locating where these attributes are being stored is the first encountered problem, while traditional value-based query optimization has no contributions. To address this problem, we propose an index on massive attributes as an attributes-oriented optimization, namely, attribute index. Attribute index is a secondary index for locating files in which the target attributes are stored. It contains three parts: ATree for searching keys, DTree for locating keys among files, and ADLinks as a mapping table between ATree and DTree. In this paper, the index architecture, logical structure and algorithms, the implementation details, the creation process, the integration to the existing key-value store, and the urban application scenario are described. Experiments show that, in comparison with B + -Tree, LSM-Tree, and AVL-Tree, the query time of ATree is 1.1x, 1.5x, and 1.2x faster, respectively. Finally, we integrate our proposition with HBase, namely, UrbanBase, whose query performance is 1.3x faster than the original HBase.

Download Full-text

The Analysis of Social Networks of T'ang Dynasty's Poets: An Approach to Their Social Poems on the Basis of Big Data and Infographics

Journal of Chinese Studies ◽

10.35982/jcs.82.7 ◽

2017 ◽

Vol 82 ◽

pp. 163-193

Author(s):

Joonyoun Kim

Keyword(s):

Social Networks ◽

Big Data ◽

Analysis Of Social Networks

Download Full-text

An Optimal Framework for Spatial Query Optimization Using Hadoop in Big Data Analytics

Recent Patents on Computer Science ◽

10.2174/2213275912666190419215231 ◽

2019 ◽

Vol 12 ◽

Author(s):

Pankaj Dadheech ◽

Dinesh Goyal ◽

Sumit Srivastava ◽

Ankit Kumar

Keyword(s):

Big Data ◽

Query Optimization ◽

Spatial Data ◽

Spatial Information ◽

Big Data Analytics ◽

Spatial Query ◽

Data Process ◽

Boolean Queries ◽

Spatial Query Optimization ◽

Hadoop System

Spatial queries frequently used in Hadoop for significant data process. However, vast and massive size of spatial information makes it difficult to process the spatial inquiries proficiently, so they utilized the Hadoop system for process Big Data. We have used Boolean Queries & Geometry Boolean Spatial Data for Query Optimization using Hadoop System. In this paper, we show a lightweight and adaptable spatial data index for big data which will process in Hadoop frameworks. Results demonstrate the proficiency and adequacy of our spatial ordering system for various spatial inquiries.

Download Full-text

A News Big Data Analysis of Issues in Higher Education in Korea amid the COVID-19 Pandemic

Sustainability ◽

10.3390/su13137347 ◽

2021 ◽

Vol 13 (13) ◽

pp. 7347

Author(s):

Jangwan Ko ◽

Seungsu Paek ◽

Seoyoon Park ◽

Jiwoo Park

Keyword(s):

Higher Education ◽

Big Data ◽

Data Analysis ◽

College Education ◽

Big Data Analysis ◽

Second Phase ◽

Public Attention ◽

Job Market ◽

Media Reports ◽

The Government

This paper examines the main issues regarding higher education in Korea—where college education experienced minimal interruptions—during the COVID-19 pandemic through a big data analysis of news articles. By analyzing policy responses from the government and colleges and examining prominent discourses on higher education, it provides a context for discussing the implications of COVID-19 on education policy and what the post-pandemic era would bring. To this end, we utilized BIgKinds, a big data research solution for news articles offered by the Korea Press Foundation, to select a total of 2636 media reports and conducted Topic Modelling based on LDA algorithms using NetMiner. The analyses are split into three distinct periods of COVID-19 spread in the country. Some notable topics from the first phase are remote class, tuition refund, returning Chinese international students, and normalization of college education. Preparations for the College Scholastic Ability Test (CSAT), contact and contactless classes, preparations for early admissions, and supporting job market candidates are extracted for the second phase. For the third phase, the extracted topics include CSAT and college-specific exams, quarantine on campus, social relations on campus, and support for job market candidates. The results confirmed widespread public attention to the relevant issues but also showed empirically that the measures taken by the government and college administrations to combat COVID-19 had limited visibility among media reports. It is important to note that timely and appropriate responses from the government and colleges have enabled continuation of higher education in some capacity during the pandemic. In addition to the media’s role in reporting issues of public interest, there is also a need for continued research and discussion on higher education amid COVID-19 to help effect actual results from various policy efforts.

Download Full-text

DaLiF: a data lifecycle framework for data-driven governments

Journal Of Big Data ◽

10.1186/s40537-021-00481-3 ◽

2021 ◽

Vol 8 (1) ◽

Author(s):

Syed Iftikhar Hussain Shah ◽

Vassilios Peristeras ◽

Ioannis Magnisalis

Keyword(s):

Big Data ◽

Data Management ◽

Good Governance ◽

Business Community ◽

Data Driven ◽

Governmental Organizations ◽

Data Lifecycle ◽

The Government ◽

Data Ecosystem ◽

Governance Transparency

AbstractThe public sector, private firms, business community, and civil society are generating data that is high in volume, veracity, velocity and comes from a diversity of sources. This kind of data is known as big data. Public Administrations (PAs) pursue big data as “new oil” and implement data-centric policies to transform data into knowledge, to promote good governance, transparency, innovative digital services, and citizens’ engagement in public policy. From the above, the Government Big Data Ecosystem (GBDE) emerges. Managing big data throughout its lifecycle becomes a challenging task for governmental organizations. Despite the vast interest in this ecosystem, appropriate big data management is still a challenge. This study intends to fill the above-mentioned gap by proposing a data lifecycle framework for data-driven governments. Through a Systematic Literature Review, we identified and analysed 76 data lifecycles models to propose a data lifecycle framework for data-driven governments (DaliF). In this way, we contribute to the ongoing discussion around big data management, which attracts researchers’ and practitioners’ interest.

Download Full-text

Big Data in the philippines: How do we actually use them?

Statistical Journal of the IAOS ◽

10.3233/sji-210826 ◽

2021 ◽

pp. 1-30

Author(s):

Lisa Grace S. Bersales ◽

Josefina V. Almeda ◽

Sabrina O. Romasoc ◽

Marie Nadeen R. Martinez ◽

Dannela Jann B. Galias

Keyword(s):

Big Data ◽

High Speed ◽

The Philippines ◽

Complex Data ◽

Official Statistics ◽

Group Discussions ◽

The Public ◽

The Government ◽

The Many ◽

Current Utilization

With the advancement of technology, digitalization, and the internet of things, large amounts of complex data are being produced daily. This vast quantity of various data produced at high speed is referred to as Big Data. The utilization of Big Data is being implemented with success in the private sector, yet the public sector seems to be falling behind despite the many potentials Big Data has already presented. In this regard, this paper explores ways in which the government can recognize the use of Big Data for official statistics. It begins by gathering and presenting Big Data-related initiatives and projects across the globe for various types and sources of Big Data implemented. Further, this paper discusses the opportunities, challenges, and risks associated with using Big Data, particularly in official statistics. This paper also aims to assess the current utilization of Big Data in the country through focus group discussions and key informant interviews. Based on desk review, discussions, and interviews, the paper then concludes with a proposed framework that provides ways in which Big Data may be utilized by the government to augment official statistics.

Download Full-text

Ensembled Adaptive Fuzzy K-Means With Stochastic Extreme Gradient Boost Big Data Clustering on Geo-Social Networks

2021 International Conference on Advance Computing and Innovative Technologies in Engineering (ICACITE) ◽

10.1109/icacite51222.2021.9404574 ◽

2021 ◽

Author(s):

M. Anoop ◽

P. Sripriya

Keyword(s):

Social Networks ◽

Big Data ◽

Data Clustering ◽

Adaptive Fuzzy

Download Full-text

LRDM: Local Record-Driving Mechanism for Big Data Privacy Preservation in Social Networks

2016 IEEE First International Conference on Data Science in Cyberspace (DSC) ◽

10.1109/dsc.2016.94 ◽

2016 ◽

Cited By ~ 2

Author(s):

Weihao Li ◽

Hui Li

Keyword(s):

Social Networks ◽

Big Data ◽

Data Privacy ◽

Privacy Preservation ◽

Driving Mechanism ◽

Big Data Privacy

Download Full-text

BIG DATA PROCESSING IN THE DIGITALIZATION OF ENTERPRISE ACTIVITIES

Bulletin Series of Physics & Mathematical Sciences ◽

10.51889/2021-3.1728-7901.09 ◽

2021 ◽

Vol 75 (3) ◽

pp. 76-82

Author(s):

G.T. Balakayeva ◽

◽

D.K. Darkenbayev ◽

M. Turdaliyev ◽

◽

...

Keyword(s):

Social Networks ◽

Growth Rate ◽

Big Data ◽

Data Processing ◽

New Technologies ◽

Large Data ◽

Modern Society ◽

Information Flows ◽

Professional Activities ◽

The Past

The growth rate of these enterprises has increased significantly in the last decade. Research has shown that over the past two decades, the amount of data has increased approximately tenfold every two years - this exceeded Moore's Law, which doubles the power of processors. About thirty thousand gigabytes of data are accumulated every second, and their processing requires an increase in the efficiency of data processing. Uploading videos, photos and letters from users on social networks leads to the accumulation of a large amount of data, including unstructured ones. This leads to the need for enterprises to work with big data of different formats, which must be prepared in a certain way for further work in order to obtain the results of modeling and calculations. In connection with the above, the research carried out in the article on processing and storing large data of an enterprise, developing a model and algorithms, as well as using new technologies is relevant. Undoubtedly, every year the information flows of enterprises will increase and in this regard, it is important to solve the issues of storing and processing large amounts of data. The relevance of the article is due to the growing digitalization, the increasing transition to professional activities online in many areas of modern society. The article provides a detailed analysis and research of these new technologies.

Download Full-text

Prospects for Cooperation Between Russia and the Republic of Korea in the Fields of Big Data, Network and Artificial Intelligence for the Development of the Digital Economy

World Economy and International Relations ◽

10.20542/0131-2227-2021-65-8-51-60 ◽

2021 ◽

Vol 65 (8) ◽

pp. 51-60

Author(s):

Yujeong Kim

Keyword(s):

Artificial Intelligence ◽

Big Data ◽

Large Scale ◽

Technology Development ◽

Technological Development ◽

Digital Economy ◽

Republic Of Korea ◽

Data Network ◽

Leading Position ◽

The Government

Today, each country has interest in digital economy and has established and implemented policies aimed at digital technology development and digital transformation for the transition to the digital economy. In particular, interest in digital technologies such as big data, 5G, and artificial intelligence, which are recognized as important factors in the digital economy, has been increasing recently, and it is a time when the role of the government for technological development and international cooperation becomes important. In addition to the overall digital economic policy, the Russian and Korean governments are also trying to improve their international competitiveness and take a leading position in the new economic order by establishing related technical and industrial policies. Moreover, Republic of Korea often refers to data, network and artificial intelligence as D∙N∙A, and has established policies in each of these areas in 2019. Russia is also establishing and implementing policies in the same field in 2019. Therefore, it is timely to find ways to expand cooperation between Russia and Republic of Korea. In particular, the years of 2020and 2021marks the 30th anniversary of diplomatic relations between the two countries, and not only large-scale events and exchange programs have prepared, but the relationship is deepening as part of the continued foreign policy of both countries – Russia’s Eastern Policy and New Northern Policy of Republic of Korea. Therefore, this paper compares and analyzes the policies of the two countries in big data, 5G, and artificial intelligence to seek long-term sustainable cooperation in the digital economy.

Download Full-text