The VLDB Journal | ScienceGate

Privacy-preserving worker allocation in crowdsourcing

The VLDB Journal ◽

10.1007/s00778-021-00713-1 ◽

2022 ◽

Author(s):

Libin Zheng ◽

Lei Chen ◽

Peng Cheng

Keyword(s):

Privacy Preserving ◽

Worker Allocation

Download Full-text

Information Resilience: the nexus of responsible and agile approaches to information use

The VLDB Journal ◽

10.1007/s00778-021-00720-2 ◽

2022 ◽

Author(s):

Shazia Sadiq ◽

Amir Aryani ◽

Gianluca Demartini ◽

Wen Hua ◽

Marta Indulska ◽

...

Keyword(s):

Case Studies ◽

Data Privacy ◽

Data Science ◽

Information Use ◽

Regulatory Compliance ◽

Future Research ◽

Public And Private ◽

Social Good ◽

Public And Private Sector ◽

Effective Use

AbstractThe appetite for effective use of information assets has been steadily rising in both public and private sector organisations. However, whether the information is used for social good or commercial gain, there is a growing recognition of the complex socio-technical challenges associated with balancing the diverse demands of regulatory compliance and data privacy, social expectations and ethical use, business process agility and value creation, and scarcity of data science talent. In this vision paper, we present a series of case studies that highlight these interconnected challenges, across a range of application areas. We use the insights from the case studies to introduce Information Resilience, as a scaffold within which the competing requirements of responsible and agile approaches to information use can be positioned. The aim of this paper is to develop and present a manifesto for Information Resilience that can serve as a reference for future research and development in relevant areas of responsible data management.

Download Full-text

Opportunities for optimism in contended main-memory multicore transactions

The VLDB Journal ◽

10.1007/s00778-021-00719-9 ◽

2022 ◽

Author(s):

Yihe Huang ◽

William Qian ◽

Eddie Kohler ◽

Barbara Liskov ◽

Liuba Shrira

Keyword(s):

Main Memory

Download Full-text

Payment behavior prediction on shared parking lots with TR-GCN

The VLDB Journal ◽

10.1007/s00778-021-00722-0 ◽

2022 ◽

Author(s):

Qingyu Xu ◽

Feng Zhang ◽

Mingde Zhang ◽

Jidong Zhai ◽

Bingsheng He ◽

...

Keyword(s):

Behavior Prediction ◽

Parking Lots

Download Full-text

On entity alignment at scale

The VLDB Journal ◽

10.1007/s00778-021-00703-3 ◽

2022 ◽

Author(s):

Weixin Zeng ◽

Xiang Zhao ◽

Xinyi Li ◽

Jiuyang Tang ◽

Wei Wang

Download Full-text

MDDE: multitasking distributed differential evolution for privacy-preserving database fragmentation

The VLDB Journal ◽

10.1007/s00778-021-00718-w ◽

2022 ◽

Author(s):

Yong-Feng Ge ◽

Maria Orlowska ◽

Jinli Cao ◽

Hua Wang ◽

Yanchun Zhang

Keyword(s):

Differential Evolution ◽

Privacy Preserving

Download Full-text

Correction to: Data dependencies for query optimization: a survey

The VLDB Journal ◽

10.1007/s00778-021-00710-4 ◽

2021 ◽

Author(s):

Jan Kossmann ◽

Thorsten Papenbrock ◽

Felix Naumann

Keyword(s):

Query Optimization ◽

Data Dependencies

Download Full-text

Efficient exploratory clustering analyses in large-scale exploration processes

The VLDB Journal ◽

10.1007/s00778-021-00716-y ◽

2021 ◽

Author(s):

Manuel Fritz ◽

Michael Behringer ◽

Dennis Tschechlov ◽

Holger Schwarz

Keyword(s):

Large Scale ◽

Clustering Algorithm ◽

Comprehensive Evaluation ◽

State Of The Art ◽

Clustering Algorithms ◽

Search Space ◽

Large Datasets ◽

Search Spaces ◽

Multiple Challenges ◽

The One

AbstractClustering is a fundamental primitive in manifold applications. In order to achieve valuable results in exploratory clustering analyses, parameters of the clustering algorithm have to be set appropriately, which is a tremendous pitfall. We observe multiple challenges for large-scale exploration processes. On the one hand, they require specific methods to efficiently explore large parameter search spaces. On the other hand, they often exhibit large runtimes, in particular when large datasets are analyzed using clustering algorithms with super-polynomial runtimes, which repeatedly need to be executed within exploratory clustering analyses. We address these challenges as follows: First, we present LOG-Means and show that it provides estimates for the number of clusters in sublinear time regarding the defined search space, i.e., provably requiring less executions of a clustering algorithm than existing methods. Second, we demonstrate how to exploit fundamental characteristics of exploratory clustering analyses in order to significantly accelerate the (repetitive) execution of clustering algorithms on large datasets. Third, we show how these challenges can be tackled at the same time. To the best of our knowledge, this is the first work which simultaneously addresses the above-mentioned challenges. In our comprehensive evaluation, we unveil that our proposed methods significantly outperform state-of-the-art methods, thus especially supporting novice analysts for exploratory clustering analyses in large-scale exploration processes.

Download Full-text

A survey on semantic schema discovery

The VLDB Journal ◽

10.1007/s00778-021-00717-x ◽

2021 ◽

Author(s):

Kenza Kellou-Menouer ◽

Nikolaos Kardoulakis ◽

Georgia Troullinou ◽

Zoubida Kedad ◽

Dimitris Plexousakis ◽

...

Keyword(s):

Semantic Schema

Download Full-text

Parallel mining of large maximal quasi-cliques

The VLDB Journal ◽

10.1007/s00778-021-00712-2 ◽

2021 ◽

Author(s):

Jalal Khalil ◽

Da Yan ◽

Guimu Guo ◽

Lyuheng Yuan

Keyword(s):

Parallel Mining

Download Full-text

The VLDB Journal
Latest Publications

TOTAL DOCUMENTS

H-INDEX

Published By Springer-Verlag

Privacy-preserving worker allocation in crowdsourcing

Information Resilience: the nexus of responsible and agile approaches to information use

Opportunities for optimism in contended main-memory multicore transactions

Payment behavior prediction on shared parking lots with TR-GCN

On entity alignment at scale

MDDE: multitasking distributed differential evolution for privacy-preserving database fragmentation

Correction to: Data dependencies for query optimization: a survey

Efficient exploratory clustering analyses in large-scale exploration processes

A survey on semantic schema discovery

Parallel mining of large maximal quasi-cliques

Export Citation Format

The VLDB JournalLatest Publications

TOTAL DOCUMENTS

H-INDEX

Published By Springer-Verlag

Privacy-preserving worker allocation in crowdsourcing

Information Resilience: the nexus of responsible and agile approaches to information use

Opportunities for optimism in contended main-memory multicore transactions

Payment behavior prediction on shared parking lots with TR-GCN

On entity alignment at scale

MDDE: multitasking distributed differential evolution for privacy-preserving database fragmentation

Correction to: Data dependencies for query optimization: a survey

Efficient exploratory clustering analyses in large-scale exploration processes

A survey on semantic schema discovery

Parallel mining of large maximal quasi-cliques

The VLDB Journal
Latest Publications