data ecosystem Latest Research Papers

Knowledge-Driven Data Ecosystems Toward Data Transparency

Journal of Data and Information Quality ◽

10.1145/3467022 ◽

2022 ◽

Vol 14 (1) ◽

pp. 1-12

Author(s):

Sandra Geisler ◽

Maria-Esther Vidal ◽

Cinzia Cappiello ◽

Bernadette Farias Lóscio ◽

Avigdor Gal ◽

...

Keyword(s):

Real World ◽

Organizational Management ◽

Data Driven ◽

Data Governance ◽

Ethical Considerations ◽

Shared Data ◽

Data Transparency ◽

Data Ecosystem

A data ecosystem (DE) offers a keystone-player or alliance-driven infrastructure that enables the interaction of different stakeholders and the resolution of interoperability issues among shared data. However, despite years of research in data governance and management, trustability is still affected by the absence of transparent and traceable data-driven pipelines. In this work, we focus on requirements and challenges that DEs face when ensuring data transparency. Requirements are derived from the data and organizational management, as well as from broader legal and ethical considerations. We propose a novel knowledge-driven DE architecture, providing the pillars for satisfying the analyzed requirements. We illustrate the potential of our proposal in a real-world scenario. Last, we discuss and rate the potential of the proposed architecture in the fulfillmentof these requirements.

Download Full-text

Integrating Data and Service Lifecycle for Smart Service Systems Engineering: Compilation of a Lifecycle Model for the Data Ecosystem of Smart Living

10.24251/hicss.2022.254 ◽

2022 ◽

Author(s):

Henrik Kortum ◽

Jonas Rebstadt ◽

Simon Hagen ◽

Oliver Thomas

Keyword(s):

Systems Engineering ◽

Service Systems ◽

Lifecycle Model ◽

Smart Living ◽

Smart Service ◽

Data Ecosystem

Download Full-text

Designing an AI compatible open government data ecosystem for public governance

10.24251/hicss.2022.291 ◽

2022 ◽

Author(s):

Evrim Tan

Keyword(s):

Open Government ◽

Public Governance ◽

Open Government Data ◽

Government Data ◽

Data Ecosystem

Download Full-text

Towards a Standard-based Open Data Ecosystem: Analysis of DCAT-AP use at National and European Level

Electronic Government an International Journal ◽

10.1504/eg.2022.10036962 ◽

2022 ◽

Vol 1 (1) ◽

pp. 1

Author(s):

IOANNIS MAGNISALIS ◽

Vassilios Peristeras ◽

Lina Molinas Comet ◽

Florian Barthelemy ◽

Michael Cochez ◽

...

Keyword(s):

Open Data ◽

European Level ◽

Data Ecosystem

Download Full-text

Forming a data ecosystem: international experience of infrastructure asset management

Fìnansi Ukraïni ◽

10.33763/finukr2021.11.038 ◽

2021 ◽

Vol 2021 (11) ◽

pp. 38-44

Author(s):

Danyila OLIYNYK ◽

Keyword(s):

Asset Management ◽

Ecosystem Model ◽

Added Value ◽

Economic Sustainability ◽

Semantic Technologies ◽

Network Infrastructure ◽

Digital Ecosystem ◽

Data Ecosystem ◽

And Control ◽

The Eu

Based on the research conducted on the European policy of data ecosystem formation, the feasibility of regulatory alignment of the components of the digital ecosystem model in Ukraine to measure and control the parameters on economic sustainability is substantiated. The article presents the approaches of the EU, international standardization organizations and scientists to understanding the essence of the data ecosystem, identifies factors that impact the complexity of network assets administration on the example of infrastructure assets. Emphasis is placed on ensuring sustainability and assurance of existing network infrastructure assets throughout their lifecycle. The problems of digital transformation related to the increasing strain on all infrastructure systems, which are solved by the model of network infrastructure formation, are outlined. The need to accelerate the introduction of semantic technologies in IoT, in particular artificial intelligence, which expands the possibilities of data analysis and control and support of economic indicators of the state and the creation of added value in production and services, is justified.

Download Full-text

Transfer Learning Based Performance Modeling And Effective Storage Management In Big Data Ecosystems

10.32920/17190185 ◽

2021 ◽

Author(s):

Fatimah Alsayoud

Keyword(s):

Big Data ◽

Transfer Learning ◽

Performance Modeling ◽

Storage System ◽

Cost Effective ◽

Storage Management ◽

Management Decisions ◽

Modeling Approach ◽

Data Ecosystem ◽

The Right

Big data ecosystems contain a mix of sophisticated hardware storage components to support heterogeneous workloads. Storage components and the workloads interact and affect each other; therefore, their relationship has to consider when modeling workloads or managing storage. Efficient workload modeling guides optimal storage management decisions, and the right decisions help guarantee the workload’s needs. The first part of this thesis focuses on workload modeling efficiency, and the second part focuses on cost-effective storage management.<div>Workload performance modeling is an essential step in management decisions. The standard modeling approach constructs the model based on a historical dataset collected from one set of setups (scenario). The standard modeling approach requires the model to be reconstructed from scratch with every time the setups changes. To address this issue, we propose a cross-scenario modeling approach that improves the workload’s performance classification accuracy by up to 78% through adopting the Transfer Learning (TL).<br></div><div>The storage system is the most crucial component of the big data ecosystem, where the workload’s execution process starts by fetching data from it and ends by storing data into it. Thus, the workload’s performance is directly affected by storage capability. To provide a high I/O performance in the ecosystems, Solid State Drive (SSD) are utilized as a tier or as a cache on big data distributed ecosystems. SSDs have a short lifespan that is affected by data size and the number of writing operations. Balancing performance requirements and SSD’s lifespan consumption is never easy, and it’s even harder when interacting with a huge amount of data and with heterogeneous I/O patterns. In this thesis, we analysis big data workloads I/O pattern impacts on SSD’s lifespan when SSD is used as a tier or as a cache. Then, we design a Hidden Markov Model (HMM) based I/O pattern controller that manages workload placement and guarantees cost-effective storage that enhances the workload performance by up to 60%, and improves SSD’s lifespan by up to 40%. </div><div>The designed transfer learning modeling approach and the storage management solutions improve workload modeling accuracy, and the quality of the storage management policies while the testing setup changes.<br></div>

Download Full-text

Transfer Learning Based Performance Modeling And Effective Storage Management In Big Data Ecosystems

10.32920/17190185.v1 ◽

2021 ◽

Author(s):

Fatimah Alsayoud

Keyword(s):

Big Data ◽

Transfer Learning ◽

Performance Modeling ◽

Storage System ◽

Cost Effective ◽

Storage Management ◽

Management Decisions ◽

Modeling Approach ◽

Data Ecosystem ◽

The Right

Big data ecosystems contain a mix of sophisticated hardware storage components to support heterogeneous workloads. Storage components and the workloads interact and affect each other; therefore, their relationship has to consider when modeling workloads or managing storage. Efficient workload modeling guides optimal storage management decisions, and the right decisions help guarantee the workload’s needs. The first part of this thesis focuses on workload modeling efficiency, and the second part focuses on cost-effective storage management.<div>Workload performance modeling is an essential step in management decisions. The standard modeling approach constructs the model based on a historical dataset collected from one set of setups (scenario). The standard modeling approach requires the model to be reconstructed from scratch with every time the setups changes. To address this issue, we propose a cross-scenario modeling approach that improves the workload’s performance classification accuracy by up to 78% through adopting the Transfer Learning (TL).<br></div><div>The storage system is the most crucial component of the big data ecosystem, where the workload’s execution process starts by fetching data from it and ends by storing data into it. Thus, the workload’s performance is directly affected by storage capability. To provide a high I/O performance in the ecosystems, Solid State Drive (SSD) are utilized as a tier or as a cache on big data distributed ecosystems. SSDs have a short lifespan that is affected by data size and the number of writing operations. Balancing performance requirements and SSD’s lifespan consumption is never easy, and it’s even harder when interacting with a huge amount of data and with heterogeneous I/O patterns. In this thesis, we analysis big data workloads I/O pattern impacts on SSD’s lifespan when SSD is used as a tier or as a cache. Then, we design a Hidden Markov Model (HMM) based I/O pattern controller that manages workload placement and guarantees cost-effective storage that enhances the workload performance by up to 60%, and improves SSD’s lifespan by up to 40%. </div><div>The designed transfer learning modeling approach and the storage management solutions improve workload modeling accuracy, and the quality of the storage management policies while the testing setup changes.<br></div>

Download Full-text

Has demography witnessed a data revolution? Promises and pitfalls of a changing data ecosystem

Population Studies ◽

10.1080/00324728.2021.1969031 ◽

2021 ◽

Vol 75 (sup1) ◽

pp. 47-75

Author(s):

Ridhi Kashyap

Keyword(s):

Data Ecosystem

Download Full-text

Mapping the education data ecosystem in Sierra Leone

10.53832/edtechhub.0068 ◽

2021 ◽

Author(s):

Chris McBurnie ◽

Iman Beoku-Betts

Keyword(s):

Sierra Leone ◽

Education Data ◽

Data Ecosystem

An output of the EdTech Hub, https://edtechhub.org

Download Full-text

Automating accountability? Privacy policies, data transparency, and the third party problem

University of Toronto Law Journal ◽

10.3138/utlj-2020-0136 ◽

2021 ◽

Author(s):

David Lie ◽

Lisa M. Austin ◽

Peter Yi Ping Sun ◽

Wenjun Qiu

Keyword(s):

Consumer Choice ◽

Third Parties ◽

Policy Implications ◽

Third Party ◽

Privacy Policy ◽

Privacy Policies ◽

The Third ◽

Data Transparency ◽

The Third Party ◽

Data Ecosystem

We have a data transparency problem. Currently, one of the main mechanisms we have to understand data flows is through the self-reporting that organizations provide through privacy policies. These suffer from many well-known problems, problems that are becoming more acute with the increasing complexity of the data ecosystem and the role of third parties – the affiliates, partners, processors, ad agencies, analytic services, and data brokers involved in the contemporary data practices of organizations. In this article, we argue that automating privacy policy analysis can improve the usability of privacy policies as a transparency mechanism. Our argument has five parts. First, we claim that we need to shift from thinking about privacy policies as a transparency mechanism that enhances consumer choice and see them as a transparency mechanism that enhances meaningful accountability. Second, we discuss a research tool that we prototyped, called AppTrans (for Application Transparency), which can detect inconsistencies between the declarations in a privacy policy and the actions the mobile application can potentially take if it is used. We used AppTrans to test seven hundred applications and found that 59.5 per cent were collecting data in ways that were not declared in their policies. The vast majority of the discrepancies were due to third party data collection such as adversiting and analytics. Third, we outline the follow-on research we did to extend AppTrans to analyse the information sharing of mobile applications with third parties, with mixed results. Fourth, we situate our findings in relation to the third party issues that came to light in the recent Cambridge Analytica scandal and the calls from regulators for enhanced technical safeguards in managing these third party relationships. Fifth, we discuss some of the limitations of privacy policy automation as a strategy for enhanced data transparency and the policy implications of these limitations.

Download Full-text

data ecosystem
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Knowledge-Driven Data Ecosystems Toward Data Transparency

Integrating Data and Service Lifecycle for Smart Service Systems Engineering: Compilation of a Lifecycle Model for the Data Ecosystem of Smart Living

Designing an AI compatible open government data ecosystem for public governance

Towards a Standard-based Open Data Ecosystem: Analysis of DCAT-AP use at National and European Level

Forming a data ecosystem: international experience of infrastructure asset management

Transfer Learning Based Performance Modeling And Effective Storage Management In Big Data Ecosystems

Transfer Learning Based Performance Modeling And Effective Storage Management In Big Data Ecosystems

Has demography witnessed a data revolution? Promises and pitfalls of a changing data ecosystem

Mapping the education data ecosystem in Sierra Leone

Automating accountability? Privacy policies, data transparency, and the third party problem

Export Citation Format

data ecosystemRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Knowledge-Driven Data Ecosystems Toward Data Transparency

Integrating Data and Service Lifecycle for Smart Service Systems Engineering: Compilation of a Lifecycle Model for the Data Ecosystem of Smart Living

Designing an AI compatible open government data ecosystem for public governance

Towards a Standard-based Open Data Ecosystem: Analysis of DCAT-AP use at National and European Level

Forming a data ecosystem: international experience of infrastructure asset management

Transfer Learning Based Performance Modeling And Effective Storage Management In Big Data Ecosystems

Transfer Learning Based Performance Modeling And Effective Storage Management In Big Data Ecosystems

Has demography witnessed a data revolution? Promises and pitfalls of a changing data ecosystem

Mapping the education data ecosystem in Sierra Leone

Automating accountability? Privacy policies, data transparency, and the third party problem

data ecosystem
Recently Published Documents