Measurement of Data Consumer Satisfaction with Data Quality for Improvement of Data Utilization

Web portals are Internet-based applications that provide a big amount of data. The data consumer who uses the data given by these applications needs to assess data quality. Due to the relevance of data quality on the Web together with the fact that DQ needs to be assessed within the context in which data are generated, data quality models specific to this context are necessary. In this chapter, we will introduce a model for data quality in Web portals (PDQM). PDQM has been built upon the foundation of three key aspects: (1) a set of Web data quality attributes identified in the literature in this area, (2) data quality expectations of data consumers on the Internet, and (3) the functionalities that a Web portal may offer its users.

Download Full-text

A Data Quality Model for Web Portals

Handbook of Research on Web Information Systems Quality ◽

10.4018/978-1-59904-847-5.ch008 ◽

2010 ◽

pp. 130-144

Author(s):

Angélica Caro ◽

Coral Calero ◽

Mario Piattini

Keyword(s):

Data Quality ◽

The Internet ◽

Web Portals ◽

Web Portal ◽

Quality Model ◽

Quality Models ◽

Key Aspects ◽

Assess Data Quality ◽

The Web ◽

Data Consumer

Web portals are Internet-based applications that provide a big amount of data. The data consumer who uses the data given by these applications needs to assess data quality. Due to the relevance of data quality on the Web together with the fact that DQ needs to be assessed within the context in which data are generated, data quality models specific to this context are necessary. In this chapter, we will introduce a model for data quality in Web portals (PDQM). PDQM has been built upon the foundation of three key aspects: (1) a set of Web data quality attributes identified in the literature in this area, (2) data quality expectations of data consumers on the Internet, and (3) the functionalities that a Web portal may offer its users.

Download Full-text

Get a Sample for a Discount: Sampling-Based XML Data Pricing

10.31219/osf.io/5y84h ◽

2017 ◽

Author(s):

Ruiming Tang ◽

Antoine Amarilli ◽

Pierre Senellart ◽

Stéphane Bressan

Keyword(s):

Data Quality ◽

Polynomial Time ◽

Random Sampling ◽

Xml Data ◽

Trade Off ◽

Quality Dimension ◽

First Case ◽

Xml Document ◽

Uniform Random Sampling ◽

Data Consumer

While price and data quality should define the major trade-off for consumers in data markets, prices are usually prescribed by vendorsand data quality is not negotiable. In this paper we study a modelwhere data quality can be traded for a discount. We focus on the case ofXML documents and consider completeness as the quality dimension. Inour setting, the data provider offers an XML document, and sets boththe price of the document and a weight to each node of the document,depending on its potential worth. The data consumer proposes a price.If the proposed price is lower than that of the entire document, thenthe data consumer receives a sample, i.e., a random rooted subtree ofthe document whose selection depends on the discounted price and theweight of nodes. By requesting several samples, the data consumer caniteratively explore the data in the document. We show that the uniformrandom sampling of a rooted subtree with prescribed weight isunfortunately intractable. However, we are able to identify several practical casesthat are tractable. The first case is uniform random sampling of a rootedsubtree with prescribed size; the second case restricts to binary weights.For both these practical cases we present polynomial-time algorithmsand explain how they can be integrated into an iterative exploratorysampling approach.

Download Full-text

A Framework for Sampling-Based XML Data Pricing

10.31219/osf.io/m3xcn ◽

2017 ◽

Author(s):

Ruiming Tang ◽

Antoine Amarilli ◽

Pierre Senellart ◽

Stéphane Bressan

Keyword(s):

Data Quality ◽

Polynomial Time ◽

Polynomial Time Algorithm ◽

Time Algorithm ◽

Uniform Sampling ◽

First Case ◽

Sampling Problem ◽

Uniform Random Sampling ◽

Data Consumer ◽

Pseudo Polynomial Time

While price and data quality should define the major tradeoff for consumers in data markets, prices are usually prescribed by vendors and data quality is not negotiable. In this paper we study a model where data quality can be traded for a discount. We focus on the case of XML documents and consider completeness as the quality dimension. In our setting, the data provider offers an XML document, and sets both the price of the document and a weight to each node of the document, depending on its potential worth. The data consumer proposes a price. If the proposed price is lower than that of the entire document, then the data consumer receives a sample, i.e., a random rooted subtree of the document whose selection depends on the discounted price and the weight of nodes. By requesting several samples, the data consumer can iteratively explore the data in the document.We present a pseudo-polynomial time algorithm to select a rooted subtree with prescribed weight uniformly at random, but show that this problem is unfortunately intractable. Yet, we are able to identify several practical cases where our algorithm runs in polynomial time. The first case is uniform random sampling of a rooted subtree with prescribed size rather than weights; the second case restricts to binary weights.As a more challenging scenario for the sampling problem, we also study the uniform sampling of a rooted subtree of prescribed weight and prescribed height. We adapt our pseudo-polynomial time algorithm to this setting and identify tractable cases.

Download Full-text

Novel framework of retaining maximum data quality and energy efficiency in reconfigurable wireless sensor network

International Journal of Electrical and Computer Engineering (IJECE) ◽

10.11591/ijece.v9i4.pp2893-2901 ◽

2019 ◽

Vol 9 (4) ◽

pp. 2893

Author(s):

B. Prathiba ◽

K. Jaya Sankar ◽

V. Sumalatha

Keyword(s):

Energy Efficiency ◽

Wireless Sensor Network ◽

Data Quality ◽

Sensor Network ◽

Wireless Sensor ◽

Data Utilization ◽

Analytical Methodology ◽

Utilization Factor ◽

Aggregated Data ◽

Good Balance

There are various unseen and unpredictable networking states in Wireless Sensor Network (WSN) that adversely affect the aggregated data quality. After reviewing the existing approaches of data quality in WSN, it was found that the solutions are quite symptomatic and they are applicable only in a static environment; however their successful applicability on dynamic and upcoming reconfigurable network is still a big question. Moreover, data quality directly affects energy conservation among the nodes. Therefore, the proposed system introduces a simple and novel framework that jointly addresses the data quality and energy efficiency using probability-based design approach. Using a simplified analytical methodology, the proposed system offers solution in the form of selection transmission of an aggergated data on the basis of message priority in order to offer higher data utilization factor. The study outcome shows proposed system offers a good balance between data quality and energy efficiency in contrast to existing system.

Download Full-text