FROM QoD TO QoS - Data Quality Issues in Cloud Computing

Thinking about police data: Analysts’ perceptions of data quality in Canadian policing

The Police Journal Theory Practice and Principles ◽

10.1177/0032258x211021461 ◽

2021 ◽

pp. 0032258X2110214

Author(s):

Christopher D O’Connor ◽

John Ng ◽

Dallas Hill ◽

Tyler Frederick

Keyword(s):

Big Data ◽

Data Collection ◽

Data Quality ◽

Research Culture ◽

Police Services ◽

Police Data ◽

Data Collection And Analysis ◽

Quality Issues

Policing is increasingly being shaped by data collection and analysis. However, we still know little about the quality of the data police services acquire and utilize. Drawing on a survey of analysts from across Canada, this article examines several data collection, analysis, and quality issues. We argue that as we move towards an era of big data policing it is imperative that police services pay more attention to the quality of the data they collect. We conclude by discussing the implications of ignoring data quality issues and the need to develop a more robust research culture in policing.

Download Full-text

A Power System Disturbance Classification Method Robust to PMU Data Quality Issues

IEEE Transactions on Industrial Informatics ◽

10.1109/tii.2021.3072397 ◽

2021 ◽

pp. 1-1

Author(s):

Zikang Li ◽

Hao Liu ◽

Junbo Zhao ◽

Tianshu Bi ◽

Qixun Yang

Keyword(s):

Power System ◽

Data Quality ◽

Classification Method ◽

Quality Issues

Download Full-text

Between the Spreadsheets

10.29085/9781783305049 ◽

2021 ◽

Author(s):

Susan Walsh

Keyword(s):

Data Quality ◽

Deep Understanding ◽

Data Classification ◽

Dirty Data ◽

Quality Issues ◽

Book Covers ◽

Level Of Experience ◽

The Impact

Dirty data is a problem that costs businesses thousands, if not millions, every year. In organisations large and small across the globe you will hear talk of data quality issues. What you will rarely hear about is the consequences or how to fix it. Between the Spreadsheets: Classifying and Fixing Dirty Data draws on classification expert Susan Walsh's decade of experience in data classification to present a fool-proof method for cleaning and classifying your data. The book covers everything from the very basics of data classification to normalisation, taxonomies and presents the author's proven COAT methodology, helping ensure an organisation's data is Consistent, Organised, Accurate and Trustworthy. A series of data horror stories outlines what can go wrong in managing data, and if it does, how it can be fixed. After reading this book, regardless of your level of experience, not only will you be able to work with your data more efficiently, but you will also understand the impact the work you do with it has, and how it affects the rest of the organisation. Written in an engaging and highly practical manner, Between the Spreadsheets gives readers of all levels a deep understanding of the dangers of dirty data and the confidence and skills to work more efficiently and effectively with it.

Download Full-text

Data Quality: A Negotiator between Paper-based and Digital Records in the Pakistan’s TB Control Program

10.20944/preprints201806.0185.v1 ◽

2018 ◽

Author(s):

Syed Mustafa Ali ◽

Farah Naureen ◽

Arif Noor ◽

Maged Kamel N. Boulos ◽

Javariya Aamir ◽

...

Keyword(s):

Data Quality ◽

Quality Assessment ◽

Control Program ◽

Digital Data ◽

Patient Treatment ◽

Assessment Framework ◽

Healthcare Organizations ◽

Data Quality Assessment ◽

Quality Issues

Background Increasingly, healthcare organizations are using technology for the efficient management of data. The aim of this study was to compare the data quality of digital records with the quality of the corresponding paper-based records by using data quality assessment framework. Methodology We conducted a desk review of paper-based and digital records over the study duration from April 2016 to July 2016 at six enrolled TB clinics. We input all data fields of the patient treatment (TB01) card into a spreadsheet-based template to undertake a field-to-field comparison of the shared fields between TB01 and digital data. Findings A total of 117 TB01 cards were prepared at six enrolled sites, whereas just 50% of the records (n=59; 59 out of 117 TB01 cards) were digitized. There were 1,239 comparable data fields, out of which 65% (n=803) were correctly matched between paper based and digital records. However, 35% of the data fields (n=436) had anomalies, either in paper-based records or in digital records. 1.9 data quality issues were calculated per digital patient record, whereas it was 2.1 issues per record for paper-based record. Based on the analysis of valid data quality issues, it was found that there were more data quality issues in paper-based records (n=123) than in digital records (n=110). Conclusion There were fewer data quality issues in digital records as compared to the corresponding paper-based records. Greater use of mobile data capture and continued use of the data quality assessment framework can deliver more meaningful information for decision making.

Download Full-text

Discovering XML Conditional Dependencies for Data Quality Issues

European Journal of Electrical Engineering and Computer Science ◽

10.24018/ejece.2020.4.1.156 ◽

2020 ◽

Vol 4 (1) ◽

Author(s):

Mohammed Ragheb Hakawati ◽

Yasmin Yacob ◽

Amiza Amir ◽

Jabiry M. Mohammed ◽

Khalid Jamal Jadaa

Keyword(s):

Data Quality ◽

Primary Standard ◽

Markup Language ◽

Document Type ◽

Data Dependencies ◽

Master Data ◽

Xml Document ◽

Extensible Markup ◽

Quality Issues ◽

Mining Algorithms

Extensible Markup Language (XML) is emerging as the primary standard for representing and exchanging data, with more than 60% of the total; XML considered the most dominant document type over the web; nevertheless, their quality is not as expected. XML integrity constraint especially XFD plays an important role in keeping the XML dataset as consistent as possible, but their ability to solve data quality issues is still intangible. The main reason is that old-fashioned data dependencies were basically introduced to maintain the consistency of the schema rather than that of the data. The purpose of this study is to introduce a method for discovering pattern tableaus for XML conditional dependencies to be used for enhancing XML document consistency as a part of data quality improvement phases. The notations of the conditional dependencies as new rules are designed mainly for improving data instance and extended traditional XML dependencies by enforcing pattern tableaus of semantically related constants. Subsequent to this, a set of minimal approximate conditional dependencies (XCFD, XCIND) is discovered and learned from the XML tree using a set of mining algorithms. The discovered patterns can be used as a Master data in order to detect inconsistencies that don’t respect the majority of the dataset.

Download Full-text

The Application of Distributed Computing Based on Cloud Computing in Statistical Work

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.694-697.2374 ◽

2013 ◽

Vol 694-697 ◽

pp. 2374-2377

Author(s):

Wei Li

Keyword(s):

Cloud Computing ◽

Distributed Computing ◽

Data Quality ◽

Social Economy ◽

Implementation Process ◽

Development Prospect ◽

Effective Development ◽

Statistical Work

With the rapid and effective development of social economy, all departments make an increasingly greater demand on statistical work. However, for a variety of reasons, all circles have been questioning the data quality of statistics departments. Specific to deficiencies of statistical work, this paper presents the basic thinking via which the distributed computing is applied to implementing statistical work under cloud computing. Besides, it compares the differences between ordinary cloud computing and distributed computing in the implementation process. Eventually, it mentions the development prospect of the application of distributed computing in statistical work under cloud computing.

Download Full-text

Using regional wildlife surveys to assess the CRP: scale and data-quality issues

Journal of Field Ornithology ◽

10.1111/j.1557-9263.2007.00097.x ◽

2007 ◽

Vol 78 (2) ◽

pp. 140-151 ◽

Cited By ~ 8

Author(s):

John H. Giudice ◽

Kurt J. Haroldson

Keyword(s):

Data Quality ◽

Quality Issues

Download Full-text

Electronic Health Record Data Quality Issues Are Not Remedied by Increasing Granularity of Diagnosis Codes

JAMA Cardiology ◽

10.1001/jamacardio.2019.0830 ◽

2019 ◽

Vol 4 (5) ◽

pp. 465 ◽

Cited By ~ 2

Author(s):

Ann Marie Navar

Keyword(s):

Electronic Health Record ◽

Data Quality ◽

Health Record ◽

Electronic Health Record Data ◽

Diagnosis Codes ◽

Record Data ◽

Quality Issues ◽

Electronic Health

Download Full-text

Numerical, secondary Big Data quality issues, quality threshold establishment, & guidelines for journal policy development

Decision Support Systems ◽

10.1016/j.dss.2019.113135 ◽

2019 ◽

Vol 126 ◽

pp. 113135 ◽

Cited By ~ 2

Author(s):

Anita Lee-Post ◽

Ram Pakath

Keyword(s):

Big Data ◽

Data Quality ◽

Policy Development ◽

Quality Issues ◽

Quality Threshold

Download Full-text

How to Handle Data Quality Issues in EQ-5D-5L Valuation Studies. The Spanish Case

Value in Health ◽

10.1016/j.jval.2016.09.170 ◽

2016 ◽

Vol 19 (7) ◽

pp. A376 ◽

Cited By ~ 2

Author(s):

JM Ramos-Goñi ◽

BM Craig ◽

M Oppe ◽

Y Ramallo-Fariña ◽

JL Pinto-Prades ◽

...

Keyword(s):

Data Quality ◽

Quality Issues

Download Full-text