scholarly journals Comparison of Semirigorous and Empirical Models Derived Using Data Quality Assessment Methods

Minerals ◽  
2021 ◽  
Vol 11 (9) ◽  
pp. 954
Author(s):  
Kevin Brooks ◽  
Derik le le Roux ◽  
Yuri A. W. Shardt ◽  
Chris Steyn

With the increase in available data and the stricter control requirements for mineral processes, the development of automated methods for data processing and model creation are becoming increasingly important. In this paper, the application of data quality assessment methods for the development of semirigorous and empirical models of a primary milling circuit in a platinum concentrator plant is investigated to determine their validity and how best to handle multivariate input data. The data set used consists of both routine operating data and planned step tests. Applying the data quality assessment method to this data set, it was seen that selecting the appropriate subset of variables for multivariate assessment was difficult. However, it was shown that it was possible to identify regions of sufficient value for modeling. Using the identified data, it was possible to fit empirical linear models and a semirigorous nonlinear model. As expected, models obtained from the routine operating data were, in general, worse than those obtained from the planned step tests. However, using the models obtained from routine operating data as the initial seed models for the automated advanced process control methods would be extremely helpful. Therefore, it can be concluded that the data quality assessment method was able to extract and identify regions sufficient and acceptable for modeling.

2009 ◽  
Vol 419-420 ◽  
pp. 445-448 ◽  
Author(s):  
Jun Ting Cheng ◽  
Wei Ling Zhao ◽  
Can Zhao ◽  
Xue Dong Xie

In the field of reverse engineering, data quality assessment is a very important work in the detection, the result of data quality assessment will directly or indirectly affect the detection and the following manufacturing process quality. Data quality assessment can be used in the camera calibration, the model and model reconstruction comparison, and so on. In this paper, on the basis of the existing method of calculating each point error, and multipurpose use of average and standard error and some other concepts of mathematical statistics, and then improve a novel and simple calculating error method. This method is applicable to many groups of one-to-one ideal data and the measured data comparison, and it can be more intuitive to reflect the error of overall data, as well as the error distribution, and it can be more efficient to determine the measured data is reasonable or not. In this paper, the data point quality which is collected in the reverse engineering is assessed, and it can see that the method which is proposed in this article has some advantages in the data point quality assessment field.


2017 ◽  
Vol 9 (1) ◽  
Author(s):  
Sophia Crossen

ObjectiveTo explore the quality of data submitted once a facility is movedinto an ongoing submission status and address the importance ofcontinuing data quality assessments.IntroductionOnce a facility meets data quality standards and is approved forproduction, an assumption is made that the quality of data receivedremains at the same level. When looking at production data qualityreports from various states generated using a SAS data qualityprogram, a need for production data quality assessment was identified.By implementing a periodic data quality update on all productionfacilities, data quality has improved for production data as a whole andfor individual facility data. Through this activity several root causesof data quality degradation have been identified, allowing processesto be implemented in order to mitigate impact on data quality.MethodsMany jurisdictions work with facilities during the onboardingprocess to improve data quality. Once a certain level of data qualityis achieved, the facility is moved into production. At this point thejurisdiction generally assumes that the quality of the data beingsubmitted will remain fairly constant. To check this assumption inKansas, a SAS Production Report program was developed specificallyto look at production data quality.A legacy data set is downloaded from BioSense production serversby Earliest Date in order to capture all records for visits which occurredwithin a specified time frame. This data set is then run through a SASdata quality program which checks specific fields for completenessand validity and prints a report on counts and percentages of null andinvalid values, outdated records, and timeliness of record submission,as well as examples of records from visits containing these errors.A report is created for the state as a whole, each facility, EHR vendor,and HIE sending data to the production servers, with examplesprovided only by facility. The facility, vendor, and HIE reportsinclude state percentages of errors for comparison.The Production Report was initially run on Kansas data for thefirst quarter of 2016 followed by consultations with facilities on thefindings. Monthly checks were made of data quality before and afterfacilities implemented changes. An examination of Kansas’ resultsshowed a marked decrease in data quality for many facilities. Everyfacility had at least one area in need of improvement.The data quality reports and examples were sent to every facilitysending production data during the first quarter attached to an emailrequesting a 30-60 minute call with each to go over the report. Thiscall was deemed crucial to the process since it had been over a year,and in a few cases over two years, since some of the facilities hadlooked at data quality and would need a review of the findings andall requirements, new and old. Ultimately, over half of all productionfacilities scheduled a follow-up call.While some facilities expressed some degree of trepidation, mostfacilities were open to revisiting data quality and to making requestedimprovements. Reasons for data quality degradation included updatesto EHR products, change of EHR product, work flow issues, engineupdates, new requirements, and personnel turnover.A request was made of other jurisdictions (including Arizona,Nevada, and Illinois) to look at their production data using the sameprogram and compare quality. Data was pulled for at least one weekof July 2016 by Earliest Date.ResultsMonthly reports have been run on Kansas Production data bothbefore and after the consultation meetings which indicate a markedimprovement in both completeness of required fields and validityof values in those fields. Data for these monthly reports was againselected by Earliest Date.ConclusionsIn order to ensure production data continues to be of value forsyndromic surveillance purposes, periodic data quality assessmentsshould continue after a facility reaches ongoing submission status.Alterations in process include a review of production data at leasttwice per year with a follow up data review one month later to confirmadjustments have been correctly implemented.


2017 ◽  
Vol 5 (1) ◽  
pp. 47-54
Author(s):  
Puguh Ika Listyorini ◽  
Mursid Raharjo ◽  
Farid Agushybana

Data are the basis to make a decision and policy. The quality of data is going to produce a better policy. The quality assessment methods nowadays do not include all indicators of data quality. If the indicators or assessment criteria in the quality assessment methods are more complete, the level of assessment methods of the data will be higher. The purpose of this study is to develop the method of independent assessment of routine data quality in Surakarta Health Department which is previously performed using the data quality assessment of PMKDR and HMN methods firstly.The design of this study is research and development (R&D) that has been modified into seven steps, namely formulating potential problems, collecting the data, designing the product, validating the design, fixing the design, testing the product, and fixing the product. The subjects consisted of 19 respondents who are managers of data in Surakarta Health Department. Data analysis method used is content analysis.The assessment results show that, in the pilot phase of the development of data quality assessment methods which have been developed, it is basically successful, or it can be used. The results of the assessment of the quality of the data by the developed method is the quality of data collection which is very adequate, the quality of data accuracy which is poor, the quality of data that consistency exists but is inadequate, the quality of the actuality of the data which is very adequate, the quality of periodicity data that is inadequate, the quality of the representation of the data that is very adequate, and sorting the data which is very adequate.It needs a commitment from Surakarta Health Department to take advantage of the development of these methods to assess the quality of data to support the availability of information, decision-making and planning of health programs. It also calls for the development of this research by conducting all stages of the steps of R&D so that the final result of the method development will be better.


2019 ◽  
Vol 181 ◽  
pp. 104954 ◽  
Author(s):  
Carlos Sáez ◽  
Siaw-Teng Liaw ◽  
Eizen Kimura ◽  
Pascal Coorevits ◽  
Juan M Garcia-Gomez

2020 ◽  
Vol 2 (4) ◽  
pp. 529-553
Author(s):  
Li Huang ◽  
Zhenzhen Liu ◽  
Fangfang Xu ◽  
Jinguang Gu

With the rapid growth of the linked data on the Web, the quality assessment of the RDF data set becomes particularly important, especially for the quality and accessibility of the linked data. In most cases, RDF data sets are shared online, leading to a high maintenance cost for the quality assessment. This also potentially pollutes Internet data. Recently blockchain technology has shown the potential in many applications. Using the blockchain storage quality assessment results can reduce the centralization of the authority, and the quality assessment results have characteristics such as non-tampering. To this end, we propose an RDF data quality assessment model in a decentralized environment, pointing out a new dimension of RDF data quality. We use the blockchain to record the data quality assessment results and design a detailed update strategy for the quality assessment results. We have implemented a system DCQA to test and verify the feasibility of the quality assessment model. The proposed method can provide users with better cost-effective results when knowledge is independently protected.


Sign in / Sign up

Export Citation Format

Share Document