Analyst Performance Measures. Volume 3. Information Quality Tools for Persistent Surveillanec Data Sets

2011 ◽  
Author(s):  
John Talburt ◽  
Serhan Dagtas ◽  
Mariofanna Milanova ◽  
Mihail Tudoreanu ◽  
Brian Tsou
2011 ◽  
Author(s):  
Marina Altynova ◽  
Ed Wasser ◽  
Telford Berkey ◽  
Sanjay Boddhu ◽  
Tin Sa ◽  
...  

2019 ◽  
Vol 341 ◽  
pp. 168-182 ◽  
Author(s):  
José-Ramón Cano ◽  
Pedro Antonio Gutiérrez ◽  
Bartosz Krawczyk ◽  
Michał Woźniak ◽  
Salvador García

2013 ◽  
Vol 711 ◽  
pp. 719-721 ◽  
Author(s):  
Agamohamadi Basmenj Fazlollah ◽  
M. Yusuff Rosnah ◽  
Zulkifli Norzima ◽  
Ismaiel Yusof ◽  
Sorooshian Shahryar

This paper considers three factors; the selection of the Performance Measures, selection of the Critical Success Factors, and selection of quality tools as three elements of TQM. The qualitative research of this study tries to find and model the interrelation between elements of Total Quality Management (TQM) practice in companies.


Nowadays, a huge amount of data is generated due to the growth in the technologies. There are different tools used to view this massive amount of data, and these tools contain different data mining techniques which can be applied for the obtained data sets. Classification is required to extract useful information or to predict the result from these enormous amounts of data. For this purpose, there are different classification algorithms. In this paper, we have compared Naive Bayes, K*, and random forest classification algorithm using Weka tool. To analyze the performance of these three algorithms we have considered three data sets. They are diabetes, supermarket and weather data set. In this work, an analysis is made based on the confusion matrix and different performance measures like RMSE, MAE, ROC, etc


Author(s):  
Sherif Ishak ◽  
Ciprian Alecsandru

The characteristics of preincident, postincident, and nonincident traffic conditions on freeways are investigated. The characteristics are defined by second-order statistical measures derived from spatiotemporal speed contour maps. Four performance measures are used to quantify properties such as smoothness, homogeneity, and randomness in traffic conditions in a manner similar to texture characterization of digital images. With real-world incident and traffic data sets, statistical analysis was conducted to seek distinctive characteristics of three groups of traffic operating conditions: preincident, postincident, and nonincident. The study results showed that the spatiotemporal characteristics of each of the three groups were not discernible. Although the distributions of performance measures within each group are statistically different, no consistent pattern was detected to imply that certain characteristics could increase the likelihood of incidents or identify precursory conditions to incidents.


2019 ◽  
Author(s):  
Suranga N Kasthurirathne ◽  
Shaun Grannis ◽  
Paul K Halverson ◽  
Justin Morea ◽  
Nir Menachemi ◽  
...  

BACKGROUND Emerging interest in precision health and the increasing availability of patient- and population-level data sets present considerable potential to enable analytical approaches to identify and mitigate the negative effects of social factors on health. These issues are not satisfactorily addressed in typical medical care encounters, and thus, opportunities to improve health outcomes, reduce costs, and improve coordination of care are not realized. Furthermore, methodological expertise on the use of varied patient- and population-level data sets and machine learning to predict need for supplemental services is limited. OBJECTIVE The objective of this study was to leverage a comprehensive range of clinical, behavioral, social risk, and social determinants of health factors in order to develop decision models capable of identifying patients in need of various wraparound social services. METHODS We used comprehensive patient- and population-level data sets to build decision models capable of predicting need for behavioral health, dietitian, social work, or other social service referrals within a safety-net health system using area under the receiver operating characteristic curve (AUROC), sensitivity, precision, F1 score, and specificity. We also evaluated the value of population-level social determinants of health data sets in improving machine learning performance of the models. RESULTS Decision models for each wraparound service demonstrated performance measures ranging between 59.2%% and 99.3%. These results were statistically superior to the performance measures demonstrated by our previous models which used a limited data set and whose performance measures ranged from 38.2% to 88.3% (behavioural health: F1 score <i>P</i>&lt;.001, AUROC <i>P</i>=.01; social work: F1 score <i>P</i>&lt;.001, AUROC <i>P</i>=.03; dietitian: F1 score <i>P</i>=.001, AUROC <i>P</i>=.001; other: F1 score <i>P</i>=.01, AUROC <i>P</i>=.02); however, inclusion of additional population-level social determinants of health did not contribute to any performance improvements (behavioural health: F1 score <i>P</i>=.08, AUROC <i>P</i>=.09; social work: F1 score <i>P</i>=.16, AUROC <i>P</i>=.09; dietitian: F1 score <i>P</i>=.08, AUROC <i>P</i>=.14; other: F1 score <i>P</i>=.33, AUROC <i>P</i>=.21) in predicting the need for referral in our population of vulnerable patients seeking care at a safety-net provider. CONCLUSIONS Precision health–enabled decision models that leverage a wide range of patient- and population-level data sets and advanced machine learning methods are capable of predicting need for various wraparound social services with good performance.


2019 ◽  
Author(s):  
Anna C. Gilbert ◽  
Alexander Vargo

AbstractHere, we evaluate the performance of a variety of marker selection methods on scRNA-seq UMI counts data. We test on an assortment of experimental and synthetic data sets that range in size from several thousand to one million cells. In addition, we propose several performance measures for evaluating the quality of a set of markers when there is no known ground truth. According to these metrics, most existing marker selection methods show similar performance on experimental scRNA-seq data; thus, the speed of the algorithm is the most important consid-eration for large data sets. With this in mind, we introduce RANKCORR, a fast marker selection method with strong mathematical underpinnings that takes a step towards sensible multi-class marker selection.


2005 ◽  
Vol 16 (2) ◽  
pp. 277-307 ◽  
Author(s):  
Sérgio D. Sousa ◽  
Elaine Aspinwall ◽  
Paulo A. Sampaio ◽  
A. Guimarães Rodrigues

2018 ◽  
Vol 19 (12) ◽  
pp. 780-782
Author(s):  
Zbigniew Łukasik ◽  
Bartłomiej Ulatowski ◽  
Łukasz Łukasik

The following article shows how to use large data sets using logistic data using radio frequency identification (RFID) technology. First of all, the so-called RFID-Cuboids processors are introduced to create a data warehouse so that logistics data managed via RFID technology can be highly integrated in terms of specific logic and operations. Second, the tables are used to combine r data to increase information quality and reduce the data set volume.


10.2196/16129 ◽  
2020 ◽  
Vol 8 (7) ◽  
pp. e16129 ◽  
Author(s):  
Suranga N Kasthurirathne ◽  
Shaun Grannis ◽  
Paul K Halverson ◽  
Justin Morea ◽  
Nir Menachemi ◽  
...  

Background Emerging interest in precision health and the increasing availability of patient- and population-level data sets present considerable potential to enable analytical approaches to identify and mitigate the negative effects of social factors on health. These issues are not satisfactorily addressed in typical medical care encounters, and thus, opportunities to improve health outcomes, reduce costs, and improve coordination of care are not realized. Furthermore, methodological expertise on the use of varied patient- and population-level data sets and machine learning to predict need for supplemental services is limited. Objective The objective of this study was to leverage a comprehensive range of clinical, behavioral, social risk, and social determinants of health factors in order to develop decision models capable of identifying patients in need of various wraparound social services. Methods We used comprehensive patient- and population-level data sets to build decision models capable of predicting need for behavioral health, dietitian, social work, or other social service referrals within a safety-net health system using area under the receiver operating characteristic curve (AUROC), sensitivity, precision, F1 score, and specificity. We also evaluated the value of population-level social determinants of health data sets in improving machine learning performance of the models. Results Decision models for each wraparound service demonstrated performance measures ranging between 59.2%% and 99.3%. These results were statistically superior to the performance measures demonstrated by our previous models which used a limited data set and whose performance measures ranged from 38.2% to 88.3% (behavioural health: F1 score P<.001, AUROC P=.01; social work: F1 score P<.001, AUROC P=.03; dietitian: F1 score P=.001, AUROC P=.001; other: F1 score P=.01, AUROC P=.02); however, inclusion of additional population-level social determinants of health did not contribute to any performance improvements (behavioural health: F1 score P=.08, AUROC P=.09; social work: F1 score P=.16, AUROC P=.09; dietitian: F1 score P=.08, AUROC P=.14; other: F1 score P=.33, AUROC P=.21) in predicting the need for referral in our population of vulnerable patients seeking care at a safety-net provider. Conclusions Precision health–enabled decision models that leverage a wide range of patient- and population-level data sets and advanced machine learning methods are capable of predicting need for various wraparound social services with good performance.


Sign in / Sign up

Export Citation Format

Share Document