scholarly journals Explainable analytics: Understanding causes, correcting errors, and increasingly achieving perfect accuracy from nature of distinguishable patterns

Author(s):  
Hao-Ting Pai ◽  
Chung-Chian Hsu

Abstract In addition to pursue accurate analytics, it is invaluable to clarify how and why inaccuracy exists. We propose a transparent classification method (TC). In training, we discover patterns from positive and negative observations respectively; next, patterns are excluded if they appear in both types. In testing, observations are scored by the pure patterns and connected like social networks. Based on set theory, pure patterns have explanatory power for distinguishing tangled relationship between negative and positive observations. Experimental results demonstrate that TC can identify all positive (e.g., malignant) observations at low ratios of training to testing, e.g., 1:9 in Breast Cancer Wisconsin (Original) and 3:7 in Contraceptive Method Choice dataset. Without fine-tuned parameters and random selection, TC eliminates uncertainty of the methodology. TC can visualize causes, and therefore, prediction errors are traceable and can be corrected. Further, TC shows potential of identifying whether the ground truth is incorrect (e.g., diagnostic errors).

2021 ◽  
Author(s):  
Hao-Ting Pai ◽  
Chung-Chian Hsu

Abstract In addition to pursue accurate analytics, it is invaluable to clarify how and why inaccuracy exists. We propose a transparent classification method (TC). In training, we discover patterns from positive and negative observations respectively; next, patterns are excluded if they appear in both types. In testing, observations are scored by the pure patterns and connected like social networks. Based on set theory, pure patterns have explanatory power for distinguishing tangled relationship between negative and positive observations. Experimental results demonstrate that TC can identify all positive (e.g., malignant) observations at low ratios of training to testing, e.g., 1:9 in Breast Cancer Wisconsin (Original) and 3:7 in Contraceptive Method Choice dataset. Without fine-tuned parameters and random selection, TC eliminates uncertainty of the methodology. TC can visualize causes, and therefore, prediction errors are traceable and can be corrected. Further, TC shows potential of identifying whether the ground truth is incorrect (e.g., diagnostic errors).


2010 ◽  
Vol 81 (3) ◽  
pp. 349-354 ◽  
Author(s):  
Cynthia C. Harper ◽  
Beth A. Brown ◽  
Anne Foster-Rosales ◽  
Tina R. Raine

2020 ◽  
Vol 1 (2) ◽  
pp. 101-123
Author(s):  
Hiroaki Shiokawa ◽  
Yasunori Futamura

This paper addressed the problem of finding clusters included in graph-structured data such as Web graphs, social networks, and others. Graph clustering is one of the fundamental techniques for understanding structures present in the complex graphs such as Web pages, social networks, and others. In the Web and data mining communities, the modularity-based graph clustering algorithm is successfully used in many applications. However, it is difficult for the modularity-based methods to find fine-grained clusters hidden in large-scale graphs; the methods fail to reproduce the ground truth. In this paper, we present a novel modularity-based algorithm, \textit{CAV}, that shows better clustering results than the traditional algorithm. The proposed algorithm employs a cohesiveness-aware vector partitioning into the graph spectral analysis to improve the clustering accuracy. Additionally, this paper also presents a novel efficient algorithm \textit{P-CAV} for further improving the clustering speed of CAV; P-CAV is an extension of CAV that utilizes the thread-based parallelization on a many-core CPU. Our extensive experiments on synthetic and public datasets demonstrate the performance superiority of our approaches over the state-of-the-art approaches.


2010 ◽  
Vol 4 (4) ◽  
pp. 372-380 ◽  
Author(s):  
Jeannette M. Beasley ◽  
Polly A. Newcomb ◽  
Amy Trentham-Dietz ◽  
John M. Hampton ◽  
Rachel M. Ceballos ◽  
...  

2021 ◽  
Author(s):  
Melissa Min-Szu Yao ◽  
Hao Du ◽  
Mikael Hartman ◽  
Wing P. Chan ◽  
Mengling Feng

UNSTRUCTURED Purpose: To develop a novel artificial intelligence (AI) model algorithm focusing on automatic detection and classification of various patterns of calcification distribution in mammographic images using a unique graph convolution approach. Materials and methods: Images from 200 patients classified as Category 4 or 5 according to the American College of Radiology Breast Imaging Reporting and Database System, which showed calcifications according to the mammographic reports and diagnosed breast cancers. The calcification distributions were classified as either diffuse, segmental, regional, grouped, or linear. Excluded were mammograms with (1) breast cancer as a single or combined characterization such as a mass, asymmetry, or architectural distortion with or without calcifications; (2) hidden calcifications that were difficult to mark; or (3) incomplete medical records. Results: A graph convolutional network-based model was developed. 401 mammographic images from 200 cases of breast cancer were divided based on calcification distribution pattern: diffuse (n = 24), regional (n = 111), group (n = 201), linear (n = 8) or segmental (n = 57). The classification performances were measured using metrics including precision, recall, F1 score, accuracy and multi-class area under receiver operating characteristic curve. The proposed achieved precision of 0.483 ± 0.015, sensitivity of 0.606 (0.030), specificity of 0.862 ± 0.018, F1 score of 0.527 ± 0.035, accuracy of 60.642% ± 3.040% and area under the curve of 0.754 ± 0.019, finding method to be superior compared to all baseline models. The predicted linear and diffuse classifications were highly similar to the ground truth, and the predicted grouped and regional classifications were also superior compared to baseline models. Conclusion: The proposed deep neural network framework is an AI solution to automatically detect and classify calcification distribution patterns on mammographic images highly suspected of showing breast cancers. Further study of the AI model in an actual clinical setting and additional data collection will improve its performance.


Contraception ◽  
2018 ◽  
Vol 98 (4) ◽  
pp. 348-349
Author(s):  
K Gifford ◽  
MJ McDuffie ◽  
H Rashid ◽  
E Knight ◽  
M Boudreaux

2020 ◽  
Vol 14 ◽  
pp. 117822342091103
Author(s):  
Sara Dorri ◽  
Asiie Olfatbakhsh ◽  
Farkhondeh Asadi

Introduction: Lymphedema is one of the complications of breast cancer treatment. It has no cure yet and can affect the quality of life. This study aimed to identify and investigate informational needs, preferred delivery methods, and time of receiving information about lymphedema for these patients. Methods: One hundred participants were recruited through Lymphedema Clinic in Motamed Cancer Institute in Tehran, Iran, through convenience sampling and were asked to complete a self-administered survey. Data collection took place on all opening days between October 2018 and mid-March 2019. Results: Most of the participants were above 40 years, have a diploma, homemaker, and the average income of most of the participants (57.2%) was low. The importance of having lymphedema information was very high for them. Most of them wanted detailed information at diagnosis of breast cancer. The preferred information of delivery methods were private sessions and social networks. Conclusions: Patients with breast cancer who have lymphedema have high needs as regards concise lymphedema information. Private sessions with physicians and social networks can provide detailed information for them.


Sign in / Sign up

Export Citation Format

Share Document