scholarly journals Federated Synthetic Learning from Multi-institutional and Heterogeneous Medical Data

Author(s):  
Qi Chang ◽  
Zhennan Yan ◽  
Hui Qu ◽  
Han Zhang ◽  
Lohendran Baskaran ◽  
...  

Abstract Statistically and information-wise adequate data plays a critical role in training a robust deep learning model. However, collecting sufficient medical data to train a centralized model is still challenging due to various constraints such as privacy regulations and security. In this work, we develop a novel privacy-preserving federated-discriminator GAN, named FedD-GAN, that can learn and synthesize high-quality and various medical images regardless of their type, from heterogeneous datasets residing in multiple data centers whose data cannot be transferred or shared. We trained and evaluated FedD-GAN on three essential classes of medical data, each involving different types of medical images: cardiac CTA, brain MRI, and histopathology. We show that the synthesized images using our method have better quality than using a standard federated learning method and are realistic and accurate enough to train accurate segmentation models in downstream tasks. The segmentation model trained on the synthetic data only is comparable to that trained on an all-in-one real-image dataset shared from multiple data centers if possible. FedD-GAN can learn to generate a scalable and diverse synthetic database without compromising data privacy. This synthetic database could help to boost machine learning techniques in medical data analytics.

Author(s):  
Dhamanpreet Kaur ◽  
Matthew Sobiesk ◽  
Shubham Patil ◽  
Jin Liu ◽  
Puran Bhagat ◽  
...  

Abstract Objective This study seeks to develop a fully automated method of generating synthetic data from a real dataset that could be employed by medical organizations to distribute health data to researchers, reducing the need for access to real data. We hypothesize the application of Bayesian networks will improve upon the predominant existing method, medBGAN, in handling the complexity and dimensionality of healthcare data. Materials and Methods We employed Bayesian networks to learn probabilistic graphical structures and simulated synthetic patient records from the learned structure. We used the University of California Irvine (UCI) heart disease and diabetes datasets as well as the MIMIC-III diagnoses database. We evaluated our method through statistical tests, machine learning tasks, preservation of rare events, disclosure risk, and the ability of a machine learning classifier to discriminate between the real and synthetic data. Results Our Bayesian network model outperformed or equaled medBGAN in all key metrics. Notable improvement was achieved in capturing rare variables and preserving association rules. Discussion Bayesian networks generated data sufficiently similar to the original data with minimal risk of disclosure, while offering additional transparency, computational efficiency, and capacity to handle more data types in comparison to existing methods. We hope this method will allow healthcare organizations to efficiently disseminate synthetic health data to researchers, enabling them to generate hypotheses and develop analytical tools. Conclusion We conclude the application of Bayesian networks is a promising option for generating realistic synthetic health data that preserves the features of the original data without compromising data privacy.


Author(s):  
Bruce Mellado ◽  
Jianhong Wu ◽  
Jude Dzevela Kong ◽  
Nicola Luigi Bragazzi ◽  
Ali Asgary ◽  
...  

COVID-19 is imposing massive health, social and economic costs. While many developed countries have started vaccinating, most African nations are waiting for vaccine stocks to be allocated and are using clinical public health (CPH) strategies to control the pandemic. The emergence of variants of concern (VOC), unequal access to the vaccine supply and locally specific logistical and vaccine delivery parameters, add complexity to national CPH strategies and amplify the urgent need for effective CPH policies. Big data and artificial intelligence machine learning techniques and collaborations can be instrumental in an accurate, timely, locally nuanced analysis of multiple data sources to inform CPH decision-making, vaccination strategies and their staged roll-out. The Africa-Canada Artificial Intelligence and Data Innovation Consortium (ACADIC) has been established to develop and employ machine learning techniques to design CPH strategies in Africa, which requires ongoing collaboration, testing and development to maximize the equity and effectiveness of COVID-19-related CPH interventions.


Predictive modelling is a mathematical technique which uses Statistics for prediction, due to the rapid growth of data over the cloud system, data mining plays a significant role. Here, the term data mining is a way of extracting knowledge from huge data sources where it’s increasing the attention in the field of medical application. Specifically, to analyse and extract the knowledge from both known and unknown patterns for effective medical diagnosis, treatment, management, prognosis, monitoring and screening process. But the historical medical data might include noisy, missing, inconsistent, imbalanced and high dimensional data.. This kind of data inconvenience lead to severe bias in predictive modelling and decreased the data mining approach performances. The various pre-processing and machine learning methods and models such as Supervised Learning, Unsupervised Learning and Reinforcement Learning in recent literature has been proposed. Hence the present research focuses on review and analyses the various model, algorithm and machine learning technique for clinical predictive modelling to obtain high performance results from numerous medical data which relates to the patients of multiple diseases.


Author(s):  
Patrícia C. T. Gonçalves ◽  
Ana S. Moura ◽  
M. Natália D. S. Cordeiro ◽  
Pedro Campos

The increasing use of medical software as an interface between patients and medical staff has raised alarming questions on the safety of data privacy and assurance of patients' rights. This issue has reached a new level with the emergent use of medical social networks in Health Information Systems. Medical networks, which work as an interface between the patient medical data and geographical and/or social connections, as well as between the patient individual needs and the attending medical doctor, can allow feasible and fast visualization/information systems. As new models for medical social networks and health data visualization and information systems are planned and presented, the need for protocols regarding data privacy in this context is becoming a subject of analysis and discussion. This chapter reviews the evolution and status quo of prospective medical social networks within data privacy and patients' rights, and discusses the ideal model and its future venues and interaction with ethics in the areas of Law, Health Policies, and Human Rights.


2022 ◽  
pp. 107-131
Author(s):  
Dhruti P. Sharma ◽  
Devesh C. Jinwala

E-health is a cloud-based system to store and share medical data with the stakeholders. From a security perspective, the stored data are in encrypted form that could further be searched by the stakeholders through searchable encryption (SE). Practically, an e-health system with support of multiple stakeholders (that may work as either data owner [writer] or user [reader]) along with the provision of multi-keyword search is desirable. However, the existing SE schemes either support multi-keyword search in multi-reader setting or offer multi-writer, multi-reader mechanism along with single-keyword search only. This chapter proposes a multi-keyword SE for an e-health system in multi-writer multi-reader setting. With this scheme, any registered writer could share data with any registered reader with optimal storage-computational overhead on writer. The proposed scheme offers conjunctive search with optimal search complexity at server. It also ensures security to medical records and privacy of keywords. The theoretical and empirical analysis demonstrates the effectiveness of the proposed work.


Author(s):  
Mahmoud Barhamgi ◽  
Djamal Benslimane ◽  
Chirine Ghedira ◽  
Brahim Medjahed

Recent years have witnessed a growing interest in using Web services as a reliable means for medical data sharing inside and across healthcare organizations. In such service-based data sharing environments, Web service composition emerged as a viable approach to query data scattered across independent locations. Patient data privacy preservation is an important aspect that must be considered when composing medical Web services. In this paper, the authors show how data privacy can be preserved when composing and executing Web services. Privacy constraints are expressed in the form of RDF queries over a mediated ontology. Query rewriting algorithms are defined to process those queries while preserving users’ privacy.


2019 ◽  
Vol 90 (e7) ◽  
pp. A42.3-A42
Author(s):  
Mastura Monif ◽  
Shokoufeh Abdollahi ◽  
Jim Stankovich ◽  
Vicki Maltby ◽  
Jeannette Lechner-Scott ◽  
...  

IntroductionCladribine Tablets (Mavenclad®) is nucleoside analogue of deoxyadenosine, and an oral treatment for relapsing remitting MS (RRMS). In RRMS clinical trials, Cladribine has been shown to reduce brain atrophy, relapse rates, and new lesions on brain MRI. P2X7R is a purinergic receptor expressed in innate immune cells, and is thought to play a critical role in neuroinflammation. The mechanism of action of Cladribine on peripheral innate immune cells (monocytes), and its effect on P2X7R, is unclear, and forms the basis of this study.MethodsThis will be a Phase IV, multi-centre, 3 year, translational trial. Patients who are starting Cladribine as part of their routine clinical care will consent to take part in the study. Monocyte numbers and activation states will be measured at various times prior and after commencement of therapy. In addition, and in an in vitro setting the effect of Cladribine on P2X7R expression and function will be assessed, as well as measuring various cytokines/chemokines in serum. The laboratory data will also be correlated with clinical data from another long-term Cladribine study, CLOBAS.ResultsThis study has been approved by Alfred Health Human Research Ethics Committee. The study is to commence in April 2019.ConclusionThis study will shed light on whether Cladribine is exerting its beneficial effects via action on peripheral monocytes and alterations of their P2X7Rs. The laboratory and clinical data will be analysed to understand the relationship between innate immune parameters and patient outcome.


2013 ◽  
Vol 3 (4) ◽  
pp. 31-46 ◽  
Author(s):  
Hanaa Ismail Elshazly ◽  
Ahmad Taher Azar ◽  
Aboul Ella Hassanien ◽  
Abeer Mohamed Elkorany

Computational intelligence provides the biomedical domain by a significant support. The application of machine learning techniques in medical applications have been evolved from the physician needs. Screening, medical images, pattern classification, prognosis are some examples of health care support systems. Typically medical data has its own characteristics such as huge size and features, continuous and real attributes that refer to patients' investigations. Therefore, discretization and feature selection process are considered a key issue in improving the extracted knowledge from patients' investigations records. In this paper, a hybrid system that integrates Rough Set (RS) and Genetic Algorithm (GA) is presented for the efficient classification of medical data sets of different sizes and dimensionalities. Genetic Algorithm is applied with the aim of reducing the dimension of medical datasets and RS decision rules were used for efficient classification. Furthermore, the proposed system applies the Entropy Gain Information (EI) for discretization process. Four biomedical data sets are tested by the proposed system (EI-GA-RS), and the highest score was obtained through three different datasets. Other different hybrid techniques shared the proposed technique the highest accuracy but the proposed system preserves its place as one of the highest results systems four three different sets. EI as discretization technique also is a common part for the best results in the mentioned datasets while RS as an evaluator realized the best results in three different data sets.


2017 ◽  
Vol 19 (4) ◽  
pp. 593-620
Author(s):  
Vinod K. Aggarwal ◽  
Simon J. Evenett

AbstractDespite initial intentions to better align transatlantic regulation and associated practices in the negotiation of the Transatlantic Trade and Investment Partnership (TTIP), this was not possible for rules concerning genetically modified organisms and data privacy. By 2016 both matters effectively fell off the TTIP negotiating agenda. This paper identifies the factors responsible, specifically the critical role played by independent regulatory agencies and associated bureaucratic politics, transnational coalitions of private sector organizations, and non-government organizations and contingency. These factors are not exclusive to the two salient regulations considered here, with the implication that the identification of cross-border spillovers is at best a necessary condition for the successful negotiation of binding trade rules on behind-the-border government policies.


Sign in / Sign up

Export Citation Format

Share Document