scholarly journals Secure Record Linkage of Large Health Data Sets: Evaluation of a Hybrid Cloud Model (Preprint)

2020 ◽  
Author(s):  
Adrian Paul Brown ◽  
Sean M Randall

BACKGROUND The linking of administrative data across agencies provides the capability to investigate many health and social issues with the potential to deliver significant public benefit. Despite its advantages, the use of cloud computing resources for linkage purposes is scarce, with the storage of identifiable information on cloud infrastructure assessed as high risk by data custodians. OBJECTIVE This study aims to present a model for record linkage that utilizes cloud computing capabilities while assuring custodians that identifiable data sets remain secure and local. METHODS A new hybrid cloud model was developed, including privacy-preserving record linkage techniques and container-based batch processing. An evaluation of this model was conducted with a prototype implementation using large synthetic data sets representative of administrative health data. RESULTS The cloud model kept identifiers on premises and uses privacy-preserved identifiers to run all linkage computations on cloud infrastructure. Our prototype used a managed container cluster in Amazon Web Services to distribute the computation using existing linkage software. Although the cost of computation was relatively low, the use of existing software resulted in an overhead of processing of 35.7% (149/417 min execution time). CONCLUSIONS The result of our experimental evaluation shows the operational feasibility of such a model and the exciting opportunities for advancing the analysis of linkage outputs.

10.2196/18920 ◽  
2020 ◽  
Vol 8 (9) ◽  
pp. e18920
Author(s):  
Adrian Paul Brown ◽  
Sean M Randall

Background The linking of administrative data across agencies provides the capability to investigate many health and social issues with the potential to deliver significant public benefit. Despite its advantages, the use of cloud computing resources for linkage purposes is scarce, with the storage of identifiable information on cloud infrastructure assessed as high risk by data custodians. Objective This study aims to present a model for record linkage that utilizes cloud computing capabilities while assuring custodians that identifiable data sets remain secure and local. Methods A new hybrid cloud model was developed, including privacy-preserving record linkage techniques and container-based batch processing. An evaluation of this model was conducted with a prototype implementation using large synthetic data sets representative of administrative health data. Results The cloud model kept identifiers on premises and uses privacy-preserved identifiers to run all linkage computations on cloud infrastructure. Our prototype used a managed container cluster in Amazon Web Services to distribute the computation using existing linkage software. Although the cost of computation was relatively low, the use of existing software resulted in an overhead of processing of 35.7% (149/417 min execution time). Conclusions The result of our experimental evaluation shows the operational feasibility of such a model and the exciting opportunities for advancing the analysis of linkage outputs.


2012 ◽  
Vol 2 (1) ◽  
pp. 44-57 ◽  
Author(s):  
Yannis Siahos ◽  
Iasonas Papanagiotou ◽  
Alkis Georgopoulos ◽  
Fotis Tsamis ◽  
Ioannis Papaioannou

The authors present their experience and practices of introducing cloud services, as a means to simplify the adoption of ICT (Information Communication and Technology) in education, using Free/Open Source Software. The solution creates a hybrid cloud infrastructure, in order to provide a pre-installed (Ubuntu and Linux Terminal Server Project) virtual machine, acting as a server inside the school, providing desktop environment based on the Software as a Service cloud model, where legacy PCs act as stateless devices. Classroom management is accomplished using the application “Epoptes.” To minimize administration tasks, educational software is provided accordingly, either on-line or through repositories to automate software installation (including patches and updates). The advantages of the hybrid cloud implementation, include services that are not completely dependent on broadband connections’ state, minimal cost, reusability of obsolete equipment, ease of administration, centralized management, patches and educational software provisioning and, above all, facilitation of the educational procedure.


The importance of cloud computing standards is the same as the World Wide Web standardization. There are plenty of prevalent standards around cloud computing that make different aspects of cloud computing possible. Standardization is a key answer and solution to the main question in this book (i.e., whether cloud computing will survive and remain on IT trends track or not). Standardization will bring interoperability, integration, and portability to the cloud computing landscape. With these three features, the main elements of IT (i.e., computation and data) can move from one cloud provider to another. Therefore, it eliminates vendor lock-in that is one of the barriers in cloud adoption. In addition, cloud interoperability will minimize cloud fragmentation. We need interoperability and portability to achieve cloud federation and to build hybrid cloud. In addition, there is still no de facto standard for moving workloads or data among different clouds. Cloud standardization needs to be addressed at various layers of a cloud infrastructure such as: virtual machine format, data, interface, context, and identity layers. This chapter reviews the emerging standards from the perspective of various organizations and standard bodies.


Author(s):  
Yannis Siahos ◽  
Iasonas Papanagiotou ◽  
Alkis Georgopoulos ◽  
Fotis Tsamis ◽  
Lefteris Nikoltsios

In this chapter, the authors present the methodology and the results of their effort towards the introduction of cloud services as a means to simplify the adoption of ICT in education using Free/Open Source Software. A hybrid cloud infrastructure is established in order to provide Linux and optionally MS-Windows desktop environments with the Software as a Service cloud model. Legacy and modern school PCs function as stateless devices. To achieve this, their “Sch-scripts” application performs an unattended installation of the Linux Terminal Server Project software to a school computer that also hosts centrally maintained virtual machines. Classroom management is accomplished using their “Epoptes” application. Administration is only required in the school server while the educational software is provided with the Software as a Service model either in online form or through repositories that automate software installation. Four-hundred-twenty schools have already implemented this architecture and 117 responded to the evaluation survey. The statistical analysis of these answers confirms the design principles, which include minimal cost, as well as reusability of obsolete equipment, ease of administration, centralized management, patches and educational software provisioning, classroom management, and above all, facilitation of the educational procedure.


Web Services ◽  
2019 ◽  
pp. 1563-1587
Author(s):  
Wu He ◽  
Feng-Kwei Wang

As a new IT paradigm for users, cloud computing has the potential to transform the way that IT resources are utilized and consumed. Many multinational enterprises (MNEs) are interested in cloud computing but do not know how to adopt and implement cloud computing in their enterprise settings. In an effort to help MNEs understand cloud computing and develop successful enterprise adoption strategies for cloud computing, the authors propose a hybrid cloud model for MNEs and illustrate the utility of this model by using two case studies. Insights for adopting and implementing this model in international settings are provided as well.


2019 ◽  
Vol ahead-of-print (ahead-of-print) ◽  
Author(s):  
Balan Sundarakani ◽  
Rukshanda Kamran ◽  
Piyush Maheshwari ◽  
Vipul Jain

PurposeSupply chain is the area that requires effective and integrated means of communication, shared risk, collaboration and orchestration in order to work successfully and the cloud computing has lot to offer to this domain. Cloud computing has appeared as a modern paradigm in supply chain networks for creating intelligent industries of future. The purpose of this paper is to propose a framework that can transform supply chain stakeholders toward Industry 4.0.Design/methodology/approachCloud computing is attributed with increasing competitiveness by focusing on cost reduction, greater elasticity, flexibility and maximum utilization of resources which results in successfully achieving business goals. This paper proposes a Hybrid Supply Chain Cloud model, which integrates the infrastructure, the resources and the configurations of platforms.FindingsThis research paper is aimed at proposing a hybrid cloud that essentially helps in integrating the supply chain network with the flexibility and efficiency. It is important that a supply chain network adds value to ensure customer satisfaction and this can be best achieved by collaborating it with hybrid cloud.Research limitations/implicationsThis research provides a consistent central management and comprehensive view of all computing resources, which gives organizations the knowledge they need to optimize workload placement.Practical implicationsThe findings derived from this research aim to facilitate policy makers and practitioners to develop effective courses of action in current and future supply chain management. Therefore, upon implementation, this model can provide long-term benefits for the organizations by improving the overall efficiency of its supply chain network.Originality/valueThe proposed hybrid cloud will provide deep level of integration in Industry 4.0 situation and thereby brought up portable comprehensive infrastructure based on resources and required configuration in real-time environment.


2018 ◽  
Author(s):  
Li Chen ◽  
Bai Zhang ◽  
Michael Schnaubelt ◽  
Punit Shah ◽  
Paul Aiyetan ◽  
...  

ABSTRACTRapid development and wide adoption of mass spectrometry-based proteomics technologies have empowered scientists to study proteins and their modifications in complex samples on a large scale. This progress has also created unprecedented challenges for individual labs to store, manage and analyze proteomics data, both in the cost for proprietary software and high-performance computing, and the long processing time that discourages on-the-fly changes of data processing settings required in explorative and discovery analysis. We developed an open-source, cloud computing-based pipeline, MS-PyCloud, with graphical user interface (GUI) support, for LC-MS/MS data analysis. The major components of this pipeline include data file integrity validation, MS/MS database search for spectral assignment, false discovery rate estimation, protein inference, determination of protein post-translation modifications, and quantitation of specific (modified) peptides and proteins. To ensure the transparency and reproducibility of data analysis, MS-PyCloud includes open source software tools with comprehensive testing and versioning for spectrum assignments. Leveraging public cloud computing infrastructure via Amazon Web Services (AWS), MS-PyCloud scales seamlessly based on analysis demand to achieve fast and efficient performance. Application of the pipeline to the analysis of large-scale iTRAQ/TMT LC-MS/MS data sets demonstrated the effectiveness and high performance of MS-PyCloud. The software can be downloaded at: https://bitbucket.org/mschnau/ms-pycloud/downloads/


Author(s):  
Saifallah Al Kati ◽  
Muhammad Asif Khan

During the recent Coronavirus (Covid-19) pandemic the traditional education system almost halted throughout the world. However, in order to continue with the education without wasting students time most of the countries transferred their teaching online. Although the online teaching is widely used but there are many challenges and security issues specially when the education is disseminated using education cloud. In this article we examine and review such challenges and security issues that may impact students and teachers in various educational institutions in Saudi Arabia. A review of hybrid cloud model is presented in order to benefit across institutions. The research also articulates different ways which can be adopted by educational institutions to provide smooth online teaching due to pandemic of covid-19. We also present some solutions that may help overcome challenges and secure a robust cloud infrastructure.


Author(s):  
Sahil Kamleshwar

Cloud infrastructure and its extensive set of Internet-enabled resources have the potential to provide significant benefits to robots and flexible systems. We look for robots and data-switching programs or code from the network to support their performance, that is, when not all sense, calculation, and memory are integrated into the standalone system. This survey is designed for four possible Cloud benefits: 1) Big Data: access to photo libraries, maps, trajectories, and descriptive data; 2) Cloud Computing: access to the same grid computer with the demand for mathematical analysis, reading, and movement planning; 3) Integrated Robots Learning: robots that share tracking, control policies, and results; and 4) Census: use of crowdourcing to tap people's skills for image and video analysis, classification, reading, and error retrieval. The cloud can also improve robots and flexible systems by providing access to: a) data sets, publications, models, measurements, and simulation tools; b) open competitions for designs and programs; and c) open source software.


Author(s):  
Suvendu Chandan Nayak ◽  
Chitaranjan Tripathy

In this work, the authors propose Multi-criteria Decision-making to schedule deadline based tasks in cloud computing. The existing backfilling task scheduling algorithm could not handle similar tasks for scheduling. In backfilling algorithm, tasks are backfilled to provide ideal resources to schedule other deadline sensitive tasks. However, the task to be backfilled is selected on first come, first serve (FCFS) basis from scheduling queue. The scheduling performances require to be improved when, there are similar tasks. In this proposed work, the authors propose to implement MCDM technique, TOPSIS (Technique for Order Preference by Similarity to Ideal Solution) to improve the performance of the backfilling algorithm through scheduling deadline sensitive tasks in cloud computing. It resolves the conflicts among the similar tasks that is used as a decision support system. The work is simulated with synthetic data sets based on slack values of the tasks. The performance results affirm the task completion and reduction in task rejection compared to the existing backfilling algorithm.


Sign in / Sign up

Export Citation Format

Share Document