A Parallel Unmixing-Based Content Retrieval System for Distributed Hyperspectral Imagery Repository on Cloud Computing Platforms

As the volume of remotely sensed data grows significantly, content-based image retrieval (CBIR) becomes increasingly important, especially for cloud computing platforms that facilitate processing and storing big data in a parallel and distributed way. This paper proposes a novel parallel CBIR system for hyperspectral image (HSI) repository on cloud computing platforms under the guide of unmixed spectral information, i.e., endmembers and their associated fractional abundances, to retrieve hyperspectral scenes. However, existing unmixing methods would suffer extremely high computational burden when extracting meta-data from large-scale HSI data. To address this limitation, we implement a distributed and parallel unmixing method that operates on cloud computing platforms in parallel for accelerating the unmixing processing flow. In addition, we implement a global standard distributed HSI repository equipped with a large spectral library in a software-as-a-service mode, providing users with HSI storage, management, and retrieval services through web interfaces. Furthermore, the parallel implementation of unmixing processing is incorporated into the CBIR system to establish the parallel unmixing-based content retrieval system. The performance of our proposed parallel CBIR system was verified in terms of both unmixing efficiency and accuracy.

Download Full-text

Cloud Security

Handbook of Research on Securing Cloud-Based Databases with Biometric Applications - Advances in Information Security, Privacy, and Ethics ◽

10.4018/978-1-4666-6559-0.ch011 ◽

2015 ◽

pp. 236-250

Author(s):

Natasha Csicsmann ◽

Victoria McIntyre ◽

Patrick Shea ◽

Syed S. Rizvi

Keyword(s):

Cloud Computing ◽

Performance Analysis ◽

Large Scale ◽

Research Issues ◽

Biometric Technology ◽

Cloud Computing Security ◽

Security Technologies ◽

Encryption Schemes ◽

Computing Platforms ◽

Cloud Auditing

Strong authentication and encryption schemes help cloud stakeholders in performing the robust and accurate cloud auditing of a potential service provider. All security-related issues and challenges, therefore, need to be addressed before a ubiquitous adoption of cloud computing. In this chapter, the authors provide an overview of existing biometrics-based security technologies and discuss some of the open research issues that need to be addressed for making biometric technology an effective tool for cloud computing security. Finally, this chapter provides a performance analysis on the use of large-scale biometrics-based authentication systems for different cloud computing platforms.

Download Full-text

Large-Scale Data Management Techniques in Cloud Computing Platforms

Data-Intensive Computing ◽

10.1017/cbo9780511844409.005 ◽

2012 ◽

pp. 85-123

Author(s):

Sherif Sakr ◽

Anna Liu

Keyword(s):

Cloud Computing ◽

Data Management ◽

Large Scale ◽

Large Scale Data ◽

Management Techniques ◽

Computing Platforms ◽

Scale Data

Download Full-text

Contextual Contracts for Component-Based Resource Abstraction in a Cloud of HPC Services

10.5753/wscad.2019.8670 ◽

2019 ◽

Cited By ~ 1

Author(s):

Wagner Al Alam ◽

Francisco Carvalho Junior

Keyword(s):

Cloud Computing ◽

Parallel Computing ◽

Large Scale ◽

Matrix Multiplication ◽

Small Scale ◽

Computing Systems ◽

Computing Platform ◽

Computing Platforms

The efforts to make cloud computing suitable for the requirements of HPC applications have motivated us to design HPC Shelf, a cloud computing platform of services for building and deploying parallel computing systems for large-scale parallel processing. We introduce Alite, the system of contextual contracts of HPC Shelf, aimed at selecting component implementations according to requirements of applications, features of targeting parallel computing platforms (e.g. clusters), QoS (Quality-of-Service) properties and cost restrictions. It is evaluated through a small-scale case study employing a componentbased framework for matrix-multiplication based on the BLAS library.

Download Full-text

Cloud bursting galaxy: federated identity and access management

Bioinformatics ◽

10.1093/bioinformatics/btz472 ◽

2019 ◽

Vol 36 (1) ◽

pp. 1-9 ◽

Cited By ~ 1

Author(s):

Vahid Jalili ◽

Enis Afgan ◽

James Taylor ◽

Jeremy Goecks

Keyword(s):

Cloud Computing ◽

High Speed ◽

Large Scale ◽

Best Practice ◽

Data Transfer ◽

Data Access ◽

Web Security ◽

Biomedical Data ◽

Authentication And Authorization ◽

Computing Platforms

Abstract Motivation Large biomedical datasets, such as those from genomics and imaging, are increasingly being stored on commercial and institutional cloud computing platforms. This is because cloud-scale computing resources, from robust backup to high-speed data transfer to scalable compute and storage, are needed to make these large datasets usable. However, one challenge for large-scale biomedical data on the cloud is providing secure access, especially when datasets are distributed across platforms. While there are open Web protocols for secure authentication and authorization, these protocols are not in wide use in bioinformatics and are difficult to use for even technologically sophisticated users. Results We have developed a generic and extensible approach for securely accessing biomedical datasets distributed across cloud computing platforms. Our approach combines OpenID Connect and OAuth2, best-practice Web protocols for authentication and authorization, together with Galaxy (https://galaxyproject.org), a web-based computational workbench used by thousands of scientists across the world. With our enhanced version of Galaxy, users can access and analyze data distributed across multiple cloud computing providers without any special knowledge of access/authorization protocols. Our approach does not require users to share permanent credentials (e.g. username, password, API key), instead relying on automatically generated temporary tokens that refresh as needed. Our approach is generalizable to most identity providers and cloud computing platforms. To the best of our knowledge, Galaxy is the only computational workbench where users can access biomedical datasets across multiple cloud computing platforms using best-practice Web security approaches and thereby minimize risks of unauthorized data access and credential use. Availability and implementation Freely available for academic and commercial use under the open-source Academic Free License (https://opensource.org/licenses/AFL-3.0) from the following Github repositories: https://github.com/galaxyproject/galaxy and https://github.com/galaxyproject/cloudauthz.

Download Full-text

Distributed Parallel Endmember Extraction of Hyperspectral Data Based on Spark

Scientific Programming ◽

10.1155/2016/3252148 ◽

2016 ◽

Vol 2016 ◽

pp. 1-9

Author(s):

Zebin Wu ◽

Jinping Gu ◽

Yonglong Li ◽

Fu Xiao ◽

Jin Sun ◽

...

Keyword(s):

Cloud Computing ◽

Hyperspectral Image ◽

Programming Model ◽

Parallel Implementation ◽

Distributed Processing ◽

Hyperspectral Data ◽

Endmember Extraction ◽

Hyperspectral Image Analysis ◽

Computing Platform ◽

Acceleration Techniques

Due to the increasing dimensionality and volume of remotely sensed hyperspectral data, the development of acceleration techniques for massive hyperspectral image analysis approaches is a very important challenge. Cloud computing offers many possibilities of distributed processing of hyperspectral datasets. This paper proposes a novel distributed parallel endmember extraction method based on iterative error analysis that utilizes cloud computing principles to efficiently process massive hyperspectral data. The proposed method takes advantage of technologies including MapReduce programming model, Hadoop Distributed File System (HDFS), and Apache Spark to realize distributed parallel implementation for hyperspectral endmember extraction, which significantly accelerates the computation of hyperspectral processing and provides high throughput access to large hyperspectral data. The experimental results, which are obtained by extracting endmembers of hyperspectral datasets on a cloud computing platform built on a cluster, demonstrate the effectiveness and computational efficiency of the proposed method.

Download Full-text

Data Management Challenges in Cloud Environments

Computer Engineering and Applications Journal ◽

10.18495/comengapp.v3i3.105 ◽

2014 ◽

Vol 3 (3) ◽

pp. 158-171

Author(s):

Mohamad Masood Javidi ◽

Najme Mansouri ◽

Asghar Asadi Karam

Keyword(s):

Cloud Computing ◽

Data Management ◽

Large Scale ◽

Web Applications ◽

Large Data ◽

Database Systems ◽

Advantages And Disadvantages ◽

Computing Paradigm ◽

Data Marts ◽

Computing Platforms

Recently the cloud computing paradigm has been receiving special excitement and attention in the new researches. Cloud computing has the potential to change a large part of the IT activity, making software even more interesting as a service and shaping the way IT hardware is proposed and purchased. Developers with novel ideas for new Internet services no longer require the large capital outlays in hardware to present their service or the human expense to do it. These cloud applications apply large data centers and powerful servers that host Web applications and Web services. This report presents an overview of what cloud computing means, its history along with the advantages and disadvantages. In this paper we describe the problems and opportunities of deploying data management issues on these emerging cloud computing platforms. We study that large scale data analysis jobs, decision support systems, and application specific data marts are more likely to take benefit of cloud computing platforms than operational, transactional database systems.

Download Full-text

Cloud Bursting Galaxy: Federated Identity and Access Management

10.1101/506238 ◽

2018 ◽

Author(s):

Vahid Jalili ◽

Enis Afgan ◽

James Taylor ◽

Jeremy Goecks

Keyword(s):

Cloud Computing ◽

High Speed ◽

Large Scale ◽

Best Practice ◽

Data Access ◽

Web Security ◽

Biomedical Data ◽

Link Type ◽

Authentication And Authorization ◽

Computing Platforms

AbstractMotivationLarge biomedical datasets, such as those from genomics and imaging, are increasingly being stored on commercial and institutional cloud computing platforms. This is because cloud-scale computing resources, from robust backup to high-speed data transfer to scalable compute and storage, are needed to make these large datasets usable. However, one challenge for large-scale biomedical data on the cloud is providing secure access, especially when datasets are distributed across platforms. While there are open Web protocols for secure authentication and authorization, these protocols are not in wide use in bioinformatics and are difficult to use for even technologically sophisticated users.ResultsWe have developed a generic and extensible approach for securely accessing biomedical datasets distributed across cloud computing platforms. Our approach combines OpenID Connect and OAuth2, best-practice Web protocols for authentication and authorization, together with Galaxy (https://galaxyproject.org), a web-based computational workbench used by thousands of scientists across the world. With our enhanced version of Galaxy, users can access and analyze data distributed across multiple cloud computing providers without any special knowledge of access/authorization protocols. Our approach does not require users to share permanent credentials (e.g., username, password, API key), instead relying on automatically-generated temporary tokens that refresh as needed. Our approach is generalizable to most identity providers and cloud computing platforms. To the best of our knowledge, Galaxy is the only computational workbench where users can access biomedical datasets across multiple cloud computing platforms using best-practice Web security approaches and thereby minimize risks of unauthorized data access and credential use.Availability and ImplementationFreely available for academic and commercial use under the open-source Academic Free License (https://opensource.org/licenses/AFL-3.0) from the following Github repositories: https://github.com/galaxyproject/galaxy and https://github.com/galaxyproject/[email protected], [email protected]

Download Full-text

An Overview of Hadoop Scheduler Algorithms

Modern Applied Science ◽

10.5539/mas.v12n8p69 ◽

2018 ◽

Vol 12 (8) ◽

pp. 69 ◽

Cited By ~ 1

Author(s):

Faten Hamad

Keyword(s):

Cloud Computing ◽

Cluster Size ◽

Large Scale ◽

Linear Expansion ◽

Large Scale Data ◽

Actual Use ◽

Hadoop Platform ◽

Computing Platforms ◽

Large Scale Data Processing ◽

Scale Data

Hadoop is a cloud computing open source system, used in large-scale data processing. It became the basic computing platforms for many internet companies. With Hadoop platform users can develop the cloud computing application and then submit the task to the platform. Hadoop has a strong fault tolerance, and can easily increase the number of cluster nodes, using linear expansion of the cluster size, so that clusters can process larger datasets. However Hadoop has some shortcomings, especially in the actual use of the process of exposure to the MapReduce scheduler, which calls for more researches on Hadoop scheduling algorithms.This survey provides an overview of the default Hadoop scheduler algorithms and the problem they have. It also compare between five Hadoop framework scheduling algorithms in term of the default scheduler algorithm to be enhanced, the proposed scheduler algorithm, type of cluster applied either heterogeneous or homogeneous, methodology, and clusters classification based on performance evaluation. Finally, a new algorithm based on capacity scheduling and use of perspective resource utilization to enhance Hadoop scheduling is proposed.

Download Full-text

An Efficient Three-Dimensional FNPF Numerical Wave Tank for Large-Scale Wave Basin Experiment Simulation

Journal of Offshore Mechanics and Arctic Engineering ◽

10.1115/1.4007597 ◽

2013 ◽

Vol 135 (2) ◽

Cited By ~ 7

Author(s):

Seshu B. Nimmala ◽

Solomon C. Yim ◽

Stephan T. Grilli

Keyword(s):

High Performance ◽

Large Scale ◽

Parallel Implementation ◽

Mathematical Formulation ◽

Three Dimensional ◽

Numerical Wave Tank ◽

Wave Tank ◽

Wave Basin ◽

Numerical Wave ◽

Computing Platforms

This paper presents a parallel implementation and validation of an accurate and efficient three-dimensional computational model (3D numerical wave tank), based on fully nonlinear potential flow (FNPF) theory, and its extension to incorporate the motion of a laboratory snake piston wavemaker, as well as an absorbing beach, to simulate experiments in a large-scale 3D wave basin. This work is part of a long-term effort to develop a “virtual” computational wave basin to facilitate and complement large-scale physical wave-basin experiments. The code is based on a higher-order boundary-element method combined with a fast multipole algorithm (FMA). Particular efforts were devoted to making the code efficient for large-scale simulations using high-performance computing platforms. The numerical simulation capability can be tailored to serve as an optimization tool at the planning and detailed design stages of large-scale experiments at a specific basin by duplicating its exact physical and algorithmic features. To date, waves that can be generated in the numerical wave tank (NWT) include solitary, cnoidal, and airy waves. In this paper we detail the wave-basin model, mathematical formulation, wave generation, and analyze the performance of the parallelized FNPF-BEM-FMA code as a function of numerical parameters. Experimental or analytical comparisons with NWT results are provided for several cases to assess the accuracy and applicability of the numerical model to practical engineering problems.

Download Full-text

An Overview of Public Cloud Security Issues

International Journal of Management Excellence ◽

10.17722/ijme.v3i2.259 ◽

2014 ◽

Vol 3 (2) ◽

pp. 440-445

Author(s):

Atefeh Heydari ◽

Mohammad Ali Tavakoli ◽

Mohammad Riazi

Keyword(s):

Cloud Computing ◽

Large Scale ◽

Physical Space ◽

Public Cloud ◽

Huge Number ◽

Security Issues ◽

On Demand ◽

Computing Services ◽

Cloud Computing Services ◽

Computing Platforms

Traditionally, computational needs of organizations were alleviated by purchasing, updating and maintaining required equipments. Beside expensive devices, physical space to hold them, technical staffs to maintain them and many other side costs were essential prerequisites of this matter. Nowadays with the development of cloud computing services, a huge number of peoples and organizations are served in terms of computational needs by large scale computing platforms. Offering enormous amounts of economical compute resources on-demand motivates organizations to outsource their computational needs incrementally. Public cloud computing vendors offer their infrastructure to the customers via the internet. It means that the control of customers’ data is not in their hands anymore. Unfortunately various security issues are emerged from this subject. In this paper the security issues of public cloud computing are overviewed. More destructive security issues are highlighted in order to be used by organizations in making better decisions for moving to cloud.

Download Full-text