Key based Reducer Placement for Data Analytics across Data Centers Considering Bi-level Resource Provision in Cloud Computing

Dynamic Selection of Optimal Cloud Service Provider for Big Data Applications

International Journal of Engineering & Technology ◽

10.14419/ijet.v7i2.24.12007 ◽

2018 ◽

Vol 7 (2.24) ◽

pp. 92

Author(s):

B V Ram Naresh Yadav ◽

P Anjaiah

Keyword(s):

Cloud Computing ◽

Big Data ◽

Data Analytics ◽

Data Centers ◽

Big Data Analytics ◽

Cloud Service ◽

Resource Reservation ◽

Distributed Applications ◽

It Industry ◽

The Cost

Big data analytics and Cloud computing are the two most imperative innovations in the current IT industry. In a surprise, these technologies come up together to convey the effective outcomes to various business organizations. However, big data analytics require a huge amount of resources for storage and computation. The storage cost is massively increased on the input amounts of data and requires innovative algorithms to reduce the cost to store the data in a specific data centers in a cloud. In Today’s IT Industry, Cloud Computing has emerged as a popular paradigm to host customer, enterprise data and many other distributed applications. Cloud Service Providers (CSPs) store huge amounts of data and numerous distributed applications with different cost. For example Amazon provides storage services at a fraction of TB/month and each CSP having different Service Level Agreements with different storage offers. Customers are interested in reliable SLAs and it increases the cost since the number of replicas are more. The CSPs are attracting the users for initial storage/put operations and get operations from the cloud becomes hurdle and subsequently increases the cost. CSPs provides these services by maintaining multiple datacenters at multiple locations throughout the world. These datacenters provide distinctive get/put latencies and unit costs for resource reservation and utilization. The way of choosing distinctive CSPs data centers, becomes tricky for cloud users those who are using the distributed application globally i.e. online social networks. In has mainly two challenges. Firstly, allocating the data to different datacenters to satisfy the SLO including the latency. Secondly, how one can reserve the remote resource i.e. memory with less cost. In this paper we have derived a new model to minimize the cost by satisfying the SLOs with integer programming. Additionally, we proposed an algorithm to store the data in a data center by minimizing the cost among different data centers and the computation of cost for put/get latencies. Our simulation works shows that the cost is minimized for resource reservation and utilization among different datacenters.

Download Full-text

Key based data analytics across data centers considering bi-level resource provision in cloud computing

Future Generation Computer Systems ◽

10.1016/j.future.2016.03.008 ◽

2016 ◽

Vol 62 ◽

pp. 40-50 ◽

Cited By ~ 5

Author(s):

Jiangtao Zhang ◽

Lingmin Zhang ◽

Hejiao Huang ◽

Zeo L. Jiang ◽

Xuan Wang

Keyword(s):

Cloud Computing ◽

Data Analytics ◽

Data Centers

Download Full-text

Trends and Opportunities in Health Analytics as a Service and Implications for Use in Low Resource Settings: A Literature Review Abstract (Preprint)

10.2196/preprints.15737 ◽

2019 ◽

Author(s):

Meghana Bastwadkar ◽

Carolyn McGregor ◽

S Balaji

Keyword(s):

Data Mining ◽

Cloud Computing ◽

Big Data ◽

Intensive Care ◽

Literature Review ◽

Health Monitoring ◽

Data Analytics ◽

Neonatal Intensive Care ◽

Big Data Analytics ◽

Healthcare Facilities

BACKGROUND This paper presents a systematic literature review of existing remote health monitoring systems with special reference to neonatal intensive care (NICU). Articles on NICU clinical decision support systems (CDSSs) which used cloud computing and big data analytics were surveyed. OBJECTIVE The aim of this study is to review technologies used to provide NICU CDSS. The literature review highlights the gaps within frameworks providing HAaaS paradigm for big data analytics METHODS Literature searches were performed in Google Scholar, IEEE Digital Library, JMIR Medical Informatics, JMIR Human Factors and JMIR mHealth and only English articles published on and after 2015 were included. The overall search strategy was to retrieve articles that included terms that were related to “health analytics” and “as a service” or “internet of things” / ”IoT” and “neonatal intensive care unit” / ”NICU”. Title and abstracts were reviewed to assess relevance. RESULTS In total, 17 full papers met all criteria and were selected for full review. Results showed that in most cases bedside medical devices like pulse oximeters have been used as the sensor device. Results revealed a great diversity in data acquisition techniques used however in most cases the same physiological data (heart rate, respiratory rate, blood pressure, blood oxygen saturation) was acquired. Results obtained have shown that in most cases data analytics involved data mining classification techniques, fuzzy logic-NICU decision support systems (DSS) etc where as big data analytics involving Artemis cloud data analysis have used CRISP-TDM and STDM temporal data mining technique to support clinical research studies. In most scenarios both real-time and retrospective analytics have been performed. Results reveal that most of the research study has been performed within small and medium sized urban hospitals so there is wide scope for research within rural and remote hospitals with NICU set ups. Results have shown creating a HAaaS approach where data acquisition and data analytics are not tightly coupled remains an open research area. Reviewed articles have described architecture and base technologies for neonatal health monitoring with an IoT approach. CONCLUSIONS The current work supports implementation of the expanded Artemis cloud as a commercial offering to healthcare facilities in Canada and worldwide to provide cloud computing services to critical care. However, no work till date has been completed for low resource setting environment within healthcare facilities in India which results in scope for research. It is observed that all the big data analytics frameworks which have been reviewed in this study have tight coupling of components within the framework, so there is a need for a framework with functional decoupling of components.

Download Full-text

Energy and Cost-Aware Workflow Scheduling in Cloud Computing Data Centers Using a Multi-objective Optimization Algorithm

Journal of Network and Systems Management ◽

10.1007/s10922-021-09599-4 ◽

2021 ◽

Vol 29 (3) ◽

Author(s):

Ali Mohammadzadeh ◽

Mohammad Masdari ◽

Farhad Soleimanian Gharehchopogh

Keyword(s):

Cloud Computing ◽

Optimization Algorithm ◽

Data Centers ◽

Workflow Scheduling ◽

Multi Objective Optimization ◽

Multi Objective

Download Full-text

Application of Big Data Analytics via Cloud Computing

2016 World Automation Congress (WAC) ◽

10.1109/wac.2016.7582986 ◽

2016 ◽

Cited By ~ 2

Author(s):

Yunus Yetis ◽

Ruthvik Goud Sara ◽

Berat A. Erol ◽

Halid Kaplan ◽

Abdurrahman Akuzum ◽

...

Keyword(s):

Cloud Computing ◽

Big Data ◽

Data Analytics ◽

Big Data Analytics

Download Full-text

Data Analytics and Unsupervised Learning Enabled Proactive Maintenance for Optical Transceivers in Hyperscale Data Centers

10.1364/oecc.2021.m4a.3 ◽

2021 ◽

Author(s):

Chunxiao Wang ◽

Lei Wang ◽

Zhicheng Wang ◽

Qin Chen ◽

Peng Wang ◽

...

Keyword(s):

Unsupervised Learning ◽

Data Analytics ◽

Data Centers ◽

Proactive Maintenance ◽

Optical Transceivers

Download Full-text

Multidimensional Resource Scheduling in Cloud Data Centers

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.1008-1009.1513 ◽

2014 ◽

Vol 1008-1009 ◽

pp. 1513-1516

Author(s):

Hai Na Song ◽

Xiao Qing Zhang ◽

Zhong Tang He

Keyword(s):

Cloud Computing ◽

Data Centers ◽

Bin Packing ◽

Virtual Machines ◽

Three Dimensional ◽

Resource Scheduling ◽

Computing Environment ◽

Cloud Computing Environment ◽

Cloud Data ◽

Cloud Data Centers

Cloud computing environment is regarded as a kind of multi-tenant computing mode. With virtulization as a support technology, cloud computing realizes the integration of multiple workloads in one server through the package and seperation of virtual machines. Aiming at the contradiction between the heterogeneous applications and uniform shared resource pool, using the idea of bin packing, the multidimensional resource scheduling problem is analyzed in this paper. We carry out some example analysis in one-dimensional resource scheduling, two-dimensional resource schduling and three-dimensional resource scheduling. The results shows that the resource utilization of cloud data centers will be improved greatly when the resource sheduling is conducted after reorganizing rationally the heterogeneous demands.

Download Full-text

The Smart Triad : Big Data Analytics , Cloud Computing and Internet of Things to shape the Smart Home, Smart City, Smart Business & Smart Country

International Journal of Recent Technology and Engineering - 2 ◽

10.35940/ijrte.b1449.0982s1119 ◽

2019 ◽

Vol 8 (2S11) ◽

pp. 3594-3600 ◽

Cited By ~ 1

Keyword(s):

Cloud Computing ◽

Big Data ◽

Internet Of Things ◽

Data Analytics ◽

Smart Home ◽

Service Providers ◽

Irrigation System ◽

Electronic Devices ◽

Big Data Analytics ◽

User Privacy

Big data analytics, cloud computing & internet of things are a smart triad which have started shaping our future towards smart home, city, business, country. Internet of things is a convergence of intelligent networks, electronic devices, and cloud computing. The source of big data at different connected electronic devices is stored on cloud server for analytics. Cloud provides the readymade infrastructure, remote processing power to consumers of internet of things. Cloud computing also gives device manufacturers and service providers access to ―advanced analytics and monitoring‖, ―communication between services and devices‖, ―user privacy and security‖. This paper, presents an overview of internet of things, role of cloud computing & big data analytics towards IoT. In this paper IoT enabled automatic irrigation system is proposed that saves data over ―ThingSpeak‖ database an IoT analytics platform through ESP8266 wifi module. This paper also summarizes the application areas and discusses the challenges of IoT.

Download Full-text

Embarrassingly Parallel Search in Constraint Programming

Journal of Artificial Intelligence Research ◽

10.1613/jair.5247 ◽

2016 ◽

Vol 57 ◽

pp. 421-464 ◽

Cited By ~ 13

Author(s):

Arnaud Malapert ◽

Jean-Charles Régin ◽

Mohamed Rezgui

Keyword(s):

Cloud Computing ◽

Efficient Method ◽

Data Centers ◽

Optimization Problems ◽

State Of The Art ◽

Parallel Search ◽

Simple Method ◽

Constraint Solver ◽

Average Performance ◽

Depth Analysis

We introduce an Embarrassingly Parallel Search (EPS) method for solving constraint problems in parallel, and we show that this method matches or even outperforms state-of-the-art algorithms on a number of problems using various computing infrastructures. EPS is a simple method in which a master decomposes the problem into many disjoint subproblems which are then solved independently by workers. Our approach has three advantages: it is an efficient method; it involves almost no communication or synchronization between workers; and its implementation is made easy because the master and the workers rely on an underlying constraint solver, but does not require to modify it. This paper describes the method, and its applications to various constraint problems (satisfaction, enumeration, optimization). We show that our method can be adapted to different underlying solvers (Gecode, Choco2, OR-tools) on different computing infrastructures (multi-core, data centers, cloud computing). The experiments cover unsatisfiable, enumeration and optimization problems, but do not cover first solution search because it makes the results hard to analyze. The same variability can be observed for optimization problems, but at a lesser extent because the optimality proof is required. EPS offers good average performance, and matches or outperforms other available parallel implementations of Gecode as well as some solvers portfolios. Moreover, we perform an in-depth analysis of the various factors that make this approach efficient as well as the anomalies that can occur. Last, we show that the decomposition is a key component for efficiency and load balancing.

Download Full-text

Actionable Knowledge As A Service (AKAAS): Leveraging big data analytics in cloud computing environments

Journal Of Big Data ◽

10.1186/s40537-015-0023-2 ◽

2015 ◽

Vol 2 (1) ◽

Cited By ~ 9

Author(s):

Audrey Depeige ◽

Dimitri Doyencourt

Keyword(s):

Cloud Computing ◽

Big Data ◽

Data Analytics ◽

Big Data Analytics ◽

Actionable Knowledge ◽

Computing Environments

Download Full-text