Performance Monitoring for Distributed Service Oriented Grid Architecture

Author(s):  
Liang Peng ◽  
Melvin Koh ◽  
Jie Song ◽  
Simon See
Author(s):  
Yang Zhang

In IoT (Internet of Things) scenarios, lots of things and services are connected and coordinated each other. In our work, we first propose a service-oriented publish/subscribe middleware as a construction base of distributed, ultra-scale, and elastic service bus for IoT applications. The IoT services in our solution are then aware of underpinning service communication fabric, where they are event-driven, their interfaces are defined by underlying event topics, their behaviors are specified by event relations, and they can cooperate with the service communication fabric to complete distributed service coordination.


Inventions ◽  
2018 ◽  
Vol 3 (3) ◽  
pp. 62
Author(s):  
Dimosthenis Kyriazis

The emergence of service-oriented architectures has driven the shift towards a service-oriented paradigm, which has been adopted in several application domains. The advent of cloud computing facilities and recently of edge computing environments has increased the aforementioned paradigm shift towards service provisioning. In this context, various “traditional” critical infrastructure components have turned to services, being deployed and managed on top of cloud and edge computing infrastructures. However, the latter poses a specific challenge: the services of the critical infrastructures within and across application verticals/domains (e.g., transportation, health, industrial venues, etc.) need to be continuously available with near-zero downtime. In this context, this paper presents an approach for high-performance monitoring and failure detection of critical infrastructure services that are deployed in virtualized environments. The failure detection framework consists of distributed agents (i.e., monitoring services) to ensure timely collection of monitoring data, while it is enhanced with a voting algorithm to minimize the case of false positives. The goal of the proposed approach is to detect failures in datacenters that support critical infrastructures by targeting both the acquisition of monitoring data in a performant way and the minimization of false positives in terms of potential failure detection. The specific approach is the baseline towards decision making and triggering of actions in runtime to ensure service high availability, given that it provides the required data for decision making on time with high accuracy.


Sign in / Sign up

Export Citation Format

Share Document