Parallel Computing for Mining Association Rules in Distributed P2P Networks

E-Activity and Intelligent Web Construction - Advances in Web Technologies and Engineering ◽

10.4018/978-1-61520-871-5.ch005 ◽

2011 ◽

pp. 47-62

Author(s):

Huiwei Guan

Keyword(s):

Data Mining ◽

Parallel Computing ◽

Distributed Computing ◽

Parallel Algorithm ◽

Association Rules ◽

Distributed Databases ◽

Computing System ◽

Computing Systems ◽

P2p Computing ◽

P2p Systems

Distributed computing and Peer-to-Peer (P2P) systems have emerged as an active research field that combines techniques which cover networks, distributed computing, distributed database, and the various distributed applications. Distributed Computing and P2P systems realize information systems that scale to voluminous information on very large numbers of participating nodes. Data mining on large distributed databases is a very important research area. Recently, most work for mining association rules focused on a single machine or client-server network model. However, this traditional approach does not satisfy the requirements from the large distributed databases and applications in a P2P computing system. Two important challenges are raised, one is how to implement data mining for large distributed databases in P2P computing systems, and the other is how to develop parallel data mining algorithms and tools for the distributed P2P computing systems to improve the efficiency. In this chapter, a parallel association rule mining approach in a P2P computing system is designed and implemented, which satisfies the distribution of the P2P computing system well and makes parallel computing become true. The performance and comparison of the parallel algorithm with the sequential algorithm is analyzed and evaluated, which presents the parallel algorithm features consistent implementation, higher performance, and fine scalable ability.

Download Full-text

Parallel Computing for Mining Association Rules in Distributed P2P Networks

Data Mining ◽

10.4018/978-1-4666-2455-9.ch006 ◽

2013 ◽

pp. 107-124

Author(s):

Huiwei Guan

Keyword(s):

Data Mining ◽

Parallel Computing ◽

Distributed Computing ◽

Parallel Algorithm ◽

Association Rules ◽

Distributed Databases ◽

Computing System ◽

Computing Systems ◽

P2p Computing ◽

P2p Systems

Download Full-text

Security Issues in Distributed Computing System Models

Security Solutions for Hyperconnectivity and the Internet of Things - Advances in Information Security, Privacy, and Ethics ◽

10.4018/978-1-5225-0741-3.ch009 ◽

2017 ◽

pp. 211-259 ◽

Cited By ~ 1

Author(s):

Ghada Farouk Elkabbany ◽

Mohamed Rasslan

Keyword(s):

Distributed Systems ◽

Distributed Computing ◽

Computing System ◽

Computing Environment ◽

Distributed Computing Systems ◽

Computing Systems ◽

Research Directions ◽

System Models ◽

Security Issues ◽

Systems Security

Distributed computing systems allow homogenous/heterogeneous computers and workstations to act as a computing environment. In this environment, users can uniformly access local and remote resources in order to run processes. Users are not aware of which computers their processes are running on. This might pose some complicated security problems. This chapter provides a security review of distributed systems. It begins with a survey about different and diverse definitions of distributed computing systems in the literature. Different systems are discussed with emphasize on the most recent. Finally, different aspects of distributed systems security and prominent research directions are explored.

Download Full-text

A Novel Efficient Mining Association Rules Algorithm for Distributed Databases

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.108-111.50 ◽

2010 ◽

Vol 108-111 ◽

pp. 50-56 ◽

Cited By ~ 2

Author(s):

Liang Zhong Shen

Keyword(s):

Data Mining ◽

Knowledge Discovery ◽

Association Rules ◽

Association Rule ◽

Association Rule Mining ◽

Distributed Databases ◽

New Method ◽

Rule Mining ◽

Mining Association Rules

Due to the popularity of knowledge discovery and data mining, in practice as well as among academic and corporate professionals, association rule mining is receiving increasing attention. The technology of data mining is applied in analyzing data in databases. This paper puts forward a new method which is suit to design the distributed databases.

Download Full-text

Apriori, Association Rules, Data Mining,Frequent Itemsets Mining (FIM), Parallel Computing

Fourth International Conference on Software Engineering Research, Management and Applications (SERA'06) ◽

10.1109/sera.2006.17 ◽

2006 ◽

Cited By ~ 1

Author(s):

M. Yoshikawa ◽

H. Terai

Keyword(s):

Data Mining ◽

Parallel Computing ◽

Association Rules ◽

Frequent Itemsets ◽

Frequent Itemsets Mining ◽

Mining Frequent Itemsets

Download Full-text

On the topological compatibility of parallel tasks and computing systems

International Journal of Modern Physics C ◽

10.1142/s0129183122500401 ◽

2021 ◽

Author(s):

A. F. Zadorozhny ◽

V. A. Melent’ev

Keyword(s):

Parallel Computing ◽

Comparative Analysis ◽

Computing System ◽

Topological Model ◽

Present Contribution ◽

Computing Systems ◽

Parallel Tasks ◽

Different Types ◽

Types Of Information

The aspects of topological compatibility of parallel computing systems and tasks are investigated in the present contribution. Based on the original topological model of parallel computations and on the unconventional graph description by its projections, the introduction of appropriate indexes is proposed and elucidated. On the example of hypercubic computing system (CS) and tasks with ring and star information topologies, we demonstrate the determination of indexes and their use in a comparative analysis of the applicability of interconnect with a given topology to solve the tasks with the same and different types of information topologies.

Download Full-text

Particle Swarm Optimization for Load Balancing in Distributed Computing Systems – A Survey

Turkish Journal of Computer and Mathematics Education (TURCOMAT) ◽

10.17762/turcomat.v12i1s.1766 ◽

2021 ◽

Vol 12 (1S) ◽

pp. 257-265

Author(s):

Vidya S. Handur, Et. al.

Keyword(s):

Particle Swarm Optimization ◽

Distributed Computing ◽

Load Balancing ◽

Particle Swarm ◽

Optimal Solution ◽

Computing System ◽

Optimization Approach ◽

Distributed Computing Systems ◽

Computing Systems ◽

Swarm Optimization

Development of technology like Cloud Computing and its widespread usage has given rise to exponential increase in the volume of traffic. With this increase in huge traffic the resources in the network would either be insufficient to handle the traffic or the situation may cause some of the resources to be over utilized or underutilized. This condition leads to reduced performance of the system. To improve the performance of the system the traffic requires to be regulated such that all the resources are utilized conferring to their capacity which is known as load balancing. Load balancing has been one of the concerns in the distributed computing systems where the computing nodes do not have a global view of the network. There have been constant efforts to provide an efficient solution for load balancing through the approaches like game theory, fuzzy logic, heuristics and metaheuristics. Even though various solutions exist for balancing the load, the issue is challenging as there does not exist one best fit solution. The paper aims at the study of how Particle Swarm Optimization approach is used to achieve an optimal solution for load balancing in distributed computing system.

Download Full-text

A Survey of Tasks Scheduling Algorithms in Distributed Computing Systems

Research Anthology on Architectures, Frameworks, and Integration Strategies for Distributed and Cloud Computing ◽

10.4018/978-1-7998-5339-8.ch061 ◽

2021 ◽

pp. 1269-1281

Author(s):

Nutan Kumari Chauhan ◽

Harendra Kumar

Keyword(s):

Distributed Computing ◽

Scheduling Algorithms ◽

Computing System ◽

Communication Link ◽

Scheduling Problems ◽

Distributed Computing Systems ◽

Computing Systems ◽

Tasks Scheduling ◽

Scheduling Strategies ◽

User Tasks

Distributed computing system (DCS) is a very popular field of computer science. DCS consists of various computers (processors) located at possibly different sites and connected by a communication link in such a manner that it appears as one system to the user. Tasks scheduling is a very interesting field of research in DCS. The main objectives of tasks scheduling problems are load balancing of processors, maximization of system reliability, minimizing the system cost, and minimizing the response time. Obviously, it is very complicated to satisfy all of the above objectives simultaneously. So, most of the researchers have solved the tasks scheduling problem with one or more objectives. The purpose of this chapter is to produce an overview of much (certainly not all) of tasks scheduling algorithms. The chapter is covering the little much valuable survey, tasks scheduling strategies, and different approaches used for tasks scheduling with one or more objectives.

Download Full-text

Heuristic algorithms for optimization of task allocation and result distribution in peer-to-peer computing systems

International Journal of Applied Mathematics and Computer Science ◽

10.2478/v10006-012-0055-0 ◽

2012 ◽

Vol 22 (3) ◽

pp. 733-748 ◽

Cited By ~ 11

Author(s):

Grzegorz Chmaj ◽

Krzysztof Walkowiak ◽

Michał Tarnawski ◽

Michał Kucharzak

Keyword(s):

Linear Problem ◽

Heuristic Algorithms ◽

Computing System ◽

Peer To Peer ◽

Network Computing ◽

Computing Systems ◽

P2p Computing ◽

Global Computing ◽

Public Resource ◽

P2p File Sharing

Abstract Recently, distributed computing system have been gaining much attention due to a growing demand for various kinds of effective computations in both industry and academia. In this paper, we focus on Peer-to-Peer (P2P) computing systems, also called public-resource computing systems or global computing systems. P2P computing systems, contrary to grids, use personal computers and other relatively simple electronic equipment (e.g., the PlayStation console) to process sophisticated computational projects. A significant example of the P2P computing idea is the BOINC (Berkeley Open Infrastructure for Network Computing) project. To improve the performance of the computing system, we propose to use the P2P approach to distribute results of computational projects, i.e., results are transmitted in the system like in P2P file sharing systems (e.g., BitTorrent). In this work, we concentrate on offline optimization of the P2P computing system including two elements: scheduling of computations and data distribution. The objective is to minimize the system OPEX cost related to data processing and data transmission. We formulate an Integer Linear Problem (ILP) to model the system and apply this formulation to obtain optimal results using the CPLEX solver. Next, we propose two heuristic algorithms that provide results very

Download Full-text

Multicore workload scheduling in JUNO[1]

EPJ Web of Conferences ◽

10.1051/epjconf/201921403048 ◽

2019 ◽

Vol 214 ◽

pp. 03048

Author(s):

Xiaomei Zhang ◽

Kang Li ◽

Xiang Hu Zhao ◽

Tian Yan ◽

Yong Sun

Keyword(s):

Parallel Computing ◽

Distributed Computing ◽

Data Processing ◽

Computing System ◽

Job Descriptions ◽

Workload Scheduling ◽

Complicated Part ◽

The Way

The Jiangmen Underground Neutrino Observatory (JUNO) is going to apply parallel computing in its software to accelerate JUNO data processing and fully use capability of multi-core and manycore CPUs. Therefore, it is necessary for the JUNO distributed computing system to explore the way to support single-core and multi-core jobs in a consistent way. To support multi-core jobs, a series of changes to the job descriptions, scheduling, monitoring needs to be considered, in which the pilot-based scheduling for a hybrid of single-core and multi-core jobs is the most complicated part. Two scheduling modes and their efficiency are presented and compared in this paper, and also a way to optimize efficiency is provided.

Download Full-text

Research and Organization of Priority Modes in a Network Distributed Computing System with Cloud Service Architecture

Proceedings of Southwest State University ◽

10.21869/2223-1560-2019-23-2-153-173 ◽

2019 ◽

Vol 23 (2) ◽

pp. 153-173

Author(s):

M. Sadeq Jaafar

Keyword(s):

Distributed Computing ◽

Petri Nets ◽

Storage System ◽

Cloud Service ◽

Computing System ◽

Cloud Services ◽

Service Architecture ◽

Distributed Computing Systems ◽

Computing Systems ◽

Replicated Databases

Purpose of research. The object of the study is a network cloud service built on the basis of a replicated database. Data in distributed computing systems are replicated in order to ensure the reliability of their storage, facilitate access to data as well as to improve the storage system performance. In this regard, the problem of analyzing the effectiveness of processing the queries to replicated databases in a network-based cloud environment, and, in particular, the problem of organizing priority query queues for updating databae copies (update requests) and for searching and reading information in databases (query-requests). The purpose of this work is to study and organize priority modes in a network distributed computing system with cloud service architecture.Methods. The study was conducted on the basis of two types of behavioural patterns: models based on Petri nets to describe and verify the functioning of a distributed computing system with replicated databases represented as a pool of resource units with several units, and models based on the GPSS simulation language for possible evaluation of passage of query time of each type in queues depending on the priority of queries.Results. Based on two simulation methods, the operation of a cloud system with database replicas was analyzed. In this system two distributed cloud computing systems interact: MANET Cloud based on a wireless network and Internet Cloud based on the Internet. These databases together are the basis of the DBaaSoD (Data Bases as a Service on Demand) cloud service (databases as a service organized at user’s query). To study this system the models of two classes were developed. The model based on Petri nets is designed to test the simulated distributed application for proper functioning. The decisions on the mapping of Petri nets on the architecture of computer networks are discussed. The simulation statistical model is used to compare the priority and non-priority maintenance modes of query- and update-requests by the criterion of average passage of time of queries in queues.Conclusion. System models based on Petri nets were tested, which showed their liveness and security, which makes it possible to move from models to building formalized specifications for network applications for network cloud services in distributed computing systems with replicated databases. The study of GPSS-model showed that in the case of priority service of update-requests, the time of passage for them is reduced by about 2 to 4 times compared with query-requests, depending on the intensity of the query-requests. In the non-priority mode, the serving conditions for update-queries deteriorate and the time of passage in the queue for them increases by about 2 to 6 times as compared with query-requests depending on the intensity of the query-requests.

Download Full-text