High Performance Hierarchical Torus Network Under Adverse Traffic Patterns

As internet traffic rapidly increases, fast and accurate network classification is becoming essential for high quality of service control and early detection of network traffic abnormalities. Machine learning techniques based on statistical features of packet flows have recently become popular for network classification partly because of the limitations of traditional port- and payload-based methods. In this paper, we propose a Markov model-based network classification with a Kullback-Leibler divergence criterion. Our study is mainly focused on hard-to-classify (or overlapping) traffic patterns of network applications, which current techniques have difficulty dealing with. The results of simulations conducted using our proposed method indicate that the overall accuracy reaches around 90% with a reasonable group size ofn=100.

Download Full-text

Torus network labeling in High Performance computing

2016 International Conference on Computing Communication Control and automation (ICCUBEA) ◽

10.1109/iccubea.2016.7859992 ◽

2016 ◽

Cited By ~ 1

Author(s):

Mayuresh Dhanak ◽

Parikshit D. Godbole ◽

R. A. Patil

Keyword(s):

High Performance Computing ◽

High Performance ◽

Performance Computing ◽

Torus Network

Download Full-text

High and stable performance under adverse traffic patterns of tori-connected torus network

Computers & Electrical Engineering ◽

10.1016/j.compeleceng.2012.12.014 ◽

2013 ◽

Vol 39 (3) ◽

pp. 973-983 ◽

Cited By ~ 8

Author(s):

M.M. Hafizur Rahman ◽

Yukinori Sato ◽

Yasushi Inoguchi

Keyword(s):

Traffic Patterns ◽

Stable Performance ◽

Torus Network

Download Full-text

Inter-Processor Communication Performance of a Hierarchical Torus Network under Bit-Flip Traffic Patterns

2006 International Conference on Electrical and Computer Engineering ◽

10.1109/icece.2006.355696 ◽

2006 ◽

Cited By ~ 1

Author(s):

M.M. Hafizur Rahman ◽

Manas Ghosh ◽

Susumu Horiguchi

Keyword(s):

Traffic Patterns ◽

Communication Performance ◽

Bit Flip ◽

Torus Network

Download Full-text

Low latency Path Aware XY-X Routing Algorithm for NoC Architectures

International Journal of Engineering & Technology ◽

10.14419/ijet.v7i2.7.10941 ◽

2018 ◽

Vol 7 (2.7) ◽

pp. 763

Author(s):

Venkateswara Rao Musala ◽

T V Rama Krishna

Keyword(s):

High Performance ◽

Interconnection Network ◽

Routing Algorithm ◽

Adaptive Routing ◽

Destination Node ◽

Specific Information ◽

Traffic Patterns ◽

Traffic Conditions ◽

Trade Offs ◽

On Chip

Route specific information with the SoC needs a great deal of wiring, which increases the Resistance & Capacitance (RC) component of the system. Network on Chip (NoC) is utilized as the interface to address the problems in SoC, On-chip interconnection network in NoC has gained more consideration over steadfast wiring and buses, like lower latency, scalability and high performance. Present routing algorithms in NoC is suffered from load balancing at incarnation networks under non-uniform traffic conditions, causes increase the NoC trade-offs (latency and throughput). Adaptive routing is a technique to progress the load balance, but previous adaptive routing techniques used uniform traffic patterns to form the routing decisions. This paper proposes a new approach at non- uniform traffic patterns in channel state and path specific, Path Aware Routing (PAR XY-X) uses a timeout piggybacking for acknowledgement and load shedding to avoid congestion which choose optimistic path calculation unit to connect the destination node without glue logic decisions in routing. PAR XY-X outperforms the Normal XY routing by 20% and 33% with respect to Avg.latency and throughput.

Download Full-text

On Solving the Decycling Problem in a Torus Network

Wireless Communications and Mobile Computing ◽

10.1155/2021/5598173 ◽

2021 ◽

Vol 2021 ◽

pp. 1-6

Author(s):

Antoine Bossard

Keyword(s):

Parallel Processing ◽

High Performance ◽

Massively Parallel ◽

Processing Capacity ◽

3 Dimensional ◽

Massively Parallel Systems ◽

The World ◽

High Performance Systems ◽

Computing Performance ◽

Torus Network

Modern supercomputers are massively parallel systems: they embody thousands of computing nodes and sometimes several millions. The torus topology has proven very popular for the interconnect of these high-performance systems. Notably, this network topology is employed by the supercomputer ranked number one in the world as of November 2020, the supercomputer Fugaku. Given the high number of compute nodes in such systems, efficient parallel processing is critical to maximise the computing performance. It is well known that cycles harm the parallel processing capacity of systems: for instance, deadlocks and starvations are two notorious issues of parallel computing that are directly linked to the presence of cycles. Hence, network decycling is an important issue, and it has been extensively discussed in the literature. We describe in this paper a decycling algorithm for the 3-dimensional k -ary torus topology and compare it with established results, both theoretically and experimentally. (This paper is a revised version of Antoine Bossard (2020)).

Download Full-text

Performance Analysis of a Scalable Algorithm for 3D Linear Transforms on Supercomputer with Intel Processors/Co-Processors

Cybernetics and Information Technologies ◽

10.2478/cait-2020-0064 ◽

2020 ◽

Vol 20 (6) ◽

pp. 94-104

Author(s):

Ivan Lirkov

Keyword(s):

Computer Architecture ◽

High Performance ◽

Performance Study ◽

Scalable Algorithm ◽

Practical Applications ◽

Sine Transform ◽

Discrete Transforms ◽

High Performance Computer ◽

2D Data ◽

Torus Network

AbstractPractical realizations of 3D forward/inverse separable discrete transforms, such as Fourier transform, cosine/sine transform, etc. are frequently the principal limiters that prevent many practical applications from scaling to a large number of processors. Existing approaches, which are based primarily on 1D or 2D data decompositions, prevent the 3D transforms from effectively scaling to the maximum (possible/available) number of computer nodes. A highly scalable approach to realize forward/inverse 3D transforms has been proposed. It is based on a 3D decomposition of data and geared towards a torus network of computer nodes. The proposed algorithms requires compute-and-roll time-steps, where each step consists of an execution of multiple GEMM operations and concurrent movement of cubical data blocks between nearest neighbors. The aim of this paper is to present an experimental performance study of an implementation on high performance computer architecture.

Download Full-text

Dynamic Communication Performance of a Modified Hierarchical 3D-Torus Network under Non-uniform Traffic Patterns

2010 First International Conference on Networking and Computing ◽

10.1109/ic-nc.2010.27 ◽

2010 ◽

Cited By ~ 1

Author(s):

M.M. Hafizur Rahman ◽

Yukinori Sato ◽

Yasushi Inoguchi

Keyword(s):

Traffic Patterns ◽

Communication Performance ◽

Torus Network

Download Full-text