Congestion aware adaptive routing for network-on-chip communication

10.32920/ryerson.14645025 ◽

2021 ◽

Author(s):

Stephen Chui

Keyword(s):

Embedded Systems ◽

High Performance ◽

Data Transfer ◽

Adaptive Routing ◽

Network On Chip ◽

Message Routing ◽

Data Packets ◽

Novel Approach ◽

Communication Links ◽

On Chip

Network-On-Chip (NoC) has surpassed the traditional bus based on-chip communication in offering better performance for data transfers among many processing, peripheral and other cores of high performance embedded systems. Adaptive routing provides an effective way of efficient on-chip communication among NoC cores. The message routing efficiency can further improve the performance of NoC based embedded systems on a chip. Congestion awareness has been applied to adaptive routing for achieving better data throughput and latency. This thesis presents a novel approach of analyzing congestion to improve NoC throughput by improving packet allocation in NoC routers. The routers would have the knowledge of the traffic conditions around themselves by utilizing the congestion information. We employ header flits to store the congestion information that does not require any additional communication links between the routers. By prioritizing data packets that are likely to suffer the worst congestion would improve overall NoC data transfer latency.

Download Full-text

High performance adaptive routing for Network-on-Chip systems with express highway mechanism

2014 IEEE Asia Pacific Conference on Circuits and Systems (APCCAS) ◽

10.1109/apccas.2014.7032704 ◽

2014 ◽

Cited By ~ 1

Author(s):

Shih-Chieh Lin ◽

En-Jui Chang ◽

Yu-Yin Chen ◽

Hsien-Kai Hsin ◽

An-Yeu Andy Wu

Keyword(s):

High Performance ◽

Adaptive Routing ◽

Network On Chip ◽

On Chip

Download Full-text

Deadlock Free Load Balanced Adaptive Routing for Network on Chip (NoC) Systems

Journal of Computational and Theoretical Nanoscience ◽

10.1166/jctn.2016.5757 ◽

2016 ◽

Vol 13 (10) ◽

pp. 7592-7598

Author(s):

J Kalaivani ◽

B Vinayagasundaram

Keyword(s):

Packet Loss ◽

Probability Function ◽

Adaptive Routing ◽

Network On Chip ◽

Communication Architecture ◽

Data Packets ◽

Oblivious Routing ◽

Simulation Results ◽

On Chip ◽

Load Balanced

The Network-on-Chip (NoC) systems have emerged in on-chip communication architecture in various fields. To achieve excellent results in Network on Chip (NoC) systems application, the routing must eliminate the deadlock issues from the network. To overcome this issue in the network, in this paper, we propose Deadlock Free Load Balanced Adaptive Routing. In this approach, Oblivious Routing (OR) algorithm is implemented on the channel by using the probability function. The network considers the capacity of the node and tries to maximize the throughput based on the connectivity between the data packets flow and minimize the channel load. A Reconfiguration Protocol is used for the data packets to choose other channel in the network if the deadlock occurs. Simulation results show that this approach reduces the delay and packet loss in the network.

Download Full-text

Efficient Instruction and Data Caching for High Performance Embedded Processors

Jornada de Jóvenes Investigadores del I3A ◽

10.26754/jji-i3a.201201788 ◽

1970 ◽

pp. 9

Author(s):

A. Ferrerón Labari ◽

D. Suárez Gracia ◽

V. Viñals Yúfera

Keyword(s):

Embedded Systems ◽

Power Consumption ◽

Low Power ◽

Interconnection Networks ◽

High Performance ◽

Critical Issue ◽

Content Management ◽

Structure Design ◽

Portable Devices ◽

On Chip

In the last years, embedded systems have evolved so that they offer capabilities we could only find before in high performance systems. Portable devices already have multiprocessors on-chip (such as PowerPC 476FP or ARM Cortex A9 MP), usually multi-threaded, and a powerful multi-level cache memory hierarchy on-chip. As most of these systems are battery-powered, the power consumption becomes a critical issue. Achieving high performance and low power consumption is a high complexity challenge where some proposals have been already made. Suarez et al. proposed a new cache hierarchy on-chip, the LP-NUCA (Low Power NUCA), which is able to reduce the access latency taking advantage of NUCA (Non-Uniform Cache Architectures) properties. The key points are decoupling the functionality, and utilizing three specialized networks on-chip. This structure has been proved to be efficient for data hierarchies, achieving a good performance and reducing the energy consumption. On the other hand, instruction caches have different requirements and characteristics than data caches, contradicting the low-power embedded systems requirements, especially in SMT (simultaneous multi-threading) environments. We want to study the benefits of utilizing small tiled caches for the instruction hierarchy, so we propose a new design, ID-LP-NUCAs. Thus, we need to re-evaluate completely our previous design in terms of structure design, interconnection networks (including topologies, flow control and routing), content management (with special interest in hardware/software content allocation policies), and structure sharing. In CMP environments (chip multiprocessors) with parallel workloads, coherence plays an important role, and must be taken into consideration.

Download Full-text

Self-Healing Router Approach for High-Performance Network-on-Chip

IEEE Open Journal of Circuits and Systems ◽

10.1109/ojcas.2021.3095000 ◽

2021 ◽

Vol 2 ◽

pp. 485-496

Author(s):

Kasem Khalil ◽

Omar Eldash ◽

Ashok Kumar ◽

Magdy Bayoumi

Keyword(s):

High Performance ◽

Network On Chip ◽

Self Healing ◽

On Chip

Download Full-text

Hybrid path-diversity-aware adaptive routing with latency prediction model in Network-on-Chip systems

2013 International Symposium onVLSI Design, Automation, and Test (VLSI-DAT) ◽

10.1109/vldi-dat.2013.6533884 ◽

2013 ◽

Cited By ~ 5

Author(s):

Po-An Tsai ◽

Yu-Hsin Kuo ◽

En-Jui Chang ◽

Hsien-Kai Hsin ◽

An-Yeu Wu

Keyword(s):

Prediction Model ◽

Adaptive Routing ◽

Network On Chip ◽

Path Diversity ◽

On Chip

Download Full-text

A Novel Adaptive Routing Algorithm for Network-On-Chip

Key Engineering Materials ◽

10.4028/www.scientific.net/kem.474-476.413 ◽

2011 ◽

Vol 474-476 ◽

pp. 413-416

Author(s):

Jia Jia ◽

Duan Zhou ◽

Jian Xian Zhang

Keyword(s):

Routing Algorithm ◽

Adaptive Routing ◽

Network On Chip ◽

Experimental Results ◽

Average Latency ◽

On Chip

In this paper, we propose a novel adaptive routing algorithm to solve the communication congestion problem for Network-on-Chip (NoC). The strategy competing for output ports in both X and Y directions is employed to utilize the output ports of the router sufficiently, and to reduce the transmission latency and improve the throughput. Experimental results show that the proposed algorithm is very effective in relieving the communication congestion, and a reduction in average latency by 45.7% and an improvement in throughput by 44.4% are achieved compared with the deterministic XY routing algorithm and the simple XY adaptive routing algorithm.

Download Full-text

Compiler-directed scratchpad memory data transfer optimization for multithreaded applications on a heterogeneous many-core architecture

The Journal of Supercomputing ◽

10.1007/s11227-021-03853-x ◽

2021 ◽

Author(s):

Xiaohan Tao ◽

Jianmin Pang ◽

Jinlong Xu ◽

Yu Zhu

Keyword(s):

Energy Consumption ◽

High Performance ◽

Scientific Computing ◽

Data Transfer ◽

Performance Model ◽

Experimental Result ◽

Transfer Model ◽

Scratchpad Memory ◽

On Chip ◽

Many Core

AbstractThe heterogeneous many-core architecture plays an important role in the fields of high-performance computing and scientific computing. It uses accelerator cores with on-chip memories to improve performance and reduce energy consumption. Scratchpad memory (SPM) is a kind of fast on-chip memory with lower energy consumption compared with a hardware cache. However, data transfer between SPM and off-chip memory can be managed only by a programmer or compiler. In this paper, we propose a compiler-directed multithreaded SPM data transfer model (MSDTM) to optimize the process of data transfer in a heterogeneous many-core architecture. We use compile-time analysis to classify data accesses, check dependences and determine the allocation of data transfer operations. We further present the data transfer performance model to derive the optimal granularity of data transfer and select the most profitable data transfer strategy. We implement the proposed MSDTM on the GCC complier and evaluate it on Sunway TaihuLight with selected test cases from benchmarks and scientific computing applications. The experimental result shows that the proposed MSDTM improves the application execution time by 5.49$$\times$$ × and achieves an energy saving of 5.16$$\times$$ × on average.

Download Full-text

High performance Architectural Design and Analysis of Network on Chip Systems

International Journal of Communications Network and System Sciences ◽

10.4236/ijcns.2010.31006 ◽

2010 ◽

Vol 03 (01) ◽

Author(s):

EZHUMALAI

Keyword(s):

High Performance ◽

Architectural Design ◽

Network On Chip ◽

On Chip

Download Full-text

Merging Plasmonics and Silicon Photonics Towards Greener and Faster “Network-on-Chip” Solutions for Data Centers and High-Performance Computing Systems

Plasmonics - Principles and Applications ◽

10.5772/51853 ◽

2012 ◽

Cited By ~ 3

Author(s):

Sotirios Papaioannou ◽

Konstantinos Vyrsokinos ◽

Dimitrios Kalavrouziotis ◽

Giannis Giannoulis ◽

Dimitrios Apostolopoulos ◽

...

Keyword(s):

High Performance Computing ◽

Silicon Photonics ◽

High Performance ◽

Data Centers ◽

Network On Chip ◽

Computing Systems ◽

On Chip ◽

Performance Computing

Download Full-text