Reducing Total Power Consumption and Total Area Techniques for Network-on-Chip Through Disable Cores and Routers Based on Clustering Method

Ng Yen Phing; M.N. Mohd Warip; Phaklen Ehkan; S.Y. Teo

doi:10.11591/ijeecs.v10.i2.pp514-520

Reducing Total Power Consumption and Total Area Techniques for Network-on-Chip Through Disable Cores and Routers Based on Clustering Method

Indonesian Journal of Electrical Engineering and Computer Science ◽

10.11591/ijeecs.v10.i2.pp514-520 ◽

2018 ◽

Vol 10 (2) ◽

pp. 514

Author(s):

Ng Yen Phing ◽

M.N. Mohd Warip ◽

Phaklen Ehkan ◽

S.Y. Teo

Keyword(s):

Power Consumption ◽

Large Scale ◽

Routing Algorithm ◽

Network On Chip ◽

Total Power ◽

Clustering Method ◽

Mesh Topology ◽

Total Power Consumption ◽

Promising Solution ◽

On Chip

<span lang="EN-US">Network-on-Chip (NoC) is a promising solution to overcome the communication problem of System-on-Chip (SoC) architecture. The execution of topology, routing algorithm and switching technique is significant because it powerfully affects the overall performance of NoC. In the Network-on-Chip, the total power consumption increasing due to the large scale of network. In order to solve it, a clustering method and disable cores and routers based on clustering method is apply onto mesh based NoC architecture. In the proposed approach, the optimization of total area and total power consumption are the major concern. Experiment results show that the proposed method outperformas the existing work. The clustering-mesh based method reduced the total area by 22% to 40 % and total power consumption by 22% to 56% compare to mesh topology. In addition, the proposed method by disable cores and routers based on clustering-mesh based method has decrease the total area by 45% to 87% and total power consumption by 33% to 75% compare to mesh topology.</span>

Download Full-text

Parallel overloaded CDMA crossbar for network on chip

Facta universitatis - series Electronics and Energetics ◽

10.2298/fuee1901105k ◽

2019 ◽

Vol 32 (1) ◽

pp. 105-118

Author(s):

Ashok Kumar ◽

P. Dananjayan

Keyword(s):

Power Consumption ◽

High Performance ◽

Routing Algorithm ◽

Network On Chip ◽

Large Network ◽

Network Systems ◽

Mesh Topology ◽

Field Programmable ◽

On Chip ◽

Code Division Multiple

For high performance of Network on Chip (NoC), Code Division Multiple Access (CDMA) technique is used recently due to its fixed communication delay, reduced area utilisation and low power consumption. The CDMA system uses Walsh based spreading code which improves the bandwidth efficiency. On the contrary, it is not effective when the number of nodes present in the system increases. Overloaded CDMA (OCDMA) is presented for such large network systems. In this paper, OCDMA crossbar is modified and advanced with parallel encoding and decoding operation using orthogonal gold codes for improving the speed of crossbar thereby obtaining high performance in NoC switch. A modified crossbar consisting of extra processing elements is used to enhance the performance of NoC based System on Chip (SoC) system. This work is simulated on Xilinx tool and implemented in Vertex-6 (XC6VLX760) Field Programmable Gate Array (FPGA) device. The proposed work is implemented for four ports, eight ports and sixteen ports with deterministic X-Y routing algorithm in 3 3 NoC design with mesh topology. This NoC switch shows 9.79% improvement in delay and shows 20.76% improvement in power consumption when compared to the existing CDMA NoCs for 8 bit data packet.

Download Full-text

Stable Matching based Virtual Channels Allocation Scheme in Network-on-Chip

Journal of Interconnection Networks ◽

10.1142/s0219265920500085 ◽

2020 ◽

Vol 20 (02) ◽

pp. 2050008

Author(s):

BANSIDHAR JOSHI ◽

MANISH K. THAKUR

Keyword(s):

Power Consumption ◽

Network On Chip ◽

Total Power ◽

Allocation Scheme ◽

Trade Off ◽

Routing Schemes ◽

Virtual Channels ◽

Total Power Consumption ◽

On Chip ◽

The One

While designing router micro-architecture of an On-Chip network, a good allocation of virtual channels (VCs) governs an effective resources utilization which essentially results in an optimized number of packets received at destination(s). Generally, the VC allocation schemes deal with the one-way approach of VC allocation to the contending flits. However, this approach produces non-optimal matching of flits to the available VCs on next routers, and therefore leads to the under-utilization of these VCs. This paper proposes a 2-Way VC Allocation scheme to map input VCs (requestors) to output VCs (resources). The proposed scheme is compared with the conventional VC allocation scheme under two different mesh configurations with a 100% channel load. Simulations performed under two different routing schemes in diverse traffic scenarios demonstrate an increase in the number of packets received at destinations by up to 76%. Also, the network’s latency exhibits trade-off with total power consumption while reducing hotspots.

Download Full-text

Comparative Performance Analysis of  Selected Routing Algorithms by Load Variation of 2-Dimensional Mesh Topology Based  Network-On-Chip

ELEKTRIKA- Journal of Electrical Engineering ◽

10.11113/elektrika.v20n3.249 ◽

2021 ◽

Vol 20 (3) ◽

pp. 1-6

Author(s):

Mohammed Shaba Saliu ◽

Muyideen Omuya Momoh ◽

Pascal Uchenna Chinedu ◽

Wilson Nwankwo ◽

Aliu Daniel

Keyword(s):

Routing Algorithm ◽

Adaptive Routing ◽

Injection Rate ◽

Network On Chip ◽

Routing Algorithms ◽

Comparative Performance ◽

Traffic Pattern ◽

Mesh Topology ◽

Similar Vein ◽

On Chip

Network-on-Chip (NoC) has been proposed as a viable solution to the communication challenges on System-on-Chips (SoCs). As the communication paradigm of SoC, NoCs performance depends mainly on the type of routing algorithm chosen. In this paper different categories of routing algorithms were compared. These include XY routing, OE turn model adaptive routing, DyAD routing and Age-Aware adaptive routing. By varying the load at different Packet Injection Rate (PIR) under random traffic pattern, comparison was conducted using a 4 × 4 mesh topology. The Noxim simulator, a cycle accurate systemC based simulator was employed. The packets were modeled as a Poisson distribution; first-in-first-out (FIFO) input buffer channel with a depth of five (5) flits and a flit size of 32 bits; and a packet size of 3 flits respectively. The simulation time was 10,000 cycles. The findings showed that the XY routing algorithm performed better when the PIR is low. In a similar vein, the DyAD routing and Age-aware algorithms performed better when the load i.e. PIR is high.

Download Full-text

Network-on-Chip Especially for Video Coding Applications Using Multilayer Mesh Topology

Recent Advances in Electrical & Electronic Engineering (Formerly Recent Patents on Electrical & Electronic Engineering) ◽

10.2174/2352096511666180525124330 ◽

2019 ◽

Vol 12 (3) ◽

pp. 247-256

Author(s):

Kamel Messaoudi ◽

Salah Toumi ◽

El-Bay Bourennane

Keyword(s):

Video Coding ◽

Communication System ◽

Data Transfer ◽

Routing Algorithm ◽

Network On Chip ◽

Mesh Topology ◽

Global Interconnects ◽

And Performance ◽

On Chip ◽

Scalable Communication

Background: Network on chip is proposed as new reusable and scalable communication system for applications with important number of IPs. The NoC architecture characteristics are based on several factors: the implementation strategy of IPs, the power dissipation, the placement of IPs, data transfer time, the requirements of the given application, etc. The N×M Mesh topology combined with the XY routing algorithm are generally chosen in many studies. Hardware IPs proposed in the literature, for various applications as example video encoders, operates at different frequencies and generally implemented according to several strategies and different bus sizes. Connecting these IPs using the same communication system is very difficult. Methods: In this paper, we present a new topology based on multi-layer mesh topology and adapted for video coding applications. The proposed topology exploits the video coding information regarding groups of cores that communicate through two cores only. The idea is to use a specific NoC for each group of cores and connect the NoCs with bridge in the positions of two communication cores. The choice of parameters in each NoC depends on the characteristic of IPs in the same group in order to maximize communication adaptivity and performance. Results: Synthesis results show that the proposed multi-layer mesh topology NoC uses much less resources than the traditional NxM mesh topology NoC. Conclusion: This reduction in term of resources is assured by the considerable reduction in the length and number of global interconnects, resulting in an increase in the performance and decrease in the power consumption and area of wire limited circuits.

Download Full-text

A SAT-Based Methodology for Effective Clock Gating for Power Minimization

Journal of Circuits System and Computers ◽

10.1142/s0218126619500117 ◽

2018 ◽

Vol 28 (01) ◽

pp. 1950011

Author(s):

Khushbu Chandrakar ◽

Suchismita Roy

Keyword(s):

Power Consumption ◽

Optimization Techniques ◽

Total Power ◽

Clock Signal ◽

Clock Gating ◽

Clock Tree ◽

Synchronous Circuits ◽

Design And Optimization ◽

Total Power Consumption ◽

On Chip

A possible solution to handle the rising complexity of modern Systems-on-Chip (SoCs) is to raise the level of abstraction for the design and optimization. A better optimization of performance and power can be achieved at higher abstraction levels by applying suitable optimization techniques. Insertion of clock gating logic into the generated Register-Transfer Level (RTL) would facilitate lowering dynamic power consumption by switching off the clock signal to portions of the circuit not currently in use and thereby reducing unnecessary toggling. In this work, we have tried to minimize the power consumption of synchronous circuits by reducing the number of activity string patterns. Activity-driven clock trees have been used wherein sections of the clock tree are turned off by gating the clock signals. Since gating the clock signal implies additional control signals and gates, there is always a trade-off existing between the logic circuit area overhead and the total power consumption of the clock tree. A pseudo-Boolean satisfiability (PB-SAT)-based approach is proposed in this work which focuses on the reduction of power consumption by reducing the activity pattern of the clock tree which will reduce the power consumption with appropriate module-binding solutions.

Download Full-text

A Deterministic-Path Routing Algorithm for Tolerating Many Faults on Very-Large-Scale Network-on-Chip

ACM Transactions on Design Automation of Electronic Systems ◽

10.1145/3414060 ◽

2021 ◽

Vol 26 (1) ◽

pp. 1-26

Author(s):

Ying Zhang ◽

Xinpeng Hong ◽

Zhongsheng Chen ◽

Zebo Peng ◽

Jianhui Jiang

Keyword(s):

Large Scale ◽

Routing Algorithm ◽

Network On Chip ◽

Large Scale Network ◽

Scale Network ◽

On Chip

Download Full-text

Performances analysis of reducing router in ring and mesh topology for network-on-chip (NoC) architecture

Indonesian Journal of Electrical Engineering and Computer Science ◽

10.11591/ijeecs.v14.i2.pp802-809 ◽

2019 ◽

Vol 14 (2) ◽

pp. 802

Author(s):

Ng Yen Phing ◽

M.N.Mohd Warip ◽

Phaklen Ehkan ◽

R Badlishah Ahmad ◽

F.W. Zulkefli

Keyword(s):

Power Consumption ◽

Small Area ◽

Network On Chip ◽

Low Power Consumption ◽

Cross Link ◽

Ring Topology ◽

Mesh Topology ◽

Average Area ◽

On Chip ◽

Point To Point

<span>The size of the transistor has reached physical processor limitation in particular for traditional bus-based and point-to-point architecture in system-on-chip (SoC). Therefore, network-on-chip (NoC) was proposed as a solution. The performances required for the optimization of the NoC are low network latency, low power consumption, small area, and high throughput. However, recently the size of the NoC architecture has increased and the communication between cores to core become complicated. To overcome this disadvantages, topology plays an important role. In this paper, we reduce the number of the router in the 16 cores and 64 cores ring and mesh topologies by connected more numbers of node in each router. Result shows that reducing the number of the router in 64 cores ring topology outperforms the conventional topologies in term of area, power consumption, latency, and accepted packet rate. Reducing router in 64 cores ring topology decrease the average area, power consumption, latency, and increase the average accepted packet rate by 160.45%, 23.88%, 54.76%, and 223.88% over the 64 cores mesh, reducing router in mesh, ring, and cross-link mesh topologies.</span>

Download Full-text

Applying Genetic Algorithm to Solve Partitioning and Mapping Problem for Mesh Network-on-Chip Systems

International Journal of Computer Science and Information Technology ◽

10.5121/ijcsit.2021.13103 ◽

2021 ◽

Vol 13 (1) ◽

pp. 33-47

Author(s):

Walid Mokthar Salh ◽

Azeddien M. Sllame

Keyword(s):

Genetic Algorithm ◽

Power Consumption ◽

Design Process ◽

Network On Chip ◽

Experimental Results ◽

Mesh Network ◽

Data Dependencies ◽

Mapping Problem ◽

Mesh Topology ◽

On Chip

This paper presents a genetic based approach to the partitioning and mapping of multicore SoC cores over a NoC system that uses mesh topology. The proposed algorithm performs the partitioning and mapping by reducing communication cost and minimizing power consumption by placing those intercommunicated cores as close as possible together. A program developed in C++ in which the provided specification of the multicore MPSoC system captures all data dependencies before any start of the design process. Experimental results of several multimedia benchmarks demonstrates that the genetic-based approach able to find different satisfied implementations to the problem of partitioning and mapping of MPSoC cores over mesh-based NoC system that satisfies design goals.

Download Full-text

YaNoC: Yet Another Network-on-Chip Simulation Acceleration Engine Supporting Congestion-Aware Adaptive Routing Using FPGAs

Journal of Circuits System and Computers ◽

10.1142/s0218126619502025 ◽

2019 ◽

Vol 28 (12) ◽

pp. 1950202 ◽

Cited By ~ 2

Author(s):

Khyamling Parane ◽

B. M. Prabhu Prasad ◽

Basavaraj Talawar

Keyword(s):

Design Space Exploration ◽

Routing Algorithm ◽

Adaptive Routing ◽

Network On Chip ◽

Routing Algorithms ◽

Tree Topology ◽

Traffic Pattern ◽

Mesh Topology ◽

On Chip ◽

Congestion Aware

Many-core systems employ the Network on Chip (NoC) as the underlying communication architecture. To achieve an optimized design for an application under consideration, there is a need for fast and flexible NoC simulator. This paper presents an FPGA-based NoC simulation acceleration framework supporting design space exploration of standard and custom NoC topologies considering a full set of microarchitectural parameters. The framework is capable of designing custom routing algorithms, various traffic patterns such as uniform random, transpose, bit complement and random permutation are supported. For conventional NoCs, the standard minimal routing algorithms are supported. For designing the custom topologies, the table-based routing has been implemented. A custom topology called diagonal mesh has been evaluated using table-based and novel shortest path routing algorithm. A congestion-aware adaptive routing has been proposed to route the packets along the minimally congested path. The congestion-aware adaptive routing algorithm has negligible FPGA area overhead compared to the conventional XY routing. Employing the congestion-aware adaptive routing, network latency is reduced by 55% compared to the XY routing algorithm. The microarchitectural parameters such as buffer depth, traffic pattern and flit width have been varied to observe the effect on NoC behavior. For the [Formula: see text] mesh topology, the LUT and FF usages will be increased from 32.23% to 34.45% and from 12.62% to 15% considering the buffer depth of 4 and flit widths of 16 bits, and 32 bits, respectively. Similar behavior has been observed for other configurations of buffer depth and flit width. The torus topology consumes 24% more resources than the mesh topology. The 56-node fat tree topology consumes 27% and 2.2% more FPGA resources than the [Formula: see text] mesh and torus topologies. The 56-node fat tree topology with buffer depth of 8 and 16 flits saturates at the injection rates of 40% and 45%, respectively.

Download Full-text

Comparison Research between XY and Odd-Even Routing Algorithm of a 2-Dimension 3X3 Mesh Topology Network-on-Chip

2009 WRI Global Congress on Intelligent Systems ◽

10.1109/gcis.2009.110 ◽

2009 ◽

Cited By ~ 46

Author(s):

Wang Zhang ◽

Ligang Hou ◽

Jinhui Wang ◽

Shuqin Geng ◽

Wuchen Wu

Keyword(s):

Routing Algorithm ◽

Network On Chip ◽

Mesh Topology ◽

Comparison Research ◽

On Chip

Download Full-text

Reducing Total Power Consumption and Total Area Techniques for Network-on-Chip Through Disable Cores and Routers Based on Clustering Method

Parallel overloaded CDMA crossbar for network on chip

Stable Matching based Virtual Channels Allocation Scheme in Network-on-Chip

Comparative Performance Analysis of Selected Routing Algorithms by Load Variation of 2-Dimensional Mesh Topology Based Network-On-Chip

Network-on-Chip Especially for Video Coding Applications Using Multilayer Mesh Topology

A SAT-Based Methodology for Effective Clock Gating for Power Minimization

A Deterministic-Path Routing Algorithm for Tolerating Many Faults on Very-Large-Scale Network-on-Chip

Performances analysis of reducing router in ring and mesh topology for network-on-chip (NoC) architecture

Applying Genetic Algorithm to Solve Partitioning and Mapping Problem for Mesh Network-on-Chip Systems

YaNoC: Yet Another Network-on-Chip Simulation Acceleration Engine Supporting Congestion-Aware Adaptive Routing Using FPGAs

Comparison Research between XY and Odd-Even Routing Algorithm of a 2-Dimension 3X3 Mesh Topology Network-on-Chip

Comparative Performance Analysis of  Selected Routing Algorithms by Load Variation of 2-Dimensional Mesh Topology Based  Network-On-Chip