Adaptive flow control in high-performance interconnection networks

2013 ◽  
Vol 68 (1) ◽  
pp. 315-338
Author(s):  
Plamenka Borovska ◽  
Dragi Kimovski
2006 ◽  
Vol 07 (04) ◽  
pp. 535-548 ◽  
Author(s):  
Shihang Yan ◽  
Geyong Min ◽  
Irfan Awan

Credit-based flow control scheme that can be used to support both end-to-end and link-level flow control is becoming increasingly popular in high speed system area networks (SAN), e.g. InfiniBand networks where multiple processor nodes and I/O devices are interconnected using switched point-to-point links. By virtue of such a scheme, the downstream node sends credits to the upstream node indicating the availability of buffer spaces. Upon receiving credits, the upstream node injects packets into the networks. Performance analysis of credit-based flow control scheme plays an important role for the design and optimization of InfiniBand interconnection networks which have been widely used in many high-performance cluster, Grid and P2P computing systems. This study develops a new queueing network model for performance evaluation of credit-based flow control in InfiniBand networks. The performance metrics to be derived include the mean queue length, throughput and response time of the system. Simulation experiments have been used to validate the accuracy of the queueing network model. Results obtained from the analytical model have showed that this model can effectively evaluate the performance of credit-based flow control in InfiniBand networks.


Author(s):  
A. Ferrerón Labari ◽  
D. Suárez Gracia ◽  
V. Viñals Yúfera

In the last years, embedded systems have evolved so that they offer capabilities we could only find before in high performance systems. Portable devices already have multiprocessors on-chip (such as PowerPC 476FP or ARM Cortex A9 MP), usually multi-threaded, and a powerful multi-level cache memory hierarchy on-chip. As most of these systems are battery-powered, the power consumption becomes a critical issue. Achieving high performance and low power consumption is a high complexity challenge where some proposals have been already made. Suarez et al. proposed a new cache hierarchy on-chip, the LP-NUCA (Low Power NUCA), which is able to reduce the access latency taking advantage of NUCA (Non-Uniform Cache Architectures) properties. The key points are decoupling the functionality, and utilizing three specialized networks on-chip. This structure has been proved to be efficient for data hierarchies, achieving a good performance and reducing the energy consumption. On the other hand, instruction caches have different requirements and characteristics than data caches, contradicting the low-power embedded systems requirements, especially in SMT (simultaneous multi-threading) environments. We want to study the benefits of utilizing small tiled caches for the instruction hierarchy, so we propose a new design, ID-LP-NUCAs. Thus, we need to re-evaluate completely our previous design in terms of structure design, interconnection networks (including topologies, flow control and routing), content management (with special interest in hardware/software content allocation policies), and structure sharing. In CMP environments (chip multiprocessors) with parallel workloads, coherence plays an important role, and must be taken into consideration.


Author(s):  
Satya R. T. Peddada ◽  
Daniel R. Herber ◽  
Herschel C. Pangborn ◽  
Andrew G. Alleyne ◽  
James T. Allison

High-performance cooling is often necessary for thermal management of high power density systems. Both human intuition and vast experience may not be adequate to identify optimal thermal management designs as systems increase in size and complexity. This paper presents a design framework supporting comprehensive exploration of a class of single phase fluid-based cooling architectures. The candidate cooling system architectures are represented using labeled rooted tree graphs. Dynamic models are automatically generated from these trees using a graph-based thermal modeling framework. Optimal performance is determined by solving an appropriate fluid flow control problem, handling temperature constraints in the presence of exogenous heat loads. Rigorous case studies are performed in simulation, with components having variable sets of heat loads and temperature constraints. Results include optimization of thermal endurance for an enumerated set of 4,051 architectures. In addition, cooling system architectures capable of steady-state operation under a given loading are identified.


2020 ◽  
Vol 20 (20) ◽  
pp. 12435-12446
Author(s):  
Cheonyong Kim ◽  
Sangdae Kim ◽  
Kwansoo Jung

Sign in / Sign up

Export Citation Format

Share Document