Plasticine: A Cross-layer Approximation Methodology for Multi-kernel Applications through Minimally Biased, High-throughput, and Energy-efficient SIMD Soft Multiplier-divider

Zahra Ebrahimi; Dennis Klar; Mohammad Aasim Ekhtiyar; Akash Kumar

doi:10.1145/3486616

Plasticine: A Cross-layer Approximation Methodology for Multi-kernel Applications through Minimally Biased, High-throughput, and Energy-efficient SIMD Soft Multiplier-divider

ACM Transactions on Design Automation of Electronic Systems ◽

10.1145/3486616 ◽

2022 ◽

Vol 27 (2) ◽

pp. 1-33

Author(s):

Zahra Ebrahimi ◽

Dennis Klar ◽

Mohammad Aasim Ekhtiyar ◽

Akash Kumar

Keyword(s):

High Throughput ◽

Performance Metrics ◽

Synergistic Effects ◽

Rapid Evolution ◽

Future Research ◽

Cross Layer ◽

Approximate Computing ◽

Approximation Techniques ◽

End To End ◽

A Chain

The rapid evolution of error-resilient programs intertwined with their quest for high throughput has motivated the use of Single Instruction, Multiple Data (SIMD) components in Field-Programmable Gate Arrays (FPGAs). Particularly, to exploit the error-resiliency of such applications, Cross-layer approximation paradigm has recently gained traction, the ultimate goal of which is to efficiently exploit approximation potentials across layers of abstraction. From circuit- to application-level, valuable studies have proposed various approximation techniques, albeit linked to four drawbacks: First, most of approximate multipliers and dividers operate only in SISD mode. Second, imprecise units are often substituted, merely in a single kernel of a multi-kernel application, with an end-to-end analysis in Quality of Results (QoR) and not in the gained performance. Third, state-of-the-art (SoA) strategies neglect the fact that each kernel contributes differently to the end-to-end QoR and performance metrics. Therefore, they lack in adopting a generic methodology for adjusting the approximation knobs to maximize performance gains for a user-defined quality constraint. Finally, multi-level techniques lack in being efficiently supported, from application-, to architecture-, to circuit-level, in a cohesive cross-layer hierarchy. In this article, we propose Plasticine , a cross-layer methodology for multi-kernel applications, which addresses the aforementioned challenges by efficiently utilizing the synergistic effects of a chain of techniques across layers of abstraction. To this end, we propose an application sensitivity analysis and a heuristic that tailor the precision at constituent kernels of the application by finding the most tolerable degree of approximations for each of consecutive kernels, while also satisfying the ultimate user-defined QoR. The chain of approximations is also effectively enabled in a cross-layer hierarchy, from application- to architecture- to circuit-level, through the plasticity of SIMD multiplier-dividers, each supporting dynamic precision variability along with hybrid functionality. The end-to-end evaluations of Plasticine on three multi-kernel applications employed in bio-signal processing, image processing, and moving object tracking for Unmanned Air Vehicles (UAV) demonstrate 41%–64%, 39%–62%, and 70%–86% improvements in area, latency, and Area-Delay-Product (ADP), respectively, over 32-bit fixed precision, with negligible loss in QoR. To springboard future research in reconfigurable and approximate computing communities, our implementations will be available and open-sourced at https://cfaed.tu-dresden.de/pd-downloads.

Download Full-text

Throughput Dependence on SNR in IEEE802.11 WLAN Systems

Advances in Computer and Electrical Engineering - Advanced Methodologies and Technologies in Network Architecture, Mobile Computing, and Data Analytics ◽

10.4018/978-1-5225-7598-6.ch096 ◽

2019 ◽

pp. 1307-1323

Author(s):

Ikponmwosa Oghogho

Keyword(s):

High Throughput ◽

Signal To Noise Ratio ◽

Physical Layer ◽

Future Research ◽

Cross Layer ◽

Signal To Noise ◽

Research Directions ◽

Data Rates ◽

Research Findings ◽

Future Research Directions

This chapter seeks to present the dependence of throughput on signal to noise ratio (SNR) in IEEE802.11 WLAN systems. High throughput and low delays are presented as the requirements for indicating good performance of WLAN systems. The multiple communication data rates specified by the physical layer of IEEE802.11 WLANs which vary depending on the SNR observed is shown to appreciably influence the throughput experienced by the users. Cross-layer modelling principles which simplifies the process of estimating the dependence of throughput on SNR is presented. Recent research findings which apply cross-layer modelling principles to model the dependence of throughput on SNR only is presented along with future research directions.

Download Full-text

Design Space Exploration on High-Order QAM Demodulation Circuits: Algorithms, Arithmetic and Approximation Techniques

Electronics ◽

10.3390/electronics11010039 ◽

2021 ◽

Vol 11 (1) ◽

pp. 39

Author(s):

Ioannis Stratakos ◽

Vasileios Leon ◽

Giorgos Armeniakos ◽

George Lentaris ◽

Dimitrios Soudris

Keyword(s):

Fixed Point ◽

Orthogonal Frequency Division Multiplexing ◽

Design Space Exploration ◽

Performance Metrics ◽

Circuit Complexity ◽

Error Rates ◽

High Order ◽

Approximate Computing ◽

Clock Frequency ◽

Approximation Techniques

Every new generation of wireless communication standard aims to improve the overall performance and quality of service (QoS), compared to the previous generations. Increased data rates, numbers and capabilities of connected devices, new applications, and higher data volume transfers are some of the key parameters that are of interest. To satisfy these increased requirements, the synergy between wireless technologies and optical transport will dominate the 5G network topologies. This work focuses on a fundamental digital function in an orthogonal frequency-division multiplexing (OFDM) baseband transceiver architecture and aims at improving the throughput and circuit complexity of this function. Specifically, we consider the high-order QAM demodulation and apply approximation techniques to achieve our goals. We adopt approximate computing as a design strategy to exploit the error resiliency of the QAM function and deliver significant gains in terms of critical performance metrics. Particularly, we take into consideration and explore four demodulation algorithms and develop accurate floating- and fixed-point circuits in VHDL. In addition, we further explore the effects of introducing approximate arithmetic components. For our test case, we consider 64-QAM demodulators, and the results suggest that the most promising design provides bit error rates (BER) ranging from 10−1 to 10−4 for SNR 0–14 dB in terms of accuracy. Targeting a Xilinx Zynq Ultrascale+ ZCU106 (XCZU7EV) FPGA device, the approximate circuits achieve up to 98% reduction in LUT utilization, compared to the accurate floating-point model of the same algorithm, and up to a 122% increase in operating frequency. In terms of power consumption, our most efficient circuit configurations consume 0.6–1.1 W when operating at their maximum clock frequency. Our results show that if the objective is to achieve high accuracy in terms of BER, the prevailing solution is the approximate LLR algorithm configured with fixed-point arithmetic and 8-bit truncation, providing 81% decrease in LUTs and 13% increase in frequency and sustains a throughput of 323 Msamples/s.

Download Full-text

Throughput Dependence on SNR in IEEE802.11 WLAN Systems

Encyclopedia of Information Science and Technology, Fourth Edition ◽

10.4018/978-1-5225-2255-3.ch574 ◽

2018 ◽

pp. 6618-6629

Author(s):

Ikponmwosa Oghogho

Keyword(s):

High Throughput ◽

Signal To Noise Ratio ◽

Physical Layer ◽

Future Research ◽

Cross Layer ◽

Signal To Noise ◽

Research Directions ◽

Data Rates ◽

Research Findings ◽

Future Research Directions

This chapter seeks to present the dependence of Throughput on Signal to Noise Ratio (SNR) in IEEE802.11 WLAN Systems. High throughput and low delays are presented as the requirements for indicating good performance of WLAN systems. The multiple communication data rates specified by the physical layer of IEEE802.11 WLANs which vary depending on the SNR observed is shown to appreciably influence the throughput experienced by the users. Cross layer modelling principles which simplifies the process of estimating the dependence of throughput on SNR is presented. Recent research findings which apply cross layer modelling principles to model the dependence of throughput on SNR only is presented along with future research directions.

Download Full-text

High Throughput via Cross-Layer Interference Alignment for Mobile Ad Hoc Networks

10.21236/ada596282 ◽

2013 ◽

Author(s):

Jr Heath ◽

Robert W.

Keyword(s):

Ad Hoc Networks ◽

Mobile Ad Hoc Networks ◽

High Throughput ◽

Ad Hoc ◽

Interference Alignment ◽

Cross Layer ◽

Mobile Ad Hoc ◽

Hoc Networks

Download Full-text

A Built-in Hash Permutation Assisted Cross-layer Secure Transport in End-to-End FlexE over WDM Networks

GLOBECOM 2020 - 2020 IEEE Global Communications Conference ◽

10.1109/globecom42002.2020.9322235 ◽

2020 ◽

Author(s):

Pengfei Zhu ◽

Jiabin Cui ◽

Yuefeng Ji

Keyword(s):

Wdm Networks ◽

Cross Layer ◽

End To End

Download Full-text

Shall I Work with Them? A Knowledge Graph-Based Approach for Predicting Future Research Collaborations

Entropy ◽

10.3390/e23060664 ◽

2021 ◽

Vol 23 (6) ◽

pp. 664

Author(s):

Nikos Kanakaris ◽

Nikolaos Giarelis ◽

Ilias Siachos ◽

Nikos Karacapilidis

Keyword(s):

Language Processing ◽

Scientific Knowledge ◽

Link Prediction ◽

Performance Metrics ◽

Future Research ◽

Knowledge Graph ◽

Prediction Problem ◽

Textual Information ◽

Research Collaborations ◽

Processing Techniques

We consider the prediction of future research collaborations as a link prediction problem applied on a scientific knowledge graph. To the best of our knowledge, this is the first work on the prediction of future research collaborations that combines structural and textual information of a scientific knowledge graph through a purposeful integration of graph algorithms and natural language processing techniques. Our work: (i) investigates whether the integration of unstructured textual data into a single knowledge graph affects the performance of a link prediction model, (ii) studies the effect of previously proposed graph kernels based approaches on the performance of an ML model, as far as the link prediction problem is concerned, and (iii) proposes a three-phase pipeline that enables the exploitation of structural and textual information, as well as of pre-trained word embeddings. We benchmark the proposed approach against classical link prediction algorithms using accuracy, recall, and precision as our performance metrics. Finally, we empirically test our approach through various feature combinations with respect to the link prediction problem. Our experimentations with the new COVID-19 Open Research Dataset demonstrate a significant improvement of the abovementioned performance metrics in the prediction of future research collaborations.

Download Full-text

Efficient AI System Design With Cross-Layer Approximate Computing

Proceedings of the IEEE ◽

10.1109/jproc.2020.3029453 ◽

2020 ◽

Vol 108 (12) ◽

pp. 2232-2250

Author(s):

Swagath Venkataramani ◽

Xiao Sun ◽

Naigang Wang ◽

Chia-Yu Chen ◽

Jungwook Choi ◽

...

Keyword(s):

System Design ◽

Cross Layer ◽

Approximate Computing

Download Full-text

Opportunistic Large Array Propagation Models: A Comprehensive Survey

Sensors ◽

10.3390/s21124206 ◽

2021 ◽

Vol 21 (12) ◽

pp. 4206

Author(s):

Farhan Nawaz ◽

Hemant Kumar ◽

Syed Ali Hassan ◽

Haejoon Jung

Keyword(s):

Energy Efficient ◽

Large Scale ◽

Performance Metrics ◽

Experimental Studies ◽

Cooperative Transmission ◽

Analytical Models ◽

Future Research ◽

Large Array ◽

Propagation Models ◽

Comprehensive Survey

Enabled by the fifth-generation (5G) and beyond 5G communications, large-scale deployments of Internet-of-Things (IoT) networks are expected in various application fields to handle massive machine-type communication (mMTC) services. Device-to-device (D2D) communications can be an effective solution in massive IoT networks to overcome the inherent hardware limitations of small devices. In such D2D scenarios, given that a receiver can benefit from the signal-to-noise-ratio (SNR) advantage through diversity and array gains, cooperative transmission (CT) can be employed, so that multiple IoT nodes can create a virtual antenna array. In particular, Opportunistic Large Array (OLA), which is one type of CT technique, is known to provide fast, energy-efficient, and reliable broadcasting and unicasting without prior coordination, which can be exploited in future mMTC applications. However, OLA-based protocol design and operation are subject to network models to characterize the propagation behavior and evaluate the performance. Further, it has been shown through some experimental studies that the most widely-used model in prior studies on OLA is not accurate for networks with networks with low node density. Therefore, stochastic models using quasi-stationary Markov chain are introduced, which are more complex but more exact to estimate the key performance metrics of the OLA transmissions in practice. Considering the fact that such propagation models should be selected carefully depending on system parameters such as network topology and channel environments, we provide a comprehensive survey on the analytical models and framework of the OLA propagation in the literature, which is not available in the existing survey papers on OLA protocols. In addition, we introduce energy-efficient OLA techniques, which are of paramount importance in energy-limited IoT networks. Furthermore, we discuss future research directions to combine OLA with emerging technologies.

Download Full-text

Cross-Layer Design for End-to-End Throughput Maximization and Fairness in MIMO Multihop Wireless Networks

EURASIP Journal on Wireless Communications and Networking ◽

10.1155/2010/515609 ◽

2010 ◽

Vol 2010 (1) ◽

Cited By ~ 3

Author(s):

Jain-Shing Liu ◽

Chun-Hung Richard Lin ◽

Kuang-Yuan Tung

Keyword(s):

Wireless Networks ◽

Multihop Wireless Networks ◽

Throughput Maximization ◽

Cross Layer ◽

Cross Layer Design ◽

Multihop Wireless ◽

End To End

Download Full-text

Cross-Layer Analysis of the End-to-End Delay Distribution in Wireless Sensor Networks

2009 30th IEEE Real-Time Systems Symposium ◽

10.1109/rtss.2009.27 ◽

2009 ◽

Cited By ~ 20

Author(s):

Yunbo Wang ◽

Mehmet C. Vuran ◽

Steve Goddard

Keyword(s):

Wireless Sensor Networks ◽

Sensor Networks ◽

Wireless Sensor ◽

Cross Layer ◽

Delay Distribution ◽

End To End Delay ◽

Layer Analysis ◽

End To End

Download Full-text