memory scheduling Latest Research Papers

A Distributed Edge-Based Scheduling Technique with Low-Latency and High-Bandwidth for Existing Driver Profiling Algorithms

Electronics ◽

10.3390/electronics10080972 ◽

2021 ◽

Vol 10 (8) ◽

pp. 972

Author(s):

Mehdi Pirahandeh ◽

Shan Ullah ◽

Deok-Hwan Kim

Keyword(s):

Embedded Systems ◽

Real Time ◽

Performance Enhancement ◽

Sensor Data ◽

Light Weight ◽

Memory Scheduling ◽

High Bandwidth ◽

Scheduling Method ◽

Rest Api ◽

Edge Based

The gradual increase in latency-sensitive, real-time applications for embedded systems encourages users to share sensor data simultaneously. Streamed sensor data have deficient performance. In this paper, we propose a new edge-based scheduling method with high-bandwidth for decreasing driver-profiling latency. The proposed multi-level memory scheduling method places data in a key-value storage, flushes sensor data when the edge memory is full, and reduces the number of I/O operations, network latency, and the number of REST API calls in the edge cloud. As a result, the proposed method provides significant read/write performance enhancement for real-time embedded systems. In fact, the proposed application improves the number of requests per second by 3.5, 5, and 4 times, respectively, compared with existing light-weight FCN-LSTM, FCN-LSTM, and DeepConvRNN Attention solutions. The proposed application also improves the bandwidth by 5.89, 5.58, and 4.16 times respectively, compared with existing light-weight FCN-LSTM, FCN-LSTM, and DeepConvRNN Attention solutions.

A memory scheduling strategy for eliminating memory access interference in heterogeneous system

The Journal of Supercomputing ◽

10.1007/s11227-019-03135-7 ◽

2020 ◽

Vol 76 (4) ◽

pp. 3129-3154

Author(s):

Juan Fang ◽

Mengxuan Wang ◽

Zelin Wei

Keyword(s):

Memory Access ◽

Access Latency ◽

Scheduling Strategy ◽

Memory Scheduling ◽

Request Queue ◽

Average Latency ◽

The Difference ◽

Memory Accesses ◽

Level Parallelism ◽

Memory Request

AbstractMultiple CPUs and GPUs are integrated on the same chip to share memory, and access requests between cores are interfering with each other. Memory requests from the GPU seriously interfere with the CPU memory access performance. Requests between multiple CPUs are intertwined when accessing memory, and its performance is greatly affected. The difference in access latency between GPU cores increases the average latency of memory accesses. In order to solve the problems encountered in the shared memory of heterogeneous multi-core systems, we propose a step-by-step memory scheduling strategy, which improve the system performance. The step-by-step memory scheduling strategy first creates a new memory request queue based on the request source and isolates the CPU requests from the GPU requests when the memory controller receives the memory request, thereby preventing the GPU request from interfering with the CPU request. Then, for the CPU request queue, a dynamic bank partitioning strategy is implemented, which dynamically maps it to different bank sets according to different memory characteristics of the application, and eliminates memory request interference of multiple CPU applications without affecting bank-level parallelism. Finally, for the GPU request queue, the criticality is introduced to measure the difference of the memory access latency between the cores. Based on the first ready-first come first served strategy, we implemented criticality-aware memory scheduling to balance the locality and criticality of application access.

Transactional Memory Scheduling Using Machine Learning Techniques

International Journal of P2P Network Trends and Technology ◽

10.14445/22492615/ijptt-v9i4p404 ◽

2019 ◽

Vol 9 (4) ◽

pp. 19-33 ◽

Cited By ~ 1

Author(s):

Basem Assiri ◽

Costas Busch ◽

Mansour Al Ghanim

Keyword(s):

Machine Learning ◽

Transactional Memory ◽

Machine Learning Techniques ◽

Memory Scheduling ◽

Learning Techniques

An Efficient Streaming Accelerator for Low Bit-Width Convolutional Neural Networks

Electronics ◽

10.3390/electronics8040371 ◽

2019 ◽

Vol 8 (4) ◽

pp. 371 ◽

Cited By ~ 3

Author(s):

Qinyu Chen ◽

Yuxiang Fu ◽

Wenqing Song ◽

Kaifeng Cheng ◽

Zhonghai Lu ◽

...

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Speech Processing ◽

Power Dissipation ◽

Heterogeneous Computing ◽

Embedded Devices ◽

Task Partitioning ◽

Area Efficiency ◽

Large Size ◽

Memory Scheduling

Convolutional Neural Networks (CNNs) have been widely applied in various fields, such as image recognition, speech processing, as well as in many big-data analysis tasks. However, their large size and intensive computation hinder their deployment in hardware, especially on the embedded systems with stringent latency, power, and area requirements. To address this issue, low bit-width CNNs are proposed as a highly competitive candidate. In this paper, we propose an efficient, scalable accelerator for low bit-width CNNs based on a parallel streaming architecture. With a novel coarse grain task partitioning (CGTP) strategy, the proposed accelerator with heterogeneous computing units, supporting multi-pattern dataflows, can nearly double the throughput for various CNN models on average. Besides, a hardware-friendly algorithm is proposed to simplify the activation and quantification process, which can reduce the power dissipation and area overhead. Based on the optimized algorithm, an efficient reconfigurable three-stage activation-quantification-pooling (AQP) unit with the low power staged blocking strategy is developed, which can process activation, quantification, and max-pooling operations simultaneously. Moreover, an interleaving memory scheduling scheme is proposed to well support the streaming architecture. The accelerator is implemented with TSMC 40 nm technology with a core size of 0.17 mm 2 . It can achieve 7.03 TOPS/W energy efficiency and 4.14 TOPS/mm 2 area efficiency at 100.1 mW, which makes it a promising design for the embedded devices.

One Self-Adaptive Memory Scheduling Algorithm for the Shuffle Process in Spark Platform

2018 IEEE International Conference on Big Data (Big Data) ◽

10.1109/bigdata.2018.8622485 ◽

2018 ◽

Author(s):

Jungang Xu ◽

Shanshan Huang ◽

Renfeng Liu ◽

Pengfei Li

Keyword(s):

Scheduling Algorithm ◽

Adaptive Memory ◽

Memory Scheduling ◽

Self Adaptive

Shared Last-Level Cache Management and Memory Scheduling for GPGPUs with Hybrid Main Memory

ACM Transactions on Embedded Computing Systems ◽

10.1145/3230643 ◽

2018 ◽

Vol 17 (4) ◽

pp. 1-25

Author(s):

Guan Wang ◽

Chuanqi Zang ◽

Lei Ju ◽

Mengying Zhao ◽

Xiaojun Cai ◽

...

Keyword(s):

Main Memory ◽

Cache Management ◽

Memory Scheduling

Coordinate Channel-Aware Page Mapping Policy and Memory Scheduling for Reducing Memory Interference Among Multimedia Applications

IEEE Systems Journal ◽

10.1109/jsyst.2015.2430522 ◽

2017 ◽

Vol 11 (4) ◽

pp. 2839-2851 ◽

Cited By ~ 6

Author(s):

Gangyong Jia ◽

Guangjie Han ◽

Aohan Li ◽

Jaime Lloret

Keyword(s):

Multimedia Applications ◽

Memory Interference ◽

Memory Scheduling

A new memory scheduling policy for real time systems

2017 7th International Symposium on Embedded Computing and System Design (ISED) ◽

10.1109/ised.2017.8303916 ◽

2017 ◽

Author(s):

Ankita Samaddar ◽

Moumita Das ◽

Ansuman Banerjee

Keyword(s):

Real Time ◽

Real Time Systems ◽

Scheduling Policy ◽

Memory Scheduling ◽

Time Systems

Thermal-aware joint CPU and memory scheduling for hard real-time tasks on multicore 3D platforms

2017 Eighth International Green and Sustainable Computing Conference (IGSC) ◽

10.1109/igcc.2017.8323573 ◽

2017 ◽

Author(s):

Gustavo A. Chaparro-Baquero ◽

Shi Sha ◽

Soamar Homsi ◽

Wujie Wen ◽

Gang Quan

Keyword(s):

Real Time ◽

Memory Scheduling ◽

Hard Real Time

Memory scheduling robust H ∞ filter-based fault detection for discrete-time polytopic uncertain systems over fading channels

IET Control Theory and Applications ◽

10.1049/iet-cta.2016.1248 ◽

2017 ◽

Vol 11 (14) ◽

pp. 2204-2212 ◽

Cited By ~ 5

Author(s):

Jian Feng ◽

Kezhen Han ◽

Qing Zhao

Keyword(s):

Fault Detection ◽

Fading Channels ◽

Discrete Time ◽

Uncertain Systems ◽

Memory Scheduling ◽

Polytopic Uncertain Systems

memory scheduling
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

A Distributed Edge-Based Scheduling Technique with Low-Latency and High-Bandwidth for Existing Driver Profiling Algorithms

A memory scheduling strategy for eliminating memory access interference in heterogeneous system

Transactional Memory Scheduling Using Machine Learning Techniques

An Efficient Streaming Accelerator for Low Bit-Width Convolutional Neural Networks

One Self-Adaptive Memory Scheduling Algorithm for the Shuffle Process in Spark Platform

Shared Last-Level Cache Management and Memory Scheduling for GPGPUs with Hybrid Main Memory

Coordinate Channel-Aware Page Mapping Policy and Memory Scheduling for Reducing Memory Interference Among Multimedia Applications

A new memory scheduling policy for real time systems

Thermal-aware joint CPU and memory scheduling for hard real-time tasks on multicore 3D platforms

Memory scheduling robust H ∞ filter-based fault detection for discrete-time polytopic uncertain systems over fading channels

Export Citation Format

memory schedulingRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

A Distributed Edge-Based Scheduling Technique with Low-Latency and High-Bandwidth for Existing Driver Profiling Algorithms

A memory scheduling strategy for eliminating memory access interference in heterogeneous system

Transactional Memory Scheduling Using Machine Learning Techniques

An Efficient Streaming Accelerator for Low Bit-Width Convolutional Neural Networks

One Self-Adaptive Memory Scheduling Algorithm for the Shuffle Process in Spark Platform

Shared Last-Level Cache Management and Memory Scheduling for GPGPUs with Hybrid Main Memory

Coordinate Channel-Aware Page Mapping Policy and Memory Scheduling for Reducing Memory Interference Among Multimedia Applications

A new memory scheduling policy for real time systems

Thermal-aware joint CPU and memory scheduling for hard real-time tasks on multicore 3D platforms

Memory scheduling robust H ∞ filter-based fault detection for discrete-time polytopic uncertain systems over fading channels

memory scheduling
Recently Published Documents