TRIPWIRE: A SYNCHRONISATION PRIMITIVE FOR VIRTUAL MEMORY MAPPED COMMUNICATION

Existing user-level network interfaces deliver high bandwidth, low latency performance to applications, but are typically unable to support diverse styles of communication and are unsuitable for use in multiprogrammed environments. Often this is because the network abstraction is presented at too high a level, and support for synchronisation is inflexible. In this paper we present a new primitive for in-band synchronisation: the Tripwire. Tripwires provide a flexible, efficient and scalable means for synchronisation that is orthogonal to data transfer. We describe the implementation of a non-coherent distributed shared memory network interface, with Tripwires for synchronisation. This interface provides a low-level communications model with gigabit class bandwidth and very low overhead and latency. We show how it supports a variety of communication styles, including remote procedure call, message passing and streaming.

Download Full-text

A shared virtual memory network with fast remote direct memory access and message passing

2004 IEEE International Conference on Cluster Computing (IEEE Cat. No.04EX935) ◽

10.1109/clustr.2004.1392660 ◽

2005 ◽

Author(s):

Gang Shi ◽

Mingchang Hu ◽

Hongda Yin ◽

Weiwu Hu ◽

Zhimin Tang

Keyword(s):

Message Passing ◽

Direct Memory Access ◽

Virtual Memory ◽

Memory Access ◽

Memory Network ◽

Shared Virtual Memory

Download Full-text

Controller for Network Interface Card on FPGA

International Journal of Reconfigurable and Embedded Systems (IJRES) ◽

10.11591/ijres.v1.i2.pp55-58 ◽

2012 ◽

Vol 1 (2) ◽

pp. 55

Author(s):

Suchita Kamble ◽

N. N. Mhala

Keyword(s):

Operating System ◽

Data Transfer ◽

Direct Memory Access ◽

Main Memory ◽

Network Interface ◽

Medium Access ◽

Hardware Interface ◽

Underlying Network ◽

Peripheral Component Interconnect ◽

Network Interfaces

The continuing advances in the performance of network servers make it essential for network interface cards (NICs) to provide more sophisticated services and data processing. Modern network interfaces provide fixed functionality and are optimized for sending and receiving large packets. Network interface cards allow the operating system to send and receive packets through the main memory to the network. The operating system stores and retrieves data from the main memory and communicates with the NIC over the local interconnect, usually a peripheral component interconnect bus (PCI). Most NICs have a PCI hardware interface to the host server, use a device driver to communicate with the operating system and use local receive and transmit storage buffers. NICs typically have a direct memory access (DMA) engine to transfer data between host memory and the network interface memory. In addition, NICs include a medium access control (MAC) unit to implement the link level protocol for the underlying network such as Ethernet, and use a signal processing hardware to implement the physical (PHY) layer defined in the network. To execute and synchronize the above operations NICs also contents controller whose architecture is customized for network data transfer. In this paper we present the architecture of application specific controller that can be used in NICs.

Download Full-text

Improving data transfer for model coupling

Geoscientific Model Development Discussions ◽

10.5194/gmdd-8-8981-2015 ◽

2015 ◽

Vol 8 (10) ◽

pp. 8981-9020 ◽

Cited By ~ 2

Author(s):

C. Zhang ◽

L. Liu ◽

G. Yang ◽

R. Li ◽

B. Wang

Keyword(s):

Performance Improvement ◽

Message Passing ◽

Message Passing Interface ◽

Data Transfer ◽

The Public ◽

Message Size ◽

Abstract Data ◽

Point To Point ◽

Component Models ◽

Size Variable

Abstract. Data transfer, which means transferring data fields between two component models or rearranging data fields among processes of the same component model, is a fundamental operation of a coupler. Most of state-of-the-art coupler versions currently use an implementation based on the point-to-point (P2P) communication of the Message Passing Interface (MPI) (call such an implementation "P2P implementation" for short). In this paper, we reveal the drawbacks of the P2P implementation, including low communication bandwidth due to small message size, variable and big number of MPI messages, and jams during communication. To overcome these drawbacks, we propose a butterfly implementation for data transfer. Although the butterfly implementation can outperform the P2P implementation in many cases, it degrades the performance in some cases because the total message size transferred by the butterfly implementation is larger than that by the P2P implementation. To make the data transfer completely improved, we design and implement an adaptive data transfer library that combines the advantages of both butterfly implementation and P2P implementation. Performance evaluation shows that the adaptive data transfer library significantly improves the performance of data transfer in most cases and does not decrease the performance in any cases. Now the adaptive data transfer library is open to the public and has been imported into a coupler version C-Coupler1 for performance improvement of data transfer. We believe that it can also improve other coupler versions.

Download Full-text

Characterization of MPI Communication Primitives on a Heterogeneous Cluster

Scientific Research Journal ◽

10.24191/srj.v6i2.5631 ◽

2009 ◽

Vol 6 (2) ◽

pp. 23

Author(s):

Siti Arpah Ahmad ◽

Mohamed Faidz Mohamed Said ◽

Norazan Mohamed Ramli ◽

Mohd Nasir Taib

Keyword(s):

Message Passing ◽

High Performance ◽

Data Transfer ◽

The Other ◽

Small Cluster ◽

Heterogeneous Cluster ◽

Processing Power ◽

Basic Performance ◽

Point To Point

This paper focuses on the performance of basic communication primitives, namely the overlap of message transfer with computation in the point-to-point communication within a small cluster of four nodes. The mpptest has been implemented to measure the basic performance of MPI message passing routines with a variety of message sizes. The mpptest is capable of measuring performance with many participating processes thus exposing contention and scalability problems. This enables programmers to select message sizes in order to isolate and evaluate sudden changes in performance. Investigating these matters is interesting in that non-blocking calls have the advantage of allowing the system to schedule communications even when many processes are running simultaneously. On the other hand, understanding the characteristics of computation and communication overlap is significant, because high- performance kernels often strive to achieve this, since it is both advantageous with respect to data transfer and latency hiding. The results indicate that certain overlap sizes utilize greater node processing power either in blocking send and receive operations or non-blocking send and receive operations. The results have elucidated a detailed MPI characterization of the performance regarding the overlap of message transfer with computation in a small cluster system.

Download Full-text

Integrated network interfaces for high-bandwidth TCP/IP

ACM SIGPLAN Notices ◽

10.1145/1168918.1168897 ◽

2006 ◽

Vol 41 (11) ◽

pp. 315-324 ◽

Cited By ~ 2

Author(s):

Nathan L. Binkert ◽

Ali G. Saidi ◽

Steven K. Reinhardt

Keyword(s):

Integrated Network ◽

High Bandwidth ◽

Network Interfaces

Download Full-text

REGION SPECIFIC WAVELET COMPRESSION FOR 4K SURVEILLANCE IMAGES

Asian Journal of Pharmaceutical and Clinical Research ◽

10.22159/ajpcr.2017.v10s1.19650 ◽

2017 ◽

Vol 10 (13) ◽

pp. 247

Author(s):

Ankush Rai ◽

Jagadeesh Kannan R

Keyword(s):

Image Compression ◽

Information Flow ◽

Information Transfer ◽

Data Transfer ◽

Image Sequences ◽

Successful Transmission ◽

Specific Approach ◽

Time Compression ◽

High Bandwidth ◽

Transfer Cost

For successful transmission of massively sequenced images during 4K surveillance operations large amount of data transfer cost high bandwidth, latency and delay of information transfer. Thus, there lies a need for real-time compression of this image sequences. In this study we present a region specific approach for wavelet based image compression to enable management of huge chunks of information flow by transforming Harr wavelets in hierarchical order.

Download Full-text