processing elements
Recently Published Documents


TOTAL DOCUMENTS

317
(FIVE YEARS 76)

H-INDEX

14
(FIVE YEARS 1)

2022 ◽  
Vol 18 (2) ◽  
pp. 1-22
Author(s):  
João Paulo Cardoso de Lima ◽  
Marcelo Brandalero ◽  
Michael Hübner ◽  
Luigi Carro

Accelerating finite-state automata benefits several emerging application domains that are built on pattern matching. In-memory architectures, such as the Automata Processor (AP), are efficient to speed them up, at least for outperforming traditional von-Neumann architectures. In spite of the AP’s massive parallelism, current APs suffer from poor memory density, inefficient routing architectures, and limited capabilities. Although these limitations can be lessened by emerging memory technologies, its architecture is still the major source of huge communication demands and lack of scalability. To address these issues, we present STAP , a Scalable TCAM-based architecture for Automata Processing . STAP adopts a reconfigurable array of processing elements, which are based on memristive Ternary CAMs (TCAMs), to efficiently implement Non-deterministic finite automata (NFAs) through proper encoding and mapping methods. The CAD tool for STAP integrates the design flow of automata applications, a specific mapping algorithm, and place and route tools for connecting processing elements by RRAM-based programmable interconnects. Results showed 1.47× higher throughput when processing 16-bit input symbols, and improvements of 3.9× and 25× on state and routing densities over the state-of-the-art AP, while preserving 10 4 programming cycles.


Micromachines ◽  
2021 ◽  
Vol 13 (1) ◽  
pp. 67
Author(s):  
Oscar Camps ◽  
Mohamad Moner Al Chawa ◽  
Stavros G. Stavrinides ◽  
Rodrigo Picos

Cellular Nonlinear Networks (CNN) are a concept introduced in 1988 by Leon Chua and Lin Yang as a bio-inspired architecture capable of massively parallel computation. Since then, CNN have been enhanced by incorporating designs that incorporate memristors to profit from their processing and memory capabilities. In addition, Stochastic Computing (SC) can be used to optimize the quantity of required processing elements; thus it provides a lightweight approximate computing framework, quite accurate and effective, however. In this work, we propose utilization of SC in designing and implementing a memristor-based CNN. As a proof of the proposed concept, an example of application is presented. This application combines Matlab and a FPGA in order to create the CNN. The implemented CNN was then used to perform three different real-time applications on a 512 × 512 gray-scale and a 768 × 512 color image: storage of the image, edge detection, and image sharpening. It has to be pointed out that the same CNN was used for the three different tasks, with the sole change of some programmable parameters. Results show an excellent capability with significant accompanying advantages, such as the low number of needed elements further allowing for a low cost FPGA-based system implementation, something confirming the system’s capacity for real time operation.


2021 ◽  
Author(s):  
Dariusz Ruciński

The article is an attempt of the methodological approach to the proposed quantum-inspired method of neural modeling of prices quoted on the Day-Ahead Market operating at TGE S.A. In the proposed quantum-inspired neural model it was assumed, inter alia, that it is composed of 12 parallel Perceptron ANNs with one hidden layer. Moreover, it was assumed that weights and biases as processing elements are described by density matrices, and the values flowing through the Artificial Neural Network of Signals are represented by qubits. Calculations checking the correctness of the adopted method and model were carried out with the use of linear algebra and vector-matrix calculus in MATLAB and Simulink environments. The obtained research results were compared to the results obtained from the neural model with the use of a comparative model.


Electronics ◽  
2021 ◽  
Vol 10 (23) ◽  
pp. 2989
Author(s):  
Peng Liu ◽  
Yan Song

Vision processing chips have been widely used in image processing and recognition tasks. They are conventionally designed based on the image signal processing (ISP) units directly connected with the sensors. In recent years, convolutional neural networks (CNNs) have become the dominant tools for many state-of-the-art vision processing tasks. However, CNNs cannot be processed by a conventional vision processing unit (VPU) with a high speed. On the other side, the CNN processing units cannot process the RAW images from the sensors directly and an ISP unit is required. This makes a vision system inefficient with a lot of data transmission and redundant hardware resources. Additionally, many CNN processing units suffer from a low flexibility for various CNN operations. To solve this problem, this paper proposed an efficient vision processing unit based on a hybrid processing elements array for both CNN accelerating and ISP. Resources are highly shared in this VPU, and a pipelined workflow is introduced to accelerate the vision tasks. We implement the proposed VPU on the Field-Programmable Gate Array (FPGA) platform and various vision tasks are tested on it. The results show that this VPU achieves a high efficiency for both CNN processing and ISP and shows a significant reduction in energy consumption for vision tasks consisting of CNNs and ISP. For various CNN tasks, it maintains an average multiply accumulator utilization of over 94% and achieves a performance of 163.2 GOPS with a frequency of 200 MHz.


Mathematics ◽  
2021 ◽  
Vol 9 (23) ◽  
pp. 3012
Author(s):  
Amir Masoud Rahmani ◽  
Rizwan Ali Naqvi ◽  
Saqib Ali ◽  
Seyedeh Yasaman Hosseini Mirmahaleh ◽  
Mohammed Alswaitti ◽  
...  

In deploying the Internet of Things (IoT) and Internet of Medical Things (IoMT)-based applications and infrastructures, the researchers faced many sensors and their output’s values, which have transferred between service requesters and servers. Some case studies addressed the different methods and technologies, including machine learning algorithms, deep learning accelerators, Processing-In-Memory (PIM), and neuromorphic computing (NC) approaches to support the data processing complexity and communication between IoMT nodes. With inspiring human brain structure, some researchers tackled the challenges of rising IoT- and IoMT-based applications and neural structures’ simulation. A defective device has destructive effects on the performance and cost of the applications, and their detection is challenging for a communication infrastructure with many devices. We inspired astrocyte cells to map the flow (AFM) of the Internet of Medical Things onto mesh network processing elements (PEs), and detect the defective devices based on a phagocytosis model. This study focuses on an astrocyte’s cholesterol distribution into neurons and presents an algorithm that utilizes its pattern to distribute IoMT’s dataflow and detect the defective devices. We researched Alzheimer’s symptoms to understand astrocyte and phagocytosis functions against the disease and employ the vaccination COVID-19 dataset to define a set of task graphs. The study improves total runtime and energy by approximately 60.85% and 52.38% after implementing AFM, compared with before astrocyte-flow mapping, which helps IoMT’s infrastructure developers to provide healthcare services to the requesters with minimal cost and high accuracy.


2021 ◽  
Author(s):  
Alejandro Moran ◽  
Vincent Canals ◽  
Plamen P. Angelov ◽  
Christian F. Frasser ◽  
Erik S. Skibinsky-Gitlin ◽  
...  

Author(s):  
Miriam Leeser ◽  
Suranga Handagala ◽  
Michael Zink

As cloud computing grows,  the types of computational hardware available in the cloud are diversifying. Field Programmable Gate Arrays (FPGAs) are a relatively new addition to high-performance computing in the cloud, with the ability to accelerate a range of different applications, and the flexibility to offer different cloud computing models. A new and growing configuration is to have the FPGAs directly connected to the network and thus reduce the latency in delivering data to processing elements. We survey the state-of-the-art in FPGAs in the cloud and present the Open Cloud Testbed (OCT), a testbed for research and experimentation into new cloud platforms, which includes network-attached FPGAs in the cloud.


Electronics ◽  
2021 ◽  
Vol 10 (21) ◽  
pp. 2724
Author(s):  
Nandakishor Yadav ◽  
Youngbae Kim ◽  
Shuai Li ◽  
Kyuwon Ken Choi

The machine learning and convolutional neural network (CNN)-based intelligent artificial accelerator needs significant parallel data processing from the cache memory. The separate read port is mostly used to design built-in computational memory (CRAM) to reduce the data processing bottleneck. This memory uses multi-port reading and writing operations, which reduces stability and reliability. In this paper, we proposed a self-adaptive 12T SRAM cell to increase the read stability for multi-port operation. The self-adaptive technique increases stability and reliability. We increased the read stability by refreshing the storing node in the read mode of operation. The proposed technique also prevents the bit-interleaving problem. Further, we offered a butterfly-inspired SRAM bank to increase the performance and reduce the power dissipation. The proposed SRAM saves 12% more total power than the state-of-the-art 12T SRAM cell-based SRAM. We improve the write performance by 28.15% compared with the state-of-the-art 12T SRAM design. The total area overhead of the proposed architecture compared to the conventional 6T SRAM cell-based SRAM is only 1.9 times larger than the 6T SRAM cell.


Author(s):  
Oscar Camps ◽  
Mohamed-Moner al Chawa ◽  
Stavros G. Stavrinides ◽  
Rodrigo Picos

Cellular Nonlinear Networks (CNN) are a concept introduced in 1988 by Leon Chua and Lin Yang as a bio-inspired architecture, capable of massively parallel computation. Later on, CNN have been enhanced by incorporating designs that incorporate memristors to profit from their processing and memory capabilities. In addition, Stochastic Computing (SC) can be used to optimize the quantity of required processing elements; thus it provides a lightweight approximate computing framework, quite accurate and effective, though. In this work, we propose utilization of SC in designing and implementing a memristor-based CNN. As a proof of the proposed concept, an example of application is presented. This application combines Matlab and a FPGA in order to create the CNN. The implemented CNN has then been used to perform three different real-time applications on a 512x512 gray-scale and a 768x512 color image: storage of the image, edge detection, and image sharpening. It has to be pointed out that the same CNN has been used for the three different tasks, with the sole change of some programmable parameters. Results show an excellent capability with significant accompanying advantages, like the low number of needed elements further allowing for a low cost FPGA-based system implementation, something confirming the system’s ability for real time operation.


2021 ◽  
Author(s):  
Viorica Cazac-Scobioala ◽  
◽  
Jana Cirja ◽  
Elena Ursu ◽  
◽  
...  

The paper presents the results of the study of the techniques of joining the parts of traditional shirts. The analysis of museum pieces presents various techniques for combining landmarks with a functional and aesthetic role. In traditional holiday women’s shirts, the joining of of parts on the lines arranged in the accessible visual perception area techniques of joining by keys of different complexity, integrated solutions of edge processing elements by crocheting and joining techniques by keys („cheițe”) were used. The joining lines of the parts arranged in areas with low visibility show applications of simple joining techniques such as stitches before the needle, after the needle, the processing of the edges of the parts with scalloped points. Traditional men’s holiday shirts used the joining of keys in some models, but shirts were also made in which the application of the techniques of joining by keys is missing. In the usual traditional shirts, the techniques of joining by keys are very rare. The terminal lines of the parts of the holiday shirts, as well, present various processing and decoration techniques applying the processing with «brezărău», the processing with crocheted lace.


Sign in / Sign up

Export Citation Format

Share Document