An FPGA-Based Quantum Computing Emulation Framework Based on Serial-Parallel Architecture

International Journal of Reconfigurable Computing ◽

10.1155/2016/5718124 ◽

2016 ◽

Vol 2016 ◽

pp. 1-18 ◽

Cited By ~ 11

Author(s):

Y. H. Lee ◽

M. Khalil-Hani ◽

M. N. Marsono

Keyword(s):

Processing Speed ◽

Large Scale ◽

Quantum Algorithms ◽

Parallel Architecture ◽

Quantum Systems ◽

Quantum Circuits ◽

Proof Of Concept ◽

Hardware Emulation ◽

Field Programmable ◽

Speed Up

Hardware emulation of quantum systems can mimic more efficiently the parallel behaviour of quantum computations, thus allowing higher processing speed-up than software simulations. In this paper, an efficient hardware emulation method that employs a serial-parallel hardware architecture targeted for field programmable gate array (FPGA) is proposed. Quantum Fourier transform and Grover’s search are chosen as case studies in this work since they are the core of many useful quantum algorithms. Experimental work shows that, with the proposed emulation architecture, a linear reduction in resource utilization is attained against the pipeline implementations proposed in prior works. The proposed work contributes to the formulation of a proof-of-concept baseline FPGA emulation framework with optimization on datapath designs that can be extended to emulate practical large-scale quantum circuits.

Download Full-text

An Efficient Software Architecture for Automated Coupling of Convection and Thermal Radiation Tools

Heat Transfer: Volume 3 ◽

10.1115/ht2008-56303 ◽

2008 ◽

Cited By ~ 1

Author(s):

Christian Rauch ◽

Thomas Ho¨rmann ◽

Sebastian Jagsch ◽

Raimund Almbauer

Keyword(s):

Software Architecture ◽

Large Scale ◽

Exhaust System ◽

Industrial Applications ◽

Proof Of Concept ◽

Speed Up ◽

Simulation Results ◽

Improved Performance ◽

New System ◽

Efficient Software

Much attention has been paid recently by research and development engineers on performing multi-physics calculations. One way to do this is to couple commercial tools for examining complex systems. Since the proposal of an software architecture for coupling programs as published in a previous paper significant changes have led to an improved performance for large-scale industrial applications. This architecture is being described and as a proof of concept a simulation is being conducted by coupling two commercial solvers. The speed-up of the new system is being presented. The simulation results are then compared with measurements of surface temperatures of an exhaust system of an actual sports utilities vehicle (SUV) and conclusions are being drawn. The proposed architecture is easily adaptable to various programs as it is implemented in C++ and changes for a specific code can be restricted to a view classes.

Download Full-text

Advanced exact synthesis of Clifford+T circuits

Quantum Information Processing ◽

10.1007/s11128-020-02816-0 ◽

2020 ◽

Vol 19 (9) ◽

Author(s):

Philipp Niemann ◽

Robert Wille ◽

Rolf Drechsler

Keyword(s):

Large Scale ◽

Fault Tolerant ◽

Logic Synthesis ◽

Research Problem ◽

Quantum Systems ◽

Quantum Circuits ◽

Research Attention ◽

Small Quantum ◽

Significant Research ◽

Important Research Problem

Abstract Quantum systems provide a new way of conducting computations based on the so-called qubits. Due to the potential for significant speed-ups, this field received significant research attention in recent years. The Clifford+T library is a very promising and popular gate library for these kinds of computations. Unlike other libraries considered so far, it consists of only a small number of gates for all of which robust, fault-tolerant realizations are known for many technologies that seem to be promising for large-scale quantum computing. As a consequence, (logic) synthesis of Clifford+T quantum circuits became an important research problem. However, previous work in this area has several drawbacks: Corresponding approaches are either only applicable to very small quantum systems or lead to circuits that are far from being optimal. The latter is mainly caused by the fact that current synthesis realizes the desired circuit by a local, i.e., column-wise, consideration of the underlying unitary transformation matrix to be synthesized. In this paper, we analyze the conceptual drawbacks of this approach and propose to overcome them by taking a global view of the matrices and perform a separation of concerns regarding individual synthesis steps. We precisely describe a corresponding algorithm as well as its efficient implementation on top of decision diagrams. Experimental results confirm the resulting benefits and show improvements of up to several orders of magnitudes in costs compared to previous work.

Download Full-text

Fisher Information in Noisy Intermediate-Scale Quantum Applications

Quantum ◽

10.22331/q-2021-09-09-539 ◽

2021 ◽

Vol 5 ◽

pp. 539

Author(s):

Johannes Jakob Meyer

Keyword(s):

Fisher Information ◽

Quantum Algorithms ◽

Quantum Fisher Information ◽

Quantum Systems ◽

Quantum Circuits ◽

Quantum Computers ◽

Quantum Devices ◽

Intermediate Scale ◽

Quantum Sensing ◽

Near Term

The recent advent of noisy intermediate-scale quantum devices, especially near-term quantum computers, has sparked extensive research efforts concerned with their possible applications. At the forefront of the considered approaches are variational methods that use parametrized quantum circuits. The classical and quantum Fisher information are firmly rooted in the field of quantum sensing and have proven to be versatile tools to study such parametrized quantum systems. Their utility in the study of other applications of noisy intermediate-scale quantum devices, however, has only been discovered recently. Hoping to stimulate more such applications, this article aims to further popularize classical and quantum Fisher information as useful tools for near-term applications beyond quantum sensing. We start with a tutorial that builds an intuitive understanding of classical and quantum Fisher information and outlines how both quantities can be calculated on near-term devices. We also elucidate their relationship and how they are influenced by noise processes. Next, we give an overview of the core results of the quantum sensing literature and proceed to a comprehensive review of recent applications in variational quantum algorithms and quantum machine learning.

Download Full-text

A LOGIC FOR QUANTUM COMPUTATION AND CLASSICAL SIMULATION OF QUANTUM ALGORITHMS

International Journal of Quantum Information ◽

10.1142/s0219749908003554 ◽

2008 ◽

Vol 06 (02) ◽

pp. 255-280

Author(s):

M. K. PATRA

Keyword(s):

Quantum Computation ◽

General Formula ◽

Quantum Algorithms ◽

Quantum Systems ◽

Quantum Circuits ◽

Finite Dimensional ◽

Classical Simulation ◽

Complex Extension

The semantics of a language for reasoning about finite-dimensional quantum systems is presented. This language can express most important classes of assertions about quantum systems, including formulas for outputs of all combinational quantum circuits/algorithms. The main result of this paper is an algorithm for efficient translation of a formula of language into an equivalent formula in another decidable language ℝℂ, which is the language of reals and its complex extension. An important consequence is a descriptive characterization of quantum circuits that can be efficiently simulated classically. We illustrate this with examples of two classes of quantum circuits which are known to have efficient classical simulation. The algorithm for deciding the satisfiability of a general formula can be adapted for the simulation.

Download Full-text

A method for synthesis and optimization for linear nearest neighbor quantum circuits by parallel processing

Quantum Information and Computation ◽

10.26421/qic18.13-14-2 ◽

2018 ◽

Vol 18 (13&14) ◽

pp. 1095-1114

Author(s):

Zongyuan Zhang ◽

Zhijin Guan ◽

Hong Zhang ◽

Haiying Ma ◽

Weiping Ding

Keyword(s):

Parallel Processing ◽

Large Scale ◽

Nearest Neighbor ◽

Matrix Multiplication ◽

Quantum Circuit ◽

Quantum Circuits ◽

Quantum Cost ◽

The Matrix ◽

Speed Up ◽

Serial Algorithm

In order to realize the linear nearest neighbor{(LNN)} of the quantum circuits and reduce the quantum cost of linear reversible quantum circuits, a method for synthesizing and optimizing linear reversible quantum circuits based on matrix multiplication of the structure of the quantum circuit is proposed. This method shows the matrix representation of linear quantum circuits by multiplying matrices of different parts of the whole circuit. The LNN realization by adding the SWAP gates is proposed and the equivalence of two ways of adding the SWAP gates is proved. The elimination rules of the SWAP gates between two overlapped adjacent quantum gates in different cases are proposed, which reduce the quantum cost of quantum circuits after realizing the LNN architecture. We propose an algorithm based on parallel processing in order to effectively reduce the time consumption for large-scale quantum circuits. Experiments show that the quantum cost can be improved by 34.31\% on average and the speed-up ratio of the GPU-based algorithm can reach 4 times compared with the CPU-based algorithm. The average time optimization ratio of the benchmark large-scale circuits in RevLib processed by the parallel algorithm is {95.57\%} comparing with the serial algorithm.

Download Full-text

Cost function dependent barren plateaus in shallow parametrized quantum circuits

Nature Communications ◽

10.1038/s41467-021-21728-w ◽

2021 ◽

Vol 12 (1) ◽

Cited By ~ 3

Author(s):

M. Cerezo ◽

Akira Sone ◽

Tyler Volkoff ◽

Lukasz Cincio ◽

Patrick J. Coles

Keyword(s):

Cost Function ◽

Large Scale ◽

Quantum Algorithms ◽

Quantum Circuit ◽

Quantum Circuits ◽

Quantum Computers ◽

Heuristic Methods ◽

Practical Applications ◽

Local Observables ◽

Large Scale Simulations

AbstractVariational quantum algorithms (VQAs) optimize the parameters θ of a parametrized quantum circuit V(θ) to minimize a cost function C. While VQAs may enable practical applications of noisy quantum computers, they are nevertheless heuristic methods with unproven scaling. Here, we rigorously prove two results, assuming V(θ) is an alternating layered ansatz composed of blocks forming local 2-designs. Our first result states that defining C in terms of global observables leads to exponentially vanishing gradients (i.e., barren plateaus) even when V(θ) is shallow. Hence, several VQAs in the literature must revise their proposed costs. On the other hand, our second result states that defining C with local observables leads to at worst a polynomially vanishing gradient, so long as the depth of V(θ) is $${\mathcal{O}}(\mathrm{log}\,n)$$ O ( log n ) . Our results establish a connection between locality and trainability. We illustrate these ideas with large-scale simulations, up to 100 qubits, of a quantum autoencoder implementation.

Download Full-text

Direct measurement of electron-diffraction-pattern intensities using an energy loss spectrometer

Proceedings, annual meeting, Electron Microscopy Society of America ◽

10.1017/s0424820100155311 ◽

1989 ◽

Vol 47 ◽

pp. 668-669

Author(s):

A. G. Jackson ◽

M. Rowe

Keyword(s):

Thermal Effects ◽

Dynamic Range ◽

Scattering Amplitude ◽

Dynamical Theory ◽

Site Symmetry ◽

Proof Of Concept ◽

Parallel Acquisition ◽

Kinematic Approximation ◽

Speed Up ◽

Structure Calculations

Diffraction intensities from intermetallic compounds are, in the kinematic approximation, proportional to the scattering amplitude from the element doing the scattering. More detailed calculations have shown that site symmetry and occupation by various atom species also affects the intensity in a diffracted beam. [1] Hence, by measuring the intensities of beams, or their ratios, the occupancy can be estimated. Measurement of the intensity values also allows structure calculations to be made to determine the spatial distribution of the potentials doing the scattering. Thermal effects are also present as a background contribution. Inelastic effects such as loss or absorption/excitation complicate the intensity behavior, and dynamical theory is required to estimate the intensity value.The dynamic range of currents in diffracted beams can be 104or 105:1. Hence, detection of such information requires a means for collecting the intensity over a signal-to-noise range beyond that obtainable with a single film plate, which has a S/N of about 103:1. Although such a collection system is not available currently, a simple system consisting of instrumentation on an existing STEM can be used as a proof of concept which has a S/N of about 255:1, limited by the 8 bit pixel attributes used in the electronics. Use of 24 bit pixel attributes would easily allowthe desired noise range to be attained in the processing instrumentation. The S/N of the scintillator used by the photoelectron sensor is about 106 to 1, well beyond the S/N goal. The trade-off that must be made is the time for acquiring the signal, since the pattern can be obtained in seconds using film plates, compared to 10 to 20 minutes for a pattern to be acquired using the digital scan. Parallel acquisition would, of course, speed up this process immensely.

Download Full-text

Sub-triangle opacity masks for faster ray tracing of transparent objects

Proceedings of the ACM on Computer Graphics and Interactive Techniques ◽

10.1145/3406180 ◽

2020 ◽

Vol 3 (2) ◽

pp. 1-12

Author(s):

Holger Gruen ◽

Carsten Benthin ◽

Sven Woop

Keyword(s):

Ray Tracing ◽

Integrate Approach ◽

Proof Of Concept ◽

Test Operation ◽

Transparent Objects ◽

Speed Up

We propose an easy and simple-to-integrate approach to accelerate ray tracing of alpha-tested transparent geometry with a focus on Microsoft® DirectX® or Vulkan® ray tracing extensions. Pre-computed bit masks are used to quickly determine fully transparent and fully opaque regions of triangles thereby skipping the more expensive alpha-test operation. These bit masks allow us to skip up to 86% of all transparency tests, yielding up to 40% speed up in a proof-of-concept DirectX® software only implementation.

Download Full-text

Practical Security of RSA Against NTC-Architecture Quantum Computing Attacks

International Journal of Theoretical Physics ◽

10.1007/s10773-021-04789-x ◽

2021 ◽

Author(s):

Kai Li ◽

Qing-yu Cai

Keyword(s):

Quantum Computing ◽

Ion Trap ◽

Quantum Algorithms ◽

Coherence Time ◽

Quantum Computers ◽

Quantum Time ◽

Speed Up ◽

Key Length ◽

Concurrent Architecture ◽

Qubit Gate

AbstractQuantum algorithms can greatly speed up computation in solving some classical problems, while the computational power of quantum computers should also be restricted by laws of physics. Due to quantum time-energy uncertainty relation, there is a lower limit of the evolution time for a given quantum operation, and therefore the time complexity must be considered when the number of serial quantum operations is particularly large. When the key length is about at the level of KB (encryption and decryption can be completed in a few minutes by using standard programs), it will take at least 50-100 years for NTC (Neighbor-only, Two-qubit gate, Concurrent) architecture ion-trap quantum computers to execute Shor’s algorithm. For NTC architecture superconducting quantum computers with a code distance 27 for error-correcting, when the key length increased to 16 KB, the cracking time will also increase to 100 years that far exceeds the coherence time. This shows the robustness of the updated RSA against practical quantum computing attacks.

Download Full-text

Bloom Filter-Based Parallel Architecture for Accelerating Equi-Join Operation on FPGA

Electronics ◽

10.3390/electronics10151778 ◽

2021 ◽

Vol 10 (15) ◽

pp. 1778

Author(s):

Binhao He ◽

Meiting Xue ◽

Shubiao Liu ◽

Wei Luo

Keyword(s):

Relational Databases ◽

Parallel Architecture ◽

Operation Time ◽

Bloom Filter ◽

Search Tree ◽

Binary Search Tree ◽

Maximum Acceleration ◽

Data Intensive ◽

Match Rate ◽

Field Programmable

As one of the most important operations in relational databases, the join is data-intensive and time-consuming. Thus, offloading this operation using field-programmable gate arrays (FPGAs) has attracted much interest and has been broadly researched in recent years. However, the available SRAM-based join architectures are often resource-intensive, power-consuming, or low-throughput. Besides, a lower match rate does not lead to a shorter operation time. To address these issues, a Bloom filter (BF)-based parallel join architecture is presented in this paper. This architecture first leverages the BF to discard the tuples that are not in the join result and classifies the remaining tuples into different channels. Second, a binary search tree is used to reduce the number of comparisons. The proposed method was implemented on a Xilinx FPGA, and the experimental results show that under a match rate of 50%, our architecture achieved a high join throughput of 145.8 million tuples per second and a maximum acceleration factor of 2.3 compared to the existing SRAM-based join architectures.

Download Full-text