scholarly journals Joint Program and Layout Transformations to Enable Convolutional Operators on Specialized Hardware Based on Constraint Programming

2022 ◽  
Vol 19 (1) ◽  
pp. 1-26
Author(s):  
Dennis Rieber ◽  
Axel Acosta ◽  
Holger Fröning

The success of Deep Artificial Neural Networks (DNNs) in many domains created a rich body of research concerned with hardware accelerators for compute-intensive DNN operators. However, implementing such operators efficiently with complex hardware intrinsics such as matrix multiply is a task not yet automated gracefully. Solving this task often requires joint program and data layout transformations. First solutions to this problem have been proposed, such as TVM, UNIT, or ISAMIR, which work on a loop-level representation of operators and specify data layout and possible program transformations before the embedding into the operator is performed. This top-down approach creates a tension between exploration range and search space complexity, especially when also exploring data layout transformations such as im2col, channel packing, or padding. In this work, we propose a new approach to this problem. We created a bottom-up method that allows the joint transformation of both computation and data layout based on the found embedding. By formulating the embedding as a constraint satisfaction problem over the scalar dataflow, every possible embedding solution is contained in the search space. Adding additional constraints and optimization targets to the solver generates the subset of preferable solutions. An evaluation using the VTA hardware accelerator with the Baidu DeepBench inference benchmark shows that our approach can automatically generate code competitive to reference implementations. Further, we show that dynamically determining the data layout based on intrinsic and workload is beneficial for hardware utilization and performance. In cases where the reference implementation has low hardware utilization due to its fixed deployment strategy, we achieve a geomean speedup of up to × 2.813, while individual operators can improve as much as × 170.

Author(s):  
Naomi A. Weiss

The Music of Tragedy offers a new approach to the study of classical Greek theater by examining the use of musical language, imagery, and performance in the late work of Euripides. Drawing on the ancient conception of mousikē, in which words, song, dance, and instrumental accompaniment were closely linked, Naomi Weiss emphasizes the interplay of performance and imagination—the connection between the chorus’s own live singing and dancing in the theater and the images of music-making that frequently appear in their songs. Through detailed readings of four plays, she argues that the mousikē referred to and imagined in these plays is central to the progression of the dramatic action and to ancient audiences’ experiences of tragedy itself. She situates Euripides’s experimentation with the dramaturgical effects of mousikē within a broader cultural context, and in doing so, she shows how he both continues the practices of his tragic predecessors and also departs from them, reinventing traditional lyric styles and motifs for the tragic stage.


2015 ◽  
Vol 24 (05) ◽  
pp. 1550017 ◽  
Author(s):  
Aderemi Oluyinka Adewumi ◽  
Akugbe Martins Arasomwan

This paper presents an improved particle swarm optimization (PSO) technique for global optimization. Many variants of the technique have been proposed in literature. However, two major things characterize many of these variants namely, static search space and velocity limits, which bound their flexibilities in obtaining optimal solutions for many optimization problems. Furthermore, the problem of premature convergence persists in many variants despite the introduction of additional parameters such as inertia weight and extra computation ability. This paper proposes an improved PSO algorithm without inertia weight. The proposed algorithm dynamically adjusts the search space and velocity limits for the swarm in each iteration by picking the highest and lowest values among all the dimensions of the particles, calculates their absolute values and then uses the higher of the two values to define a new search range and velocity limits for next iteration. The efficiency and performance of the proposed algorithm was shown using popular benchmark global optimization problems with low and high dimensions. Results obtained demonstrate better convergence speed and precision, stability, robustness with better global search ability when compared with six recent variants of the original algorithm.


2020 ◽  
Vol 15 (4) ◽  
pp. 1-35
Author(s):  
Charilaos Skandylas ◽  
Narges Khakpour ◽  
Jesper Andersson

Modern software systems and their corresponding architectures are increasingly decentralized, distributed, and dynamic. As a consequence, decentralized mechanisms are required to ensure security in such architectures. Decentralized Information Flow Control (DIFC) is a mechanism to control information flow in distributed systems. This article presents and discusses several improvements to an adaptive decentralized information flow approach that incorporates trust for decentralized systems to provide security. Adaptive Trust-Aware Decentralized Information Flow (AT-DIFC + ) combines decentralized information flow control mechanisms, trust-based methods, and decentralized control architectures to control and enforce information flow in an open, decentralized system. We strengthen our approach against newly discovered attacks and provide additional information about its reconfiguration, decentralized control architectures, and reference implementation. We evaluate the effectiveness and performance of AT-DIFC + on two case studies and perform additional experiments and to gauge the mitigations’ effectiveness against the identified attacks.


2014 ◽  
Vol 1018 ◽  
pp. 571-579
Author(s):  
Günther Schuh ◽  
Thomas Gartzen ◽  
Felix Basse

Reliable and accurate predictions on future states of production systems are the objective of production theories. In this paper, the authors determined shortcomings of current deterministic models and traced them back to the poor theoretical basis of scientific research in the area. The observations resulted in the development of the conceptScientific Management 2.0as an appropriate research methodology for production management. This new empirical approach takes into account three requirements to scientifically precise investigations: It expands existing theory by socio-technical aspects, uses embedded experiments as a profound basis for investigation and provides a design that warrants the methodical exactness required. RWTH Aachen’sDemonstration Factoryrepresents an adequate infrastructure to prove feasibility and performance of the new approach.


2005 ◽  
Vol 868 ◽  
Author(s):  
K. Endo ◽  
P. Badica ◽  
H. Sato ◽  
H. Akoh

AbstractHigh quality thin films of HTS have been grown by MOCVD on substrates with artificial steps of predefined height and width. The surface of the films grown on the steps having width equal to the ‘double of the migration length' of the atomic species depositing on the substrate is totally free of precipitates: precipitates are gathered at the step edges where the free energy is lowest. The method has several advantages: it is simple, universal (it is independent of the materials, substrates, deposition technique or application) and allows control of precipitates segregates so that the quality and growth conditions of the films are the same as for the films grown on conventional substrates. The method is expected to result in new opportunities for the device fabrication, design and performance. As an example we present successful fabrication of a mesa structure showing intrinsic Josephson effect. We have used thin films of Bi-2212/Bi-2223 superstructure grown on (001) SrTiO3 single crystal substrates with artificial steps of 20 μm.


2019 ◽  
Vol 2 ◽  
pp. 11-18
Author(s):  
Ali M. Fayad ◽  
Bassam Hussein ◽  
Mohamad Hajj-Hassan ◽  
Amin Haj-Ali

The earned value management method is commonly used for monitoring and assessing project budget and schedule performance. Although widely used, this method does not consider the time value of money. This paper suggests the inclusion of inflation as a factor affecting project performance and gives a methodology to do so using the net present value. This is done by adjusting the present value and actual cost of the work packages. The result is more realistic progress measurements and performance indicators.


Author(s):  
Robert Chee Choong Gan ◽  
Christina May May Chin

Due to alarmingly high failure rates attributed to either a lack of project implementation or if implemented, poor results in organizations, many PM consulting organizations have begun developing their own PM maturity models (PM3) to assess organization maturity level, to identify their clients' PM maturity gap, and to provide a pathway by which their clients could move up the maturity scale and performance. Despite the many claims of PM3 assessment capabilities, the lack of success in market adoption of PM3 models suggests the need for more studies to identify if these are due to the many definition of project success, the lack of consensus of what the components of PM3 should be, or the increasing expectations of the PM community. Thus, this chapter aims to identify the reasons behind differing organizations' views on the dimension of project success, components of PM3's direct impact on organizational performance, and how PM maturity can be measured and correlated to the various level of organizational success with a new approach known as DPM3.


Crime Science ◽  
2019 ◽  
Vol 8 (1) ◽  
Author(s):  
Tarah Hodgkinson ◽  
Tullio Caputo ◽  
Michael L. McIntyre

Abstract In this conceptual piece, we argue that the current approach to police performance measurement typically based on the use of traditional police metrics has failed to achieve the desired results and that a different strategy is required. Traditional police metrics have a narrow focus on crime and the police response to it. They provide little information on how well police organizations are performing. Importantly, traditional police metrics do not incorporate input from police stakeholders in goal identification, nor do they use specifically designed indicators to assess progress towards achieving these goals. Following an analysis of the criticisms levelled at the use of traditional police metrics, and subsequent attempts to address these issues, we argue that a networked governance approach represents a more promising foundation for undertaking police organizational performance assessment. Such an approach would engage stakeholders more directly in goal identification and performance assessment, and potentially lead to more successful, responsive and accountable policing.


Complexity ◽  
2018 ◽  
Vol 2018 ◽  
pp. 1-16 ◽  
Author(s):  
Amer Awad Alzaidi ◽  
Musheer Ahmad ◽  
Hussam S. Ahmed ◽  
Eesa Al Solami

This paper proposes a novel method of constructing strong substitution-boxes (S-boxes) of order n (4 ≤ n ≤ 8) based on a recent optimization algorithm known as sine-cosine algorithm (SCA). The paper also proposes a new 1D chaotic map, which owns enhanced dynamics compared to conventional chaotic map, for generating initial population of S-boxes and facilitating the optimization mechanism of SCA. The proposed method applies the SCA with enhanced chaotic map to explore and exploit the search space for obtaining optimized S-boxes on the basis of maximization of nonlinearity as fitness function. The S-box construction involves three phases such as initialization of population, optimization, and adjustment. The simulation and performance analyses are done using standard measures of nonlinearity, strict avalanche criterion, bits independence criterion, differential uniformity, linear approximation probability, and autocorrelation function. The obtained experimental results are compared with some immediate optimization-based and other S-boxes to show the strength of proposed method for constructing bijective S-boxes of salient cryptographic features.


Sign in / Sign up

Export Citation Format

Share Document