Building Domain Specific Dictionaries of Verb-Object Relation from Source Code

Polyhedral+Dataflow Graphs

10.18122/td/1654/boisestate ◽

2020 ◽

Author(s):

Eddie C. Davis

Keyword(s):

Differential Equation ◽

Sparse Matrix ◽

Source Code ◽

Tensor Decomposition ◽

Domain Specific Language ◽

Intermediate Representation ◽

Specific Language ◽

Domain Specific ◽

Temporary Storage ◽

Matrix Operations

This research presents an intermediate compiler representation that is designed for optimization, and emphasizes the temporary storage requirements and execution schedule of a given computation to guide optimization decisions. The representation is expressed as a dataflow graph that describes computational statements and data mappings within the polyhedral compilation model. The targeted applications include both the regular and irregular scientific domains. The intermediate representation can be integrated into existing compiler infrastructures. A specification language implemented as a domain specific language in C++ describes the graph components and the transformations that can be applied. The visual representation allows users to reason about optimizations. Graph variants can be translated into source code or other representation. The language, intermediate representation, and associated transformations have been applied to improve the performance of differential equation solvers, or sparse matrix operations, tensor decomposition, and structured multigrid methods.

Download Full-text

Development in Vulkan: a domain-specific approach

Proceedings of the Institute for System Programming of RAS ◽

10.15514/ispras-2021-33(5)-11 ◽

2021 ◽

Vol 33 (5) ◽

pp. 181-204

Author(s):

Vladimir Frolov ◽

Vadim Sanzharov ◽

Vladimir Galaktionov ◽

Alexander Shcherbakov

Keyword(s):

Image Processing ◽

Ray Tracing ◽

Code Generation ◽

Source Code ◽

General Purpose ◽

Software Systems ◽

Specific Class ◽

Domain Specific ◽

Architectural Patterns ◽

C Program

In this paper we propose a high-level approach to developing GPU applications based on the Vulkan API. The purpose of the work is to reduce the complexity of developing and debugging applications that implement complex algorithms on the GPU using Vulkan. The proposed approach uses the technology of code generation by translating a C++ program into an optimized implementation in Vulkan, which includes automatic shader generation, resource binding, and the use of synchronization mechanisms (Vulkan barriers). The proposed solution is not a general-purpose programming technology, but specializes in specific tasks. At the same time, it has extensibility, which allows to adapt the solution to new problems. For single input C++ program, we can generate several implementations for different cases (via translator options) or different hardware. For example, a call to virtual functions can be implemented either through a switch construct in a kernel, or through sorting threads and an indirect dispatching via different kernels, or through the so-called callable shaders in Vulkan. Instead of creating a universal programming technology for building various software systems, we offer an extensible technology that can be customized for a specific class of applications. Unlike, for example, Halide, we do not use a domain-specific language, and the necessary knowledge is extracted from ordinary C++ code. Therefore, we do not extend with any new language constructs or directives and the input source code is assumed to be normal C++ source code (albeit with some restrictions) that can be compiled by any C++ compiler. We use pattern matching to find specific patterns (or patterns) in C++ code and convert them to GPU efficient code using Vulkan. Pattern are expressed through classes, member functions, and the relationship between them. Thus, the proposed technology makes it possible to ensure a cross-platform solution by generating different implementations of the same algorithm for different GPUs. At the same time, due to this, it allows you to provide access to specific hardware functionality required in computer graphics applications. Patterns are divided into architectural and algorithmic. The architectural pattern defines the domain and behavior of the translator as a whole (for example, image processing, ray tracing, neural networks, computational fluid dynamics and etc.). Algorithmic pattern express knowledge of data flow and control and define a narrower class of algorithms that can be efficiently implemented in hardware. Algorithmic patterns can occur within architectural patterns. For example, parallel reduction, compaction (parallel append), sorting, prefix sum, histogram calculation, map-reduce, etc. The proposed generator works on the principle of code morphing. The essence of this approach is that, having a certain class in the program and transformation rules, one can automatically generate another class with the desired properties (for example, the implementation of the algorithm on the GPU). The generated class inherits from the input class and thus has access to all data and functions of the input class. Overriding virtual functions in generated class helps user to carefully connect generated code to the other Vulkan code written by hand. Shaders can be generated in two variants: OpenCL shaders for google “clspv” compiler and GLSL shaders for an arbitrary GLSL compiler. Clspv variant is better for code which intensively uses pointers and the GLSL generator is better if specific HW features are used (like hardware ray tracing acceleration). We have demonstrated our technology on several examples related to image processing and ray tracing on which we get 30-100 times acceleration over multithreaded CPU implementation.

Download Full-text

Emendo – A Toolchain for Creating Gamified Learning Arrangements for Online Learning Settings

Proceedings of the 4th International Conference on Higher Education Advances (HEAd'18) ◽

10.4995/head18.2018.8045 ◽

2018 ◽

Author(s):

Alexander Bartel ◽

Georg Hagel ◽

Christian Wolff

Keyword(s):

Online Learning ◽

Management System ◽

Teaching And Learning ◽

Source Code ◽

Qualitative Evaluation ◽

Modeling Language ◽

Domain Specific ◽

Domain Specific Modeling ◽

Gamified Learning ◽

Specific Modeling

This contribution describes the Emendo toolchain which enables the effective creation and implementation of gamified learning arrangements for online learning settings based on the domain-specific modeling approach. The components of Emendo are a domain-specific modeling language, a generator which transforms models based on the language into source code as well as the embedding of the latter in a learning management system. Scenarios for the usage of Emendo for teaching and learning are presented with respect to the functionalities of the toolchain. In addition, a qualitative evaluation concerning Emendo’s goals, concept and insights on the results is given. The evaluation shows that Emendo reaches high acceptance for teaching purposes and can serve as a promising means for the digitisation of teaching and learning.

Download Full-text

A Reuse Definition, Assessment, and Analysis Framework for UML

Advances in UML and XML-Based Software Evolution ◽

10.4018/978-1-59140-621-1.ch013 ◽

2011 ◽

pp. 286-307 ◽

Cited By ~ 1

Author(s):

Donald Needham ◽

Rodrigo Caballero ◽

Steven Demurjian ◽

Felix Eickhoff ◽

Yi Zhang

Keyword(s):

Development Time ◽

Source Code ◽

Use Case ◽

Control Center ◽

Improve Design ◽

Domain Specific ◽

Software Artifacts ◽

Formal Framework ◽

Software Engineers ◽

Class Diagrams

This chapter examines a formal framework for reusability assessment of development-time components and classes via metrics, refactoring guidelines, and algorithms. It argues that software engineers seeking to improve design reusability stand to benefit from tools that precisely measure the potential and actual reuse of software artifacts to achieve domain-specific reuse for an organization’s current and future products. The authors consider the reuse definition, assessment, and analysis of a UML design prior to the existence of source code, and include dependency tracking for use case and class diagrams in support of reusability analysis and refactoring for UML. The integration of these extensions into the UML tool Together Control Center to support reusability measurement from design to development is also considered.

Download Full-text

Statistical Unigram Analysis for Source Code Repository

International Journal of Semantic Computing ◽

10.1142/s1793351x18400123 ◽

2018 ◽

Vol 12 (02) ◽

pp. 237-260

Author(s):

Weifeng Xu ◽

Dianxiang Xu ◽

Abdulrahman Alatawi ◽

Omar El Ariss ◽

Yunkai Liu

Keyword(s):

Natural Language Processing ◽

Empirical Study ◽

Natural Language ◽

Programming Languages ◽

Language Processing ◽

Probabilistic Model ◽

Source Code ◽

Code Analysis ◽

Domain Specific ◽

Language Corpus

Unigram is a fundamental element of [Formula: see text]-gram in natural language processing. However, unigrams collected from a natural language corpus are unsuitable for solving problems in the domain of computer programming languages. In this paper, we analyze the properties of unigrams collected from an ultra-large source code repository. Specifically, we have collected 1.01 billion unigrams from 0.7 million open source projects hosted at GitHub.com. By analyzing these unigrams, we have discovered statistical properties regarding (1) how developers name variables, methods, and classes, and (2) how developers choose abbreviations. We describe a probabilistic model which relies on these properties for solving a well-known problem in source code analysis: how to expand a given abbreviation to its original indented word. Our empirical study shows that using the unigrams extracted from source code repository outperforms the using of the natural language corpus by 21% when solving the domain specific problems.

Download Full-text

DAEH: A Tool for Specifying and Monitoring the Exception Handling Policy

International Journal of Software Engineering and Knowledge Engineering ◽

10.1142/s0218194015400306 ◽

2015 ◽

Vol 25 (09n10) ◽

pp. 1515-1530

Author(s):

Joilson Abrantes ◽

Roberta Coelho ◽

Rodrigo Bonifácio

Keyword(s):

Large Scale ◽

Source Code ◽

Exception Handling ◽

Runtime Monitoring ◽

Domain Specific Language ◽

Monitoring Tool ◽

Web Based ◽

Domain Specific ◽

Lack Of Information ◽

Contract Language

The exception handling policy of a system comprises the set of design rules that specify its exception handling behavior (how exceptions should be handled and thrown). Such policy is usually undocumented and implicitly defined by the system architect. For this reason, developers often consider that by just including catch-blocks in the code they are dealing with exceptional conditions. This lack of information may turn the exception handling into a generalized “goto” mechanism making the program more complex and less reliable. This work presents a domain-specific language called ECL (Exception Contract Language) to specify the exception handling policy and a runtime monitoring tool which dynamically checks this policy. The monitoring tool is implemented in the form of an aspect library, which can be added to any Java system without the need to change the application source code. We applied this approach to two large-scale web-based systems and to a set of versions of the well-known JUnit framework. The results indicate that this approach can be used to express and to automatically check the exception handling policy of a system, and consequently support the development of more robust Java systems.

Download Full-text

A domain-specific language for defining static structure of database applications

Computer Science and Information Systems ◽

10.2298/csis090203002d ◽

2010 ◽

Vol 7 (3) ◽

pp. 409-440 ◽

Cited By ~ 8

Author(s):

Igor Dejanovic ◽

Gordana Milosavljevic ◽

Branko Perisic ◽

Maja Tumbas

Keyword(s):

Source Code ◽

Domain Specific Language ◽

Static Structure ◽

Code Generator ◽

Specific Language ◽

Meta Model ◽

Model Driven ◽

Domain Specific ◽

Definition Of ◽

Execution Semantics

In this paper we present DOMMLite - an extensible domain specific language (DSL) for static structure definition of data base oriented applications. The model-driven engineering (MDE) approach, an emerging software development paradigm, has been used. The language structure is defined by the means of a meta model supplemented by validation rules based on Check language and extensions based on Extend language, which are parts of the openArchitectureWare framework [1]. The meta model has been defined along with the textual syntax, which enables creation, update and persistence of DOMMLite models using a common text editor. DSL execution semantics has been defined by the specification and implementation of the source code generator for a target platform with an already defined execution semantics. In order to enable model editing, a textual Eclipse editor has also been developed. DSL, defined in this way, has the capability of generating complete source code for GUI forms with CRUDS (Create-Read-Update-Delete-Search) and navigation operations [2,3,4,5].

Download Full-text

Expert programmers have fine-tuned cortical representations of source code

10.1101/2020.01.28.923953 ◽

2020 ◽

Cited By ~ 2

Author(s):

Yoshiharu Ikutani ◽

Takatomi Kubo ◽

Satoshi Nishida ◽

Hideaki Hata ◽

Kenichi Matsumoto ◽

...

Keyword(s):

Brain Activity ◽

Source Code ◽

Knowledge Structure ◽

Brain Regions ◽

Data Driven ◽

Behavioral Performance ◽

Functional Categories ◽

Domain Specific ◽

Cortical Representations ◽

The Brain

ABSTRACTExpertise enables humans to achieve outstanding performance on domain-specific tasks, and programming is no exception. Many have shown that expert programmers exhibit remarkable differences from novices in behavioral performance, knowledge structure, and selective attention. However, the underlying differences in the brain are still unclear. We here address this issue by associating the cortical representation of source code with individual programming expertise using a data-driven decoding approach. This approach enabled us to identify seven brain regions, widely distributed in the frontal, parietal, and temporal cortices, that have a tight relationship with programming expertise. In these brain regions, functional categories of source code could be decoded from brain activity and the decoding accuracies were significantly correlated with individual behavioral performances on source-code categorization. Our results suggest that programming expertise is built up on fine-tuned cortical representations specialized for the domain of programming.

Download Full-text