The analysis of a symbolic framework for static analysis of imperative programming languages

The use of pointers and indirect memory accesses in the program, as well as the complex control flow are some of the main weaknesses of the static analysis of programs. The program properties investigated by this analysis are too conservative to accurately describe program behavior and hence they prevent parallel execution of the program. The application of dynamic analysis allows us to expand the capabilities of semi-automatic parallelization. In the SAPFOR system (System FOR Automated Parallelization), a dynamic analysis tool has been implemented, based on on the instrumentation of the LLVM representation of an analyzed program, which allows the system to explore programs in both C and Fortran programming languages. The capabilities of the static analysis implemented in SAPFOR are used to reduce the overhead program execution, while maintaining the completeness of the analysis. The use of static analysis allows to reduce the number of analyzed memory accesses and to ignore scalar variables, which can be explored in a static way. The developed tool was tested on performance tests from the NAS Parallel Benchmarks package for C and Fortran languages. The implementation of dynamic analysis, in addition to traditional types of data dependencies (flow, anit, output), allows us to determine privitizable variables and a possibility of pipeline execution of loops. Together with the capabilities of DVM and OpenMP these greatly facilitates program parallelization and simplify insertion of the appropriate compiler directives.

Download Full-text

Detecting Uninitialized Variables in C++ with the Clang Static Analyzer

Acta Cybernetica ◽

10.14232/actacyb.282900 ◽

2020 ◽

Author(s):

Kristóf Umann ◽

Zoltán Porkoláb

Keyword(s):

Software Engineering ◽

Open Source ◽

Programming Languages ◽

Static Analysis ◽

Common Source ◽

Initial Value ◽

Analysis Techniques ◽

Specific Variable ◽

Prototype Tool ◽

Run Time

Uninitialized variables have been a source of errors since the beginning of software engineering. Some programming languages (e.g. Java and Python) will automatically zero-initialize such variables, but others, like C and C++, leave their state undefined. While laying aside initialization in C and C++ might be a performance advantage if an initial value can't be supplied, working with such variables is an undefined behavior, and is a common source of instabilities and crashes. To avoid such errors, whenever meaningful initialization is possible, it should be used. Tools for detecting these errors run time have existed for decades, but those require the problematic code to be executed. Since in many cases the number of possible execution paths are combinatoric, static analysis techniques emerged as an alternative. In this paper, we overview the technique for detecting uninitialized C++ variables using the Clang Static Analyzer, and describe various heuristics to guess whether a specific variable was left in an undefined state intentionally. We implemented a prototype tool based on our idea and successfully tested it on large open source projects.

Download Full-text

Static analysis of communication for asynchronous concurrent programming languages

Static Analysis - Lecture Notes in Computer Science ◽

10.1007/3-540-60360-3_42 ◽

1995 ◽

pp. 225-242 ◽

Cited By ~ 16

Author(s):

Naoki Kobayashi ◽

Motoki Nakade ◽

Akinori Yonezawa

Keyword(s):

Programming Languages ◽

Static Analysis ◽

Concurrent Programming ◽

Concurrent Programming Languages

Download Full-text

Static Analysis Usage for Customizable Semantic Checks of C and C++ Programming Languages Constraints

2014 IEEE Seventh International Conference on Software Testing, Verification and Validation Workshops ◽

10.1109/icstw.2014.59 ◽

2014 ◽

Cited By ~ 1

Author(s):

Valery Ignatyev

Keyword(s):

Programming Languages ◽

Static Analysis ◽

C Programming

Download Full-text

Static Validation of XSL Transformations

BRICS Report Series ◽

10.7146/brics.v12i32.21899 ◽

2005 ◽

Vol 12 (32) ◽

Cited By ~ 3

Author(s):

Anders Møller ◽

Mads Østerby Olesen ◽

Michael I. Schwartzbach

Keyword(s):

Web Services ◽

Programming Languages ◽

Large Class ◽

Static Analysis ◽

Programming Language ◽

Pragmatic Approach ◽

Validation Technique ◽

Summary Graph

XSL Transformations (XSLT) is a programming language for defining transformations between XML languages. The structure of these languages is formally described by schemas, for example using DTD, which allows individual documents to be validated. However, existing XSLT tools offer no static guarantees that, under the assumption that the input is valid relative to the input schema, the output of the transformation is valid relative to the output schema.<br /> <br />We present a validation technique for XSLT based on the summary graph formalism introduced in the static analysis of JWIG Web services. Being able to provide static guarantees, we can detect a large class of errors in an XSLT stylesheet at the time it is written instead of later when it has been deployed, and thereby provide benefits similar to those of static type checkers for modern programming languages.<br /> <br />Our analysis takes a pragmatic approach that focuses its precision on the essential language features but still handles the entire XSLT 1.0 language. We evaluate the analysis precision on a range of real stylesheets and demonstrate how it may be useful in practice.

Download Full-text

Systematic abstraction of abstract machines

Journal of Functional Programming ◽

10.1017/s0956796812000238 ◽

2012 ◽

Vol 22 (4-5) ◽

pp. 705-746 ◽

Cited By ~ 12

Author(s):

DAVID VAN HORN ◽

MATTHEW MIGHT

Keyword(s):

Programming Languages ◽

Static Analysis ◽

Abstract Interpretation ◽

Garbage Collection ◽

Higher Order ◽

Return Flow ◽

Initial Development ◽

Abstract Machines ◽

Temporal Ordering

AbstractWe describe a derivational approach to abstract interpretation that yields novel and transparently sound static analyses when applied to well-established abstract machines for higher-order and imperative programming languages. To demonstrate the technique and support our claim, we transform the CEK machine of Felleisen and Friedman (Proc. of the 14th ACM SIGACT-SIGPLAN Symp. Prin. Program. Langs, 1987, pp. 314–325), a lazy variant of Krivine's machine (Higher-Order Symb. Comput. Vol 20, 2007, pp. 199–207), and the stack-inspecting CM machine of Clements and Felleisen (ACM Trans. Program. Lang. Syst. Vol 26, 2004, pp. 1029–1052) into abstract interpretations of themselves. The resulting analyses bound temporal ordering of program events; predict return-flow and stack-inspection behavior; and approximate the flow and evaluation of by-need parameters. For all of these machines, we find that a series of well-known concrete machine refactorings, plus a technique of store-allocated continuations, leads to machines that abstract into static analyses simply by bounding their stores. These machines are parameterized by allocation functions that tune performance and precision and substantially expand the space of analyses that this framework can represent. We demonstrate that the technique scales up uniformly to allow static analysis of realistic language features, including tail calls, conditionals, mutation, exceptions, first-class continuations, and even garbage collection. In order to close the gap between formalism and implementation, we provide translations of the mathematics as running Haskell code for the initial development of our method.

Download Full-text

Creation of a static analysis algorithm using ad hoc programming languages

2016 19th Conference of Open Innovations Association (FRUCT) ◽

10.23919/fruct.2016.7892185 ◽

2016 ◽

Author(s):

Dmitry Khalansky ◽

Arthur Lazdin ◽

Dmitry Mouromtsev

Keyword(s):

Programming Languages ◽

Static Analysis ◽

Ad Hoc ◽

Analysis Algorithm

Download Full-text

Statically bounded-memory delayed sampling for probabilistic streams

Proceedings of the ACM on Programming Languages ◽

10.1145/3485492 ◽

2021 ◽

Vol 5 (OOPSLA) ◽

pp. 1-28

Author(s):

Eric Atkinson ◽

Guillaume Baudart ◽

Louis Mandel ◽

Charles Yuan ◽

Michael Carbin

Keyword(s):

Bayesian Inference ◽

Programming Languages ◽

Static Analysis ◽

Programming Language ◽

Inference Algorithm ◽

Prior Work ◽

Probabilistic Programming ◽

The Core ◽

Bounded Memory ◽

Automated Inference

Probabilistic programming languages aid developers performing Bayesian inference. These languages provide programming constructs and tools for probabilistic modeling and automated inference. Prior work introduced a probabilistic programming language, ProbZelus, to extend probabilistic programming functionality to unbounded streams of data. This work demonstrated that the delayed sampling inference algorithm could be extended to work in a streaming context. ProbZelus showed that while delayed sampling could be effectively deployed on some programs, depending on the probabilistic model under consideration, delayed sampling is not guaranteed to use a bounded amount of memory over the course of the execution of the program. In this paper, we the present conditions on a probabilistic program’s execution under which delayed sampling will execute in bounded memory. The two conditions are dataflow properties of the core operations of delayed sampling: the m -consumed property and the unseparated paths property . A program executes in bounded memory under delayed sampling if, and only if, it satisfies the m -consumed and unseparated paths properties. We propose a static analysis that abstracts over these properties to soundly ensure that any program that passes the analysis satisfies these properties, and thus executes in bounded memory under delayed sampling.

Download Full-text

Error location in Python: where the mutants hide

10.7287/peerj.preprints.1132v1 ◽

2015 ◽

Author(s):

Joshua C Campbell ◽

Abram Hindle ◽

José N Amaral

Keyword(s):

Programming Languages ◽

Static Analysis ◽

Language Model ◽

Unique Challenge ◽

Syntax Analysis ◽

Error Location ◽

Dynamic Languages ◽

N Gram ◽

Dynamic Scripting ◽

Python Programming

Dynamic scripting programming languages present a unique challenge to software engineering tools that depend on static analysis. Dynamic languages do not benefit from the full lexical and syntax analysis provided by compilers and static analysis tools. Prior work exploited a statically typed language (Java) and a simple \(n\)-gram language model to find syntax-error locations in programs. This work investigates whether \(n\)-gram-based error location on source code written in a dynamic language is effective without static analysis or compilation. UnnaturalCode.py is a syntax-error locator developed for the Python programming language. The UnnaturalCode.py approach is effective on Python code, but faces significantly more challenges than its Java counterpart did. UnnaturalCode.py generalizes the success of previous statically-typed approaches to a dynamically-typed language.

Download Full-text

A symbolic analysis framework for static analysis of imperative programming languages

Journal of Systems and Software ◽

10.1016/j.jss.2011.11.1039 ◽

2012 ◽

Vol 85 (6) ◽

pp. 1418-1439 ◽

Cited By ~ 5

Author(s):

Bernd Burgstaller ◽

Bernhard Scholz ◽

Johann Blieberger

Keyword(s):

Programming Languages ◽

Static Analysis ◽

Symbolic Analysis ◽

Analysis Framework

Download Full-text