code analysis Latest Research Papers

IoT firmware oftentimes incorporates third-party components, such as network-oriented middleware and media encoders/decoders. These components consist of large and mature codebases, shipping with a variety of non-critical features. Feature bloat increases code size, complicates auditing/debugging, and reduces stability. This is problematic for IoT devices, which are severely resource-constrained and must remain operational in the field for years. Unfortunately, identification and complete removal of code related to unwanted features requires familiarity with codebases of interest, cumbersome manual effort, and may introduce bugs. We address these difficulties by introducing PRAT, a system that takes as input the codebase of software of interest, identifies and maps features to code, presents this information to a human analyst, and removes all code belonging to unwanted features. PRAT solves the challenge of identifying feature-related code through a novel form of differential dynamic analysis and visualizes results as user-friendly feature graphs . Evaluation on diverse codebases shows superior code removal compared to both manual feature deactivation and state-of-art debloating tools, and generality across programming languages. Furthermore, a user study comparing PRAT to manual code analysis shows that it can significantly simplify the feature identification workflow.

Download Full-text

Testing Part 1: Static Code Analysis

10.1201/9781003265566-8 ◽

2021 ◽

pp. 97-114

Author(s):

Mark S. Merkow

Keyword(s):

Code Analysis ◽

Static Code Analysis

Download Full-text

Provincial Town Students Speech Code Analysis (Exemplified in Komsomolsk-on-Amure)

10.1007/978-3-030-77000-6_55 ◽

2021 ◽

pp. 461-467

Author(s):

Yuliya V. Markova

Keyword(s):

Speech Code ◽

Code Analysis

Download Full-text

Adversarial EXEmples

ACM Transactions on Privacy and Security ◽

10.1145/3473039 ◽

2021 ◽

Vol 24 (4) ◽

pp. 1-31

Author(s):

Luca Demetrio ◽

Scott E. Coull ◽

Battista Biggio ◽

Giovanni Lagorio ◽

Alessandro Armando ◽

...

Keyword(s):

Machine Learning ◽

Domain Knowledge ◽

Black Box ◽

Mitigation Strategies ◽

File Format ◽

Subject Matter Experts ◽

Code Analysis ◽

Static Code Analysis ◽

Executable File ◽

Functional Areas

Recent work has shown that adversarial Windows malware samples—referred to as adversarial EXE mples in this article—can bypass machine learning-based detection relying on static code analysis by perturbing relatively few input bytes. To preserve malicious functionality, previous attacks either add bytes to existing non-functional areas of the file, potentially limiting their effectiveness, or require running computationally demanding validation steps to discard malware variants that do not correctly execute in sandbox environments. In this work, we overcome these limitations by developing a unifying framework that does not only encompass and generalize previous attacks against machine-learning models, but also includes three novel attacks based on practical, functionality-preserving manipulations to the Windows Portable Executable file format. These attacks, named Full DOS , Extend , and Shift , inject the adversarial payload by respectively manipulating the DOS header, extending it, and shifting the content of the first section. Our experimental results show that these attacks outperform existing ones in both white-box and black-box scenarios, achieving a better tradeoff in terms of evasion rate and size of the injected payload, while also enabling evasion of models that have been shown to be robust to previous attacks. To facilitate reproducibility of our findings, we open source our framework and all the corresponding attack implementations as part of the secml-malware Python library. We conclude this work by discussing the limitations of current machine learning-based malware detectors, along with potential mitigation strategies based on embedding domain knowledge coming from subject-matter experts directly into the learning process.

Download Full-text

Evaluation of Compilers’ Capability of Automatic Vectorization Based on Source Code Analysis

Scientific Programming ◽

10.1155/2021/3264624 ◽

2021 ◽

Vol 2021 ◽

pp. 1-15

Author(s):

Jing Ge Feng ◽

Ye Ping He ◽

Qiu Ming Tao

Keyword(s):

Compiler Optimization ◽

Source Code ◽

Academic Research ◽

Engineering Practice ◽

Source Code Analysis ◽

Code Analysis ◽

Multiple Data ◽

Program Characteristics ◽

Transformation Methods ◽

Compiler Techniques

Automatic vectorization is an important technique for compilers to improve the parallelism of programs. With the widespread usage of SIMD (Single Instruction Multiple Data) extensions in modern processors, automatic vectorization has become a hot topic in the research of compiler techniques. Accurately evaluating the effectiveness of automatic vectorization in typical compilers is quite valuable for compiler optimization and design. This paper evaluates the effectiveness of automatic vectorization, analyzes the limitation of automatic vectorization and the main causes, and improves the automatic vectorization technology. This paper firstly classifies the programs by two main factors: program characteristics and transformation methods. Then, it evaluates the effectiveness of automatic vectorization in three well-known compilers (GCC, LLVM, and ICC, including their multiple versions in recent 5 years) through TSVC (Test Suite for Vectorizing Compilers) benchmark. Furthermore, this paper analyzes the limitation of automatic vectorization based on source code analysis, and introduces the differences between academic research and engineering practice in automatic vectorization and the main causes, Finally, it gives some suggestions as to how to improve automatic vectorization capability.

Download Full-text

Formal concept analysis model for static code analysis

Carpathian Journal of Mathematics ◽

10.37193/cjm.2022.01.13 ◽

2021 ◽

Vol 38 (1) ◽

pp. 159-168

Author(s):

SIMONA MOTOGNA ◽

◽

DIANA CRISTEA ◽

DIANA ȘOTROPA MOLNAR ◽

◽

...

Keyword(s):

Error Detection ◽

Formal Concept Analysis ◽

Applied Mathematics ◽

Concept Analysis ◽

Formal Concept ◽

Analysis Model ◽

Code Analysis ◽

Static Code Analysis ◽

Early Error ◽

Qualitative Exploration

Tools that focus on static code analysis for early error detection are of utmost importance in software development, especially since the propagation of errors is strongly related to higher costs in the development process. Formal Concept Analysis is a prominent field of applied mathematics that uses conceptual landscapes to discover and represent maximal clusters of data. Its expressive visualization method makes it suitable for exploratory analyses in different fields. In this paper we present a Formal Concept Analysis framework for static code analysis that can serve as a model for quantitative and qualitative exploration and interpretation of such results.

Download Full-text

Headlines against Democracy: Operational Code Analysis of the Serbian Daily Informer’s Headlines in Relation to the Anti-Government Protests’ First Phase (2018–2019)

Journal of Media Research ◽

10.24193/jmr.41.2 ◽

2021 ◽

Vol 14 (3 (41)) ◽

pp. 23-41

Author(s):

Srđan Mladenov JOVANOVIĆ ◽

Keyword(s):

Authoritarian Regime ◽

Whole Body ◽

Daily Newspaper ◽

Scholarly Research ◽

Code Analysis ◽

Operational Code ◽

Indirect Control ◽

The Media

Since late 2019, Serbia has been gripped in a wave of protests against, as scholarly research has dubbed it, the semi-authoritarian regime of President Aleksandar Vučić. Having in mind that the President’s regime has by known been uncovered to rule by direct and indirect control of the media, the arguably main government-supporting daily newspaper, the Informer, has been covering the protests avidly, and with significant vitriol. With the understanding a headline is seen by the reader more commonly than the whole body of the article and having in mind the Informer’s pro-clivity towards exaggeration and hyperbole, we have analyzed all of the daily’s headlines that refer to the protests, protesters, or protest/opposition leaders during the so-called ‘First phase of the protests’ via the methodo-logical position of Operational Code Analysis. The paper shows a fairly extreme OPCODE for the Informer.

Download Full-text

Experience Report: Teaching Code Analysis and Verification Using Frama-C

Electronic Proceedings in Theoretical Computer Science ◽

10.4204/eptcs.349.5 ◽

2021 ◽

Vol 349 ◽

pp. 69-75

Author(s):

Salwa Souaf ◽

Frédéric Loulergue

Keyword(s):

Experience Report ◽

Code Analysis

Download Full-text

Static Code Analysis Tool for Laravel Framework Based Web Application

10.1109/icodse53690.2021.9648519 ◽

2021 ◽

Author(s):

Ranindya Paramitha ◽

Yudistira Dwi Wardhana Asnar

Keyword(s):

Web Application ◽

Analysis Tool ◽

Code Analysis ◽

Static Code Analysis

Download Full-text

Program analysis via efficient symbolic abstraction

Proceedings of the ACM on Programming Languages ◽

10.1145/3485495 ◽

2021 ◽

Vol 5 (OOPSLA) ◽

pp. 1-32

Author(s):

Peisen Yao ◽

Qingkai Shi ◽

Heqing Huang ◽

Charles Zhang

Keyword(s):

Strong Evidence ◽

Program Analysis ◽

Abstract Interpretation ◽

State Of The Art ◽

Diverse Group ◽

Machine Code ◽

Code Analysis ◽

Polyhedral Domain ◽

Performance Issues ◽

Bit Vector

This paper concerns the scalability challenges of symbolic abstraction: given a formula ϕ in a logic L and an abstract domain A , find a most precise element in the abstract domain that over-approximates the meaning of ϕ. Symbolic abstraction is an important point in the space of abstract interpretation, as it allows for automatically synthesizing the best abstract transformers. However, current techniques for symbolic abstraction can have difficulty delivering on its practical strengths, due to performance issues. In this work, we introduce two algorithms for the symbolic abstraction of quantifier-free bit-vector formulas, which apply to the bit-vector interval domain and a certain kind of polyhedral domain, respectively. We implement and evaluate the proposed techniques on two machine code analysis clients, namely static memory corruption analysis and constrained random fuzzing. Using a suite of 57,933 queries from the clients, we compare our approach against a diverse group of state-of-the-art algorithms. The experiments show that our algorithms achieve a substantial speedup over existing techniques and illustrate significant precision advantages for the clients. Our work presents strong evidence that symbolic abstraction of numeric domains can be efficient and practical for large and realistic programs.

Download Full-text

code analysis
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Guided Feature Identification and Removal for Resource-constrained Firmware

Testing Part 1: Static Code Analysis

Provincial Town Students Speech Code Analysis (Exemplified in Komsomolsk-on-Amure)

Adversarial EXEmples

Evaluation of Compilers’ Capability of Automatic Vectorization Based on Source Code Analysis

Formal concept analysis model for static code analysis

Headlines against Democracy: Operational Code Analysis of the Serbian Daily Informer’s Headlines in Relation to the Anti-Government Protests’ First Phase (2018–2019)

Experience Report: Teaching Code Analysis and Verification Using Frama-C

Static Code Analysis Tool for Laravel Framework Based Web Application

Program analysis via efficient symbolic abstraction

Export Citation Format

code analysisRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Guided Feature Identification and Removal for Resource-constrained Firmware

Testing Part 1: Static Code Analysis

Provincial Town Students Speech Code Analysis (Exemplified in Komsomolsk-on-Amure)

Adversarial EXEmples

Evaluation of Compilers’ Capability of Automatic Vectorization Based on Source Code Analysis

Formal concept analysis model for static code analysis

Headlines against Democracy: Operational Code Analysis of the Serbian Daily Informer’s Headlines in Relation to the Anti-Government Protests’ First Phase (2018–2019)

Experience Report: Teaching Code Analysis and Verification Using Frama-C

Static Code Analysis Tool for Laravel Framework Based Web Application

Program analysis via efficient symbolic abstraction

code analysis
Recently Published Documents