scholarly journals Understanding the effects of removing common blocks on Approximate Matching scores under different scenarios for digital forensic investigations

2019 ◽  
Author(s):  
Vitor Hugo Moia ◽  
Frank Breitinger ◽  
Marco Aurélio Henriques

Finding similarity in digital forensics investigations can be assisted with the use of Approximate Matching (AM) functions. These algorithms create small and compact representations of objects (similar to hashes) which can be compared to identify similarity. However, often results are biased due to common blocks (data structures found in many different files regardless of content). In this paper, we evaluate the precision and recall metrics for AM functions when removing common blocks. In detail, we analyze how the similarity score changes and impacts different investigation scenarios. Results show that many irrelevant matches can be filtered out and that a new interpretation of the score allows a better similarity detection.

2015 ◽  
Vol 57 (6) ◽  
Author(s):  
Harald Baier

AbstractHandling bulk data (e. g. some terabytes of data) is a issue in contemporary digital forensics. Separating relevant data structures from irrelevant ones resembles finding the needle in the haystack. The article at hand presents and assesses automatic hash-based techniques to preprocess the input data with the goal to facilitate the investigator's job. We discuss concepts like blacklisting and whitelisting based on cryptographic hash functions and approximate matching, respectively. In case of two established process models for a lab and an on-site investigation, respectively, we describe how to jointly use these techniques to automatically get a pointer to the needle.


2017 ◽  
pp. 571-621
Author(s):  
Thomas J. Holt ◽  
Adam M. Bossler ◽  
Kathryn C. Seigfried-Spellar

Sign in / Sign up

Export Citation Format

Share Document