Copy-Move Forgery Detection (CMFD) Using Deep Learning for Image and Video Forensics

With the exponential growth of high-quality fake images in social networks and media, it is necessary to develop recognition algorithms for this type of content. One of the most common types of image and video editing consists of duplicating areas of the image, known as the copy-move technique. Traditional image processing approaches manually look for patterns related to the duplicated content, limiting their use in mass data classification. In contrast, approaches based on deep learning have shown better performance and promising results, but they present generalization problems with a high dependence on training data and the need for appropriate selection of hyperparameters. To overcome this, we propose two approaches that use deep learning, a model by a custom architecture and a model by transfer learning. In each case, the impact of the depth of the network is analyzed in terms of precision (P), recall (R) and F1 score. Additionally, the problem of generalization is addressed with images from eight different open access datasets. Finally, the models are compared in terms of evaluation metrics, and training and inference times. The model by transfer learning of VGG-16 achieves metrics about 10% higher than the model by a custom architecture, however, it requires approximately twice as much inference time as the latter.

Download Full-text

Encapsulating the Impact of Transfer Learning, Domain Knowledge and Training Strategies in Deep-Learning Based Architecture: A Biometric Based Case Study

2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) ◽

10.1109/cvprw.2018.00242 ◽

2018 ◽

Author(s):

Avantika Singh ◽

Aditya Nigam

Keyword(s):

Deep Learning ◽

Transfer Learning ◽

Domain Knowledge ◽

Training Strategies ◽

The Impact ◽

And Training

Download Full-text

StyleGANs and Transfer Learning for Generating Synthetic Images in Industrial Applications

Symmetry ◽

10.3390/sym13081497 ◽

2021 ◽

Vol 13 (8) ◽

pp. 1497

Author(s):

Harold Achicanoy ◽

Deisy Chaves ◽

Maria Trujillo

Keyword(s):

Deep Learning ◽

Transfer Learning ◽

Data Augmentation ◽

Industrial Applications ◽

Generative Models ◽

Training Data ◽

Generative Adversarial Networks ◽

Augmentation Strategy ◽

Synthetic Images ◽

The Impact

Deep learning applications on computer vision involve the use of large-volume and representative data to obtain state-of-the-art results due to the massive number of parameters to optimise in deep models. However, data are limited with asymmetric distributions in industrial applications due to rare cases, legal restrictions, and high image-acquisition costs. Data augmentation based on deep learning generative adversarial networks, such as StyleGAN, has arisen as a way to create training data with symmetric distributions that may improve the generalisation capability of built models. StyleGAN generates highly realistic images in a variety of domains as a data augmentation strategy but requires a large amount of data to build image generators. Thus, transfer learning in conjunction with generative models are used to build models with small datasets. However, there are no reports on the impact of pre-trained generative models, using transfer learning. In this paper, we evaluate a StyleGAN generative model with transfer learning on different application domains—training with paintings, portraits, Pokémon, bedrooms, and cats—to generate target images with different levels of content variability: bean seeds (low variability), faces of subjects between 5 and 19 years old (medium variability), and charcoal (high variability). We used the first version of StyleGAN due to the large number of publicly available pre-trained models. The Fréchet Inception Distance was used for evaluating the quality of synthetic images. We found that StyleGAN with transfer learning produced good quality images, being an alternative for generating realistic synthetic images in the evaluated domains.

Download Full-text

Optimal Accuracy Zone Identification in Object Detection Technique - A Learning Rate Methodology

International Journal of Engineering and Advanced Technology - Regular Issue ◽

10.35940/ijeat.a2258.109119 ◽

2019 ◽

Vol 9 (1) ◽

pp. 6470-6476

Keyword(s):

Deep Learning ◽

Object Detection ◽

Learning Rate ◽

Training Data ◽

Fine Tuning ◽

Batch Size ◽

Detection Accuracy ◽

Learning Models ◽

The Impact ◽

Selection Of

In the recent past, Deep Learning models [1] are predominantly being used in Object Detection algorithms due to their accurate Image Recognition capability. These models extract features from the input images and videos [2] for identification of objects present in them. Various applications of these models include Image Processing, Video analysis, Speech Recognition, Biomedical Image Analysis, Biometric Recognition, Iris Recognition, National Security applications, Cyber Security, Natural Language Processing [3], Weather Forecasting applications, Renewable Energy Generation Scheduling etc. These models utilize the concept of Convolutional Neural Network (CNN) [3], which constitutes several layers of artificial neurons. The accuracy of Deep Learning models [1] depends on various parameters such as ‘Learning-rate’, ‘Training batch size’, ‘Validation batch size’, ‘Activation Function’, ‘Drop-out rate’ etc. These parameters are known as Hyper-Parameters. Object detection accuracy depends on selection of Hyperparameters and these in-turn decides the optimum accuracy. Hence, finding the best values for these parameters is a challenging task. Fine-Tuning is a process used for selection of a suitable Hyper-Parameter value for improvement of object detection accuracy. Selection of an inappropriate Hyper-Parameter value, leads to Over-Fitting or Under-Fitting of data. Over-Fitting is a case, when training data is larger than the required, which results in learning noise and inaccurate object detection. Under-fitting is a case, when the model is unable to capture the trend of the data and which leads to more erroneous results in testing or training data. In this paper, a balance between Over-fitting and Under-fitting is achieved by varying the ‘Learning rate’ of various Deep Learning models. Four Deep Learning Models such as VGG16, VGG19, InceptionV3 and Xception are considered in this paper for analysis purpose. The best zone of Learning-rate for each model, in respect of maximum Object Detection accuracy, is analyzed. In this paper a dataset of 70 object classes is taken and the prediction accuracy is analyzed by changing the ‘Learning-rate’ and keeping the rest of the Hyper-Parameters constant. This paper mainly concentrates on the impact of ‘Learning-rate’ on accuracy and identifies an optimum accuracy zone in Object Detection

Download Full-text

The Evolving Role of Task-Group Leaders, A Field Study

Proceedings of the Human Factors Society Annual Meeting ◽

10.1518/107118188786762333 ◽

1988 ◽

Vol 32 (13) ◽

pp. 760-764

Author(s):

Robert F. Randolph

Keyword(s):

Group Performance ◽

Training Data ◽

Underground Coal Mining ◽

Group Leaders ◽

Underground Coal Mines ◽

And Performance ◽

Task Oriented ◽

The Impact ◽

And Training

Leaders of task-oriented production groups play an important role in their group's functioning and performance. That role also evolves as groups mature and learn to work together more smoothly. The present study uses a functional analysis of the evolving role of supervisors of underground coal mining crews to evaluate the impact of supervisors' characteristics and behaviors on their crews' efficiency and safety, and makes recommendations for improving supervisory selection and training. Data were gathered from a sample of 138 supervisors at 13 underground coal mines. Detailed structured observations of the supervisors indicated that most of their time was spent attending to hardware and paperwork, while comparatively little time was spent on person to person “leadership”. The findings point out that while group needs changed over time, the supervisors' behaviors typically did not keep pace and probably restricted group performance.

Download Full-text

An efficient copy move forgery detection using deep learning feature extraction and matching algorithm

Multimedia Tools and Applications ◽

10.1007/s11042-019-08495-z ◽

2019 ◽

Vol 79 (11-12) ◽

pp. 7355-7376 ◽

Cited By ~ 3

Author(s):

Ritu Agarwal ◽

Om Prakash Verma

Keyword(s):

Feature Extraction ◽

Deep Learning ◽

Forgery Detection ◽

Matching Algorithm ◽

Copy Move Forgery Detection ◽

Feature Extraction And Matching

Download Full-text

Detection of Target Persons Using Deep Learning and Training Data Generation for Tsukuba Challenge

Journal of Robotics and Mechatronics ◽

10.20965/jrm.2018.p0513 ◽

2018 ◽

Vol 30 (4) ◽

pp. 513-522 ◽

Cited By ~ 3

Author(s):

Yuichi Konishi ◽

◽

Kosuke Shigematsu ◽

Takashi Tsubouchi ◽

Akihisa Ohya

Keyword(s):

Deep Learning ◽

Autonomous Navigation ◽

Detection System ◽

Computational Cost ◽

Weather Conditions ◽

Recognition System ◽

Training Data ◽

Urban Setting ◽

Data Generation ◽

And Training

The Tsukuba Challenge is an open experiment competition held annually since 2007, and wherein the autonomous navigation robots developed by the participants must navigate through an urban setting in which pedestrians and cyclists are present. One of the required tasks in the Tsukuba Challenge from 2013 to 2017 was to search for persons wearing designated clothes within the search area. This is a very difficult task since it is necessary to seek out these persons in an environment that includes regular pedestrians, and wherein the lighting changes easily because of weather conditions. Moreover, the recognition system must have a light computational cost because of the limited performance of the computer that is mounted onto the robot. In this study, we focused on a deep learning method of detecting the target persons in captured images. The developed detection system was expected to achieve high detection performance, even when small-sized input images were used for deep learning. Experiments demonstrated that the proposed system achieved better performance than an existing object detection network. However, because a vast amount of training data is necessary for deep learning, a method of generating training data to be used in the detection of target persons is also discussed in this paper.

Download Full-text

Examining Deep Learning Architectures for Crime Classification and Prediction

Forecasting ◽

10.3390/forecast3040046 ◽

2021 ◽

Vol 3 (4) ◽

pp. 741-762

Author(s):

Panagiotis Stalidis ◽

Theodoros Semertzidis ◽

Petros Daras

Keyword(s):

Deep Learning ◽

State Of The Art ◽

Open Data ◽

Training Data ◽

Crime Prediction ◽

Crime Types ◽

Improved Performance ◽

Learning Architectures ◽

And Training ◽

Crime Classification

In this paper, a detailed study on crime classification and prediction using deep learning architectures is presented. We examine the effectiveness of deep learning algorithms in this domain and provide recommendations for designing and training deep learning systems for predicting crime areas, using open data from police reports. Having time-series of crime types per location as training data, a comparative study of 10 state-of-the-art methods against 3 different deep learning configurations is conducted. In our experiments with 5 publicly available datasets, we demonstrate that the deep learning-based methods consistently outperform the existing best-performing methods. Moreover, we evaluate the effectiveness of different parameters in the deep learning architectures and give insights for configuring them to achieve improved performance in crime classification and finally crime prediction.

Download Full-text

Light Actuation Based On Facial Mood Recognition

International Journal Of Engineering And Computer Science ◽

10.18535/ijecs/v9i05.4483 ◽

2020 ◽

Vol 9 (05) ◽

pp. 25052-25056

Author(s):

Abhi Kadam ◽

Anupama Mhatre ◽

Sayali Redasani ◽

Amit Nerurkar

Keyword(s):

Neural Network ◽

Deep Learning ◽

Convolutional Neural Network ◽

Well Being ◽

Training Data ◽

Facial Emotion ◽

Specific Data ◽

Affective Meaning ◽

Deep Learning Model ◽

And Training

Current lighting technologies extend the options for changing the appearance of rooms and closed spaces, as such creating ambiences with an affective meaning. Using intelligence, these ambiences may instantly be adapted to the needs of the room’s occupant(s), possibly improving their well-being. In this paper, we set actuate lighting in our surrounding using mood detection. We analyze the mood of the person by Facial Emotion Recognition using deep learning model such as Convolutional Neural Network (CNN). On recognizing this emotion, we will actuate lighting in our surrounding in accordance with the mood. Based on implementation results, the system needs to be developed further by adding more specific data class and training data.

Download Full-text

Efficient and effective OCR engine training

International Journal on Document Analysis and Recognition (IJDAR) ◽

10.1007/s10032-019-00347-8 ◽

2019 ◽

Vol 23 (1) ◽

pp. 73-88

Author(s):

Christian Clausner ◽

Apostolos Antonacopoulos ◽

Stefan Pletschacher

Keyword(s):

Evaluation System ◽

Training Data ◽

Text Recognition ◽

Training Approach ◽

Efficiency And Effectiveness ◽

Analysis System ◽

The Impact ◽

Data Requirements ◽

And Training

Abstract We present an efficient and effective approach to train OCR engines using the Aletheia document analysis system. All components required for training are seamlessly integrated into Aletheia: training data preparation, the OCR engine’s training processes themselves, text recognition, and quantitative evaluation of the trained engine. Such a comprehensive training and evaluation system, guided through a GUI, allows for iterative incremental training to achieve best results. The widely used Tesseract OCR engine is used as a case study to demonstrate the efficiency and effectiveness of the proposed approach. Experimental results are presented validating the training approach with two different historical datasets, representative of recent significant digitisation projects. The impact of different training strategies and training data requirements is presented in detail.

Download Full-text

Selection of optimal external filter for colorimetric camera

Color and Imaging Conference ◽

10.2352/issn.2169-2629.2021.29.141 ◽

2021 ◽

Vol 2021 (29) ◽

pp. 141-147

Author(s):

Michael J. Vrhel ◽

H. Joel Trussell

Keyword(s):

Figure Of Merit ◽

Band System ◽

Image Data ◽

Optimal Filter ◽

Training Data ◽

Noise Model ◽

Data Set ◽

Testing Data ◽

And Training ◽

Selection Of

A database of realizable filters is created and searched to obtain the best filter that, when placed in front of an existing camera, results in improved colorimetric capabilities for the system. The image data with the external filter is combined with image data without the filter to provide a six-band system. The colorimetric accuracy of the system is quantified using simulations that include a realistic signal-dependent noise model. Using a training data set, we selected the optimal filter based on four criteria: Vora Value, Figure of Merit, training average ΔE, and training maximum ΔE. Each selected filter was used on testing data. The filters chosen using the training ΔE criteria consistently outperformed the theoretical criteria.

Download Full-text