Exposing Manipulated Photos and Videos in Digital Forensics Analysis

Tampered multimedia content is being increasingly used in a broad range of cybercrime activities. The spread of fake news, misinformation, digital kidnapping, and ransomware-related crimes are amongst the most recurrent crimes in which manipulated digital photos and videos are the perpetrating and disseminating medium. Criminal investigation has been challenged in applying machine learning techniques to automatically distinguish between fake and genuine seized photos and videos. Despite the pertinent need for manual validation, easy-to-use platforms for digital forensics are essential to automate and facilitate the detection of tampered content and to help criminal investigators with their work. This paper presents a machine learning Support Vector Machines (SVM) based method to distinguish between genuine and fake multimedia files, namely digital photos and videos, which may indicate the presence of deepfake content. The method was implemented in Python and integrated as new modules in the widely used digital forensics application Autopsy. The implemented approach extracts a set of simple features resulting from the application of a Discrete Fourier Transform (DFT) to digital photos and video frames. The model was evaluated with a large dataset of classified multimedia files containing both legitimate and fake photos and frames extracted from videos. Regarding deepfake detection in videos, the Celeb-DFv1 dataset was used, featuring 590 original videos collected from YouTube, and covering different subjects. The results obtained with the 5-fold cross-validation outperformed those SVM-based methods documented in the literature, by achieving an average F1-score of 99.53%, 79.55%, and 89.10%, respectively for photos, videos, and a mixture of both types of content. A benchmark with state-of-the-art methods was also done, by comparing the proposed SVM method with deep learning approaches, namely Convolutional Neural Networks (CNN). Despite CNN having outperformed the proposed DFT-SVM compound method, the competitiveness of the results attained by DFT-SVM and the substantially reduced processing time make it appropriate to be implemented and embedded into Autopsy modules, by predicting the level of fakeness calculated for each analyzed multimedia file.

Download Full-text

Machine Learning Frameworks in Cancer Detection

E3S Web of Conferences ◽

10.1051/e3sconf/202129701073 ◽

2021 ◽

Vol 297 ◽

pp. 01073

Author(s):

Sabyasachi Pramanik ◽

K. Martin Sagayam ◽

Om Prakash Jena

Keyword(s):

Machine Learning ◽

Prediction Models ◽

Supervised Machine Learning ◽

Machine Learning Techniques ◽

Cancer Development ◽

Support Vector ◽

Learning Approaches ◽

Learning Techniques ◽

Fact Finding ◽

Risk Of Cancer

Cancer has been described as a diverse illness with several distinct subtypes that may occur simultaneously. As a result, early detection and forecast of cancer types have graced essentially in cancer fact-finding methods since they may help to improve the clinical treatment of cancer survivors. The significance of categorizing cancer suffers into higher or lower-threat categories has prompted numerous fact-finding associates from the bioscience and genomics field to investigate the utilization of machine learning (ML) algorithms in cancer diagnosis and treatment. Because of this, these methods have been used with the goal of simulating the development and treatment of malignant diseases in humans. Furthermore, the capacity of machine learning techniques to identify important characteristics from complicated datasets demonstrates the significance of these technologies. These technologies include Bayesian networks and artificial neural networks, along with a number of other approaches. Decision Trees and Support Vector Machines which have already been extensively used in cancer research for the creation of predictive models, also lead to accurate decision making. The application of machine learning techniques may undoubtedly enhance our knowledge of cancer development; nevertheless, a sufficient degree of validation is required before these approaches can be considered for use in daily clinical practice. An overview of current machine learning approaches utilized in the simulation of cancer development is presented in this paper. All of the supervised machine learning approaches described here, along with a variety of input characteristics and data samples, are used to build the prediction models. In light of the increasing trend towards the use of machine learning methods in biomedical research, we offer the most current papers that have used these approaches to predict risk of cancer or patient outcomes in order to better understand cancer.

Download Full-text

A Machine Learning View on Momentum and Reversal Trading

Algorithms ◽

10.3390/a11110170 ◽

2018 ◽

Vol 11 (11) ◽

pp. 170 ◽

Cited By ~ 2

Author(s):

Zhixi Li ◽

Vincent Tam

Keyword(s):

Neural Network ◽

Machine Learning ◽

Stock Market ◽

Short Term Memory ◽

Predictive Ability ◽

Trading Strategies ◽

Machine Learning Techniques ◽

Support Vector ◽

Learning Approaches ◽

Learning Techniques

Momentum and reversal effects are important phenomena in stock markets. In academia, relevant studies have been conducted for years. Researchers have attempted to analyze these phenomena using statistical methods and to give some plausible explanations. However, those explanations are sometimes unconvincing. Furthermore, it is very difficult to transfer the findings of these studies to real-world investment trading strategies due to the lack of predictive ability. This paper represents the first attempt to adopt machine learning techniques for investigating the momentum and reversal effects occurring in any stock market. In the study, various machine learning techniques, including the Decision Tree (DT), Support Vector Machine (SVM), Multilayer Perceptron Neural Network (MLP), and Long Short-Term Memory Neural Network (LSTM) were explored and compared carefully. Several models built on these machine learning approaches were used to predict the momentum or reversal effect on the stock market of mainland China, thus allowing investors to build corresponding trading strategies. The experimental results demonstrated that these machine learning approaches, especially the SVM, are beneficial for capturing the relevant momentum and reversal effects, and possibly building profitable trading strategies. Moreover, we propose the corresponding trading strategies in terms of market states to acquire the best investment returns.

Download Full-text

Sentiment Analysis of Lithuanian Texts Using Traditional and Deep Learning Approaches

Computers ◽

10.3390/computers8010004 ◽

2019 ◽

Vol 8 (1) ◽

pp. 4 ◽

Cited By ~ 4

Author(s):

Jurgita Kapočiūtė-Dzikienė ◽

Robertas Damaševičius ◽

Marcin Woźniak

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Sentiment Analysis ◽

Short Term Memory ◽

Machine Learning Techniques ◽

Support Vector ◽

Learning Approaches ◽

Full Dataset ◽

Learning Techniques ◽

Long Short Term Memory

We describe the sentiment analysis experiments that were performed on the Lithuanian Internet comment dataset using traditional machine learning (Naïve Bayes Multinomial—NBM and Support Vector Machine—SVM) and deep learning (Long Short-Term Memory—LSTM and Convolutional Neural Network—CNN) approaches. The traditional machine learning techniques were used with the features based on the lexical, morphological, and character information. The deep learning approaches were applied on the top of two types of word embeddings (Vord2Vec continuous bag-of-words with negative sampling and FastText). Both traditional and deep learning approaches had to solve the positive/negative/neutral sentiment classification task on the balanced and full dataset versions. The best deep learning results (reaching 0.706 of accuracy) were achieved on the full dataset with CNN applied on top of the FastText embeddings, replaced emoticons, and eliminated diacritics. The traditional machine learning approaches demonstrated the best performance (0.735 of accuracy) on the full dataset with the NBM method, replaced emoticons, restored diacritics, and lemma unigrams as features. Although traditional machine learning approaches were superior when compared to the deep learning methods; deep learning demonstrated good results when applied on the small datasets.

Download Full-text

Classification of volcanic and tectonic earthquakes in Kamchatka (Russia) with different machine learning techniques

10.5194/egusphere-egu2020-755 ◽

2020 ◽

Author(s):

Natalia Galina ◽

Nikolai Shapiro ◽

Leonard Seydoux ◽

Dmitry Droznin

Keyword(s):

Machine Learning ◽

Subduction Zone ◽

Aftershock Sequence ◽

Machine Learning Techniques ◽

Support Vector ◽

Learning Approaches ◽

Shiveluch Volcano ◽

Agglomerative Clustering ◽

Learning Techniques ◽

Single Station

<p>Kamchatka is an active subduction zone that exhibits intense seismic and volcanic activities. As a consequence, tectonic and volcanic earthquakes are often nearly simultaneously recorded at the same station. In this work, we consider seismograms recorded between December 2018 and April 2019. During this time period when the M=7.3 earthquake followed by an aftershock sequence occurred nearly simultaneously with a strong eruption of Shiveluch volcano. As a result, stations of the Kamchatka seismic monitoring network recorded up to several hundreds of earthquakes per day. In total, we detected almost 7000 events of different origin using a simple automatic detection algorithm based on signal envelope amplitudes. Then, for each detection different features have been extracted. We started from simple signal parameters (amplitude, duration, peak frequency, etc.), unsmoothed and smoothed spectra and finally used a multi-dimensional signal decomposition (scattering coefficients). For events classification both unsupervised (K-means, agglomerative clustering) and supervised (Support Vector Classification, Random Forest) classic machine learning techniques were performed on all types of extracted features. Obtained results are quite stable and do not vary significantly depending on features and method choice. As a result, the machine learning approaches allow us to clearly separate tectonic subduction-zone earthquakes and those associated with the Shiveluch volcano eruptions based on data of a single station.</p>

Download Full-text

Investigating Machine Learning Techniques for User Sentiment Analysis

International Journal of Decision Support System Technology ◽

10.4018/ijdsst.2019070101 ◽

2019 ◽

Vol 11 (3) ◽

pp. 1-12 ◽

Cited By ~ 2

Author(s):

Nimesh V Patel ◽

Hitesh Chhinkaniwala

Keyword(s):

Machine Learning ◽

Sentiment Analysis ◽

Social Networking Sites ◽

Machine Learning Techniques ◽

Support Vector ◽

Product Reviews ◽

Learning Approaches ◽

Current Trends ◽

Learning Techniques ◽

Benchmark Datasets

Sentiment analysis identifies users in the textual reviews available in social networking sites, tweets, blog posts, forums, status updates to share their emotions or reviews and these reviews are to be used by market researchers to do know the product reviews and current trends in the market. The sentiment analysis is performed by two methods. Machine learning approaches and lexicon methods which are also known as the knowledge base approach. These. In this article, the authors evaluate the performance of some machine learning techniques: Maximum Entropy, Naïve Bayes and Support Vector Machines on two benchmark datasets: the positive-negative dataset and a Movie Review dataset by measuring parameters like accuracy, precision, recall and F-score. In this article, the authors present the performance of various sentiment analysis and classification methods by classifying the reviews in binary classes as positive, negative opinion about reviews on different domains of dataset. It is also justified that sentiment analysis using the Support Vector Machine outperforms other machine learning techniques.

Download Full-text

Prediction of drug synergy in cancer using ensemble-based machine learning techniques

Modern Physics Letters B ◽

10.1142/s0217984918501324 ◽

2018 ◽

Vol 32 (11) ◽

pp. 1850132 ◽

Cited By ~ 9

Author(s):

Harpreet Singh ◽

Prashant Singh Rana ◽

Urvinder Singh

Keyword(s):

Machine Learning ◽

Fuzzy Inference System ◽

Fuzzy Inference ◽

Machine Learning Techniques ◽

Support Vector ◽

Prediction Errors ◽

Learning Approaches ◽

Inference System ◽

Drug Synergy ◽

Learning Techniques

Drug synergy prediction plays a significant role in the medical field for inhibiting specific cancer agents. It can be developed as a pre-processing tool for therapeutic successes. Examination of different drug–drug interaction can be done by drug synergy score. It needs efficient regression-based machine learning approaches to minimize the prediction errors. Numerous machine learning techniques such as neural networks, support vector machines, random forests, LASSO, Elastic Nets, etc., have been used in the past to realize requirement as mentioned above. However, these techniques individually do not provide significant accuracy in drug synergy score. Therefore, the primary objective of this paper is to design a neuro-fuzzy-based ensembling approach. To achieve this, nine well-known machine learning techniques have been implemented by considering the drug synergy data. Based on the accuracy of each model, four techniques with high accuracy are selected to develop ensemble-based machine learning model. These models are Random forest, Fuzzy Rules Using Genetic Cooperative-Competitive Learning method (GFS.GCCL), Adaptive-Network-Based Fuzzy Inference System (ANFIS) and Dynamic Evolving Neural-Fuzzy Inference System method (DENFIS). Ensembling is achieved by evaluating the biased weighted aggregation (i.e. adding more weights to the model with a higher prediction score) of predicted data by selected models. The proposed and existing machine learning techniques have been evaluated on drug synergy score data. The comparative analysis reveals that the proposed method outperforms others in terms of accuracy, root mean square error and coefficient of correlation.

Download Full-text

Prediction of the Hardness of Cu-Ti-Co Alloy Using Machine Learning Techniques

Key Engineering Materials ◽

10.4028/www.scientific.net/kem.777.372 ◽

2018 ◽

Vol 777 ◽

pp. 372-376 ◽

Cited By ~ 1

Author(s):

Shan Feng Fang

Keyword(s):

Machine Learning ◽

Copper Alloys ◽

Least Square ◽

Machine Learning Techniques ◽

Support Vector ◽

Learning Approaches ◽

Forecasting Accuracy ◽

Learning Techniques ◽

Vector Machines ◽

Forecasting Performance

Diverse machine learning approaches were employed to build regression models for predicting mechanical property of Cu-Ti-Co alloy. The forecasting performance of the least-square support vector machines (LSSVM) model has been compared with other artificial intelligence methods such as GRNN, RBF-PLS and RBFNN. The models were developed and validated utilizing a cross-validation (CV) procedure to improve the forecasting accuracy and generalization ability. The result demonstrates that the generalization performance of the new LSSVM is slightly better or superior to those acquired using GRNN, RBF-PLS and RBFNN. In future, it would be expected that the relatively new model based on machine learning is used as an especially helpful implement to accelerate materials design of copper alloys.

Download Full-text

Machine Learning Approaches for Outdoor Air Quality Modelling: A Systematic Review

Applied Sciences ◽

10.3390/app8122570 ◽

2018 ◽

Vol 8 (12) ◽

pp. 2570 ◽

Cited By ~ 23

Author(s):

Yves Rybarczyk ◽

Rasa Zalakeviciute

Keyword(s):

Machine Learning ◽

Systematic Review ◽

Machine Learning Techniques ◽

Support Vector ◽

Learning Approaches ◽

Deterministic Models ◽

Learning Techniques ◽

Vector Machines ◽

Estimation Problems ◽

Selection Of

Current studies show that traditional deterministic models tend to struggle to capture the non-linear relationship between the concentration of air pollutants and their sources of emission and dispersion. To tackle such a limitation, the most promising approach is to use statistical models based on machine learning techniques. Nevertheless, it is puzzling why a certain algorithm is chosen over another for a given task. This systematic review intends to clarify this question by providing the reader with a comprehensive description of the principles underlying these algorithms and how they are applied to enhance prediction accuracy. A rigorous search that conforms to the PRISMA guideline is performed and results in the selection of the 46 most relevant journal papers in the area. Through a factorial analysis method these studies are synthetized and linked to each other. The main findings of this literature review show that: (i) machine learning is mainly applied in Eurasian and North American continents and (ii) estimation problems tend to implement Ensemble Learning and Regressions, whereas forecasting make use of Neural Networks and Support Vector Machines. The next challenges of this approach are to improve the prediction of pollution peaks and contaminants recently put in the spotlights (e.g., nanoparticles).

Download Full-text

Performance Evaluation of Several Machine Learning Techniques Used in the Diagnosis of Mammograms

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.i7891.0881019 ◽

2019 ◽

Vol 8 (10) ◽

pp. 228-232 ◽

Cited By ~ 1

Keyword(s):

Machine Learning ◽

Learning Algorithm ◽

Machine Learning Techniques ◽

Theoretical Research ◽

Support Vector ◽

Common Disease ◽

Learning Approaches ◽

Learning Techniques ◽

Analysis Society ◽

Life Threatening

Throughout the world breast cancer has become a common disease among the women and it is also a life threatening diseases. Machine learning(ML) approach has been widely used for the diagnosis of benign and malignant masses in the mammogram. In this manuscript, I have represented the theoretical research and practical advances on various machine learning techniques the diagnosis of benign and malignant masses in the mammogram. The objective of this manuscript is to analyze the performance of distinct machine learning techniques used in the diagnosis of the Digital Mammography Image Analysis Society (MIAS) database. In this work I have compared performance of four machine learning approaches i.e. Support Vector, Naive Bayes, K-Nearest Neighbours and Multilayer Perceptron. The above four types of machine learning algorithm are used to categorize mammograms image. The achievements of these four techniques were recognized to discover the most acceptable classifier. On the end of the examine, derived outcomes indicates that support vector is a successful approach compares to other approach.

Download Full-text

Machine Learning Techniques for Identifying Fetal Risk During Pregnancy

International Journal of Image and Graphics ◽

10.1142/s0219467822500450 ◽

2021 ◽

Author(s):

S. Ravikumar ◽

E. Kannan

Keyword(s):

Machine Learning ◽

Fetal Heart ◽

Quantitative Description ◽

Absolute Error ◽

Outcome Data ◽

Machine Learning Techniques ◽

Support Vector ◽

Learning Approaches ◽

Fetal Risk ◽

Learning Techniques

Cardiotocography (CTG) is a biophysical method for assessing fetal condition that primarily relies on the recording and automated analysis of fetal heart activity. The quantitative description of the CTG signals is provided by computerized fetal monitoring systems. Even though effective conclusion generation methods for decision process support are still required to find out the fetal risk such as premature embryo, this proposed method and outcome data can confirm the assessment of the fetal state after birth. Low birth weight is quite possibly the main attribute that significantly depicts an unusual fetal result. These expectations are assessed in a constant experimental decision support system, providing valuable information that can be used to obtain additional information about the fetal state using machine learning techniques. The advancements in modern obstetric practice enabled the use of numerous reliable and robust machine learning approaches in classifying fetal heart rate signals. The Naïve Bayes (NB) classifier, support vector machine (SVM), decision trees (DT), and random forest (RF) are used in the proposed method. To assess these outcomes in the proposed method, some of the metrics such as precision, accuracy, F1 score, recall, sensitivity, logarithmic loss and mean absolute error have been taken. The above mentioned metrics will be helpful to predict the fetal risk.

Download Full-text