scholarly journals Development of a Machine Learning Model for Image-based Email Spam Detection

2021 ◽  
Vol 6 (4) ◽  
Author(s):  
Christopher U. Onova ◽  
Temidayo O. Omotehinwa

Combatting email spam has remained a very daunting task. Despite the over 99% accuracy in most non-image-based spam email detection, studies on image-based spam hardly attain such a high level of accuracy as new email spamming techniques that defeat existing spam filters emerges from time to time. The number of email spams sent out daily has remained a key factor in the continued use of spam. In this paper, a simple convolutional neural network model, 123DNet was developed and trained with 28,929 images drawn from 2 public datasets and a Personally Generated dataset. The model was optimized to the least set of layers to have 1 input layer, 2 embedded Convolutional layers as a hidden layer, and 3 neural network layers. The model was tested with a total of 4,339 images of the three dataset samples and then with a separate set of 1,200 images to test performance on never-seen-before images. A Classification Performance analysis was carried out using the confusion matrix. Performance metrics including Accuracy, Precision, True Negative Accuracy, Sensitivity, Specificity, and F1 Measure were computed to ascertain the model’s performance. The Model returned an F1 Score of 97% on a public dataset’s test sample and 88% on Never-seen-before test samples outperforming some pre-existing models while performing significantly well on the newly generated image test samples. It is recommended that a model that performed so well with new never-seen-before spam images be integrated into spam filtering systems. Keywords- Convolutional Neural Network, Deep Learning,  Image-based Spam Detection

Author(s):  
Niha Kamal Basha ◽  
Aisha Banu Wahab

: Absence seizure is a type of brain disorder in which subject get into sudden lapses in attention. Which means sudden change in brain stimulation. Most of this type of disorder is widely found in children’s (5-18 years). These Electroencephalogram (EEG) signals are captured with long term monitoring system and are analyzed individually. In this paper, a Convolutional Neural Network to extract single channel EEG seizure features like Power, log sum of wavelet transform, cross correlation, and mean phase variance of each frame in a windows are extracted after pre-processing and classify them into normal or absence seizure class, is proposed as an empowerment of monitoring system by automatic detection of absence seizure. The training data is collected from the normal and absence seizure subjects in the form of Electroencephalogram. The objective is to perform automatic detection of absence seizure using single channel electroencephalogram signal as input. Here the data is used to train the proposed Convolutional Neural Network to extract and classify absence seizure. The Convolutional Neural Network consist of three layers 1] convolutional layer – which extract the features in the form of vector 2] Pooling layer – the dimensionality of output from convolutional layer is reduced and 3] Fully connected layer–the activation function called soft-max is used to find the probability distribution of output class. This paper goes through the automatic detection of absence seizure in detail and provide the comparative analysis of classification between Support Vector Machine and Convolutional Neural Network. The proposed approach outperforms the performance of Support Vector Machine by 80% in automatic detection of absence seizure and validated using confusion matrix.


2021 ◽  
Vol 11 (14) ◽  
pp. 6594
Author(s):  
Yu-Chia Hsu

The interdisciplinary nature of sports and the presence of various systemic and non-systemic factors introduce challenges in predicting sports match outcomes using a single disciplinary approach. In contrast to previous studies that use sports performance metrics and statistical models, this study is the first to apply a deep learning approach in financial time series modeling to predict sports match outcomes. The proposed approach has two main components: a convolutional neural network (CNN) classifier for implicit pattern recognition and a logistic regression model for match outcome judgment. First, the raw data used in the prediction are derived from the betting market odds and actual scores of each game, which are transformed into sports candlesticks. Second, CNN is used to classify the candlesticks time series on a graphical basis. To this end, the original 1D time series are encoded into 2D matrix images using Gramian angular field and are then fed into the CNN classifier. In this way, the winning probability of each matchup team can be derived based on historically implied behavioral patterns. Third, to further consider the differences between strong and weak teams, the CNN classifier adjusts the probability of winning the match by using the logistic regression model and then makes a final judgment regarding the match outcome. We empirically test this approach using 18,944 National Football League game data spanning 32 years and find that using the individual historical data of each team in the CNN classifier for pattern recognition is better than using the data of all teams. The CNN in conjunction with the logistic regression judgment model outperforms the CNN in conjunction with SVM, Naïve Bayes, Adaboost, J48, and random forest, and its accuracy surpasses that of betting market prediction.


2018 ◽  
Vol 4 (9) ◽  
pp. 107 ◽  
Author(s):  
Mohib Ullah ◽  
Ahmed Mohammed ◽  
Faouzi Alaya Cheikh

Articulation modeling, feature extraction, and classification are the important components of pedestrian segmentation. Usually, these components are modeled independently from each other and then combined in a sequential way. However, this approach is prone to poor segmentation if any individual component is weakly designed. To cope with this problem, we proposed a spatio-temporal convolutional neural network named PedNet which exploits temporal information for spatial segmentation. The backbone of the PedNet consists of an encoder–decoder network for downsampling and upsampling the feature maps, respectively. The input to the network is a set of three frames and the output is a binary mask of the segmented regions in the middle frame. Irrespective of classical deep models where the convolution layers are followed by a fully connected layer for classification, PedNet is a Fully Convolutional Network (FCN). It is trained end-to-end and the segmentation is achieved without the need of any pre- or post-processing. The main characteristic of PedNet is its unique design where it performs segmentation on a frame-by-frame basis but it uses the temporal information from the previous and the future frame for segmenting the pedestrian in the current frame. Moreover, to combine the low-level features with the high-level semantic information learned by the deeper layers, we used long-skip connections from the encoder to decoder network and concatenate the output of low-level layers with the higher level layers. This approach helps to get segmentation map with sharp boundaries. To show the potential benefits of temporal information, we also visualized different layers of the network. The visualization showed that the network learned different information from the consecutive frames and then combined the information optimally to segment the middle frame. We evaluated our approach on eight challenging datasets where humans are involved in different activities with severe articulation (football, road crossing, surveillance). The most common CamVid dataset which is used for calculating the performance of the segmentation algorithm is evaluated against seven state-of-the-art methods. The performance is shown on precision/recall, F 1 , F 2 , and mIoU. The qualitative and quantitative results show that PedNet achieves promising results against state-of-the-art methods with substantial improvement in terms of all the performance metrics.


Author(s):  
Gauri Jain ◽  
Manisha Sharma ◽  
Basant Agarwal

This article describes how spam detection in the social media text is becoming increasing important because of the exponential increase in the spam volume over the network. It is challenging, especially in case of text within the limited number of characters. Effective spam detection requires more number of efficient features to be learned. In the current article, the use of a deep learning technology known as a convolutional neural network (CNN) is proposed for spam detection with an added semantic layer on the top of it. The resultant model is known as a semantic convolutional neural network (SCNN). A semantic layer is composed of training the random word vectors with the help of Word2vec to get the semantically enriched word embedding. WordNet and ConceptNet are used to find the word similar to a given word, in case it is missing in the word2vec. The architecture is evaluated on two corpora: SMS Spam dataset (UCI repository) and Twitter dataset (Tweets scrapped from public live tweets). The authors' approach outperforms the-state-of-the-art results with 98.65% accuracy on SMS spam dataset and 94.40% accuracy on Twitter dataset.


Author(s):  
Fei Rong ◽  
Li Shasha ◽  
Xu Qingzheng ◽  
Liu Kun

The Station logo is a way for a TV station to claim copyright, which can realize the analysis and understanding of the video by the identification of the station logo, so as to ensure that the broadcasted TV signal will not be illegally interfered. In this paper, we design a station logo detection method based on Convolutional Neural Network by the characteristics of the station, such as small scale-to-height ratio change and relatively fixed position. Firstly, in order to realize the preprocessing and feature extraction of the station data, the video samples are collected, filtered, framed, labeled and processed. Then, the training sample data and the test sample data are divided proportionally to train the station detection model. Finally, the sample is tested to evaluate the effect of the training model in practice. The simulation experiments prove its validity.


Author(s):  
Adigun Oyeranmi ◽  
Babatunde Ronke ◽  
Rufai Mohammed ◽  
Aigbokhan Edwin

Fractured bone detection and categorization is currently receiving research attention in computer aided diagnosis system because of the ease it has brought to doctors in classification and interpretation of X-ray images.  The choice of an efficient algorithm or combination of algorithms is paramount to accurately detect and categorize fractures in X-ray images, which is the first stage of diagnosis in treatment and correction of damaged bones for patients. This is what this research seeks to address. The research design involves data collection, preprocessing, segmentation, feature extraction, classification and evaluation of the proposed method. The sample dataset were x-ray images collected from the Department of Radiology, National Orthopedic Hospital, Igbobi-Lagos, Nigeria as well as Open Access Medical Image Repositories. The image preprocessing involves the conversion of images in RGB format to grayscale, sharpening and smoothing using Unsharp Masking Tool.  The segmentation of the preprocessed image was carried out by adopting the Entropy method in the first stage and Canny edge method in the second stage while feature extraction was performed using Hough Transformation. Detection and classification of fracture image employed a combination of two algorithms;  K-Nearest Neighbor (KNN) and Support Vector Machine (SVM) for detecting fracture locations based on four classification types: (normal, comminute, oblique and transverse).Two performance assessment methods were employed to evaluate the developed system. The first evaluation was based on confusion matrix which evaluates fracture and non-fracture on the basis of TP (True Positive), TN (True negative), FP (False Positive) and FN (False Negative). The second appraisal was based on Kappa Statistics which evaluates the type of fracture by determining the accuracy of the categorized fracture bone type. The result of first assessment for fracture detection shows that 26 out of 40 preprocessed images were fractured, resulting to the following three values of performance metrics: accuracy value of 90%, sensitivity of 87% and specificity of 100%. The Kappa coefficient error assessment produced accuracy of 83% during classification. The proposed method can find suitable use in categorization of fracture types on different bone images based on the results obtained from the experiment.


2020 ◽  
Vol 5 (1) ◽  
pp. 23
Author(s):  
Daru Prasetyawan ◽  
Shofwatul 'Uyun

Emosi seseorang dapat ditunjukan melalui ekspresi wajah. Ekspresi wajah manusia dapat berubah-ubah secara dinamis tanpa disadari oleh orang tersebut. Penelitian ini melakukan penentuan emosi dengan melakukan pengenalan ekspresi wajah manusia dan melakukan perekaman untuk setiap perubahan ekspresi wajah tersebut. Metode dalam penelitian ini adalah dengan melakukan klasifikasi terhadap 6 ekspresi dasar wajah manusia ditambah ekspresi netral dengan Convolutional Neural Network (CNN). Pemerataan distribusi data dilakukan untuk meningkatkan kinerja model. Dari pemodelan tersebut, dihasilkan model klasifikasi yang dapat diterapkan pada sebuah video. Model tersebut diuji menggunakan data yang terpisah dari data latih dan dievaluasi menggunakan confusion matrix. Sebagai hasil evaluasi, diperoleh akurasi 74%, rata-rata presisi 75,05%, dan rata-rata recall 74%. Di akhir penelitian ini, peneliti melakukan percobaan dengan menerapkan model klasifikasi tersebut pada beberapa video yang mewakili ekspresi seseorang di dalam video tersebut. Setiap perubahan ekspresi akan direkam dan dianalisis sehingga ditemukan emosi yang paling dominan.


2021 ◽  
Vol 905 (1) ◽  
pp. 012059
Author(s):  
Y Hendrawan ◽  
B Rohmatulloh ◽  
F I Ilmi ◽  
M R Fauzy ◽  
R Damayanti ◽  
...  

Abstract Various types of Indonesian coffee are already popular internationally. Recently, there are still not many methods to classify the types of typical Indonesian coffee. Computer vision is a non-destructive method for classifying agricultural products. This study aimed to classify three types of Indonesian Arabica coffee beans, i.e., Gayo Aceh, Kintamani Bali, and Toraja Tongkonan, using computer vision. The classification method used was the AlexNet convolutional neural network with sensitivity analysis using several variations of the optimizer such as SGDm, Adam, and RMSProp and the learning rate of 0.00005 and 0.0001. Each type of coffee used 500 data for training and validation with the distribution of 70% training and 30% validation. The results showed that all AlexNet models achieved a perfect validation accuracy value of 100% in 1,040 iterations. This study also used 100 testing-set data on each type of coffee bean. In the testing confusion matrix, the accuracy reached 99.6%.


Sign in / Sign up

Export Citation Format

Share Document