Research on Performance of the Classifying Models Based on Chinese, Pakistani, and Other Genres

2021 ◽  
Vol 11 (4) ◽  
pp. 61-79
Author(s):  
Ejaz ud Din ◽  
Long Hua ◽  
Zhongyu Lu

In recent years, with the increase in the amount of audio on the internet, the demand for audio classification is increasing. This paper focuses on finding the performance of the classifiers, uses Python for the simulation part, compares the performance, and finds the best classifier. Two experiments are performed for this paper; for the first part of the experiment, Pakistan and Chinese music samples are considered, and classifiers are used to classify these music samples. It is found that the artificial neural network (ANN) has lowest accuracy of 81.4%; additionally, support vector machine (SVM), k-nearest neighbor (KNN), and convolutional (CNN) accuracies remain between 82% to 86% based on the dataset. Random forest model has the highest accuracy of 94.3%. It is considered to be the best classifier. For the second part of the experiment, other genres such as classical, country, and pop music were added to the previous dataset. After adding these genres, performance of the classifying models varies slightly; it fluctuates between 75% to 84%. These results can be used for music recommendation applications.

Author(s):  
Djarot Hindarto ◽  
Handri Santoso

Currently adoption of mobile phones and mobile applications based  on Android operating system is increasing rapidly. Many companies and emerging startups are carrying out digital transformation by using mobile applications to provide disruptive digital services to replace existing old styled services. This transformation prompted the attackers to create malicious software (malware) using sophisticate methods to target victims of Android mobile phone users. The purpose of this study is to identify Android APK files by classifying them using Artificial Neural Network (ANN) and Non Neural Network (NNN). The ANN is Multi-Layer Perceptron Classifier (MLPC), while  the NNN are KNN, SVM, Decision Tree, Logistic Regression and Naïve Bayes methods. The results show that the performance using NNN has decreasing accuracy when training using larger datasets. The use of the K-Nearest Neighbor algorithm with a dataset of 600 APKs achieves an accuracy of  91.2% and dataset of 14170 APKs achieves an accuracy of 88%. The using of the Support Vector Machine algorithm with the 600 APK dataset has an accuracy of 99.1% and the 14170 APK dataset has an accuracy of 90.5%. The using of the Decision Tree algorithm with the 600 APK dataset has an accuracy of  99.2%, the 14170 APK dataset has an accuracy of 90.8%. The experiment using the Multi-Layer Perceptron Classifier has increasing with the 600 APK dataset reaching 99%, the 7000 APK dataset reaching 100% and the 14170 APK dataset reaching 100%.


Author(s):  
S. Vijaya Rani ◽  
G. N. K. Suresh Babu

The illegal hackers  penetrate the servers and networks of corporate and financial institutions to gain money and extract vital information. The hacking varies from one computing system to many system. They gain access by sending malicious packets in the network through virus, worms, Trojan horses etc. The hackers scan a network through various tools and collect information of network and host. Hence it is very much essential to detect the attacks as they enter into a network. The methods  available for intrusion detection are Naive Bayes, Decision tree, Support Vector Machine, K-Nearest Neighbor, Artificial Neural Networks. A neural network consists of processing units in complex manner and able to store information and make it functional for use. It acts like human brain and takes knowledge from the environment through training and learning process. Many algorithms are available for learning process This work carry out research on analysis of malicious packets and predicting the error rate in detection of injured packets through artificial neural network algorithms.


2015 ◽  
Vol 13 (2) ◽  
pp. 50-58
Author(s):  
R. Khadim ◽  
R. El Ayachi ◽  
Mohamed Fakir

This paper focuses on the recognition of 3D objects using 2D attributes. In order to increase the recognition rate, the present an hybridization of three approaches to calculate the attributes of color image, this hybridization based on the combination of Zernike moments, Gist descriptors and color descriptor (statistical moments). In the classification phase, three methods are adopted: Neural Network (NN), Support Vector Machine (SVM), and k-nearest neighbor (KNN). The database COIL-100 is used in the experimental results.


2021 ◽  
Author(s):  
Jerome Asedegbega ◽  
Oladayo Ayinde ◽  
Alexander Nwakanma

Abstract Several computer-aided techniques have been developed in recent past to improve interpretational accuracy of subsurface geology. This paradigm shift has provided tremendous success in variety of Machine Learning Application domains and help for better feasibility study in reservoir evaluation using multiple classification techniques. Facies classification is an essential subsurface exploration task as sedimentary facies reflect associated physical, chemical, and biological conditions that formation unit experienced during sedimentation activity. This study however, employed formation samples for facies classification using Machine Learning (ML) techniques and classified different facies from well logs in seven (7) wells of the PORT Field, Offshore Niger Delta. Six wells were concatenated during data preparation and trained using supervised ML algorithms before validating the models by blind testing on one well log to predict discrete facies groups. The analysis started with data preparation and examination where various features of the available well data were conditioned. For the model building and performance, support vector machine, random forest, decision tree, extra tree, neural network (multilayer preceptor), k-nearest neighbor and logistic regression model were built after dividing the data sets into training, test, and blind test well data. Results of metric score for the blind test well estimated for the various models using Jaccard index and F1-score indicated 0.73 and 0.82 for support vector machine, 0.38 and 0.54 for random forest, 0.78 and 0.83 for extra tree, 0.91 and 0.95 for k-nearest neighbor, 0.41 and 0.56 for decision tree, 0.63 and 0.74 for logistic regression, 0.55 and 0.68 for neural network, respectively. The efficiency of ML techniques for enhancing the prediction accuracy and decreasing the procedure time and their approach toward the data, makes it importantly desirable to recommend them in subsurface facies classification analysis.


2020 ◽  
Author(s):  
Nazrul Anuar Nayan ◽  
Hafifah Ab Hamid ◽  
Mohd Zubir Suboh ◽  
Noraidatulakma Abdullah ◽  
Rosmina Jaafar ◽  
...  

Abstract Background: Cardiovascular disease (CVD) is the leading cause of deaths worldwide. In 2017, CVD contributed to 13,503 deaths in Malaysia. The current approaches for CVD prediction are usually invasive and costly. Machine learning (ML) techniques allow an accurate prediction by utilizing the complex interactions among relevant risk factors. Results: This study presents a case–control study involving 60 participants from The Malaysian Cohort, which is a prospective population-based project. Five parameters, namely, the R–R interval and root mean square of successive differences extracted from electrocardiogram (ECG), systolic and diastolic blood pressures, and total cholesterol level, were statistically significant in predicting CVD. Six ML algorithms, namely, linear discriminant analysis, linear and quadratic support vector machines, decision tree, k-nearest neighbor, and artificial neural network (ANN), were evaluated to determine the most accurate classifier in predicting CVD risk. ANN, which achieved 90% specificity, 90% sensitivity, and 90% accuracy, demonstrated the highest prediction performance among the six algorithms. Conclusions: In summary, by utilizing ML techniques, ECG data can serve as a good parameter for CVD prediction among the Malaysian multiethnic population.


Sensors ◽  
2019 ◽  
Vol 19 (12) ◽  
pp. 2814 ◽  
Author(s):  
Xiaoguang Liu ◽  
Huanliang Li ◽  
Cunguang Lou ◽  
Tie Liang ◽  
Xiuling Liu ◽  
...  

Falls are the major cause of fatal and non-fatal injury among people aged more than 65 years. Due to the grave consequences of the occurrence of falls, it is necessary to conduct thorough research on falls. This paper presents a method for the study of fall detection using surface electromyography (sEMG) based on an improved dual parallel channels convolutional neural network (IDPC-CNN). The proposed IDPC-CNN model is designed to identify falls from daily activities using the spectral features of sEMG. Firstly, the classification accuracy of time domain features and spectrograms are compared using linear discriminant analysis (LDA), k-nearest neighbor (KNN) and support vector machine (SVM). Results show that spectrograms provide a richer way to extract pattern information and better classification performance. Therefore, the spectrogram features of sEMG are selected as the input of IDPC-CNN to distinguish between daily activities and falls. Finally, The IDPC-CNN is compared with SVM and three different structure CNNs under the same conditions. Experimental results show that the proposed IDPC-CNN achieves 92.55% accuracy, 95.71% sensitivity and 91.7% specificity. Overall, The IDPC-CNN is more effective than the comparison in accuracy, efficiency, training and generalization.


2019 ◽  
Vol 2019 ◽  
pp. 1-14
Author(s):  
Renzhou Gui ◽  
Tongjie Chen ◽  
Han Nie

With the continuous development of science, more and more research results have proved that machine learning is capable of diagnosing and studying the major depressive disorder (MDD) in the brain. We propose a deep learning network with multibranch and local residual feedback, for four different types of functional magnetic resonance imaging (fMRI) data produced by depressed patients and control people under the condition of listening to positive- and negative-emotions music. We use the large convolution kernel of the same size as the correlation matrix to match the features and obtain the results of feature matching of 264 regions of interest (ROIs). Firstly, four-dimensional fMRI data are used to generate the two-dimensional correlation matrix of one person’s brain based on ROIs and then processed by the threshold value which is selected according to the characteristics of complex network and small-world network. After that, the deep learning model in this paper is compared with support vector machine (SVM), logistic regression (LR), k-nearest neighbor (kNN), a common deep neural network (DNN), and a deep convolutional neural network (CNN) for classification. Finally, we further calculate the matched ROIs from the intermediate results of our deep learning model which can help related fields further explore the pathogeny of depression patients.


2019 ◽  
Vol 9 (20) ◽  
pp. 4448 ◽  
Author(s):  
İş ◽  
Tuncer

This article considers methodological approaches to determine and prevent social media manipulation specific to Twitter. Behavioral analyses of Twitter users were performed by using their profile structures and interaction types, and Twitter users were classified according to their effect size values by determining their asset values. User profiles were classified into three different categories, namely popular-active, observer-passive, and spam-bot-malicious by using k-nearest neighbor (K-NN), support vector machine (SVM), and artificial neural network (ANN) algorithms. For classification, the study used the basic characteristics of users, such as density, centralization, and diameter, as well as suggested time series such as the simple moving average and cumulative moving average. The highest accuracy was obtained by the K-NN algorithm. The results obtained with K-NN for all classes were higher than the F1-Score values obtained for the other algorithms. According to the results obtained, classification accuracy values were found to reach a maximum of 96.81% and a minimum of 92.33%. Our classification results showed that the proposed method was satisfactory for popular-active, observer-passive, and spam-bot-malicious account separation.


Author(s):  
Nayan Nazrul Anuar ◽  
Ab Hamid Hafifah ◽  
Suboh Mohd Zubir ◽  
Abdullah Noraidatulakma ◽  
Jaafar Rosmina ◽  
...  

<p>Cardiovascular disease (CVD) is the leading cause of deaths worldwide. In 2017, CVD contributed to 13,503 deaths in Malaysia. The current approaches for CVD prediction are usually invasive and costly. Machine learning (ML) techniques allow an accurate prediction by utilizing the complex interactions among relevant risk factors. This study presents a case–control study involving 60 participants from The Malaysian Cohort, which is a prospective population-based project. Five parameters, namely, the R–R interval and root mean square of successive differences extracted from electrocardiogram (ECG), systolic and diastolic blood pressures, and total cholesterol level, were statistically significant in predicting CVD. Six ML algorithms, namely, linear discriminant analysis, linear and quadratic support vector machines, decision tree, k-nearest neighbor, and artificial neural network (ANN), were evaluated to determine the most accurate classifier in predicting CVD risk. ANN, which achieved 90% specificity, 90% sensitivity, and 90% accuracy, demonstrated the highest prediction performance among the six algorithms. In summary, by utilizing ML techniques, ECG data can serve as a good parameter for CVD prediction among the Malaysian multiethnic population.</p>


Sign in / Sign up

Export Citation Format

Share Document