Cardiac Sound Classification Using Mel-Frequency Cepstral Coefficients (MFCC) and Artificial Neural Network (ANN)

Author(s):  
Muhammad Rahmandani ◽  
Hanung Adi Nugroho ◽  
Noor Akhmad Setiawan
2020 ◽  
Vol 4 (3) ◽  
pp. 497-503
Author(s):  
Thasya Nurul Wulandari Siagian ◽  
Hilal Hudan Nuha ◽  
Rahmat Yasirandi

Footstep recognition is relatively new biometrics and based on the learning of footsteps signals captured from people walking on the sensing area. The footstep signals classification process for security systems still has a low level of accuracy. Therefore, we need a classification system that has a high accuracy for security systems. Most systems are generally developed using geometric and holistic features but still provide high error rates. In this research, a new system is proposed by using the Mel Frequency Cepstral Coefficients (MFCCs) feature extraction, because it has a good linear frequency as a copycat of the human hearing system and Artificial Neural Network (ANN) as a classification algorithm because it has a good level of accuracy with a dataset of 500 recording footsteps. The classification results show that the proposed system can achieve the highest accuracy of validation loss value 57.3, Accuracy testing 92.0%, loss value 193.8, and accuracy training 100%, the accuracy results are an evaluation of the system in improving the foot signal recognition system for security systems in the smart home environment.


2021 ◽  
Author(s):  
Yunendah Nur Fu’adah ◽  
Ki Moo Lim

Abstract Heart sound auscultation is one of the most widely used approaches for detecting cardiovascular disorders. Diagnosing abnormalities of heart sound using a stethoscope depends on the physician’s skill and judgement. Several studies have shown promising results in the automatic detection of cardiovascular disorders based on heart sound signals. However, the accuracy performance needs to be improved as automated heart sound classification aids in the early detection and prevention of the dangerous effects of cardiovascular problems. In this study, an optimal heart sound classification method based on machine learning technologies for cardiovascular disease prediction is performed. It consists of three steps: pre-processing that sets the 5 s duration of the Physionet Challenge 2016 datasets, feature extraction using mel-frequency cepstrum coefficients (MFCC), and classification using an artificial neural network (ANN) with one hidden layer that provides low parameter consumption. Ten-fold cross-validation was used to evaluate the performance of the proposed method. The best model obtained 94% accuracy and 93% AUC score, which were assessed using 1626 test datasets. Taken together, the results show that the proposed method obtained excellent classification results and provided low parameter consumption, thereby reducing computational time to facilitate a real-time implementation.


Author(s):  
Gizachew Belayneh Gebre Et. al.

In this artificial intelligence time, speaker recognition is the most useful biometric recognition technique. Security is a big issue that needs careful attention because of every activities have been becoming automated and internet based. For security purpose, unique features of authorized user are highly needed. Voice is one of the wonderful unique biometric features. So, developing speaker recognition based on scientific research is the most concerned issue. Nowadays, criminal activities are increasing day to day in different clever way. So, every country should have strengthen forensic investigation using such technologies. The study was done by inspiration of contextualizing this concept for our country. In this study, text-independent Amharic language speaker recognition model was developed using Mel-Frequency Cepstral Coefficients to extract features from preprocessed speech signals and Artificial Neural Network to model the feature vector obtained from the Mel-Frequency Cepstral Coefficients and to classify objects while testing. The researcher used 20 sampled speeches of 10 each speaker (total of 200 speech samples) for training and testing separately. By setting the number of hidden neurons to 15, 20, and 25, three different models have been developed and evaluated for accuracy. The fourth-generation high-level programming language and interactive environment MATLAB is used to conduct the overall study implementations. At the end, very promising findings have been obtained. The study achieved better performance than other related researches which used Vector Quantization and Gaussian Mixture Model modelling techniques. Implementable result could obtain for the future by increasing number of speakers and speech samples and including the four Amharic accents.


2019 ◽  
Vol 12 (3) ◽  
pp. 145 ◽  
Author(s):  
Epyk Sunarno ◽  
Ramadhan Bilal Assidiq ◽  
Syechu Dwitya Nugraha ◽  
Indhana Sudiharto ◽  
Ony Asrarul Qudsi ◽  
...  

2020 ◽  
Vol 38 (4A) ◽  
pp. 510-514
Author(s):  
Tay H. Shihab ◽  
Amjed N. Al-Hameedawi ◽  
Ammar M. Hamza

In this paper to make use of complementary potential in the mapping of LULC spatial data is acquired from LandSat 8 OLI sensor images are taken in 2019.  They have been rectified, enhanced and then classified according to Random forest (RF) and artificial neural network (ANN) methods. Optical remote sensing images have been used to get information on the status of LULC classification, and extraction details. The classification of both satellite image types is used to extract features and to analyse LULC of the study area. The results of the classification showed that the artificial neural network method outperforms the random forest method. The required image processing has been made for Optical Remote Sensing Data to be used in LULC mapping, include the geometric correction, Image Enhancements, The overall accuracy when using the ANN methods 0.91 and the kappa accuracy was found 0.89 for the training data set. While the overall accuracy and the kappa accuracy of the test dataset were found 0.89 and 0.87 respectively.


2020 ◽  
Vol 38 (2A) ◽  
pp. 255-264
Author(s):  
Hanan A. R. Akkar ◽  
Sameem A. Salman

Computer vision and image processing are extremely necessary for medical pictures analysis. During this paper, a method of Bio-inspired Artificial Intelligent (AI) optimization supported by an artificial neural network (ANN) has been widely used to detect pictures of skin carcinoma. A Moth Flame Optimization (MFO) is utilized to educate the artificial neural network (ANN). A different feature is an extract to train the classifier. The comparison has been formed with the projected sample and two Artificial Intelligent optimizations, primarily based on classifier especially with, ANN-ACO (ANN training with Ant Colony Optimization (ACO)) and ANN-PSO (training ANN with Particle Swarm Optimization (PSO)). The results were assessed using a variety of overall performance measurements to measure indicators such as Average Rate of Detection (ARD), Average Mean Square error (AMSTR) obtained from training, Average Mean Square error (AMSTE) obtained for testing the trained network, the Average Effective Processing Time (AEPT) in seconds, and the Average Effective Iteration Number (AEIN). Experimental results clearly show the superiority of the proposed (ANN-MFO) model with different features.


Sign in / Sign up

Export Citation Format

Share Document