Artificial intelligence-based classification with classical Turkish music makams: Possibilities to Turkish music education

2021 ◽  
Vol 9 (2) ◽  
pp. 570-580
Author(s):  
Mert Kayış ◽  

Makams of Classical Turkish Music have been tried to be classified through various studies for the past years. Significant differences of opinion have emerged in the classification process of the makams in Music Education and Literacy from past to present. This situation creates problems in learning the makams related to music education and recognizing the makams heard. Additionally, there are uncertainties in the classification of the makam genre of the song, as individual mistakes were made while notating the musical notes. Apart from that, this situation constitutes a problem not only for the ones studying Turkish Classical Music but also for the ones interested in this certain type of Music. Therefore, the objective of the research is to contribute to the makam classification in Classical Turkish Music Education by developing an MIR system that determines the makam of the songs. Theoretically, we can extract the properties of sound signals with Time Wavelet Scattering Feature Extraction, classify them with SVM and distinguish between types of makams. In this study, upon eight different Makams, a Musical Information Retrieval system has been created via the Artificial Intelligence (AI) method of Support Vector Machines (SVM) and Time Wavelet Scattering Feature Extraction and through using a Graphics Processing Unit (GPU) accelerator for the sake of feature extraction. We performed the classification process by modeling it on the MATLAB program. The study's success rate was identified as 98.21% and it acquired a higher success rate compared to the other studies in the literature. After completing the classification procedure, the Makams were identified by sending samples belonging to different sound files from the system consisting of a database belonging to eight different Makams. In our study, the classification and detection processes were realized with nearly a hundred percent success. The difficulties encountered in classifying the makams in Classical Turkish Music mentioned above, with the application of artificial intelligence, the classification difficulty of individuals who have received this type of training or are interested in this subject has been overcome.

Author(s):  
D. A. Kalina ◽  
R. V. Golovanov ◽  
D. V. Vorotnev

We present the monocamera approach of static hand gestures recognition based on skeletonization. The problem of creating skeleton of the human’s hand, as well as body, became solvable a few years ago after inventing so called convolutional pose machines – the novel architecture of artificial neural network. Our solution uses such kind of pretrained convolutional artificial network for extracting hand joints keypoints with further skeleton reconstruction. In this work we also propose special skeleton descriptor with proving its stability and distinguishability in terms of classification. We considered a few widespread machine learning algorithms to build and verify different classifiers. The quality of the classifier’s recognition is estimated using the wellknown Accuracy metric, which identified that classical SVM (Support Vector Machines) with radial basis kernel gives the best results. The testing of the whole system was conducted using public databases containing about 3000 of test images for more than 10 types of gestures. The results of a comparative analysis of the proposed system with existing approaches are demonstrated. It is shown that our gesture recognition system provides better quality in comparison with existing solutions. The performance of the proposed system was estimated for two configurations of standard personal computer: with CPU (Central Processing Unit) only and with GPU (Graphics Processing Unit) in addition where the latest one provides realtime processing with up to 60 frames per second. Thus we demonstrate that the proposed approach can find an application in the practice.


Micromachines ◽  
2021 ◽  
Vol 12 (7) ◽  
pp. 838
Author(s):  
Dong Hyun Hwang ◽  
Chang Yeop Han ◽  
Hyun Woo Oh ◽  
Seung Eun Lee

Artificial intelligence algorithms need an external computing device such as a graphics processing unit (GPU) due to computational complexity. For running artificial intelligence algorithms in an embedded device, many studies proposed light-weighted artificial intelligence algorithms and artificial intelligence accelerators. In this paper, we propose the ASimOV framework, which optimizes artificial intelligence algorithms and generates Verilog hardware description language (HDL) code for executing intelligence algorithms in field programmable gate array (FPGA). To verify ASimOV, we explore the performance space of k-NN algorithms and generate Verilog HDL code to demonstrate the k-NN accelerator in FPGA. Our contribution is to provide the artificial intelligence algorithm as an end-to-end pipeline and ensure that it is optimized to a specific dataset through simulation, and an artificial intelligence accelerator is generated in the end.


Complexity ◽  
2021 ◽  
Vol 2021 ◽  
pp. 1-11
Author(s):  
Tao Fan

This paper studies the traditional target classification and recognition algorithm based on Histogram of Oriented Gradients (HOG) feature extraction and Support Vector Machine (SVM) classification and applies this algorithm to distributed artificial intelligence image recognition. Due to the huge number of images, the general detection speed cannot meet the requirements. We have improved the HOG feature extraction algorithm. Using principal component analysis (PCA) to perform dimensionality reduction operations on HOG features and doing distributed artificial intelligence image recognition experiments, the results show that the image detection efficiency is slightly improved, and the detection speed is also improved. This article analyzes the reason for these changes because PCA mainly uses the useful feature information in HOG features. The parallelization processing of HOG features on graphics processing unit (GPU) is studied. GPU is used for high parallel and high-density calculations, and the calculation of HOG features is very complicated. Using GPU for parallelization of HOG features can make the calculation speed of HOG features improved. We use image experiments for the parallelized HOG feature algorithm. Experimental simulations show that the speed of distributed artificial intelligence image recognition is greatly improved. By analyzing the existing digital image recognition methods, an improved BP neural network algorithm is proposed. Under the premise of ensuring accuracy, the recognition speed of digital images is accelerated, the time required for recognition is reduced, real-time performance is guaranteed, and the effectiveness of the algorithm is verified.


2020 ◽  
Vol 20 (1) ◽  
pp. 67-76
Author(s):  
Rahmadya Trias Handayanto ◽  
Herlawati Herlawati

For the first time, machine learning did the classical classification process using two classes (bi-class) such as class -1 and class +1, 0 and 1, or the form of categories such as true and false. Famous methods used are Artificial Neural Networks (ANN) and Support Vector Machine (SVM). The current development was a problem with more than two classes, known as multi-class classes. For SVM sometimes the plural classes are overcome by doing a gradual process like a decision tree (DT) method. Meanwhile, ANN has experienced rapid development and is currently being developed with a large number of layers with the new activation functions, i.e. the rectified linear units (ReLu), and the probabilistic-based activation, i.e. softmax, including its optimizer methods (adam, sgd, and others). Then the term changed to Deep Learning (DL). This study aimed to compare two well-known methods (DL and SVM) in classifying multiple classes. The number of DL layers was six with the neuron composition are 128, 64, 32, 8, 4, and 3, while SVM uses a radial kernel base function with gamma and c respectively 0.7 and 5. Besides, this study intends to compare the use of the Graphics Processing Unit (GPU) available on Google Interactive Notebook (Google Colab), an online Python language programming application. The results showed that DL accuracy outperformed SVM but required large computational resources, with the accuracy for DL and SVM are 99% and 98%, respectively. However, the use of the GPU can overcome these problems and is proven to increase the speed of the process as much as 47 times. Keywords: Artificial Neural Networks, Graphics Processing Unit, Google Interactive Notebook, Rectified Linear units, Support Vector Machine. Abstrak Di awal perkembangannya mesin pembelajaran melakukan proses klasikfikasi menggunakan dua kelas (bi-class) misalnya kelas -1 dan kelas +1, 0 dan 1, atau bentuk kategori seperti benar dan salah. Metode terkenal yang digunakan adalah Jaringan Syaraf Tiruan (JST) dan Support Vector Machine (SVM). Perkembangan selanjutnya adalah problem dengan kelas yang lebih dari dua kelas, dikenal dengan istilah kelas jamak (multi-class). Untuk SVM terkadang kelas jamak diatasi dengan melakukan proses berjenjang mirip pohon keputusan (decision tree). Sementara itu JST telah mengalami perkembangan yang pesat dan saat ini sudah dikembangkan dengan jumlah layer yang banyak disertai dengan fungsi-fungsi aktivasi terkini seperti rectified linear unit (ReLu), dan softmax yang berbasis probabilistik, termasuk juga metode-metode optimizernya (adam, sgd, dan lain-lain). Kemudian istilahnya berubah menjadi Deep Learning (DL). Penelitian ini mencoba membandingkan dua metode terkenal (DL dan SVM) dalam melakukan klasifikasi kelas jamak. Jumlah layer DL sebanyak enam dengan masing-masing neuron sebesar 128, 64, 32, 8, 4, dan 3, sementara SVM menggunakan kernel radial basis function dengan gamma dan c berturut-turut 0.7 dan 5. Selain itu penelitian ini bermaksud membandingkan penggunaan Graphics Processing Unit (GPU) yang tersedia di Google Interactive Notebook (Google Colab), sebuah aplikasi online pemrograman bahasa Python. Hasil penelitian menunjukan akurasi DL unggul tipis dibanding SVM namun memerlukan sumber daya komputasi yang besar masing-masing dengan akurasi 99% dan 98%. Namun penggunaan GPU mampu mengatasi permasalahan tersebut dan terbukti meningkatkan kecepatan proses sebanyak 47 kali. Kata kunci: Jaringan Syaraf Tiruan, Graphics Processing Unit, Google Interactive Notebook, Rectified Linear units, Support Vector Machine.


Sensors ◽  
2021 ◽  
Vol 21 (16) ◽  
pp. 5330
Author(s):  
Marcin Łukasz Kowalski ◽  
Norbert Pałka ◽  
Jarosław Młyńczak ◽  
Mateusz Karol ◽  
Elżbieta Czerwińska ◽  
...  

Smuggling of drugs and cigarettes in small inflatable boats across border rivers is a serious threat to the EU’s financial interests. Early detection of such threats is challenging due to difficult and changing environmental conditions. This study reports on the automatic detection of small inflatable boats and people in a rough wild terrain in the infrared thermal domain. Three acquisition campaigns were carried out during spring, summer, and fall under various weather conditions. Three deep learning algorithms, namely, YOLOv2, YOLOv3, and Faster R-CNN working with six different feature extraction neural networks were trained and evaluated in terms of performance and processing time. The best performance was achieved with Faster R-CNN with ResNet101, however, processing requires a long time and a powerful graphics processing unit.


2007 ◽  
Author(s):  
Fredrick H. Rothganger ◽  
Kurt W. Larson ◽  
Antonio Ignacio Gonzales ◽  
Daniel S. Myers

Sign in / Sign up

Export Citation Format

Share Document