Convolutional Neural Network untuk Pengenalan Citra Notasi Musik

Optical Music Recognition (OMR) adalah suatu cara untuk melakukan pengenalan pada notasi musik secara otomatis. Masalah utama dalam pendeteksian notasi musik adalah bagaimana sistem dapat mendeteksi sebuah notasi musik dan kemudian mengenali notasi musik tersebut. Notasi musik yang telah dikenali oleh mesin dapat dimanfaatkan untuk diproses kembali menjadi suara. Pada penelitian ini, proses segmentasi dilakukan untuk memotong setiap notasi. Untuk pengenalan notasi musik digunakan Convolutional Neural Network (CNN). Arsitektur CNN yang dipakai adalah kernel 3x3, jumlah layer pada feature learning sebanyak 3 convolutional layer dan 3 pooling layer, filter pada convolutional layer 64,128, 256 dan jumlah neuron pada hidden layer sebanyak 7168. Pengujian dilakukan dengan dua cara, yang pertama menguji performasi CNN menggunakan data notasi musik yang telah dipotong dan yang kedua adalah melakukan pengujian menggunakan sebaris notasi musik. Nilai akurasi yang didapatkan untuk pengenalan sebaris notasi musik tidak terlalu besar, yaitu 26,19%. Walaupun untuk proses segmentasi masih belum maksimal dalam memotong setiap notasi, namun metode CNN bekerja sangat baik untuk mengenali setiap notasi musik yang telah dipotong dengan benar. Hal ini ditunjukkan dari nilai akurasi yang mencapai 95,56%.

Download Full-text

Music Note Position Recognition in Optical Music Recognition using Convolutional Neural Network

International Journal of Arts and Technology ◽

10.1504/ijart.2021.10035633 ◽

2021 ◽

Vol 13 (1) ◽

pp. 1

Author(s):

Andrea Andrea ◽

Paoline Paoline ◽

Amalia Zahra

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Optical Music Recognition ◽

Music Recognition

Download Full-text

Optical Music Recognition Method Combining Multi-Scale Residual Convolutional Neural Network and Bi-Directional Simple Recurrent Units

Laser & Optoelectronics Progress ◽

10.3788/lop57.081006 ◽

2020 ◽

Vol 57 (8) ◽

pp. 081006

Author(s):

吴琼 Wu Qiong ◽

李锵 Li Qiang ◽

关欣 Guan Xin

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Recognition Method ◽

Optical Music Recognition ◽

Multi Scale ◽

Music Recognition

Download Full-text

Camera-Based Optical Music Recognition Using a Convolutional Neural Network

2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR) ◽

10.1109/icdar.2017.261 ◽

2017 ◽

Cited By ~ 2

Author(s):

Adria Rico Blanes ◽

Alicia Fornes Bisquerra

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Optical Music Recognition ◽

Music Recognition

Download Full-text

Music note position recognition in optical music recognition using convolutional neural network

International Journal of Arts and Technology ◽

10.1504/ijart.2021.115764 ◽

2021 ◽

Vol 13 (1) ◽

pp. 45

Author(s):

N.A. Andrea ◽

N.A. Paoline ◽

Amalia Zahra

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Optical Music Recognition ◽

Music Recognition

Download Full-text

Mixup of Feature Maps in a Hidden Layer for Training of Convolutional Neural Network

Neural Information Processing - Lecture Notes in Computer Science ◽

10.1007/978-3-030-04179-3_56 ◽

2018 ◽

pp. 635-644

Author(s):

Hideki Oki ◽

Takio Kurita

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Feature Maps ◽

Hidden Layer

Download Full-text

Deep Feature Learning for Disease Risk Assessment Based on Convolutional Neural Network With Intra-Layer Recurrent Connection by Using Hospital Big Data

IEEE Access ◽

10.1109/access.2018.2879158 ◽

2018 ◽

Vol 6 ◽

pp. 67927-67939 ◽

Cited By ~ 12

Author(s):

Mohd Usama ◽

Belal Ahmad ◽

Jiafu Wan ◽

M. Shamim Hossain ◽

Mohammed F. Alhamid ◽

...

Keyword(s):

Neural Network ◽

Risk Assessment ◽

Big Data ◽

Convolutional Neural Network ◽

Disease Risk ◽

Feature Learning ◽

Recurrent Connection ◽

Deep Feature ◽

Deep Feature Learning

Download Full-text

Fpga Implementation of Precise Convolutional Neural Network for Extreme Learning Machine

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.h6501.069820 ◽

2020 ◽

Vol 9 (8) ◽

pp. 470-480

Keyword(s):

Neural Network ◽

Receptive Field ◽

Convolutional Neural Network ◽

Computation Time ◽

Field Approach ◽

Pixel Array ◽

Gradient Based ◽

Feed Forward Neural Networks ◽

Learning Machine ◽

Hidden Layer

Feed-forward neural networks can be trained based on a gradient-descent based backpropagation algorithm. But, these algorithms require more computation time. Extreme Learning Machines (ELM’s) are time-efficient, and they are less complicated than the conventional gradient-based algorithm. In previous years, an SRAM based convolutional neural network using a receptive – field Approach was proposed. This neural network was used as an encoder for the ELM algorithm and was implemented on FPGA. But, this neural network used an inaccurate 3-stage pipelined parallel adder. Hence, this neural network generates imprecise stimuli to the hidden layer neurons. This paper presents an implementation of precise convolutional neural network for encoding in the ELM algorithm based on the receptive - field approach at the hardware level. In the third stage of the pipelined parallel adder, instead of approximating the output by using one 2-input 15-bit adder, one 4-input 14-bit adder is used. Also, an additional weighted pixel array block is used. This weighted pixel array improves the accuracy of generating 128 weighted pixels. This neural network was simulated using ModelSim-Altera 10.1d and synthesized using Quartus II 13.0 sp1. This neural network is implemented on Cyclone V FPGA and used for pattern recognition applications. Although this design consumes slightly more hardware resources, this design is more accurate compared to previously existing encoders

Download Full-text

Applying of machine learning in the construction of a voice-controlled interface on the example of a music player

Journal of Computer Sciences Institute ◽

10.35784/jcsi.1324 ◽

2019 ◽

Vol 13 ◽

pp. 302-309

Author(s):

Jakub Basiakowski

Keyword(s):

Neural Network ◽

Machine Learning ◽

Convolutional Neural Network ◽

Feedforward Neural Network ◽

Hidden Layer ◽

Music Player ◽

The Impact

The following paper presents the results of research on the impact of machine learning in the construction of a voice-controlled interface. Two different models were used for the analysys: a feedforward neural network containing one hidden layer and a more complicated convolutional neural network. What is more, a comparison of the applied models was presented. This comparison was performed in terms of quality and the course of training.

Download Full-text