Optical Music Recognition Method Combining Multi-Scale Residual Convolutional Neural Network and Bi-Directional Simple Recurrent Units

Optical Music Recognition (OMR) adalah suatu cara untuk melakukan pengenalan pada notasi musik secara otomatis. Masalah utama dalam pendeteksian notasi musik adalah bagaimana sistem dapat mendeteksi sebuah notasi musik dan kemudian mengenali notasi musik tersebut. Notasi musik yang telah dikenali oleh mesin dapat dimanfaatkan untuk diproses kembali menjadi suara. Pada penelitian ini, proses segmentasi dilakukan untuk memotong setiap notasi. Untuk pengenalan notasi musik digunakan Convolutional Neural Network (CNN). Arsitektur CNN yang dipakai adalah kernel 3x3, jumlah layer pada feature learning sebanyak 3 convolutional layer dan 3 pooling layer, filter pada convolutional layer 64,128, 256 dan jumlah neuron pada hidden layer sebanyak 7168. Pengujian dilakukan dengan dua cara, yang pertama menguji performasi CNN menggunakan data notasi musik yang telah dipotong dan yang kedua adalah melakukan pengujian menggunakan sebaris notasi musik. Nilai akurasi yang didapatkan untuk pengenalan sebaris notasi musik tidak terlalu besar, yaitu 26,19%. Walaupun untuk proses segmentasi masih belum maksimal dalam memotong setiap notasi, namun metode CNN bekerja sangat baik untuk mengenali setiap notasi musik yang telah dipotong dengan benar. Hal ini ditunjukkan dari nilai akurasi yang mencapai 95,56%.

Download Full-text

Music note position recognition in optical music recognition using convolutional neural network

International Journal of Arts and Technology ◽

10.1504/ijart.2021.115764 ◽

2021 ◽

Vol 13 (1) ◽

pp. 45

Author(s):

N.A. Andrea ◽

N.A. Paoline ◽

Amalia Zahra

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Optical Music Recognition ◽

Music Recognition

Download Full-text

A novel multi-scale convolutional neural network for motor imagery classification

Biomedical Signal Processing and Control ◽

10.1016/j.bspc.2021.102747 ◽

2021 ◽

Vol 68 ◽

pp. 102747

Author(s):

Mouad Riyad ◽

Mohammed Khalil ◽

Abdellah Adib

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Motor Imagery ◽

Multi Scale

Download Full-text

Pixel-level Diabetic Retinopathy Lesion Detection Using Multi-scale Convolutional Neural Network

2021 IEEE 3rd Global Conference on Life Sciences and Technologies (LifeTech) ◽

10.1109/lifetech52111.2021.9391891 ◽

2021 ◽

Author(s):

Qi Li ◽

Chenglei Peng ◽

Yazhen Ma ◽

Sidan Du ◽

Bin Guo ◽

...

Keyword(s):

Neural Network ◽

Diabetic Retinopathy ◽

Convolutional Neural Network ◽

Lesion Detection ◽

Multi Scale

Download Full-text

Chicken Image Segmentation via Multi-Scale Attention-Based Deep Convolutional Neural Network

IEEE Access ◽

10.1109/access.2021.3074297 ◽

2021 ◽

pp. 1-1

Author(s):

Wei Li ◽

Yang Xiao ◽

Xibin Song ◽

Na Lv ◽

Xinbo Jiang ◽

...

Keyword(s):

Neural Network ◽

Image Segmentation ◽

Convolutional Neural Network ◽

Deep Convolutional Neural Network ◽

Multi Scale

Download Full-text

Bayesian Multi-scale Convolutional Neural Network for Motif Occupancy Identification

2020 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) ◽

10.1109/bibm49941.2020.9313556 ◽

2020 ◽

Author(s):

Wei Li ◽

Qingqing Zhao ◽

Han Zhang ◽

Xiongwen Quan ◽

Jing Xu ◽

...

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Multi Scale

Download Full-text

Multi-Scale Feature-Guided Stereoscopic Video Quality Assessment Based on 3d Convolutional Neural Network

ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp39728.2021.9414231 ◽

2021 ◽

Author(s):

Yingjie Feng ◽

Sumei Li ◽

Yongli Chang

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Quality Assessment ◽

Video Quality ◽

Video Quality Assessment ◽

Stereoscopic Video ◽

Scale Feature ◽

Multi Scale

Download Full-text

Attention-Based Multi-Scale Convolutional Neural Network (A+MCNN) for Multi-Class Classification in Road Images

Sensors ◽

10.3390/s21155137 ◽

2021 ◽

Vol 21 (15) ◽

pp. 5137

Author(s):

Elham Eslami ◽

Hae-Bum Yun

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Contextual Information ◽

Automated Classification ◽

Automated Recognition ◽

Pavement Distress ◽

Multi Scale ◽

Pavement Distresses ◽

Multi Class Classification ◽

Transportation Applications

Automated pavement distress recognition is a key step in smart infrastructure assessment. Advances in deep learning and computer vision have improved the automated recognition of pavement distresses in road surface images. This task remains challenging due to the high variation of defects in shapes and sizes, demanding a better incorporation of contextual information into deep networks. In this paper, we show that an attention-based multi-scale convolutional neural network (A+MCNN) improves the automated classification of common distress and non-distress objects in pavement images by (i) encoding contextual information through multi-scale input tiles and (ii) employing a mid-fusion approach with an attention module for heterogeneous image contexts from different input scales. A+MCNN is trained and tested with four distress classes (crack, crack seal, patch, pothole), five non-distress classes (joint, marker, manhole cover, curbing, shoulder), and two pavement classes (asphalt, concrete). A+MCNN is compared with four deep classifiers that are widely used in transportation applications and a generic CNN classifier (as the control model). The results show that A+MCNN consistently outperforms the baselines by 1∼26% on average in terms of the F-score. A comprehensive discussion is also presented regarding how these classifiers perform differently on different road objects, which has been rarely addressed in the existing literature.

Download Full-text