Isolated instrument transcription using a deep belief network

Learning Techniques ◽

Single Instrument

Automatic music transcription is a difficult task that has provoked extensive research on transcription systems that are predominantly general purpose, processing any number or type of instruments sounding simultaneously. This paper presents a polyphonic transcription system that is constrained to processing the output of a single instrument with an upper bound on polyphony. For example, a guitar has six strings and is limited to producing six notes simultaneously. The transcription system consists of a novel pitch estimation algorithm that uses a deep belief network and multi-label learning techniques to generate multiple pitch estimates for each audio analysis frame, such that the polyphony does not exceed that of the instrument. The implemented transcription system is evaluated on a compiled dataset of synthesized guitar recordings. Comparing these results to a prior single-instrument polyphonic transcription system that received exceptional results, this paper demonstrates the effectiveness of deep, multi-label learning for the task of polyphonic transcription.

Isolated guitar transcription using a deep belief network

PeerJ Computer Science ◽

10.7717/peerj-cs.109 ◽

2017 ◽

Vol 3 ◽

pp. e109 ◽

Cited By ~ 1

Author(s):

Gregory Burlet ◽

Abram Hindle

Keyword(s):

Audio Signal ◽

Estimation Algorithm ◽

General Purpose ◽

Deep Belief Network ◽

Music Notation ◽

Audio Recording ◽

Belief Network ◽

Music Transcription ◽

Transcription System ◽

Audio Output

Music transcription involves the transformation of an audio recording to common music notation, colloquially referred to as sheet music. Manually transcribing audio recordings is a difficult and time-consuming process, even for experienced musicians. In response, several algorithms have been proposed to automatically analyze and transcribe the notes sounding in an audio recording; however, these algorithms are often general-purpose, attempting to process any number of instruments producing any number of notes sounding simultaneously. This paper presents a polyphonic transcription algorithm that is constrained to processing the audio output of a single instrument, specifically an acoustic guitar. The transcription system consists of a novel note pitch estimation algorithm that uses a deep belief network and multi-label learning techniques to generate multiple pitch estimates for each analysis frame of the input audio signal. Using a compiled dataset of synthesized guitar recordings for evaluation, the algorithm described in this work results in an 11% increase in the f-measure of note transcriptions relative to Zhou et al.’s (2009) transcription algorithm in the literature. This paper demonstrates the effectiveness of deep, multi-label learning for the task of polyphonic transcription.

A Shift-Invariant Latent Variable Model for Automatic Music Transcription

Computer Music Journal ◽

10.1162/comj_a_00146 ◽

2012 ◽

Vol 36 (4) ◽

pp. 81-94 ◽

Cited By ~ 36

Author(s):

Emmanouil Benetos ◽

Simon Dixon

Keyword(s):

Latent Variable ◽

Markov Models ◽

Variable Model ◽

Data Set ◽

Music Transcription ◽

Transcription System ◽

Error Metrics ◽

Frequency Modulations ◽

Time Varying Pitch

In this work, a probabilistic model for multiple-instrument automatic music transcription is proposed. The model extends the shift-invariant probabilistic latent component analysis method, which is used for spectrogram factorization. Proposed extensions support the use of multiple spectral templates per pitch and per instrument source, as well as a time-varying pitch contribution for each source. Thus, this method can effectively be used for multiple-instrument automatic transcription. In addition, the shift-invariant aspect of the method can be exploited for detecting tuning changes and frequency modulations, as well as for visualizing pitch content. For note tracking and smoothing, pitch-wise hidden Markov models are used. For training, pitch templates from eight orchestral instruments were extracted, covering their complete note range. The transcription system was tested on multiple-instrument polyphonic recordings from the RWC database, a Disklavier data set, and the MIREX 2007 multi-F0 data set. Results demonstrate that the proposed method outperforms leading approaches from the transcription literature, using several error metrics.

Deep Learning Innovations and Their Convergence With Big Data - Advances in Data Mining and Database Management ◽

Classifying Images of Drought-Affected Area Using Deep Belief Network, kNN, and Random Forest Learning Techniques

10.4018/978-1-5225-3015-2.ch006 ◽

2018 ◽

pp. 102-119 ◽

Cited By ~ 1

Author(s):

Sanjiban Sekhar Roy ◽

Pulkit Kulshrestha ◽

Pijush Samui

Keyword(s):

Random Forest ◽

Performance Metrics ◽

Deep Belief Network ◽

Machine Learning Algorithms ◽

Nearest Neighbour ◽

Data Set ◽

Belief Network ◽

Learning Techniques ◽

Severe Shortage

Drought is a condition of land in which the ground water faces a severe shortage. This condition affects the survival of plants and animals. Drought can impact ecosystem and agricultural productivity, severely. Hence, the economy also gets affected by this situation. This paper proposes Deep Belief Network (DBN) learning technique, which is one of the state of the art machine learning algorithms. This proposed work uses DBN, for classification of drought and non-drought images. Also, k nearest neighbour (kNN) and random forest learning methods have been proposed for the classification of the same drought images. The performance of the Deep Belief Network(DBN) has been compared with k nearest neighbour (kNN) and random forest. The data set has been split into 80:20, 70:30 and 60:40 as train and test. Finally, the effectiveness of the three proposed models have been measured by various performance metrics.

Development of improved automatic music transcription system for the arabian flute (NAY)

Eighth International Multi-Conference on Systems, Signals & Devices ◽

10.1109/ssd.2011.5993561 ◽

2011 ◽

Cited By ~ 3

Author(s):

F. Al-ghawanmeh ◽

I. Jafar ◽

M Al-taee ◽

M Al-ghawanmeh ◽

Z. Muhsin

Keyword(s):

Music Transcription ◽

Transcription System ◽

A Compositional Automatic Music Transcription System for Computersynthesized Music

International Journal of Advancements in Computing Technology ◽

10.4156/ijact.vol4.issue6.20 ◽

2012 ◽

Vol 4 (6) ◽

pp. 165-173

Author(s):

Guo Yi

Keyword(s):

Music Transcription ◽

Transcription System ◽

A Combined Mathematical Treatment for a Special Automatic Music Transcription System

Abstract and Applied Analysis ◽

10.1155/2012/302958 ◽

2012 ◽

Vol 2012 ◽

pp. 1-13

Author(s):

Yi Guo ◽

Jiyong Tang

Keyword(s):

Mathematical Problem ◽

Experimental Results ◽

Probability Analysis ◽

Matrix Analysis ◽

Mathematical Treatment ◽

Analysis Method ◽

Auditory Model ◽

Music Transcription ◽

Transcription System ◽

This paper presents a combined mathematical treatment for a special automatic music transcription system. This system is specially made for computer-synthesized music. The combined mathematical treatment includes harmonic selection, matrix analysis, and probability analysis method. The algorithm reduces dimension by PCA and selects candidates first by human auditory model and harmonic structures of notes. It changes the multiple-F0 estimation question into a mathematical problem and solves it in a mathematical way. It can be shown in this paper that the experimental results indicate that this method has very good recognition results.

Advances in Multimedia and Interactive Technologies - Trends in Music Information Seeking, Behavior, and Retrieval for Creativity ◽

Design of an Automatic Music Transcription System for the Traditional Repertoire of the Marovany Zither from Madagascar

10.4018/978-1-5225-0270-8.ch010 ◽

2016 ◽

pp. 205-227

Author(s):

Dorian Cazau ◽

Marc Chemillier ◽

Olivier Adam

Keyword(s):

Sensory System ◽

Audio Signal ◽

Music Transcription ◽

Transcription System ◽

String Instrument ◽

Sensor Signals ◽

Machine Improvisation ◽

Original Approach ◽

Very High

This chapter presents an original approach for the development of an automatic music transcription system of a Malagasy traditional plucked string instrument, called marovany zither. Our approach is based on a technology of multichannel capturing sensory system, which allows breaking down a complex polyphonic audio signal into a sum of monophonic sensor signals. A very high precision in transcription is obtained, i.e. & gt; 95% on the average note-based F-measure metric. A second part of this chapter consists in using these transcripts in the human-machine improvisation system ImproteK. Details of an exploratory working session with a local Malagasy musician are reported and discussed.

Automatic Music Transcription System Using SIDE

The KIPS Transactions PartB ◽

10.3745/kipstb.2009.16-b.2.141 ◽

2009 ◽

Vol 16B (2) ◽

pp. 141-150 ◽

Cited By ~ 1

Author(s):

A-Young Hyoung ◽

Joon-Whoan Lee

Keyword(s):

Music Transcription ◽

Transcription System ◽

Timbre Comparison in Note Tracking from Onset, Frames and Pitch Estimation

Jornada de Jóvenes Investigadores del I3A ◽

10.26754/jjii3a.4872 ◽

2020 ◽

Vol 8 ◽

Author(s):

Carlos Hernández Oliván ◽

Ignacio Zay Pinilla ◽

José Ramón Beltrán Blázquez

Keyword(s):

Information Retrieval ◽

Music Information Retrieval ◽

Tracking Algorithm ◽

Pitch Detection ◽

Critical Problem ◽

Pitch Estimation ◽

Music Transcription ◽

Music Information

Note Tracking (NT) is a subtask of Automatic Music Transcription (AMT) which is a critical problem in the field of Music Information Retrieval (MIR). The aim of this work is to compare the performance of two models, one for onsets and frames prediction and another one with pitch detection and a note tracking algorithm in order to study the behaviour of different timbres and families of instruments in note tracking subtasks.