Automatic Music Transcription System Using SIDE

In this work, a probabilistic model for multiple-instrument automatic music transcription is proposed. The model extends the shift-invariant probabilistic latent component analysis method, which is used for spectrogram factorization. Proposed extensions support the use of multiple spectral templates per pitch and per instrument source, as well as a time-varying pitch contribution for each source. Thus, this method can effectively be used for multiple-instrument automatic transcription. In addition, the shift-invariant aspect of the method can be exploited for detecting tuning changes and frequency modulations, as well as for visualizing pitch content. For note tracking and smoothing, pitch-wise hidden Markov models are used. For training, pitch templates from eight orchestral instruments were extracted, covering their complete note range. The transcription system was tested on multiple-instrument polyphonic recordings from the RWC database, a Disklavier data set, and the MIREX 2007 multi-F0 data set. Results demonstrate that the proposed method outperforms leading approaches from the transcription literature, using several error metrics.

Download Full-text

Development of improved automatic music transcription system for the arabian flute (NAY)

Eighth International Multi-Conference on Systems, Signals & Devices ◽

10.1109/ssd.2011.5993561 ◽

2011 ◽

Cited By ~ 3

Author(s):

F. Al-ghawanmeh ◽

I. Jafar ◽

M Al-taee ◽

M Al-ghawanmeh ◽

Z. Muhsin

Keyword(s):

Music Transcription ◽

Transcription System ◽

Automatic Music Transcription

Download Full-text

A Compositional Automatic Music Transcription System for Computersynthesized Music

International Journal of Advancements in Computing Technology ◽

10.4156/ijact.vol4.issue6.20 ◽

2012 ◽

Vol 4 (6) ◽

pp. 165-173

Author(s):

Guo Yi

Keyword(s):

Music Transcription ◽

Transcription System ◽

Automatic Music Transcription

Download Full-text

A Combined Mathematical Treatment for a Special Automatic Music Transcription System

Abstract and Applied Analysis ◽

10.1155/2012/302958 ◽

2012 ◽

Vol 2012 ◽

pp. 1-13

Author(s):

Yi Guo ◽

Jiyong Tang

Keyword(s):

Mathematical Problem ◽

Experimental Results ◽

Probability Analysis ◽

Matrix Analysis ◽

Mathematical Treatment ◽

Analysis Method ◽

Auditory Model ◽

Music Transcription ◽

Transcription System ◽

Automatic Music Transcription

This paper presents a combined mathematical treatment for a special automatic music transcription system. This system is specially made for computer-synthesized music. The combined mathematical treatment includes harmonic selection, matrix analysis, and probability analysis method. The algorithm reduces dimension by PCA and selects candidates first by human auditory model and harmonic structures of notes. It changes the multiple-F0 estimation question into a mathematical problem and solves it in a mathematical way. It can be shown in this paper that the experimental results indicate that this method has very good recognition results.

Download Full-text

Isolated instrument transcription using a deep belief network

10.7287/peerj.preprints.1193 ◽

2015 ◽

Author(s):

Gregory Burlet ◽

Abram Hindle

Keyword(s):

Estimation Algorithm ◽

General Purpose ◽

Deep Belief Network ◽

Pitch Estimation ◽

Belief Network ◽

Music Transcription ◽

Transcription System ◽

Automatic Music Transcription ◽

Learning Techniques ◽

Single Instrument

Automatic music transcription is a difficult task that has provoked extensive research on transcription systems that are predominantly general purpose, processing any number or type of instruments sounding simultaneously. This paper presents a polyphonic transcription system that is constrained to processing the output of a single instrument with an upper bound on polyphony. For example, a guitar has six strings and is limited to producing six notes simultaneously. The transcription system consists of a novel pitch estimation algorithm that uses a deep belief network and multi-label learning techniques to generate multiple pitch estimates for each audio analysis frame, such that the polyphony does not exceed that of the instrument. The implemented transcription system is evaluated on a compiled dataset of synthesized guitar recordings. Comparing these results to a prior single-instrument polyphonic transcription system that received exceptional results, this paper demonstrates the effectiveness of deep, multi-label learning for the task of polyphonic transcription.

Download Full-text

Isolated instrument transcription using a deep belief network

10.7287/peerj.preprints.1193v1 ◽

2015 ◽

Author(s):

Gregory Burlet ◽

Abram Hindle

Keyword(s):

Estimation Algorithm ◽

General Purpose ◽

Deep Belief Network ◽

Pitch Estimation ◽

Belief Network ◽

Music Transcription ◽

Transcription System ◽

Automatic Music Transcription ◽

Learning Techniques ◽

Single Instrument

Automatic music transcription is a difficult task that has provoked extensive research on transcription systems that are predominantly general purpose, processing any number or type of instruments sounding simultaneously. This paper presents a polyphonic transcription system that is constrained to processing the output of a single instrument with an upper bound on polyphony. For example, a guitar has six strings and is limited to producing six notes simultaneously. The transcription system consists of a novel pitch estimation algorithm that uses a deep belief network and multi-label learning techniques to generate multiple pitch estimates for each audio analysis frame, such that the polyphony does not exceed that of the instrument. The implemented transcription system is evaluated on a compiled dataset of synthesized guitar recordings. Comparing these results to a prior single-instrument polyphonic transcription system that received exceptional results, this paper demonstrates the effectiveness of deep, multi-label learning for the task of polyphonic transcription.

Download Full-text

Design of an Automatic Music Transcription System for the Traditional Repertoire of the Marovany Zither from Madagascar

Advances in Multimedia and Interactive Technologies - Trends in Music Information Seeking, Behavior, and Retrieval for Creativity ◽

10.4018/978-1-5225-0270-8.ch010 ◽

2016 ◽

pp. 205-227

Author(s):

Dorian Cazau ◽

Marc Chemillier ◽

Olivier Adam

Keyword(s):

Sensory System ◽

Audio Signal ◽

Music Transcription ◽

Transcription System ◽

Automatic Music Transcription ◽

String Instrument ◽

Sensor Signals ◽

Machine Improvisation ◽

Original Approach ◽

Very High

This chapter presents an original approach for the development of an automatic music transcription system of a Malagasy traditional plucked string instrument, called marovany zither. Our approach is based on a technology of multichannel capturing sensory system, which allows breaking down a complex polyphonic audio signal into a sum of monophonic sensor signals. A very high precision in transcription is obtained, i.e. & gt; 95% on the average note-based F-measure metric. A second part of this chapter consists in using these transcripts in the human-machine improvisation system ImproteK. Details of an exploratory working session with a local Malagasy musician are reported and discussed.

Download Full-text