Audio Analysis Applications for Music

Author(s):  
Simon Dixon

The last decade has seen a revolution in the use of digital audio: The CD, which one decade earlier had taken over the home audio market, is starting to be replaced by electronic media which are distributed over the Internet and stored on computers or portable devices in compressed formats. The need has arisen for software to manage and manipulate the gigabytes of data in these music collections, and with the continual increase in computer speed, memory and disk storage capacity, the development of many previously infeasible applications has become possible. This article provides a brief review of automatic analysis of digital audio recordings with musical content, a rapidly expanding research area which finds numerous applications. One application area is the field of music information retrieval, where content-based indexing, classification and retrieval of audio data are needed in order to manage multimedia databases and libraries, as well as being useful in music retailing and commercial information services. Another application area is music software for the home and studio, where automatic beat tracking and transcription of music are much desired goals. In systematic musicology, audio analysis algorithms are being used in the study of expressive interpretation of music. Other emerging applications which make use of audio analysis are music recommender systems, playlist generators, visualisation systems, and software for automatic synchronisation of audio with other media and/or devices. We illustrate recent developments with three case studies of systems which analyse specific aspects of music (Dixon, 2004). The first system is BeatRoot (Dixon, 2001a, 2001c), a beat tracking system that finds the temporal location of musical beats in an audio recording, analogous to the way that people tap their feet in time to music. The second system is JTranscriber, an interactive automatic transcription system based on (Dixon, 2000a, 2000b), which recognizes musical notes and converts them into MIDI format, displaying the audio data as a spectrogram with the MIDI data overlaid in piano roll notation, and allowing interactive monitoring and correction of the extracted MIDI data. The third system is the Performance Worm (Dixon, Goebl, & Widmer, 2002), a real-time system for visualisation of musical expression, which presents in real time a two dimensional animation of variations in tempo and loudness (Langner & Goebl, 2002, 2003). Space does not permit the description of the many other music content analysis applications, such as: audio fingerprinting, where recordings can be uniquely identified with a high degree of accuracy, even with poor sound quality and in noisy environments (Wang, 2003); music summarisation, where important parts of songs such as choruses are identified automatically; instrument identification, using machine learning techniques to classify sounds by their source instruments; and melody and bass line extraction, essential components of query-by-example systems, where music databases can be searched by singing or whistling a small part of the desired piece. At the end of the article, we discuss emerging and future trends and research opportunities in audio content analysis.

Author(s):  
Simon Dixon

Automatic analysis of digital audio with musical content is a difficult — but important — task for various applications in computer music, audio compression, and music information retrieval. This chapter contains a brief review of audio analysis as it relates to music, followed by three case studies of recently developed systems which analyse specific aspects of music. The first system is BeatRoot, a beat tracking system that finds the temporal location of musical beats in an audio recording, analogous to the way that people tap their feet in time to music. The second system is JTranscriber, an interactive automatic transcription system, which recognises musical notes and converts them into MIDI format allowing interactive monitoring and correction of the extracted MIDI data via a multimedia interface. The third system is the Performance Worm, a real time system for visualisation of musical expression, which presents in real time a two-dimensional animation of variations in tempo and loudness.


2010 ◽  
Vol 19 (07) ◽  
pp. 1399-1421
Author(s):  
MOJTABA MAHDAVI ◽  
SHADROKH SAMAVI ◽  
SORINA DUMITRESCU ◽  
FERESHTEH AALAMIFAR ◽  
PARISA ABEDIKHOOZANI

Data hiding in the LSB of audio signals is an appealing steganographic method. This is due to the large volume of real-time production and transmission of audio data which makes it difficult to store and analyze these signals. Hence, steganalysis of audio signals requires online operations. Most of the existing steganalysis methods work on stored media files. In this paper, we present a steganalysis technique that can detect the existence of embedded data in the least significant bits of natural audio samples. The algorithm is designed to be simple, accurate, and to be hardware implementable. Hence, hardware implementation is presented for the proposed algorithm. The proposed hardware analyzes the histogram of an incoming stream of audio signals by using a sliding window strategy without needing the storage of the signals. The algorithm is mathematically modeled to show its capability to accurately predict the amount of embedding in an incoming stream of audio signals. Audio files with different amounts of embedded data were used to test the algorithm and its hardware implementation. The experimental results prove the functionality and high accuracy of the proposed method.


2014 ◽  
Vol 25 (4) ◽  
pp. 279-287 ◽  
Author(s):  
Stefan Hey ◽  
Panagiota Anastasopoulou ◽  
André Bideaux ◽  
Wilhelm Stork

Ambulatory assessment of emotional states as well as psychophysiological, cognitive and behavioral reactions constitutes an approach, which is increasingly being used in psychological research. Due to new developments in the field of information and communication technologies and an improved application of mobile physiological sensors, various new systems have been introduced. Methods of experience sampling allow to assess dynamic changes of subjective evaluations in real time and new sensor technologies permit a measurement of physiological responses. In addition, new technologies facilitate the interactive assessment of subjective, physiological, and behavioral data in real-time. Here, we describe these recent developments from the perspective of engineering science and discuss potential applications in the field of neuropsychology.


1989 ◽  
Vol 32 (7) ◽  
pp. 862-871 ◽  
Author(s):  
Clement Yu ◽  
Wei Sun ◽  
Dina Bitton ◽  
Qi Yang ◽  
Richard Bruno ◽  
...  

Sensors ◽  
2021 ◽  
Vol 21 (2) ◽  
pp. 408
Author(s):  
Elicia L. S. Wong ◽  
Khuong Q. Vuong ◽  
Edith Chow

Nanozymes are advanced nanomaterials which mimic natural enzymes by exhibiting enzyme-like properties. As nanozymes offer better structural stability over their respective natural enzymes, they are ideal candidates for real-time and/or remote environmental pollutant monitoring and remediation. In this review, we classify nanozymes into four types depending on their enzyme-mimicking behaviour (active metal centre mimic, functional mimic, nanocomposite or 3D structural mimic) and offer mechanistic insights into the nature of their catalytic activity. Following this, we discuss the current environmental translation of nanozymes into a powerful sensing or remediation tool through inventive nano-architectural design of nanozymes and their transduction methodologies. Here, we focus on recent developments in nanozymes for the detection of heavy metal ions, pesticides and other organic pollutants, emphasising optical methods and a few electrochemical techniques. Strategies to remediate persistent organic pollutants such as pesticides, phenols, antibiotics and textile dyes are included. We conclude with a discussion on the practical deployment of these nanozymes in terms of their effectiveness, reusability, real-time in-field application, commercial production and regulatory considerations.


Author(s):  
Rhiannon Edge ◽  
Carolyn Mazariego ◽  
Zhicheng Li ◽  
Karen Canfell ◽  
Annie Miller ◽  
...  

Abstract Purpose This study aimed to explore the psychosocial impacts of the coronavirus disease (COVID-19) pandemic on cancer patients, survivors, and carers in Australia. Methods Using real-time insights from two Cancer Council NSW services—131120 Information and Support Line and Online Community (CCOC) forums—we assessed service demand trends, distress levels (using the distress thermometer), and content from 131120 calls and online posts between 01 December 2019 and 31 May 2020. Emergent themes were identified through an inductive conventional content analysis with 131120 call notes, followed by a deductive directed content analysis on CCOC posts. Results In total, 688 COVID-19-related 131120 calls (n = 496) and online posts (n = 192) were analysed. Service demand peaked in March 2020 and self-reported distress peaked in May 2020 at an average of 8/10 [Mean = 7.5; SD = 0.9]. Five themes emerged from the qualitative analysis: psychological distress and fear of virus susceptibility, practical issues, cancer service disruptions, information needs, and carer Issues. Conclusions The psychosocial impacts of COVID-19 on people affected by cancer are multifaceted and likely to have long-lasting consequences. Our findings drove the development of six recommendations across three domains of support, information, and access. Cancer patients, survivors, and carers already face stressful challenges dealing with a cancer diagnosis or survivorship. The added complexity of restrictions and uncertainty associated with the pandemic may compound this. It is important that healthcare providers are equipped to provide patient-centred care during and after this crisis. Our recommendations provide points of consideration to ensure care is tailored and patient oriented.


2006 ◽  
Vol 11 (2) ◽  
pp. 101-112 ◽  
Author(s):  
BRUNO BOSSIS

The musicologist is confronted with many situations during the analysis of electroacoustic music, whether on support media, mixed, or real-time. Musical genres and styles vary greatly, and the collection of electronic musical instruments has also proven to be very heterogeneous. The intrinsic characteristics of the electroacoustic parts and their scoring create serious limitations. Furthermore, many sources remain inaccessible or are already lost. Thus the preoccupation with documentary sources related to the acts of creation, interpretation, and technological context becomes more and more pressing. It is now essential to formulate a synthetic vision of this music, which has existed for half a century, and to pursue the search for invariants. This work must be based on a rigorous methodology that has yet to be developed. More generally speaking, the goal is to establish the terms and conditions of a systematic musicology of electroacoustics.


2018 ◽  
Vol 25 (4) ◽  
pp. 1135-1143 ◽  
Author(s):  
Faisal Khan ◽  
Suresh Narayanan ◽  
Roger Sersted ◽  
Nicholas Schwarz ◽  
Alec Sandy

Multi-speckle X-ray photon correlation spectroscopy (XPCS) is a powerful technique for characterizing the dynamic nature of complex materials over a range of time scales. XPCS has been successfully applied to study a wide range of systems. Recent developments in higher-frame-rate detectors, while aiding in the study of faster dynamical processes, creates large amounts of data that require parallel computational techniques to process in near real-time. Here, an implementation of the multi-tau and two-time autocorrelation algorithms using the Hadoop MapReduce framework for distributed computing is presented. The system scales well with regard to the increase in the data size, and has been serving the users of beamline 8-ID-I at the Advanced Photon Source for near real-time autocorrelations for the past five years.


Sign in / Sign up

Export Citation Format

Share Document