Research on Environmental Sound Classification Algorithm Based on Multi-feature Fusion

Author(s):  
Ruixue Li ◽  
Bo Yin ◽  
Yongchao Cui ◽  
Zehua Du ◽  
Kexin Li
2018 ◽  
Vol 173 ◽  
pp. 03059
Author(s):  
Huimin Zhao ◽  
Xianglin Huang ◽  
Wei Liu ◽  
Lifang Yang

With deep great breakthroughs of deep learning in the field of computer vision, the field of audio recognition has gradually introduced deep learning methods and achieved excellent results. These results are mainly for speech and music recognition research, and there is very little research on environmental sound classification. In recent years, people have begun to expand the research object of deep learning to the environmental sound, and achieved certain results. In this paper, we use ESC-50 as our test set, based on the SoundNet network and EnvNet network to propose a feature fusion method[1]. After the features extracted by SoundNet and EnvNet were merged, they were classified using fusion features. Experimental results show that this method has better classification accuracy for the recognition of environmental sounds than using either of the two networks separately for classification.


2021 ◽  
Vol 263 (5) ◽  
pp. 1130-1141
Author(s):  
Kivanc Kitapci ◽  
Dogukan Ozdemir

One of the objectives of architectural design is to create multi-sensory environments. The users are under the influence of a wide variety and intense perceptual data flow when users experience a designed space. Architects and environmental designers should not ignore the sense of hearing, one of the most important of the five primitive senses that allow us to experience the physical environment within the framework of creative thinking from the first stage of the design process. Today, auditory analysis of spaces has been studied under architectural acoustics, soundscapes, multi-sensory interactions, and sense of place. However, the current sound design methods implemented in the film and video game industries and industrial design have not been used in architectural design practices. Sound design is the art and application of making soundtracks in various disciplines and it involves recognizing, acquiring, or developing of auditory components. This research aims to establish a holistic architectural sound design framework based on the previous sound classification and taxonomic models found in the literature. The proposed sound design framework will help the architects and environmental designers classify the sound elements in the built environment and provide holistic environmental sound design guidelines depending on the spaces' functions and context.


2010 ◽  
Author(s):  
Stavros Ntalampiras ◽  
Ilyas Potamitis ◽  
Nikos Fakotakis

Author(s):  
Jinfang Zeng ◽  
Youming Li ◽  
Yu Zhang ◽  
Da Chen

Environmental sound classification (ESC) is a challenging problem due to the complexity of sounds. To date, a variety of signal processing and machine learning techniques have been applied to ESC task, including matrix factorization, dictionary learning, wavelet filterbanks and deep neural networks. It is observed that features extracted from deeper networks tend to achieve higher performance than those extracted from shallow networks. However, in ESC task, only the deep convolutional neural networks (CNNs) which contain several layers are used and the residual networks are ignored, which lead to degradation in the performance. Meanwhile, a possible explanation for the limited exploration of CNNs and the difficulty to improve on simpler models is the relative scarcity of labeled data for ESC. In this paper, a residual network called EnvResNet for the ESC task is proposed. In addition, we propose to use audio data augmentation to overcome the problem of data scarcity. The experiments will be performed on the ESC-50 database. Combined with data augmentation, the proposed model outperforms baseline implementations relying on mel-frequency cepstral coefficients and achieves results comparable to other state-of-the-art approaches in terms of classification accuracy.


Sign in / Sign up

Export Citation Format

Share Document