scholarly journals Feature extraction with mel scale separation method on noise audio recordings

Author(s):  
Roy Rudolf Huizen ◽  
Florentina Tatrin Kurniati

This paper focuses on improving the accuracy of noise audio recordings. High-quality audio recording, extraction using the mel frequency cepstral coefficients (MFCC) method produces high accuracy. While the low-quality is because of noise, the accuracy is low. Improved accuracy by investigating the effect of bandwidth on the mel scale. The proposed improvement uses the mel scale separation methods into two frequency channels (MFCC dual-channel). For the comparison method using the mel scale bandwidth without separation (MFCC single-channel). Feature analysis using k-mean clustering. The data uses a noise variance of up to -16 dB. Testing on the MFCC single-channel method for -16 dB noise has an accuracy of 47.5%, while the MFCC dual-channel method has an accuracy better of 76.25%. The next test used adaptive noise-canceling (ANC) to reduce noise before extraction. The result is that the MFCC single-channel method has an accuracy of 82.5% and the MFCC dual-channel method has an accuracy better of 83.75%. High-quality audio recording testing for the MFCC single-channel method has an accuracy of 92.5% and the MFCC dual-channel method has an accuracy better of 97.5%. The test results show the effect of mel scale bandwidth to increase accuracy. The MFCC dual-channel method has higher accuracy.

2021 ◽  
Author(s):  
Peng Du ◽  
Laihui Luo ◽  
Yafei Hou ◽  
Weiping Li

Rational control of thermal quenching performance is crucial for achieving high quality luminescent thermometer. Herein, we report the Tb3+/Yb3+-coactivated Y2Mo3O12 (YMO) negative thermal expansion (NTE) microparticles with color-tunable emissions. Irradiated...


1963 ◽  
Vol 6 (3) ◽  
pp. 263-269 ◽  
Author(s):  
Richard G. Chappell ◽  
James F. Kavanagh ◽  
Stanley Zerlin

Normal hearing adults demonstrated approximately 20% better intelligibility scores for monosyllabic words presented binaurally (with a background of conversation) than to these words presented monaurally. The test materials were recorded on dual-channel tape through two head-mounted microphones. These microphones were directed toward each of three speakers who in turn produced the monosyllabic words while two simultaneous conversations were carried on by four other participants. Throughout the recording session the experimenters attempted to preserve as naturalistic a situation as possible. The 18 subjects with normal hearing listened through earphones to a single channel of this tape presented monaurally and to both channels delivered binaurally. The difference between the monaural and binaural intelligibility scores is discussed in terms of image-separation in space.


Author(s):  
Victor O. Adegboye ◽  
Jason H. Rife

Abstract Whilst extensive work has been done on fault detection in bearings using sound, very little has been accomplished with other machine components and machinery partly due to the scarcity of datasets. The recent release of the Malfunctioning Industrial Machine Investigation and Inspection (MIMII) dataset opens the opportunity for research into malfunctioning machines like pumps, fans, slide rails, and valves. In this paper, we compare common features from audio recordings to investigate which best support the classification of malfunctioning pumps. We evaluate our results using the Area Under the Curve (AUC) as a performance metric and determine that the log mel spectrum is a very useful feature, at least for this dataset, but that other features can enhance detection performance when ambient noise is present (improving AUC from 0.88 to 0.94 in one case). Also, we find that mel Frequency Cepstral Coefficients (MFCC) perform substantially poorer as features than a sampled mel spectrogram.


Author(s):  
L. N. Arbachakova ◽  

The paper compares the variants of the Shor heroic epic “Künnü körgen Kün Köök” (“Kün Köök that saw the Sun”) recorded in 1999 with an interval of two months in the narrator’s self-recording (written in January 1999) and in audio recording (recorded in March 1999 by L. Arbachakova from V. E. Tannagashev (1932–2007). The version in the audio recording was performed by the Kai narrator accompanied by komus in the performer’s apartment in Myski city. V. E. Tannagashev learned this epic from his teacher P. N. Amzorov. The small period between the recordings resulted in insignificant discrepancies in the versions that were complementary and hardly influenced the qualitative content of the legend. The Kai narrator’s memory did not let him down, with the plots almost coinciding and different epic formulas used only in some fragments of typical places, or there were some permutations or omissions of lines. Sometimes the narrator uses synonymous words, or there are repetitions and reservations. However, there are practically no such flaws in the self-recordings. The typical points used by the kaichi, sometimes expanded and colorful, sometimes compressed, probably depended on his mood, as well as on different ways of fixing the epic (in the kaichi’s selfrecording and audio recordings). Live performance is influenced by the mood, health of the narrator, and other factors. Self-recordings made by hand are the most time-consuming since they require physical effort, perseverance, attention. It is perhaps for this reason that the recording turned out to be more shortened.


Sensors ◽  
2020 ◽  
Vol 20 (4) ◽  
pp. 1230
Author(s):  
Lei Du ◽  
Qiao Sun ◽  
Jie Bai ◽  
Xiaolei Wang ◽  
Tianqi Xu

The 24 GHz continuous-wave (CW) Doppler radar sensor (DRS) is widely used for measuring the instantaneous speed of moving objects by using a non-contact approach, and has begun to be used in train-borne movable speed measurements in recent years in China because of its advanced performance. The architecture and working principle of train-borne DRSs with different structures including single-channel DRSs used for freight train speed measurements in railway freight dedicated lines and dual-channel DRSs used for speed measurements of high-speed and urban rail trains in railway passenger dedicated lines, are first introduced. Then, the disadvantages of two traditional speed calibration methods for train-borne DRS are described, and a new speed calibration method based on the Doppler shift signal simulation by imposing a signal modulation on the incident CW microwave signal is proposed. A 24 GHz CW radar target simulation system for a train-borne DRS was specifically realized to verify the proposed speed calibration method for a train-borne DRS, and traceability and performance evaluation on simulated speed were taken into account. The simulated speed range of the simulation system was up to (5~500) km/h when the simulated incident angle range was within the range of (45 ± 8)°, and the maximum permissible error (MPE) of the simulated speed was ±0.05 km/h. Finally, the calibration and uncertainty evaluation results of two typical train-borne dual-channel DRS samples validated the effectiveness and feasibility of the proposed speed calibration approach for a train-borne DRS with full range in the laboratory as well as in the field.


Energies ◽  
2019 ◽  
Vol 12 (13) ◽  
pp. 2489
Author(s):  
M. Korkosz ◽  
P. Bogusz ◽  
J. Prokop ◽  
B. Pakla ◽  
G. Podskarbi

This article presents the results of a comparative analysis of two electronically commutated brushless direct current machines intended for fault-tolerant drives. Two machines designed by the authors were compared: a 12/14 dual-channel brushless direct current motor (DCBLDCM) with permanent magnets and a 12/8 dual-channel switched reluctance motor (DCSRM). Information is provided here on the winding configuration, the parameters, and the power converters of both machines. We developed mathematical models of the DCBLDCM and DCSRM which accounted for the nonlinearity of their magnetization characteristics in dual-channel operation (DCO) and single-channel operation (SCO) modes. The static torque characteristics and flux characteristics of both machines were compared for operation in DCO and SCO modes. The waveforms of the current and the electromagnetic torque are presented for DCO and SCO operating conditions. For DCO mode, an analysis of the behavior of both machines under fault conditions (i.e., asymmetrical control, shorted coil, and open phase) was performed. The two designs were compared, and their strengths and weaknesses were indicated.


Sign in / Sign up

Export Citation Format

Share Document