A Novel Approach to Detect Note Onset Using Musical Wavelet

2014 ◽  
Vol 651-653 ◽  
pp. 2094-2098
Author(s):  
Wen Ming Gui ◽  
Yu Feng Chen ◽  
Rui Fan Liu

In this paper, a novel approach to detect note onset was proposed. At the signal transform stage of the note onset detection, a new musical signal decomposition method based on musical wavelet was bring forward according to the frequency structure of the musical note. At the feature extraction stage, the partial flux feature was proposed in light of the point that some of the partials must change at the time of the onsets. The experimental results indicted that the proposed approach was theoretically feasible and practically effective.

2021 ◽  
Vol 40 (1) ◽  
pp. 551-563
Author(s):  
Liqiong Lu ◽  
Dong Wu ◽  
Ziwei Tang ◽  
Yaohua Yi ◽  
Faliang Huang

This paper focuses on script identification in natural scene images. Traditional CNNs (Convolution Neural Networks) cannot solve this problem perfectly for two reasons: one is the arbitrary aspect ratios of scene images which bring much difficulty to traditional CNNs with a fixed size image as the input. And the other is that some scripts with minor differences are easily confused because they share a subset of characters with the same shapes. We propose a novel approach combing Score CNN, Attention CNN and patches. Attention CNN is utilized to determine whether a patch is a discriminative patch and calculate the contribution weight of the discriminative patch to script identification of the whole image. Score CNN uses a discriminative patch as input and predict the score of each script type. Firstly patches with the same size are extracted from the scene images. Secondly these patches are used as inputs to Score CNN and Attention CNN to train two patch-level classifiers. Finally, the results of multiple discriminative patches extracted from the same image via the above two classifiers are fused to obtain the script type of this image. Using patches with the same size as inputs to CNN can avoid the problems caused by arbitrary aspect ratios of scene images. The trained classifiers can mine discriminative patches to accurately identify some confusing scripts. The experimental results show the good performance of our approach on four public datasets.


2020 ◽  
Vol 13 (1) ◽  
pp. 71
Author(s):  
Zhiyong Xu ◽  
Weicun Zhang ◽  
Tianxiang Zhang ◽  
Jiangyun Li

Semantic segmentation is a significant method in remote sensing image (RSIs) processing and has been widely used in various applications. Conventional convolutional neural network (CNN)-based semantic segmentation methods are likely to lose the spatial information in the feature extraction stage and usually pay little attention to global context information. Moreover, the imbalance of category scale and uncertain boundary information meanwhile exists in RSIs, which also brings a challenging problem to the semantic segmentation task. To overcome these problems, a high-resolution context extraction network (HRCNet) based on a high-resolution network (HRNet) is proposed in this paper. In this approach, the HRNet structure is adopted to keep the spatial information. Moreover, the light-weight dual attention (LDA) module is designed to obtain global context information in the feature extraction stage and the feature enhancement feature pyramid (FEFP) structure is promoted and employed to fuse the contextual information of different scales. In addition, to achieve the boundary information, we design the boundary aware (BA) module combined with the boundary aware loss (BAloss) function. The experimental results evaluated on Potsdam and Vaihingen datasets show that the proposed approach can significantly improve the boundary and segmentation performance up to 92.0% and 92.3% on overall accuracy scores, respectively. As a consequence, it is envisaged that the proposed HRCNet model will be an advantage in remote sensing images segmentation.


Aerospace ◽  
2021 ◽  
Vol 8 (3) ◽  
pp. 79
Author(s):  
Carolyn J. Swinney ◽  
John C. Woods

Unmanned Aerial Vehicles (UAVs) undoubtedly pose many security challenges. We need only look to the December 2018 Gatwick Airport incident for an example of the disruption UAVs can cause. In total, 1000 flights were grounded for 36 h over the Christmas period which was estimated to cost over 50 million pounds. In this paper, we introduce a novel approach which considers UAV detection as an imagery classification problem. We consider signal representations Power Spectral Density (PSD); Spectrogram, Histogram and raw IQ constellation as graphical images presented to a deep Convolution Neural Network (CNN) ResNet50 for feature extraction. Pre-trained on ImageNet, transfer learning is utilised to mitigate the requirement for a large signal dataset. We evaluate performance through machine learning classifier Logistic Regression. Three popular UAVs are classified in different modes; switched on; hovering; flying; flying with video; and no UAV present, creating a total of 10 classes. Our results, validated with 5-fold cross validation and an independent dataset, show PSD representation to produce over 91% accuracy for 10 classifications. Our paper treats UAV detection as an imagery classification problem by presenting signal representations as images to a ResNet50, utilising the benefits of transfer learning and outperforming previous work in the field.


2016 ◽  
Vol 2016 ◽  
pp. 1-14 ◽  
Author(s):  
Jie Zhang ◽  
Xiaolong Zheng ◽  
Zhanyong Tang ◽  
Tianzhang Xing ◽  
Xiaojiang Chen ◽  
...  

Mobile sensing has become a new style of applications and most of the smart devices are equipped with varieties of sensors or functionalities to enhance sensing capabilities. Current sensing systems concentrate on how to enhance sensing capabilities; however, the sensors or functionalities may lead to the leakage of users’ privacy. In this paper, we present WiPass, a way to leverage the wireless hotspot functionality on the smart devices to snoop the unlock passwords/patterns without the support of additional hardware. The attacker can “see” your unlock passwords/patterns even one meter away. WiPass leverages the impacts of finger motions on the wireless signals during the unlocking period to analyze the passwords/patterns. To practically implement WiPass, we are facing the difficult feature extraction and complex unlock passwords matching, making the analysis of the finger motions challenging. To conquer the challenges, we use DCASW to extract feature and hierarchical DTW to do unlock passwords matching. Besides, the combination of amplitude and phase information is used to accurately recognize the passwords/patterns. We implement a prototype of WiPass and evaluate its performance under various environments. The experimental results show that WiPass achieves the detection accuracy of 85.6% and 74.7% for passwords/patterns detection in LOS and in NLOS scenarios, respectively.


2013 ◽  
Vol 2013 ◽  
pp. 1-14 ◽  
Author(s):  
Sambit Bakshi ◽  
Pankaj K. Sa ◽  
Banshidhar Majhi

A novel approach for selecting a rectangular template around periocular region optimally potential for human recognition is proposed. A comparatively larger template of periocular image than the optimal one can be slightly more potent for recognition, but the larger template heavily slows down the biometric system by making feature extraction computationally intensive and increasing the database size. A smaller template, on the contrary, cannot yield desirable recognition though the smaller template performs faster due to low computation for feature extraction. These two contradictory objectives (namely, (a) to minimize the size of periocular template and (b) to maximize the recognition through the template) are aimed to be optimized through the proposed research. This paper proposes four different approaches for dynamic optimal template selection from periocular region. The proposed methods are tested on publicly available unconstrained UBIRISv2 and FERET databases and satisfactory results have been achieved. Thus obtained template can be used for recognition of individuals in an organization and can be generalized to recognize every citizen of a nation.


2016 ◽  
Vol 09 (03) ◽  
pp. 1650043 ◽  
Author(s):  
Haolin Wu ◽  
Jie Yang ◽  
Haibiao Chen ◽  
Feng Pan

Preferentially etching either carbon or silica from silicon oxycarbide (SiOC) created a porous network as an inverse image of the removed phase. The porous structure was analyzed by gas adsorption, and the experimental results verified the nanodomain structure of SiOC. This work demonstrated a novel approach for analyzing materials containing nanocomposite structures.


A comb shaped microstrip antenna is designed by loading rectangular slots on the patch of the antenna. The antenna resonating at three different frequencies f1 = 5.35 GHz, f2 = 6.19 GHz and f3= 8.15 GHz. The designed antenna is simulated on High Frequency Structure Simulator software [HFSS] and the antenna is fabricated using substrate glass epoxy with dielectric constant 4.4 having dimension of 8x4x0.16 cms. The antenna shows good return loss, bandwidth and VSWR. Experimental results are observed using Vector Analyzer MS2037C/2.


2014 ◽  
Vol 2014 ◽  
pp. 1-8 ◽  
Author(s):  
Chun-Hui Wu ◽  
Chia-Wei Chen ◽  
Long-Sheng Kuo ◽  
Ping-Hei Chen

A novel approach was proposed to measure the hydraulic capacitance of a microfluidic membrane pump. Membrane deflection equations were modified from various studies to propose six theoretical equations to estimate the hydraulic capacitance of a microfluidic membrane pump. Thus, measuring the center deflection of the membrane allows the corresponding pressure and hydraulic capacitance of the pump to be determined. This study also investigated how membrane thickness affected the Young’s modulus of a polydimethylsiloxane (PDMS) membrane. Based on the experimental results, a linear correlation was proposed to estimate the hydraulic capacitance. The measured hydraulic capacitance data and the proposed equations in the linear and nonlinear regions qualitatively exhibited good agreement.


Sign in / Sign up

Export Citation Format

Share Document