Video Moment Retrieval with Cross-Modal Neural Architecture Search

Author(s):  
Xun Yang ◽  
Shanshan Wang ◽  
Jian Dong ◽  
Jianfeng Dong ◽  
Meng Wang ◽  
...  
Keyword(s):  
1992 ◽  
Author(s):  
William Ross ◽  
Ennio Mingolla

Author(s):  
Hanna Mazzawi ◽  
Xavi Gonzalvo ◽  
Aleks Kracun ◽  
Prashant Sridhar ◽  
Niranjan Subrahmanya ◽  
...  

Author(s):  
Wei Jia ◽  
Wei Xia ◽  
Yang Zhao ◽  
Hai Min ◽  
Yan-Xiang Chen

AbstractPalmprint recognition and palm vein recognition are two emerging biometrics technologies. In the past two decades, many traditional methods have been proposed for palmprint recognition and palm vein recognition and have achieved impressive results. In recent years, in the field of artificial intelligence, deep learning has gradually become the mainstream recognition technology because of its excellent recognition performance. Some researchers have tried to use convolutional neural networks (CNNs) for palmprint recognition and palm vein recognition. However, the architectures of these CNNs have mostly been developed manually by human experts, which is a time-consuming and error-prone process. In order to overcome some shortcomings of manually designed CNN, neural architecture search (NAS) technology has become an important research direction of deep learning. The significance of NAS is to solve the deep learning model’s parameter adjustment problem, which is a cross-study combining optimization and machine learning. NAS technology represents the future development direction of deep learning. However, up to now, NAS technology has not been well studied for palmprint recognition and palm vein recognition. In this paper, in order to investigate the problem of NAS-based 2D and 3D palmprint recognition and palm vein recognition in-depth, we conduct a performance evaluation of twenty representative NAS methods on five 2D palmprint databases, two palm vein databases, and one 3D palmprint database. Experimental results show that some NAS methods can achieve promising recognition results. Remarkably, among different evaluated NAS methods, ProxylessNAS achieves the best recognition performance.


2021 ◽  
pp. 1-11
Author(s):  
Yaran Chen ◽  
Ruiyuan Gao ◽  
Fenggang Liu ◽  
Dongbin Zhao
Keyword(s):  

2021 ◽  
Vol 2 (1) ◽  
pp. 1-25
Author(s):  
Yongsen Ma ◽  
Sheheryar Arshad ◽  
Swetha Muniraju ◽  
Eric Torkildson ◽  
Enrico Rantala ◽  
...  

In recent years, Channel State Information (CSI) measured by WiFi is widely used for human activity recognition. In this article, we propose a deep learning design for location- and person-independent activity recognition with WiFi. The proposed design consists of three Deep Neural Networks (DNNs): a 2D Convolutional Neural Network (CNN) as the recognition algorithm, a 1D CNN as the state machine, and a reinforcement learning agent for neural architecture search. The recognition algorithm learns location- and person-independent features from different perspectives of CSI data. The state machine learns temporal dependency information from history classification results. The reinforcement learning agent optimizes the neural architecture of the recognition algorithm using a Recurrent Neural Network (RNN) with Long Short-Term Memory (LSTM). The proposed design is evaluated in a lab environment with different WiFi device locations, antenna orientations, sitting/standing/walking locations/orientations, and multiple persons. The proposed design has 97% average accuracy when testing devices and persons are not seen during training. The proposed design is also evaluated by two public datasets with accuracy of 80% and 83%. The proposed design needs very little human efforts for ground truth labeling, feature engineering, signal processing, and tuning of learning parameters and hyperparameters.


Author(s):  
Yuhui Xu ◽  
Lingxi Xie ◽  
Wenrui Dai ◽  
Xiaopeng Zhang ◽  
Xin Chen ◽  
...  
Keyword(s):  

Sign in / Sign up

Export Citation Format

Share Document