Effective multi-shot person re-identification through representative frames selection and temporal feature pooling

2019 ◽  
Vol 78 (23) ◽  
pp. 33939-33967 ◽  
Author(s):  
Thuy-Binh Nguyen ◽  
Thi-Lan Le ◽  
Louis Devillaine ◽  
Thi Thanh Thuy Pham ◽  
Nam Pham Ngoc
2011 ◽  
Vol 38 (9) ◽  
pp. 866-871 ◽  
Author(s):  
Zhi-Hua HUANG ◽  
Ming-Hong LI ◽  
Yuan-Ye MA ◽  
Chang-Le ZHOU

Author(s):  
Zhiwen Xiao ◽  
Xin Xu ◽  
Huanlai Xing ◽  
Shouxi Luo ◽  
Penglin Dai ◽  
...  

Sensors ◽  
2021 ◽  
Vol 21 (5) ◽  
pp. 1579 ◽  
Author(s):  
Kyoung Ju Noh ◽  
Chi Yoon Jeong ◽  
Jiyoun Lim ◽  
Seungeun Chung ◽  
Gague Kim ◽  
...  

Speech emotion recognition (SER) is a natural method of recognizing individual emotions in everyday life. To distribute SER models to real-world applications, some key challenges must be overcome, such as the lack of datasets tagged with emotion labels and the weak generalization of the SER model for an unseen target domain. This study proposes a multi-path and group-loss-based network (MPGLN) for SER to support multi-domain adaptation. The proposed model includes a bidirectional long short-term memory-based temporal feature generator and a transferred feature extractor from the pre-trained VGG-like audio classification model (VGGish), and it learns simultaneously based on multiple losses according to the association of emotion labels in the discrete and dimensional models. For the evaluation of the MPGLN SER as applied to multi-cultural domain datasets, the Korean Emotional Speech Database (KESD), including KESDy18 and KESDy19, is constructed, and the English-speaking Interactive Emotional Dyadic Motion Capture database (IEMOCAP) is used. The evaluation of multi-domain adaptation and domain generalization showed 3.7% and 3.5% improvements, respectively, of the F1 score when comparing the performance of MPGLN SER with a baseline SER model that uses a temporal feature generator. We show that the MPGLN SER efficiently supports multi-domain adaptation and reinforces model generalization.


2021 ◽  
Vol 14 (2) ◽  
pp. 239-251
Author(s):  
Hualei Zhang ◽  
Mohammad Asif Ikbal

PurposeIn response to these shortcomings, this paper proposes a dynamic obstacle detection and tracking method based on multi-feature fusion and a dynamic obstacle recognition method based on spatio-temporal feature vectors.Design/methodology/approachThe existing dynamic obstacle detection and tracking methods based on geometric features have a high false detection rate. The recognition methods based on the geometric features and motion status of dynamic obstacles are greatly affected by distance and scanning angle, and cannot meet the requirements of real traffic scene applications.FindingsFirst, based on the geometric features of dynamic obstacles, the obstacles are considered The echo pulse width feature is used to improve the accuracy of obstacle detection and tracking; second, the space-time feature vector is constructed based on the time dimension and space dimension information of the obstacle, and then the support vector machine method is used to realize the recognition of dynamic obstacles to improve the obstacle The accuracy of object recognition. Finally, the accuracy and effectiveness of the proposed method are verified by real vehicle tests.Originality/valueThe paper proposes a dynamic obstacle detection and tracking method based on multi-feature fusion and a dynamic obstacle recognition method based on spatio-temporal feature vectors. The accuracy and effectiveness of the proposed method are verified by real vehicle tests.


Sign in / Sign up

Export Citation Format

Share Document