Rescaling Egocentric Vision: Collection, Pipeline and Challenges for EPIC-KITCHENS-100

International Journal of Computer Vision ◽

10.1007/s11263-021-01531-2 ◽

2021 ◽

Author(s):

Dima Damen ◽

Hazel Doughty ◽

Giovanni Maria Farinella ◽

Antonino Furnari ◽

Evangelos Kazakos ◽

...

Keyword(s):

Action Recognition ◽

Domain Adaptation ◽

Weak Supervision ◽

Action Detection ◽

Previous Version ◽

Fine Grained ◽

Unsupervised Domain Adaptation ◽

Egocentric Vision ◽

New Challenges

AbstractThis paper introduces the pipeline to extend the largest dataset in egocentric vision, EPIC-KITCHENS. The effort culminates in EPIC-KITCHENS-100, a collection of 100 hours, 20M frames, 90K actions in 700 variable-length videos, capturing long-term unscripted activities in 45 environments, using head-mounted cameras. Compared to its previous version (Damen in Scaling egocentric vision: ECCV, 2018), EPIC-KITCHENS-100 has been annotated using a novel pipeline that allows denser (54% more actions per minute) and more complete annotations of fine-grained actions (+128% more action segments). This collection enables new challenges such as action detection and evaluating the “test of time”—i.e. whether models trained on data collected in 2018 can generalise to new footage collected two years later. The dataset is aligned with 6 challenges: action recognition (full and weak supervision), action detection, action anticipation, cross-modal retrieval (from captions), as well as unsupervised domain adaptation for action recognition. For each challenge, we define the task, provide baselines and evaluation metrics.

Download Full-text

Multi-Modal Domain Adaptation for Fine-Grained Action Recognition

2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) ◽

10.1109/cvpr42600.2020.00020 ◽

2020 ◽

Cited By ~ 3

Author(s):

Jonathan Munro ◽

Dima Damen

Keyword(s):

Action Recognition ◽

Domain Adaptation ◽

Fine Grained

Download Full-text

Exploiting Local Feature Patterns for Unsupervised Domain Adaptation

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33015401 ◽

2019 ◽

Vol 33 ◽

pp. 5401-5408 ◽

Cited By ~ 6

Author(s):

Jun Wen ◽

Risheng Liu ◽

Nenggan Zheng ◽

Qian Zheng ◽

Zhefeng Gong ◽

...

Keyword(s):

Negative Transfer ◽

Domain Adaptation ◽

Local Feature ◽

Fine Grained ◽

Unsupervised Domain Adaptation ◽

Distribution Matching ◽

Benchmark Datasets ◽

Invariant Representations ◽

Multi Mode ◽

Feature Alignment

Unsupervised domain adaptation methods aim to alleviate performance degradation caused by domain-shift by learning domain-invariant representations. Existing deep domain adaptation methods focus on holistic feature alignment by matching source and target holistic feature distributions, without considering local features and their multi-mode statistics. We show that the learned local feature patterns are more generic and transferable and a further local feature distribution matching enables fine-grained feature alignment. In this paper, we present a method for learning domain-invariant local feature patterns and jointly aligning holistic and local feature statistics. Comparisons to the state-of-the-art unsupervised domain adaptation methods on two popular benchmark datasets demonstrate the superiority of our approach and its effectiveness on alleviating negative transfer.

Download Full-text

Multi-Modal Domain Adaptation for Fine-Grained Action Recognition

2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW) ◽

10.1109/iccvw.2019.00461 ◽

2019 ◽

Author(s):

Jonathan Munro ◽

Dima Damen

Keyword(s):

Action Recognition ◽

Domain Adaptation ◽

Fine Grained

Download Full-text

A Novel Multiple-View Adversarial Learning Network for Unsupervised Domain Adaptation Action Recognition

IEEE Transactions on Cybernetics ◽

10.1109/tcyb.2021.3105637 ◽

2021 ◽

pp. 1-15

Author(s):

Zan Gao ◽

Yibo Zhao ◽

Hua Zhang ◽

Da Chen ◽

An-An Liu ◽

...

Keyword(s):

Action Recognition ◽

Domain Adaptation ◽

Adversarial Learning ◽

Learning Network ◽

Unsupervised Domain Adaptation ◽

Multiple View ◽

Adaptation Action

Download Full-text

Structure-Aware Feature Fusion for Unsupervised Domain Adaptation

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i07.6629 ◽

2020 ◽

Vol 34 (07) ◽

pp. 10567-10574

Author(s):

Qingchao Chen ◽

Yang Liu

Keyword(s):

Spatial Structure ◽

Feature Fusion ◽

Domain Adaptation ◽

Structural Information ◽

Feature Representation ◽

Local Feature ◽

Global Feature ◽

Feature Maps ◽

Fine Grained ◽

Unsupervised Domain Adaptation

Unsupervised domain Adaptation (UDA) aims to learn and transfer generalized features from a labelled source domain to a target domain without any annotations. Existing methods only aligning high-level representation but without exploiting the complex multi-class structure and local spatial structure. This is problematic as 1) the model is prone to negative transfer when the features from different classes are misaligned; 2) missing the local spatial structure poses a major obstacle in performing the fine-grained feature alignment. In this paper, we integrate the valuable information conveyed in classifier prediction and local feature maps into global feature representation and then perform a single mini-max game to make it domain invariant. In this way, the domain-invariant feature not only describes the holistic representation of the original image but also preserves mode-structure and fine-grained spatial structural information. The feature integration is achieved by estimating and maximizing the mutual information (MI) among the global feature, local feature and classifier prediction simultaneously. As the MI is hard to measure directly in high-dimension spaces, we adopt a new objective function that implicitly maximizes the MI via an effective sampling strategy and a discriminator design. Our STructure-Aware Feature Fusion (STAFF) network achieves the state-of-the-art performances in various UDA datasets.

Download Full-text

Importance-weighted conditional adversarial network for unsupervised domain adaptation

Expert Systems with Applications ◽

10.1016/j.eswa.2020.113404 ◽

2020 ◽

Vol 155 ◽

pp. 113404 ◽

Cited By ~ 1

Author(s):

Peng Liu ◽

Ting Xiao ◽

Cangning Fan ◽

Wei Zhao ◽

Xianglong Tang ◽

...

Keyword(s):

Domain Adaptation ◽

Unsupervised Domain Adaptation ◽

Adversarial Network

Download Full-text

Domain randomization-enhanced deep learning models for bird detection

Scientific Reports ◽

10.1038/s41598-020-80101-x ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Xin Mao ◽

Jun Kang Chow ◽

Pin Siang Tan ◽

Kuan-fu Liu ◽

Jimmy Wu ◽

...

Keyword(s):

Deep Learning ◽

Continuous Monitoring ◽

Bird Species ◽

Training Data ◽

Learning Models ◽

Fine Grained ◽

Bird Detection ◽

Relationship Of ◽

The Relationship

AbstractAutomatic bird detection in ornithological analyses is limited by the accuracy of existing models, due to the lack of training data and the difficulties in extracting the fine-grained features required to distinguish bird species. Here we apply the domain randomization strategy to enhance the accuracy of the deep learning models in bird detection. Trained with virtual birds of sufficient variations in different environments, the model tends to focus on the fine-grained features of birds and achieves higher accuracies. Based on the 100 terabytes of 2-month continuous monitoring data of egrets, our results cover the findings using conventional manual observations, e.g., vertical stratification of egrets according to body size, and also open up opportunities of long-term bird surveys requiring intensive monitoring that is impractical using conventional methods, e.g., the weather influences on egrets, and the relationship of the migration schedules between the great egrets and little egrets.

Download Full-text