Classes Matter: A Fine-Grained Adversarial Approach to Cross-Domain Semantic Segmentation

Semantic segmentation from very fine resolution (VFR) urban scene images plays a significant role in several application scenarios including autonomous driving, land cover classification, urban planning, etc. However, the tremendous details contained in the VFR image, especially the considerable variations in scale and appearance of objects, severely limit the potential of the existing deep learning approaches. Addressing such issues represents a promising research field in the remote sensing community, which paves the way for scene-level landscape pattern analysis and decision making. In this paper, we propose a Bilateral Awareness Network which contains a dependency path and a texture path to fully capture the long-range relationships and fine-grained details in VFR images. Specifically, the dependency path is conducted based on the ResT, a novel Transformer backbone with memory-efficient multi-head self-attention, while the texture path is built on the stacked convolution operation. In addition, using the linear attention mechanism, a feature aggregation module is designed to effectively fuse the dependency features and texture features. Extensive experiments conducted on the three large-scale urban scene image segmentation datasets, i.e., ISPRS Vaihingen dataset, ISPRS Potsdam dataset, and UAVid dataset, demonstrate the effectiveness of our BANet. Specifically, a 64.6% mIoU is achieved on the UAVid dataset.

Download Full-text

A New Algorithm for Sketch-Based Fashion Image Retrieval Based on Cross-Domain Transformation

Wireless Communications and Mobile Computing ◽

10.1155/2021/5577735 ◽

2021 ◽

Vol 2021 ◽

pp. 1-14

Author(s):

Haopeng Lei ◽

Simin Chen ◽

Mingwen Wang ◽

Xiangjian He ◽

Wenjing Jia ◽

...

Keyword(s):

Image Retrieval ◽

Online Shopping ◽

Unsolved Problem ◽

Correct Match ◽

Retrieval Accuracy ◽

Fine Grained ◽

Cross Domain ◽

Domain Transformation ◽

Domain Similarity ◽

Natural Way

Due to the rise of e-commerce platforms, online shopping has become a trend. However, the current mainstream retrieval methods are still limited to using text or exemplar images as input. For huge commodity databases, it remains a long-standing unsolved problem for users to find the interested products quickly. Different from the traditional text-based and exemplar-based image retrieval techniques, sketch-based image retrieval (SBIR) provides a more intuitive and natural way for users to specify their search need. Due to the large cross-domain discrepancy between the free-hand sketch and fashion images, retrieving fashion images by sketches is a significantly challenging task. In this work, we propose a new algorithm for sketch-based fashion image retrieval based on cross-domain transformation. In our approach, the sketch and photo are first transformed into the same domain. Then, the sketch domain similarity and the photo domain similarity are calculated, respectively, and fused to improve the retrieval accuracy of fashion images. Moreover, the existing fashion image datasets mostly contain photos only and rarely contain the sketch-photo pairs. Thus, we contribute a fine-grained sketch-based fashion image retrieval dataset, which includes 36,074 sketch-photo pairs. Specifically, when retrieving on our Fashion Image dataset, the accuracy of our model ranks the correct match at the top-1 which is 96.6%, 92.1%, 91.0%, and 90.5% for clothes, pants, skirts, and shoes, respectively. Extensive experiments conducted on our dataset and two fine-grained instance-level datasets, i.e., QMUL-shoes and QMUL-chairs, show that our model has achieved a better performance than other existing methods.

Download Full-text

Rotation Consistency-Preserved Generative Adversarial Networks for Cross-Domain Aerial Image Semantic Segmentation

10.1109/igarss47720.2021.9554606 ◽

2021 ◽

Author(s):

Te Shi ◽

Yansheng Li ◽

Yongjun Zhang

Keyword(s):

Semantic Segmentation ◽

Aerial Image ◽

Generative Adversarial Networks ◽

Adversarial Networks ◽

Cross Domain

Download Full-text

Constructing Self-Motivated Pyramid Curriculums for Cross-Domain Semantic Segmentation: A Non-Adversarial Approach

2019 IEEE/CVF International Conference on Computer Vision (ICCV) ◽

10.1109/iccv.2019.00686 ◽

2019 ◽

Cited By ~ 13

Author(s):

Qing Lian ◽

Lixin Duan ◽

Fengmao Lv ◽

Boqing Gong

Keyword(s):

Semantic Segmentation ◽

Cross Domain

Download Full-text

A Fine-Grained Adversarial Network Method for Cross-Domain Industrial Fault Diagnosis

IEEE Transactions on Automation Science and Engineering ◽

10.1109/tase.2019.2957232 ◽

2020 ◽

Vol 17 (3) ◽

pp. 1432-1442 ◽

Cited By ~ 6

Author(s):

Zheng Chai ◽

Chunhui Zhao

Keyword(s):

Fault Diagnosis ◽

Fine Grained ◽

Adversarial Network ◽

Cross Domain ◽

Network Method

Download Full-text

A Fine-Grained Cross-Domain Access Control Mechanism for Social Internet of Things

2014 IEEE 11th Intl Conf on Ubiquitous Intelligence and Computing and 2014 IEEE 11th Intl Conf on Autonomic and Trusted Computing and 2014 IEEE 14th Intl Conf on Scalable Computing and Communications and Its Associated Workshops ◽

10.1109/uic-atc-scalcom.2014.140 ◽

2014 ◽

Cited By ~ 2

Author(s):

Jun Wu ◽

Mianxiong Dong ◽

Kaoru Ota ◽

Jianhua Li ◽

Bei Pei

Keyword(s):

Internet Of Things ◽

Access Control ◽

Control Mechanism ◽

Fine Grained ◽

Cross Domain ◽

Social Internet Of Things ◽

Access Control Mechanism

Download Full-text

Semantic Locality-Aware Deformable Network for Clothing Segmentation

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2018/106 ◽

2018 ◽

Cited By ~ 4

Author(s):

Wei Ji ◽

Xi Li ◽

Yueting Zhuang ◽

Omar El Farouk Bourahla ◽

Yixin Ji ◽

...

Keyword(s):

Learning Process ◽

State Of The Art ◽

Semantic Segmentation ◽

Small Sample ◽

Sample Problem ◽

Fine Grained ◽

Domain Specific ◽

Proposed Model ◽

Segmentation Framework ◽

Small Sample Problem

Clothing segmentation is a challenging vision problem typically implemented within a fine-grained semantic segmentation framework. Different from conventional segmentation, clothing segmentation has some domain-specific properties such as texture richness, diverse appearance variations, non-rigid geometry deformations, and small sample learning. To deal with these points, we propose a semantic locality-aware segmentation model, which adaptively attaches an original clothing image with a semantically similar (e.g., appearance or pose) auxiliary exemplar by search. Through considering the interactions of the clothing image and its exemplar, more intrinsic knowledge about the locality manifold structures of clothing images is discovered to make the learning process of small sample problem more stable and tractable. Furthermore, we present a CNN model based on the deformable convolutions to extract the non-rigid geometry-aware features for clothing images. Experimental results demonstrate the effectiveness of the proposed model against the state-of-the-art approaches.

Download Full-text

CAN SPOT-6/7 CNN SEMANTIC SEGMENTATION IMPROVE SENTINEL-2 BASED LAND COVER PRODUCTS? SENSOR ASSESSMENT AND FUSION

ISPRS Annals of Photogrammetry Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-annals-v-2-2020-557-2020 ◽

2020 ◽

Vol V-2-2020 ◽

pp. 557-564

Author(s):

O. Stocker ◽

A. Le Bris

Keyword(s):

Deep Learning ◽

Land Cover ◽

Data Fusion ◽

Semantic Segmentation ◽

Fine Grained ◽

Satellite Sensors ◽

Fusion Framework ◽

Sentinel 2

Abstract. Needs for fine-grained, accurate and up-to-date land cover (LC) data are important to answer both societal and scientific purposes. Several automatic products have already been proposed, but are mostly generated out of satellite sensors like Sentinel-2 (S2) or Landsat. Metric sensors, e.g. SPOT-6/7, have been less considered, while they enable (at least annual) acquisitions at country scale and can now be efficiently processed thanks to deep learning (DL) approaches. This study thus aimed at assessing whether such sensor can improve such land cover products. A custom simple yet effective U-net - Deconv-Net inspired DL architecture is developed and applied to SPOT-6/7 and S2 for different LC nomenclatures, aiming at comparing the relevance of their spatial/spectral configurations and investigating their complementarity. The proposed DL architecture is then extended to data fusion and applied to previous sensors. At the end, the proposed fusion framework is used to enrich an existing S2 based LC product, as it is generic enough to cope with fusion at distinct levels.

Download Full-text

Classes Matter: A Fine-Grained Adversarial Approach to Cross-Domain Semantic Segmentation

Cross-Domain Semantic Segmentation of Urban Scenes via Multi-Level Feature Alignment

Deep transfer learning mechanism for fine-grained cross-domain sentiment classification

Transformer Meets Convolution: A Bilateral Awareness Network for Semantic Segmentation of Very Fine Resolution Urban Scene Images

A New Algorithm for Sketch-Based Fashion Image Retrieval Based on Cross-Domain Transformation

Rotation Consistency-Preserved Generative Adversarial Networks for Cross-Domain Aerial Image Semantic Segmentation

Constructing Self-Motivated Pyramid Curriculums for Cross-Domain Semantic Segmentation: A Non-Adversarial Approach

A Fine-Grained Adversarial Network Method for Cross-Domain Industrial Fault Diagnosis

A Fine-Grained Cross-Domain Access Control Mechanism for Social Internet of Things

Semantic Locality-Aware Deformable Network for Clothing Segmentation

CAN SPOT-6/7 CNN SEMANTIC SEGMENTATION IMPROVE SENTINEL-2 BASED LAND COVER PRODUCTS? SENSOR ASSESSMENT AND FUSION

Export Citation Format