Improving Street Object Detection Using Transfer Learning: From Generic Model to Specific Model

Abstract Introduction Classifying whether concepts in an unstructured clinical text are negated is an important unsolved task. New domain adaptation and transfer learning methods can potentially address this issue. Objective We examine neural unsupervised domain adaptation methods, introducing a novel combination of domain adaptation with transformer-based transfer learning methods to improve negation detection. We also want to better understand the interaction between the widely used bidirectional encoder representations from transformers (BERT) system and domain adaptation methods. Materials and Methods We use 4 clinical text datasets that are annotated with negation status. We evaluate a neural unsupervised domain adaptation algorithm and BERT, a transformer-based model that is pretrained on massive general text datasets. We develop an extension to BERT that uses domain adversarial training, a neural domain adaptation method that adds an objective to the negation task, that the classifier should not be able to distinguish between instances from 2 different domains. Results The domain adaptation methods we describe show positive results, but, on average, the best performance is obtained by plain BERT (without the extension). We provide evidence that the gains from BERT are likely not additive with the gains from domain adaptation. Discussion Our results suggest that, at least for the task of clinical negation detection, BERT subsumes domain adaptation, implying that BERT is already learning very general representations of negation phenomena such that fine-tuning even on a specific corpus does not lead to much overfitting. Conclusion Despite being trained on nonclinical text, the large training sets of models like BERT lead to large gains in performance for the clinical negation detection task.

Download Full-text

Plug-and-Play Few-shot Object Detection with Meta Strategy and Explicit Localization Inference

10.36227/techrxiv.16864711.v1 ◽

2021 ◽

Author(s):

Junying Huang ◽

Fan Chen ◽

Liang Lin ◽

dongyu zhang

Keyword(s):

Object Detection ◽

Fine Tuning ◽

The Novel ◽

Plug And Play ◽

Tuning Method ◽

One Step ◽

Tuning Process ◽

General Object ◽

The One ◽

Parallel Techniques

Aiming at recognizing and localizing the object of novel categories by a few reference samples, few-shot object detection is a quite challenging task. Previous works often depend on the fine-tuning process to transfer their model to the novel category and rarely consider the defect of fine-tuning, resulting in many drawbacks. For example, these methods are far from satisfying in the low-shot or episode-based scenarios since the fine-tuning process in object detection requires much time and high-shot support data. To this end, this paper proposes a plug-and-play few-shot object detection (PnP-FSOD) framework that can accurately and directly detect the objects of novel categories without the fine-tuning process. To accomplish the objective, the PnP-FSOD framework contains two parallel techniques to address the core challenges in the few-shot learning, i.e., across-category task and few-annotation support. Concretely, we first propose two simple but effective meta strategies for the box classifier and RPN module to enable the across-category object detection without fine-tuning. Then, we introduce two explicit inferences into the localization process to reduce its dependence on the annotated data, including explicit localization score and semi-explicit box regression. In addition to the PnP-FSOD framework, we propose a novel one-step tuning method that can avoid the defects in fine-tuning. It is noteworthy that the proposed techniques and tuning method are based on the general object detector without other prior methods, so they are easily compatible with the existing FSOD methods. Extensive experiments show that the PnP-FSOD framework has achieved the state-of-the-art few-shot object detection performance without any tuning method. After applying the one-step tuning method, it further shows a significant lead in both efficiency, precision, and recall, under varied few-shot evaluation protocols.

Download Full-text

Hyperspectral Imaging Combined With Deep Transfer Learning for Rice Disease Detection

Frontiers in Plant Science ◽

10.3389/fpls.2021.693521 ◽

2021 ◽

Vol 12 ◽

Author(s):

Lei Feng ◽

Baohua Wu ◽

Yong He ◽

Chu Zhang

Keyword(s):

Hyperspectral Imaging ◽

Transfer Learning ◽

Wavelength Range ◽

Fine Tuning ◽

Disease Detection ◽

Target Domain ◽

Learning Methods ◽

Rice Varieties ◽

Rice Disease ◽

Transfer Tasks

Various rice diseases threaten the growth of rice. It is of great importance to achieve the rapid and accurate detection of rice diseases for precise disease prevention and control. Hyperspectral imaging (HSI) was performed to detect rice leaf diseases in four different varieties of rice. Considering that it costs much time and energy to develop a classifier for each variety of rice, deep transfer learning was firstly introduced to rice disease detection across different rice varieties. Three deep transfer learning methods were adapted for 12 transfer tasks, namely, fine-tuning, deep CORrelation ALignment (CORAL), and deep domain confusion (DDC). A self-designed convolutional neural network (CNN) was set as the basic network of the deep transfer learning methods. Fine-tuning achieved the best transferable performance with an accuracy of over 88% for the test set of the target domain in the majority of transfer tasks. Deep CORAL obtained an accuracy of over 80% in four of all the transfer tasks, which was superior to that of DDC. A multi-task transfer strategy has been explored with good results, indicating the potential of both pair-wise, and multi-task transfers. A saliency map was used for the visualization of the key wavelength range captured by CNN with and without transfer learning. The results indicated that the wavelength range with and without transfer learning was overlapped to some extent. Overall, the results suggested that deep transfer learning methods could perform rice disease detection across different rice varieties. Hyperspectral imaging, in combination with the deep transfer learning method, is a promising possibility for the efficient and cost-saving field detection of rice diseases among different rice varieties.

Download Full-text

Plug-and-Play Few-shot Object Detection with Meta Strategy and Explicit Localization Inference

10.36227/techrxiv.16864711 ◽

2021 ◽

Author(s):

Junying Huang ◽

Fan Chen ◽

Liang Lin ◽

dongyu zhang

Keyword(s):

Object Detection ◽

Fine Tuning ◽

The Novel ◽

Plug And Play ◽

Tuning Method ◽

One Step ◽

Tuning Process ◽

General Object ◽

The One ◽

Parallel Techniques

Aiming at recognizing and localizing the object of novel categories by a few reference samples, few-shot object detection is a quite challenging task. Previous works often depend on the fine-tuning process to transfer their model to the novel category and rarely consider the defect of fine-tuning, resulting in many drawbacks. For example, these methods are far from satisfying in the low-shot or episode-based scenarios since the fine-tuning process in object detection requires much time and high-shot support data. To this end, this paper proposes a plug-and-play few-shot object detection (PnP-FSOD) framework that can accurately and directly detect the objects of novel categories without the fine-tuning process. To accomplish the objective, the PnP-FSOD framework contains two parallel techniques to address the core challenges in the few-shot learning, i.e., across-category task and few-annotation support. Concretely, we first propose two simple but effective meta strategies for the box classifier and RPN module to enable the across-category object detection without fine-tuning. Then, we introduce two explicit inferences into the localization process to reduce its dependence on the annotated data, including explicit localization score and semi-explicit box regression. In addition to the PnP-FSOD framework, we propose a novel one-step tuning method that can avoid the defects in fine-tuning. It is noteworthy that the proposed techniques and tuning method are based on the general object detector without other prior methods, so they are easily compatible with the existing FSOD methods. Extensive experiments show that the PnP-FSOD framework has achieved the state-of-the-art few-shot object detection performance without any tuning method. After applying the one-step tuning method, it further shows a significant lead in both efficiency, precision, and recall, under varied few-shot evaluation protocols.

Download Full-text

COMPARATIVE ANALYSIS OF DEEP LEARNING METHODS FOR OBJECT DETECTION

Advances in Mathematics: Scientific Journal ◽

10.37418/amsj.9.6.54 ◽

2020 ◽

Vol 9 (6) ◽

pp. 3759-3775

Author(s):

K. Gill ◽

V. Mangat

Keyword(s):

Deep Learning ◽

Comparative Analysis ◽

Object Detection ◽

Learning Methods

Download Full-text

Deep Learning Techniques for Grape Plant Species Identification in Natural Images

Sensors ◽

10.3390/s19224850 ◽

2019 ◽

Vol 19 (22) ◽

pp. 4850 ◽

Cited By ~ 6

Author(s):

Carlos S. Pereira ◽

Raul Morais ◽

Manuel J. C. S. Reis

Keyword(s):

Transfer Learning ◽

Climatic Conditions ◽

Fine Tuning ◽

Variety Identification ◽

Test Accuracy ◽

Accuracy Score ◽

Learning Techniques ◽

Four Corners ◽

Integrated Software ◽

Grape Varieties

Frequently, the vineyards in the Douro Region present multiple grape varieties per parcel and even per row. An automatic algorithm for grape variety identification as an integrated software component was proposed that can be applied, for example, to a robotic harvesting system. However, some issues and constraints in its development were highlighted, namely, the images captured in natural environment, low volume of images, high similarity of the images among different grape varieties, leaf senescence, and significant changes on the grapevine leaf and bunch images in the harvest seasons, mainly due to adverse climatic conditions, diseases, and the presence of pesticides. In this paper, the performance of the transfer learning and fine-tuning techniques based on AlexNet architecture were evaluated when applied to the identification of grape varieties. Two natural vineyard image datasets were captured in different geographical locations and harvest seasons. To generate different datasets for training and classification, some image processing methods, including a proposed four-corners-in-one image warping algorithm, were used. The experimental results, obtained from the application of an AlexNet-based transfer learning scheme and trained on the image dataset pre-processed through the four-corners-in-one method, achieved a test accuracy score of 77.30%. Applying this classifier model, an accuracy of 89.75% on the popular Flavia leaf dataset was reached. The results obtained by the proposed approach are promising and encouraging in helping Douro wine growers in the automatic task of identifying grape varieties.

Download Full-text

Transfer Learning Methods for Using Textural Features in Histopathological Image Classification

2020 Medical Technologies Congress (TIPTEKNO) ◽

10.1109/tiptekno50054.2020.9299220 ◽

2020 ◽

Author(s):

Sabri Can Cetindag ◽

Kubilay Guran ◽

Gokhan Bilgin

Keyword(s):

Image Classification ◽

Transfer Learning ◽

Textural Features ◽

Learning Methods ◽

Histopathological Image ◽

Histopathological Image Classification

Download Full-text

A Two-Phase Fashion Apparel Detection Method Based on YOLOv4

Applied Sciences ◽

10.3390/app11093782 ◽

2021 ◽

Vol 11 (9) ◽

pp. 3782

Author(s):

Chu-Hui Lee ◽

Chen-Wei Lin

Keyword(s):

Object Detection ◽

Transfer Learning ◽

Detection Method ◽

Phase Transfer ◽

Recognition Task ◽

Phase Detection ◽

Target Domain ◽

Two Phase ◽

Detection Technology ◽

Fashion Apparel

Object detection is one of the important technologies in the field of computer vision. In the area of fashion apparel, object detection technology has various applications, such as apparel recognition, apparel detection, fashion recommendation, and online search. The recognition task is difficult for a computer because fashion apparel images have different characteristics of clothing appearance and material. Currently, fast and accurate object detection is the most important goal in this field. In this study, we proposed a two-phase fashion apparel detection method named YOLOv4-TPD (YOLOv4 Two-Phase Detection), based on the YOLOv4 algorithm, to address this challenge. The target categories for model detection were divided into the jacket, top, pants, skirt, and bag. According to the definition of inductive transfer learning, the purpose was to transfer the knowledge from the source domain to the target domain that could improve the effect of tasks in the target domain. Therefore, we used the two-phase training method to implement the transfer learning. Finally, the experimental results showed that the mAP of our model was better than the original YOLOv4 model through the two-phase transfer learning. The proposed model has multiple potential applications, such as an automatic labeling system, style retrieval, and similarity detection.

Download Full-text

Multiscale Object Detection from Drone Imagery Using Ensemble Transfer Learning

Drones ◽

10.3390/drones5030066 ◽

2021 ◽

Vol 5 (3) ◽

pp. 66

Author(s):

Rahee Walambe ◽

Aboli Marathe ◽

Ketan Kotecha

Keyword(s):

Object Detection ◽

Transfer Learning ◽

Data Augmentation ◽

Test Time ◽

Complex Task ◽

Open Domain ◽

End User ◽

Aerial Vehicle ◽

Uav Images ◽

Voting Strategy

Object detection in uncrewed aerial vehicle (UAV) images has been a longstanding challenge in the field of computer vision. Specifically, object detection in drone images is a complex task due to objects of various scales such as humans, buildings, water bodies, and hills. In this paper, we present an implementation of ensemble transfer learning to enhance the performance of the base models for multiscale object detection in drone imagery. Combined with a test-time augmentation pipeline, the algorithm combines different models and applies voting strategies to detect objects of various scales in UAV images. The data augmentation also presents a solution to the deficiency of drone image datasets. We experimented with two specific datasets in the open domain: the VisDrone dataset and the AU-AIR Dataset. Our approach is more practical and efficient due to the use of transfer learning and two-level voting strategy ensemble instead of training custom models on entire datasets. The experimentation shows significant improvement in the mAP for both VisDrone and AU-AIR datasets by employing the ensemble transfer learning method. Furthermore, the utilization of voting strategies further increases the 3reliability of the ensemble as the end-user can select and trace the effects of the mechanism for bounding box predictions.

Download Full-text