Mask R-CNN Object Detection Method Based on Improved Feature Pyramid

2019 ◽  
Vol 56 (4) ◽  
pp. 041502 ◽  
Author(s):  
任之俊 Ren Zhijun ◽  
蔺素珍 Lin Suzhen ◽  
李大威 Li Dawei ◽  
王丽芳 Wang Lifang ◽  
左健宏 Zuo Jianhong
2019 ◽  
Vol 11 (16) ◽  
pp. 1921 ◽  
Author(s):  
Zijun Duo ◽  
Wenke Wang ◽  
Huizan Wang

Oceanic mesoscale eddies greatly influence energy and matter transport and acoustic propagation. However, the traditional detection method for oceanic mesoscale eddies relies too much on the threshold value and has significant subjectivity. The existing machine learning methods are not mature or purposeful enough, as their train set lacks authority. In view of the above problems, this paper constructs a mesoscale eddy automatic identification and positioning network—OEDNet—based on an object detection network. Firstly, 2D image processing technology is used to enhance the data of a small number of accurate eddy samples annotated by marine experts to generate the train set. Then, the object detection model with a deep residual network, and a feature pyramid network as the main structure, is designed and optimized for small samples and complex regions in the mesoscale eddies of the ocean. Experimental results show that the model achieves better recognition compared to the traditional detection method and exhibits a good generalization ability in different sea areas.


2021 ◽  
Vol 11 (9) ◽  
pp. 3782
Author(s):  
Chu-Hui Lee ◽  
Chen-Wei Lin

Object detection is one of the important technologies in the field of computer vision. In the area of fashion apparel, object detection technology has various applications, such as apparel recognition, apparel detection, fashion recommendation, and online search. The recognition task is difficult for a computer because fashion apparel images have different characteristics of clothing appearance and material. Currently, fast and accurate object detection is the most important goal in this field. In this study, we proposed a two-phase fashion apparel detection method named YOLOv4-TPD (YOLOv4 Two-Phase Detection), based on the YOLOv4 algorithm, to address this challenge. The target categories for model detection were divided into the jacket, top, pants, skirt, and bag. According to the definition of inductive transfer learning, the purpose was to transfer the knowledge from the source domain to the target domain that could improve the effect of tasks in the target domain. Therefore, we used the two-phase training method to implement the transfer learning. Finally, the experimental results showed that the mAP of our model was better than the original YOLOv4 model through the two-phase transfer learning. The proposed model has multiple potential applications, such as an automatic labeling system, style retrieval, and similarity detection.


2021 ◽  
Vol 1880 (1) ◽  
pp. 012018
Author(s):  
Shaobo Wang ◽  
Cheng Zhang ◽  
Di Su ◽  
Tianqi Sun

2021 ◽  
Author(s):  
Shuqi Xiong ◽  
Xiaohong Wu ◽  
Honggang Chen ◽  
Linbo Qing ◽  
Tong Chen ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document