Benchmarking Object Detection Networks for Image Based Reference Detection in Document Images

In any document, graphical elements like tables, figures, and formulas contain essential information. The processing and interpretation of such information require specialized algorithms. Off-the-shelf OCR components cannot process this information reliably. Therefore, an essential step in document analysis pipelines is to detect these graphical components. It leads to a high-level conceptual understanding of the documents that makes digitization of documents viable. Since the advent of deep learning, the performance of deep learning-based object detection has improved many folds. In this work, we outline and summarize the deep learning approaches for detecting graphical page objects in the document images. Therefore, we discuss the most relevant deep learning-based approaches and state-of-the-art graphical page object detection in document images. This work provides a comprehensive understanding of the current state-of-the-art and related challenges. Furthermore, we discuss leading datasets along with the quantitative evaluation. Moreover, it discusses briefly the promising directions that can be utilized for further improvements.

Download Full-text

Seal object detection in document images using GHT of local component shapes

Proceedings of the 2010 ACM Symposium on Applied Computing - SAC '10 ◽

10.1145/1774088.1774094 ◽

2010 ◽

Cited By ~ 8

Author(s):

Partha Pratim Roy ◽

Umapada Pal ◽

Josep Lladós

Keyword(s):

Object Detection ◽

Document Images ◽

Local Component

Download Full-text

A transfer learning approach to improve object detection (on document-images) performance in presence of poor quality datasets

Developments of Artificial Intelligence Technologies in Computation and Robotics ◽

10.1142/9789811223334_0126 ◽

2020 ◽

Author(s):

Perumadura De Silva ◽

Kolli Abhiram ◽

Al-Sayeed Mohamad ◽

Vahid Tavakkoli ◽

Kabeh Mohsenzadegan ◽

...

Keyword(s):

Object Detection ◽

Transfer Learning ◽

Poor Quality ◽

Learning Approach ◽

Document Images

Download Full-text

Page Object Detection from PDF Document Images by Deep Structured Prediction and Supervised Clustering

2018 24th International Conference on Pattern Recognition (ICPR) ◽

10.1109/icpr.2018.8546073 ◽

2018 ◽

Cited By ~ 6

Author(s):

Xiao-Hui Li ◽

Fei Yin ◽

Cheng-Lin Liu

Keyword(s):

Object Detection ◽

Structured Prediction ◽

Document Images ◽

Supervised Clustering

Download Full-text

Graphical Object Detection in Document Images

2019 International Conference on Document Analysis and Recognition (ICDAR) ◽

10.1109/icdar.2019.00018 ◽

2019 ◽

Cited By ~ 3

Author(s):

Ranajit Saha ◽

Ajoy Mondal ◽

C V Jawahar

Keyword(s):

Object Detection ◽

Document Images ◽

Graphical Object

Download Full-text

A Survey of Graphical Page Object Detection with Deep Neural Networks

Applied Sciences ◽

10.3390/app11125344 ◽

2021 ◽

Vol 11 (12) ◽

pp. 5344

Author(s):

Jwalin Bhatt ◽

Khurram Azeem Hashmi ◽

Muhammad Zeshan Afzal ◽

Didier Stricker

Keyword(s):

Deep Learning ◽

Object Detection ◽

Conceptual Understanding ◽

Deep Neural Networks ◽

State Of The Art ◽

Learning Approaches ◽

Document Images ◽

Essential Information ◽

Current State ◽

High Level

In any document, graphical elements like tables, figures, and formulas contain essential information. The processing and interpretation of such information require specialized algorithms. Off-the-shelf OCR components cannot process this information reliably. Therefore, an essential step in document analysis pipelines is to detect these graphical components. It leads to a high-level conceptual understanding of the documents that make the digitization of documents viable. Since the advent of deep learning, deep learning-based object detection performance has improved many folds. This work outlines and summarizes the deep learning approaches for detecting graphical page objects in document images. Therefore, we discuss the most relevant deep learning-based approaches and state-of-the-art graphical page object detection in document images. This work provides a comprehensive understanding of the current state-of-the-art and related challenges. Furthermore, we discuss leading datasets along with the quantitative evaluation. Moreover, it discusses briefly the promising directions that can be utilized for further improvements.

Download Full-text