Dataset Augmentation with Synthetic Images Improves Semantic Segmentation

An algorithm (divided into multiple modules) for generating images of full-text documents is presented. These images can be used to train, test, and evaluate models for Optical Character Recognition (OCR). The algorithm is modular, individual parts can be changed and tweaked to generate desired images. A method for obtaining background images of paper from already digitized documents is described. For this, a novel approach based on Variational AutoEncoder (VAE) to train a generative model was used. These backgrounds enable the generation of similar background images as the training ones on the fly.The module for printing the text uses large text corpora, a font, and suitable positional and brightness character noise to obtain believable results (for natural-looking aged documents). A few types of layouts of the page are supported. The system generates a detailed, structured annotation of the synthesized image. Tesseract OCR to compare the real-world images to generated images is used. The recognition rate is very similar, indicating the proper appearance of the synthetic images. Moreover, the errors which were made by the OCR system in both cases are very similar. From the generated images, fully-convolutional encoder-decoder neural network architecture for semantic segmentation of individual characters was trained. With this architecture, the recognition accuracy of 99.28% on a test set of synthetic documents is reached.

Download Full-text

The SYNTHIA Dataset: A Large Collection of Synthetic Images for Semantic Segmentation of Urban Scenes

2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) ◽

10.1109/cvpr.2016.352 ◽

2016 ◽

Cited By ~ 416

Author(s):

German Ros ◽

Laura Sellart ◽

Joanna Materzynska ◽

David Vazquez ◽

Antonio M. Lopez

Keyword(s):

Semantic Segmentation ◽

Large Collection ◽

Urban Scenes ◽

Synthetic Images

Download Full-text

The effects of different levels of realism on the training of CNNs with only synthetic images for the semantic segmentation of robotic instruments in a head phantom

International Journal of Computer Assisted Radiology and Surgery ◽

10.1007/s11548-020-02185-0 ◽

2020 ◽

Vol 15 (8) ◽

pp. 1257-1265

Author(s):

Saul Alexis Heredia Perez ◽

Murilo Marques Marinho ◽

Kanako Harada ◽

Mamoru Mitsuishi

Keyword(s):

Semantic Segmentation ◽

Synthetic Images ◽

Robotic Instruments ◽

Different Levels

Download Full-text

Synthetic Images of Longitudinal Cracks in Steel Slabs Via Wasserstein Generative Adversarial Nets Used Toward Unsupervised Classification

AISTech2020 Proceedings of the Iron and Steel Technology Conference ◽

10.33313/380/214 ◽

2020 ◽

Author(s):

A. Barreiro ◽

M. Simiand ◽

D. Andrade

Keyword(s):

Unsupervised Classification ◽

Steel Slabs ◽

Synthetic Images ◽

Longitudinal Cracks

Download Full-text

Face Completion Using Semantic Segmentation and Geometric Features

International Review of Automatic Control (IREACO) ◽

10.15866/ireaco.v11i6.15738 ◽

2018 ◽

Vol 11 (6) ◽

pp. 304

Author(s):

Javier Pinzon-Arenas ◽

Robinson Jimenez-Moreno ◽

Ruben Hernandez-Beleno

Keyword(s):

Semantic Segmentation ◽

Geometric Features

Download Full-text

EVALUATION OF THE RAINBOW VOLUMIC VELOCIMETRY (RVV) PROCESS BY SYNTHETIC IMAGES

Journal of Flow Visualization and Image Processing ◽

10.1615/jflowvisimageproc.v14.i1.10 ◽

2007 ◽

Vol 14 (1) ◽

pp. 1-15 ◽

Cited By ~ 3

Author(s):

R. Malfara ◽

Y. Bailly ◽

J. P. Prenel ◽

C. Cudel

Keyword(s):

Synthetic Images

Download Full-text

Multi-model Integrated Weakly Supervised Semantic Segmentation Method

Journal of Computer-Aided Design & Computer Graphics ◽

10.3724/sp.j.1089.2019.17379 ◽

2019 ◽

Vol 31 (5) ◽

pp. 800

Author(s):

Changzhen Xiong ◽

Hui Zhi

Keyword(s):

Semantic Segmentation ◽

Segmentation Method ◽

Weakly Supervised

Download Full-text

Development of environment design support mixed reality system capable of environment estimation using deep learning

Impact ◽

10.21820/23987073.2020.2.9 ◽

2020 ◽

Vol 2020 (2) ◽

pp. 9-11

Author(s):

Tomohiro Fukuda

Keyword(s):

Deep Learning ◽

Real Time ◽

Computer Games ◽

Construction Projects ◽

Mixed Reality ◽

Semantic Segmentation ◽

Environment Design ◽

Aviation Training ◽

Architecture And Design ◽

World Environment

Mixed reality (MR) is rapidly becoming a vital tool, not just in gaming, but also in education, medicine, construction and environmental management. The term refers to systems in which computer-generated content is superimposed over objects in a real-world environment across one or more sensory modalities. Although most of us have heard of the use of MR in computer games, it also has applications in military and aviation training, as well as tourism, healthcare and more. In addition, it has the potential for use in architecture and design, where buildings can be superimposed in existing locations to render 3D generations of plans. However, one major challenge that remains in MR development is the issue of real-time occlusion. This refers to hiding 3D virtual objects behind real articles. Dr Tomohiro Fukuda, who is based at the Division of Sustainable Energy and Environmental Engineering, Graduate School of Engineering at Osaka University in Japan, is an expert in this field. Researchers, led by Dr Tomohiro Fukuda, are tackling the issue of occlusion in MR. They are currently developing a MR system that realises real-time occlusion by harnessing deep learning to achieve an outdoor landscape design simulation using a semantic segmentation technique. This methodology can be used to automatically estimate the visual environment prior to and after construction projects.

Download Full-text