3. Deep learning techniques for image processing

2019 ◽  
Vol 63 (11) ◽  
pp. 1658-1667
Author(s):  
M J Castro-Bleda ◽  
S España-Boquera ◽  
J Pastor-Pellicer ◽  
F Zamora-Martínez

Abstract This paper presents the ‘NoisyOffice’ database. It consists of images of printed text documents with noise mainly caused by uncleanliness from a generic office, such as coffee stains and footprints on documents or folded and wrinkled sheets with degraded printed text. This corpus is intended to train and evaluate supervised learning methods for cleaning, binarization and enhancement of noisy images of grayscale text documents. As an example, several experiments of image enhancement and binarization are presented by using deep learning techniques. Also, double-resolution images are also provided for testing super-resolution methods. The corpus is freely available at UCI Machine Learning Repository. Finally, a challenge organized by Kaggle Inc. to denoise images, using the database, is described in order to show its suitability for benchmarking of image processing systems.


2020 ◽  
Author(s):  
Jordan Reece ◽  
Margaret Couvillon ◽  
Christoph Grüter ◽  
Francis Ratnieks ◽  
Constantino Carlos Reyes-Aldasoro

AbstractThis work describe an algorithm for the automatic analysis of the waggle dance of honeybees. The algorithm analyses a video of a beehive with 13,624 frames, acquired at 25 frames/second. The algorithm employs the following traditional image processing steps: conversion to grayscale, low pass filtering, background subtraction, thresholding, tracking and clustering to detect run of bees that perform waggle dances. The algorithm detected 44,530 waggle events, i.e. one bee waggling in one time frame, which were then clustered into 511 waggle runs. Most of these were concentrated in one section of the hive. The accuracy of the tracking was 90% and a series of metrics like intra-dance variation in angle and duration were found to be consistent with literature. Whilst this algorithm was tested on a single video, the ideas and steps, which are simple as compared with Machine and Deep Learning techniques, should be attractive for researchers in this field who are not specialists in more complex techniques.


Author(s):  
Meng Xiao ◽  
Haibo Yi

According to the survey, off-line examination is still the main examination method in universities, primary and secondary schools. The grading processing of off-line examination is time-consuming. Besides, since the off-line grading is subjective, it is error-prone. In order to address the challenges in off-line examinations of universities, primary and secondary schools, it is very urgent to improve the efficiency of off-line grading. In order to realize intelligent grading for off-line examinations, we exploit deep learning techniques to off-line grading. First, we propose an image processing method for English letters. Second, we propose a image recognition method based on deep learning for English letters. Third, we propose a lightweight framework for grading. Based on the above designs, we design an intelligent grading system based on deep learning. We implement the system and the result shows that the intelligent grading system can batch grading efficiently. Besides, compared with related designs, the proposed system is more flexible and intelligent.


Author(s):  
Ozge Oztimur Karadag ◽  
Ozlem Erdas

In the traditional image processing approaches, first low-level image features are extracted and then they are sent to a classifier or a recognizer for further processing. While the traditional image processing techniques employ this step-by-step approach, majority of the recent studies prefer layered architectures which both extract features and do the classification or recognition tasks. These architectures are referred as deep learning techniques and they are applicable if sufficient amount of labeled data is available and the minimum system requirements are met. Nevertheless, most of the time either the data is insufficient or the system sources are not enough. In this study, we experimented how it is still possible to obtain an effective visual representation by combining low-level visual features with features from a simple deep learning model. As a result, combinational features gave rise to 0.80 accuracy on the image data set while the performance of low-level features and deep learning features were 0.70 and 0.74 respectively.


2020 ◽  
Vol 2 (2) ◽  
pp. 112-119
Author(s):  
Kawal Arora ◽  
Ankur Singh Bist ◽  
Roshan Prakash ◽  
Saksham Chaurasia

Recent advancements in the area of Optical Character Recognition (OCR) using deep learning techniques made it possible to use for real world applications with good accuracy. In this paper we present a system named as OCRXNet. OCRXNetv1, OCRXNetv2 and OCRXNetv3 are proposed and compared on different identity documents. Image processing methods and various text detectors have been used to identify best fitted process for custom ocr of identity documents. We also introduced the end to end pipeline to implement OCR for various use cases.


2021 ◽  
Vol 11 (2) ◽  
pp. 2124-2131
Author(s):  
Dr.N. Kanya ◽  
Dr. Pacha Shobha Rani ◽  
Dr.S. Geetha ◽  
Dr.M. Rajkumar ◽  
G. Sandhiya

The Unmanned Aerial Vehicle (UAV) has been around for a long time but has been widely used recently by humans. Their acceptance of various communications-based applications is expected to improve coverage, compared to traditional ground-based solutions. In this paper, the Deep-learning and Image Processing Process framework is expected to provide solutions to the various problems already identified when UAVs are used for communication purposes. UAVs are used in disaster relief because of their accessibility even in inaccessible places. In this paper, we propose research into Deep learning and Image Processing strategies for UAVs. In deep learning is a form of machine learning that teaches computers to do what comes naturally to people: learn by example and get a lot of attention recently and for a good reason. It achieves previously impossible results. Image processing is the process of performing a specific task on an image, finding an enhanced image or extracting useful information from it. So our paper has the idea of using in depth face recognition and photo processing a digital photo taken by the UAV to identify victims of rescue, overcoming back to the latest UAV technology some of which include blurry images, unable to identify the victim when there are too many objects and much more. The solution includes a variety of features that allow for the distribution of images. It includes features and presentation of image detection and demonstrates the effectiveness of drone use in damage applications.


Sign in / Sign up

Export Citation Format

Share Document