A Holistic Approach to Urdu Language Word Recognition using Deep Neural Networks

Urdu is one of the most popular languages in the world. It is a Persianized standard register of the Hindi language with considerable and valuable literature. While digital libraries are constantly replacing conventional libraries, a vast amount of Urdu literature is still handwritten. Digitizing this handwritten literature is essential to preserve it and make it more accessible. Nevertheless, the scarcity of Urdu Optical Character Recognition (OCR) research limits a digital library's scope to a manual document search. The limited research work in this area is mainly due to the complexity of Urdu Script. Unlike the English language, the Urdu writing style is cursive, bidirectional, and character shapes and sizes highly vary depending on their position. Holistic word recognition is found to be a better solution among many other text segmentation techniques as it takes the complete word into account instead of segmenting it explicitly or implicitly. For this project, the data of five different Urdu words were collected for training and testing a convolutional neural network and 96% recognition accuracy was achieved.

Download Full-text

MELHISSA: a multilingual entity linking architecture for historical press articles

International Journal on Digital Libraries ◽

10.1007/s00799-021-00319-6 ◽

2021 ◽

Author(s):

Elvys Linhares Pontes ◽

Luis Adrián Cabrera-Diego ◽

Jose G. Moreno ◽

Emanuela Boros ◽

Ahmed Hamdi ◽

...

Keyword(s):

Language Processing ◽

Digital Libraries ◽

Character Recognition ◽

Optical Character Recognition ◽

Historical Documents ◽

Entity Linking ◽

Named Entities ◽

European Languages ◽

Meta Information ◽

The Impact

AbstractDigital libraries have a key role in cultural heritage as they provide access to our culture and history by indexing books and historical documents (newspapers and letters). Digital libraries use natural language processing (NLP) tools to process these documents and enrich them with meta-information, such as named entities. Despite recent advances in these NLP models, most of them are built for specific languages and contemporary documents that are not optimized for handling historical material that may for instance contain language variations and optical character recognition (OCR) errors. In this work, we focused on the entity linking (EL) task that is fundamental to the indexation of documents in digital libraries. We developed a Multilingual Entity Linking architecture for HIstorical preSS Articles that is composed of multilingual analysis, OCR correction, and filter analysis to alleviate the impact of historical documents in the EL task. The source code is publicly available. Experimentation has been done over two historical documents covering five European languages (English, Finnish, French, German, and Swedish). Results have shown that our system improved the global performance for all languages and datasets by achieving an F-score@1 of up to 0.681 and an F-score@5 of up to 0.787.

Download Full-text

Optical character recognition (OCR) system for Roman script & English language using Artificial Neural Network (ANN) classifier

2016 International Conference on Research Advances in Integrated Navigation Systems (RAINS) ◽

10.1109/rains.2016.7764379 ◽

2016 ◽

Cited By ~ 4

Author(s):

Honey Mehta ◽

Sanjay Singla ◽

Aarti Mahajan

Keyword(s):

Neural Network ◽

Artificial Neural Network ◽

Character Recognition ◽

Optical Character Recognition ◽

English Language ◽

Optical Character ◽

Artificial Neural ◽

Artificial Neural Network Ann ◽

Roman Script ◽

Ann Classifier

Download Full-text

Handwritten Bangla Numerical Digit Recognition Using Fine Regulated Deep Neural Network

Engineering International ◽

10.18034/ei.v9i2.551 ◽

2021 ◽

Vol 9 (2) ◽

pp. 73-84

Author(s):

Md. Shahadat Hossain ◽

Md. Anwar Hossain ◽

AFM Zainul Abadin ◽

Md. Manik Ahmed

Keyword(s):

Neural Network ◽

Character Recognition ◽

Optical Character Recognition ◽

Network Performance ◽

Deep Neural Network ◽

Recognition Accuracy ◽

Digit Recognition ◽

Optical Character ◽

Regularization Parameters ◽

Handwritten Recognition

The recognition of handwritten Bangla digit is providing significant progress on optical character recognition (OCR). It is a very critical task due to the similar pattern and alignment of handwriting digits. With the progress of modern research on optical character recognition, it is reducing the complexity of the classification task by several methods, a few problems encounter during recognition and wait to be solved with simpler methods. The modern emerging field of artificial intelligence is the Deep Neural Network, which promises a solid solution to these few handwritten recognition problems. This paper proposed a fine regulated deep neural network (FRDNN) for the handwritten numeric character recognition problem that uses convolutional neural network (CNN) models with regularization parameters which makes the model generalized by preventing the overfitting. This paper applied Traditional Deep Neural Network (TDNN) and Fine regulated deep neural network (FRDNN) models with a similar layer experienced on BanglaLekha-Isolated databases and the classification accuracies for the two models were 96.25% and 96.99%, respectively over 100 epochs. The network performance of the FRDNN model on the BanglaLekha-Isolated digit dataset was more robust and accurate than the TDNN model and depend on experimentation. Our proposed method is obtained a good recognition accuracy compared with other existing available methods.

Download Full-text

Ransomware Automatic Data Recognition Tool using SRANSAC

International Journal of Recent Technology and Engineering - 2 ◽

10.35940/ijrte.e5778.018520 ◽

2020 ◽

Vol 8 (5) ◽

pp. 2079-2083

Keyword(s):

Character Recognition ◽

Optical Character Recognition ◽

Research Work ◽

Bundle Adjustment ◽

Online Data ◽

Automatic Data ◽

Proposed Model ◽

Malicious Insiders ◽

Surrounding Areas ◽

Online Storage

In recent times, Ransomware is the most common form of malware seen which are achieved through ransomware attacks. The most common attacks are DDoS, Malicious Insiders, and Phishing. In this research work, information related to the ransomware attacks on windows and Linux are extracted, the detection of OCR(Optical character recognition) is improved to generate the screenshot of the infected machine and corresponding information are added to the database so that patterns are enhanced. The Hybrid Speeded Up Robust Feature (SURF) algorithm and image matching using Random Sample Consensus (SRANSAC) algorithm, bundle adjustment and image blending algorithms are used to develop the proposed model. An additional step is taken to crop the dark surrounding areas in the stitched image. Frequently used ransomware are crysis,gandcrab, crypto jacking and Notpetya. If the ransomware attack is detected in online data then the stored results is implemented so that USB dependence is avoided and to safeguard from the Ransomware like Crysis or GandCrab. Research work also focuses in developing online storage process.

Download Full-text

Pashtu Numerals Recognition through Convolutional Neural Networks

Journal of Applied and Emerging Sciences ◽

10.36785/buitems.jaes.338 ◽

2019 ◽

pp. 91-96

Keyword(s):

Neural Networks ◽

Deep Learning ◽

Convolutional Neural Networks ◽

Character Recognition ◽

Optical Character Recognition ◽

Recognition Accuracy ◽

Research Use ◽

Optical Character ◽

Classification Tasks ◽

Scanned Images

In the proposed paper we introduce a new Pashtu numerals dataset having handwritten scanned images. We make the dataset publically available for scientific and research use. Pashtu language is used by more than fifty million people both for oral and written communication, but still no efforts are devoted to the Optical Character Recognition (OCR) system for Pashtu language. We introduce a new method for handwritten numerals recognition of Pashtu language through the deep learning based models. We use convolutional neural networks (CNNs) both for features extraction and classification tasks. We assess the performance of the proposed CNNs based model and obtained recognition accuracy of 91.45%.

Download Full-text

Expiry date recognition using deep neural networks

International Joural of User-System Interaction ◽

10.37789/ijusi.2020.13.1.1 ◽

2020 ◽

Vol 13 (1) ◽

pp. 1-17

Author(s):

Traian Rebedea ◽

Vlad Florea

Keyword(s):

Neural Networks ◽

Character Recognition ◽

Optical Character Recognition ◽

Speech Synthesis ◽

Efficient Solution ◽

Deep Neural Networks ◽

Accuracy Improvement ◽

Expiry Date ◽

Food Items ◽

Optical Character

This paper proposes a deep learning solution for optical character recognition, specifically tuned to detect expiration dates that are printed on the packaging of food items. This method can be used to reduce food waste, having a significant impact on the design of smart refrigerators and can prove especially useful for persons with vision difficulties, by combining it with a speech synthesis engine. The main problem in designing an efficient solution for expiry date recognition is the lack of a large enough dataset to train deep neural networks. To tackle this issue, we propose to use an additional dataset composed of synthetically generated images. Both the synthetic and real image datasets are detailed in the paper and we show that the proposed method offers a 9.4% accuracy improvement over using real images alone.

Download Full-text

Robust Combined Binarization Method of Non-Uniformly Illuminated Document Images for Alphanumerical Character Recognition

Sensors ◽

10.3390/s20102914 ◽

2020 ◽

Vol 20 (10) ◽

pp. 2914

Author(s):

Hubert Michalak ◽

Krzysztof Okarma

Keyword(s):

Character Recognition ◽

Optical Character Recognition ◽

Recognition Accuracy ◽

Image Data ◽

Document Images ◽

Historical Document ◽

Image Binarization ◽

Optical Character ◽

Binarization Method ◽

Camera Sensors

Image binarization is one of the key operations decreasing the amount of information used in further analysis of image data, significantly influencing the final results. Although in some applications, where well illuminated images may be easily captured, ensuring a high contrast, even a simple global thresholding may be sufficient, there are some more challenging solutions, e.g., based on the analysis of natural images or assuming the presence of some quality degradations, such as in historical document images. Considering the variety of image binarization methods, as well as their different applications and types of images, one cannot expect a single universal thresholding method that would be the best solution for all images. Nevertheless, since one of the most common operations preceded by the binarization is the Optical Character Recognition (OCR), which may also be applied for non-uniformly illuminated images captured by camera sensors mounted in mobile phones, the development of even better binarization methods in view of the maximization of the OCR accuracy is still expected. Therefore, in this paper, the idea of the use of robust combined measures is presented, making it possible to bring together the advantages of various methods, including some recently proposed approaches based on entropy filtering and a multi-layered stack of regions. The experimental results, obtained for a dataset of 176 non-uniformly illuminated document images, referred to as the WEZUT OCR Dataset, confirm the validity and usefulness of the proposed approach, leading to a significant increase of the recognition accuracy.

Download Full-text

RECOGNITION OF HANDWRITTEN SIMILAR CHINESE CHARACTERS BY SELF-GROWING PROBABILISTIC DECISION-BASED NEURAL NETWORK

International Journal of Neural Systems ◽

10.1142/s0129065799000575 ◽

1999 ◽

Vol 09 (06) ◽

pp. 545-561 ◽

Cited By ~ 3

Author(s):

HSIN-CHIA FU ◽

Y. Y. XU ◽

H. Y. CHANG

Keyword(s):

Neural Network ◽

Character Recognition ◽

Optical Character Recognition ◽

Recognition Accuracy ◽

Recognition System ◽

Difficult Problem ◽

Input Image ◽

Prototype System ◽

Credit Assignment ◽

Similar Character

Recognition of similar (confusion) characters is a difficult problem in optical character recognition (OCR). In this paper, we introduce a neural network solution that is capable of modeling minor differences among similar characters, and is robust to various personal handwriting styles. The Self-growing Probabilistic Decision-based Neural Network (SPDNN) is a probabilistic type neural network, which adopts a hierarchical network structure with nonlinear basis functions and a competitive credit-assignment scheme. Based on the SPDNN model, we have constructed a three-stage recognition system. First, a coarse classifier determines a character to be input to one of the pre-defined subclasses partitioned from a large character set, such as Chinese mixed with alphanumerics. Then a character recognizer determines the input image which best matches the reference character in the subclass. Lastly, the third module is a similar character recognizer, which can further enhance the recognition accuracy among similar or confusing characters. The prototype system has demonstrated a successful application of SPDNN to similar handwritten Chinese recognition for the public database CCL/HCCR1 (5401 characters × 200 samples). Regarding performance, experiments on the CCL/HCCR1 database produced 90.12% recognition accuracy with no rejection, and 94.11% accuracy with 6.7% rejection, respectively. This recognition accuracy represents about 4% improvement on the previously announced performance.5,11 As to processing speed, processing before recognition (including image preprocessing, segmentation, and feature extraction) requires about one second for an A4 size character image, and recognition consumes approximately 0.27 second per character on a Pentium-100 based personal computer, without use of any hardware accelerator or co-processor.

Download Full-text

Improving optical character recognition accuracy using adaptive image restoration

Journal of Electronic Imaging ◽

10.1117/12.240710 ◽

1996 ◽

Vol 5 (3) ◽

pp. 379 ◽

Cited By ~ 4

Author(s):

Junichi Kanai

Keyword(s):

Image Restoration ◽

Character Recognition ◽

Optical Character Recognition ◽

Recognition Accuracy ◽

Optical Character

Download Full-text

SLOPE AND CURVE CODING TO RECOGNIZE ENGLISH NUMERALS

International Journal of Students Research in Technology & Management ◽

10.18510/ijsrtm.2020.823 ◽

2020 ◽

Vol 8 (2) ◽

pp. 15-20

Author(s):

Binod Kumar Prasad

Keyword(s):

Character Recognition ◽

Optical Character Recognition ◽

Recognition Accuracy ◽

Recognition System ◽

The Novel ◽

Nearest Neighbour ◽

Global Database ◽

Optical Character ◽

Individual Character ◽

Nearest Neighbour Classifier

Purpose: Lines and Curves are important parts of characters in any script. Features based on lines and curves go a long way to characterize an individual character as well as differentiate similar-looking characters. The present paper proposes an English numerals recognition system using feature elements obtained from the novel and efficient coding of the curves and local slopes. The purpose of this paper is to recognize English numerals efficiently to develop a reliable Optical Character recognition system. Methodology: K-Nearest Neighbour classification technique has been implemented on a global database MNIST to get an overall recognition accuracy rate of 96.7 %, which is competitive to other reported works in literature. Distance features and slope features are extracted from pre-processed images. The feature elements from training images are used to train K-Nearest-Neighbour classifier and those from test images have been used to classify them. Main Findings: The findings of the current paper can be used in Optical Character Recognition (OCR) of alphanumeric characters of any language, automatic reading of amount on bank cheque, address written on envelops, etc. Implications: Due to the similarity in structures of some numerals like 2, 3, and 8, the system produces respectively lower recognition accuracy rates for them. Novelty: The ways of finding distance and slope features to differentiate the curves in the structure of English Numerals is the novelty of this work.

Download Full-text