<P Class="Elsarticletitle">Bpti: Bilingual (Arabic/English) Printed Text Images Dataset for Recognition Research<O:P></O:P></P>

Urdu Optical Character Recognition (OCR) based on character level recognition (analytical approach) is less popular as compared to ligature level recognition (holistic approach) due to its added complexity, characters and strokes overlapping. This paper presents a holistic approach Urdu ligature extraction technique. The proposed Photometric Ligature Extraction (PLE) technique is independent of font size and column layout and is capable to handle non-overlapping and all inter and intra overlapping ligatures. It uses a customized photometric filter along with the application of X-shearing and padding with connected component analysis, to extract complete ligatures instead of extracting primary and secondary ligatures separately. A total of ~ 2,67,800 ligatures were extracted from scanned Urdu Nastaliq printed text images with an accuracy of 99.4%. Thus, the proposed framework outperforms the existing Urdu Nastaliq text extraction and segmentation algorithms. The proposed PLE framework can also be applied to other languages using the Nastaliq script style, languages such as Arabic, Persian, Pashto, and Sindhi.

Download Full-text

Computational modelling of an optical character recognition system for Yorùbá printed text images

Scientific African ◽

10.1016/j.sciaf.2020.e00415 ◽

2020 ◽

Vol 9 ◽

pp. e00415

Author(s):

Olalekan Joseph ONI ◽

Franklin Oladiipo ASAHIAH

Keyword(s):

Character Recognition ◽

Optical Character Recognition ◽

Computational Modelling ◽

Recognition System ◽

Optical Character ◽

Printed Text ◽

Text Images

Download Full-text

Hardware and Software Co-Design of Arabic Alphabets Recognition Platform for Blind and Visually Impaired Persons

The Open Electrical & Electronic Engineering Journal ◽

10.2174/1874129001711010193 ◽

2017 ◽

Vol 11 (1) ◽

pp. 193-200

Author(s):

Brahim Sabir ◽

Yassine Khazri ◽

Mohamed Moussetad ◽

Bouzekri Touri

Keyword(s):

Character Recognition ◽

Optical Character Recognition ◽

Visually Impaired ◽

Recognition Algorithm ◽

Hardware Platform ◽

Blind And Visually Impaired ◽

Optical Character ◽

Printed Text ◽

Visually Impaired Persons ◽

Text Images

Background:Optical character Recognition (OCR) is a technic that converts scanned or printed text images into editable text. Many OCR solutions have been proposed and used for Latin and Chinese alphabets.However not much can be found about OCRs for the handwriting scripts Arabic Alphabets, and especially to be used for blind and visually impaired persons.This paper has been an attempt towards the development of an OCR for Arabic Alphabets dedicated to blind and visually impaired persons.Method:The proposed Optical Arabic Alphabets Recognition algorithm includes binarization of the inputted image, segmentation, feature extraction and a classification based on neural networks to match read Arabic alphabets with trained pattern.The proposed algorithm has been developed using Matlab, and the solution was designed to be implemented on hardware platform and can be customized for mobile phones.Conclusion:The presented method has the benefit that the accuracy of recognition is comparable to other OCR algorithms.

Download Full-text

An Account of the Printed Text of the Greek New Testament

10.1017/cbo9781107326293 ◽

2009 ◽

Author(s):

Samuel Prideaux Tregelles

Keyword(s):

New Testament ◽

Greek New Testament ◽

Printed Text

Download Full-text

The Miller‘s Tale: a study of an unrecorded fragment of a manuscript in the John Rylands Library in relation to the first printed text

Bulletin of the John Rylands Library ◽

10.7227/bjrl.17.2.8 ◽

1933 ◽

Vol 17 (2) ◽

pp. 333-347

Author(s):

Guthrie Vine

Keyword(s):

Printed Text

Download Full-text

Experiments in the recognition of hand-printed text, part II

10.1145/1476706.1476736 ◽

1968 ◽

Cited By ~ 5

Author(s):

Richard O. Duda ◽

Peter E. Hart

Keyword(s):

Printed Text

Download Full-text

Gender Differences in Teens’ Digital Propensity and Perceptions and Preferences With Regard to Digital and Printed Text

TechTrends ◽

10.1007/s11528-016-0134-4 ◽

2016 ◽

Vol 61 (2) ◽

pp. 171-178 ◽

Cited By ~ 2

Author(s):

Soonhwa Seok ◽

Boaventura DaCosta

Keyword(s):

Gender Differences ◽

Printed Text

Download Full-text

A Robot Object Recognition Method Based on Scene Text Reading in Home Environments

Sensors ◽

10.3390/s21051919 ◽

2021 ◽

Vol 21 (5) ◽

pp. 1919

Author(s):

Shuhua Liu ◽

Huixin Xu ◽

Qi Li ◽

Fei Zhang ◽

Kun Hou

Keyword(s):

Object Recognition ◽

Recognition Accuracy ◽

Multiple Objects ◽

Recognition Method ◽

Home Environments ◽

Complex Scenes ◽

Scene Text ◽

Text Reading ◽

Text Images ◽

Chinese And English

With the aim to solve issues of robot object recognition in complex scenes, this paper proposes an object recognition method based on scene text reading. The proposed method simulates human-like behavior and accurately identifies objects with texts through careful reading. First, deep learning models with high accuracy are adopted to detect and recognize text in multi-view. Second, datasets including 102,000 Chinese and English scene text images and their inverse are generated. The F-measure of text detection is improved by 0.4% and the recognition accuracy is improved by 1.26% because the model is trained by these two datasets. Finally, a robot object recognition method is proposed based on the scene text reading. The robot detects and recognizes texts in the image and then stores the recognition results in a text file. When the user gives the robot a fetching instruction, the robot searches for corresponding keywords from the text files and achieves the confidence of multiple objects in the scene image. Then, the object with the maximum confidence is selected as the target. The results show that the robot can accurately distinguish objects with arbitrary shape and category, and it can effectively solve the problem of object recognition in home environments.

Download Full-text

Unsupervised learning technique for binarization of gray scale text images

2014 Annual IEEE India Conference (INDICON) ◽

10.1109/indicon.2014.7030453 ◽

2014 ◽

Author(s):

Saumya Srivastava ◽

Sudip Sanyal

Keyword(s):

Unsupervised Learning ◽

Gray Scale ◽

Learning Technique ◽

Text Images

Download Full-text

Bpti: Bilingual (Arabic/English) Printed Text Images Dataset for Recognition Research

Skew correction and line extraction in binarized printed text images

Photometric Ligature Extraction Technique for Urdu Optical Character Recognition

Computational modelling of an optical character recognition system for Yorùbá printed text images

Hardware and Software Co-Design of Arabic Alphabets Recognition Platform for Blind and Visually Impaired Persons

An Account of the Printed Text of the Greek New Testament

The Miller‘s Tale: a study of an unrecorded fragment of a manuscript in the John Rylands Library in relation to the first printed text

Experiments in the recognition of hand-printed text, part II

Gender Differences in Teens’ Digital Propensity and Perceptions and Preferences With Regard to Digital and Printed Text

A Robot Object Recognition Method Based on Scene Text Reading in Home Environments

Unsupervised learning technique for binarization of gray scale text images

Export Citation Format