Artificial Intelligence in Metrology Data Collection

Mapping Intimacies ◽

10.51843/wsproceedings.2021.03 ◽

2021 ◽

Author(s):

Michael Schwartz ◽

Keyword(s):

Artificial Intelligence ◽

Data Collection ◽

Character Recognition ◽

Optical Character Recognition ◽

Lessons Learned ◽

Continuous Learning ◽

Automate Data ◽

Optical Character ◽

Automate Data Collection ◽

Continual Learning

Many companies have tried to automate data collection for handheld Digital Multimeters (DMM) using Optical Character Recognition (OCR). Only recently have companies tried to perform this task using Artificial Intelligence (AI) technology, Cal Lab Solutions being one of them in 2020. But when we developed our first prototype application, we discovered the difficulties of getting a good value with every measurement and test point.A year later, lessons learned and equipped with better software, this paper is a continuation of that AI project. In Beta-,1 we learned the difficulties of AI reading segmented displays. There are no pre-trained models for this type of display, so we needed to train a model. This required the testing of thousands of images, so we changed the scope of the project to a continual learning AI project. This paper will cover how we built our continuous learning AI model to show how any lab with a webcam can start automating those handheld DMMS with software that gets smarter over time.

Download Full-text

Evaluation of Text Legibility in Alternative Imaging Approaches to Microfiche Digitization

Archiving Conference ◽

10.2352/issn.2168-3204.2021.1.0.22 ◽

2021 ◽

Vol 2021 (1) ◽

pp. 96-101

Author(s):

Hilda Deborah ◽

Dipendra J. Mandal

Keyword(s):

Data Collection ◽

Character Recognition ◽

Optical Character Recognition ◽

Optical Character ◽

Archival Storage ◽

Digital Formats ◽

Historical Periods ◽

Text Legibility

Microfiche was a common format used in microforms reproductions of documents, extensively used for archival storage before the move to digital formats. While contemporary documents are still available for digitization, others from older historical periods are no longer physically accessible for various reasons. In some cases, their microfiche copies are available, making microfiche digitization a must. However, a microfiche reader is not always available and, even then, it is a machine made for the purpose of reading and not for data collection. In this work, the performance two imaging devices are evaluated as alternatives to the traditional microfiche reader, by means of optical character recognition (OCR). Results show that this alternative surpasses the performance of a microfiche reader in terms of text legibility.

Download Full-text

Improve OCR Accuracy with Advanced Image Preprocessing using Machine Learning with Python

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.g5745.059720 ◽

2020 ◽

Vol 9 (7) ◽

pp. 1026-1030

Keyword(s):

Artificial Intelligence ◽

Machine Learning ◽

Neural Networks ◽

Character Recognition ◽

Optical Character Recognition ◽

Image Preprocessing ◽

Optical Character ◽

Handwritten Text ◽

Printed Text ◽

Learning Machine

Optical Character Recognition or Optical Character Reader (OCR) is a pattern-based method consciousness that transforms the concept of electronic conversion of images of handwritten text or printed text in a text compiled. Equipment or tools used for that purpose are cameras and apartment scanners. Handwritten text is scanned using a scanner. The image of the scrutinized document is processed using the program. Identification of manuscripts is difficult compared to other western language texts. In our proposed work we will accept the challenge of identifying letters and letters and working to achieve the same. Image Preprocessing techniques can effectively improve the accuracy of an OCR engine. The goal is to design and implement a machine with a learning machine and Python that is best to work with more accurate than OCR's pre-built machines with unique technologies such as MatLab, Artificial Intelligence, Neural networks, etc.

Download Full-text

Optical character recognition (OCR) using partial least square (PLS) based feature reduction: an application to artificial intelligence for biometric identification

Journal of Enterprise Information Management ◽

10.1108/jeim-02-2020-0076 ◽

2020 ◽

Vol ahead-of-print (ahead-of-print) ◽

Author(s):

Zainab Akhtar ◽

Jong Weon Lee ◽

Muhammad Attique Khan ◽

Muhammad Sharif ◽

Sajid Ali Khan ◽

...

Keyword(s):

Artificial Intelligence ◽

Character Recognition ◽

Optical Character Recognition ◽

Feature Reduction ◽

Partial Least Square ◽

Least Square ◽

License Plate ◽

Content Type ◽

Optical Character ◽

Machine Readable

PurposeIn artificial intelligence, the optical character recognition (OCR) is an active research area based on famous applications such as automation and transformation of printed documents into machine-readable text document. The major purpose of OCR in academia and banks is to achieve a significant performance to save storage space.Design/methodology/approachA novel technique is proposed for automated OCR based on multi-properties features fusion and selection. The features are fused using serially formulation and output passed to partial least square (PLS) based selection method. The selection is done based on the entropy fitness function. The final features are classified by an ensemble classifier.FindingsThe presented method was extensively tested on two datasets such as the authors proposed and Chars74k benchmark and achieved an accuracy of 91.2 and 99.9%. Comparing the results with existing techniques, it is found that the proposed method gives improved performance.Originality/valueThe technique presented in this work will help for license plate recognition and text conversion from a printed document to machine-readable.

Download Full-text

Segmentation of Handwritten Text Document Written in Devanagri Script for Simple character, skewed character and broken character

INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY ◽

10.24297/ijct.v8i1.3427 ◽

2013 ◽

Vol 8 (1) ◽

pp. 686-691

Author(s):

Vneeta Rani ◽

Dr.Vijay Laxmi

Keyword(s):

Artificial Intelligence ◽

Character Recognition ◽

Optical Character Recognition ◽

Character Segmentation ◽

Research Areas ◽

Text Document ◽

Optical Character ◽

Handwritten Text ◽

Recognition Phase ◽

Simple Character

OCR (optical character recognition) is a technology that is commonly used for recognizing patterns artificial intelligence & computer machine. With the help of OCR we can convert scanned document into editable documents which can be further used in various research areas. In this paper, we are presenting a character segmentation technique that can segment simple characters, skewed characters as well as broken characters. Character segmentation is very important phase in any OCR process because output of this phase will be served as input to various other phase like character recognition phase etc. If there is some problem in character segmentation phase then recognition of the corresponding character is very difficult or nearly impossible.

Download Full-text

PENDETEKSIAN PLAT NOMOR KENDARAAN MENGGUNAKAN ALGORITMA YOU ONLY LOOK ONCE V3 DAN TESSERACT

Jurnal Ilmiah Teknologi Infomasi Terapan ◽

10.33197/jitter.vol8.iss1.2021.718 ◽

2021 ◽

Vol 8 (1) ◽

pp. 57-62

Author(s):

Muhamad Rizky Fauzan ◽

Ari Purno Wahyu Wibowo

Keyword(s):

Artificial Intelligence ◽

Character Recognition ◽

Optical Character Recognition ◽

Optical Character

Perkembangan teknologi saat ini sangat berkembang pesat. Teknologi yang saat ini sedang dilakukan pengembangan secara besar-besaran yaitu Artificial Intelligence. Artificial Intelligence atau AI memiliki berbagai macam fungsi dan tujuan tergantung dari sistem yang akan dibuat. Salah satunya yaitu pendekteksian objek dan teks dari gambar atau video. Contoh dari pemanfaatan teknologi ini yaitu pada pendeteksian objek dan teks pada plat nomor kendaraan. Pada penelitian ini dilakukan perancangan sistem dengan menggunakan algoritma You Only Look Once V3 sebagai algoritma pendeteksi objek dan Tesseract Optical Character Recognition sebagai pendeteksi teks dalam gambar. Perancangan ini akan dibantu dengan library OpenCV pada bahasa pemrogramanan python dan menggunakan dataset gambar yang sudah tersedia. Penelitian ini bertujuan untuk mengetahui tingkat keakurasian algoritma You Only Look Once V3 yang dikombinasikan dengan Tesseract Optical Character Recognition.

Download Full-text

Optical Character Recognition using Artificial Intelligence

International Journal of Computer Applications ◽

10.5120/ijca2018916390 ◽

2018 ◽

Vol 179 (31) ◽

pp. 14-20 ◽

Cited By ~ 1

Author(s):

Shreshtha Garg ◽

Kapil Kumar ◽

Nikhil Prabhakar ◽

Amulya Ratan ◽

Aayush Trivedi

Keyword(s):

Artificial Intelligence ◽

Character Recognition ◽

Optical Character Recognition ◽

Optical Character

Download Full-text

An efficient extraction of information from Indian Government issued documents Aadhar and Pan Card

10.54216/fpa.040201 ◽

2021 ◽

pp. 56-61

Author(s):

Rachna Tewani ◽

◽

...

Keyword(s):

Natural Language Processing ◽

Data Collection ◽

Language Processing ◽

Character Recognition ◽

Optical Character Recognition ◽

Image Data ◽

Indian Government ◽

Use Of Data ◽

Optical Character ◽

Efficient Extraction

In today's world, everything is getting digitized, and widespread use of data scanning tools and photography. When we have a lot of image data, it becomes important to accumulate data in a form that is useful for the company/organization. Doing it manually is a tedious task and takes an ample amount of time. Hence to simplify the job, we have developed a FLASK API that takes an image folder as an object and returns an excel sheet of relevant data from the image data. We have used optical character recognition and software like pytesseract to extract data from images. Further in the process, we have used natural language processing, and finally, we have found relevant data using the globe and regex module. This model is helpful in data collection from Registration certificates which helps us store data like chassis number, owner name, car number, etc., easily and can be applied to Aadhaar cards and pan cards.

Download Full-text

OPTICAL CHARACTER RECOGNITION USING ARTIFICIAL INTELLIGENCE TECHNOLOGIES

Informatyka Automatyka Pomiary w Gospodarce i Ochronie Środowiska ◽

10.5604/20830157.1109372 ◽

2014 ◽

Vol 4 (2) ◽

pp. 41-44 ◽

Cited By ~ 2

Author(s):

Adam Musiał ◽

Piotr Szczepaniak

Keyword(s):

Artificial Intelligence ◽

Character Recognition ◽

Optical Character Recognition ◽

Optical Character

Download Full-text

SELECTION TECHNIQUE FOR MULTIPLE OUTPUTS OF OPTICAL CHARACTER RECOGNITION

Eurasian Journal of Mathematical and Computer Applications ◽

10.32523/2306-6172-2020-8-2-41-51 ◽

2020 ◽

Vol 8 (2) ◽

pp. 41-51

Author(s):

I.Q. Habeeb ◽

Z.Q. Al-Zaydi ◽

H.N. Abdulkhudhur

Keyword(s):

Character Recognition ◽

Optical Character Recognition ◽

Selection Technique ◽

Multiple Outputs ◽

Optical Character

Download Full-text

A Structured Method for the Recognition of Complex Historical Tables

History and Computing ◽

10.3366/hac.1997.9.1-3.58 ◽

1997 ◽

Vol 9 (1-3) ◽

pp. 58-77

Author(s):

Vitaly Kliatskine ◽

Eugene Shchepin ◽

Gunnar Thorvaldsen ◽

Konstantin Zingerman ◽

Valery Lazarev

Keyword(s):

Nineteenth Century ◽

Character Recognition ◽

Optical Character Recognition ◽

Complex Structure ◽

Source Material ◽

Historical Sources ◽

Tax Assessment ◽

Optical Character ◽

Algorithmic Model ◽

Machine Readable

In principle, printed source material should be made machine-readable with systems for Optical Character Recognition, rather than being typed once more. Offthe-shelf commercial OCR programs tend, however, to be inadequate for lists with a complex layout. The tax assessment lists that assess most nineteenth century farms in Norway, constitute one example among a series of valuable sources which can only be interpreted successfully with specially designed OCR software. This paper considers the problems involved in the recognition of material with a complex table structure, outlining a new algorithmic model based on ‘linked hierarchies’. Within the scope of this model, a variety of tables and layouts can be described and recognized. The ‘linked hierarchies’ model has been implemented in the ‘CRIPT’ OCR software system, which successfully reads tables with a complex structure from several different historical sources.

Download Full-text