scholarly journals Method of the optical recognition of technical documentation and the transformation of graphic information into machine-readable form for cognitive analysis

2021 ◽  
Vol 2094 (3) ◽  
pp. 032056
Author(s):  
A A Dzyubanenko ◽  
A V Rabin

Abstract The paper proposes the implementation of the method of optical recognition of technical documentation and the transformation of graphic information into a machine-readable form available for cognitive analysis, which is based on the methods of binarization and alignment of images, text segmentation and recognition. The use of the proposed method will provide a dramatic reduction in the costs of cataloging, checking the completeness and inventory of documentation, as well as an increase in design quality due to the semantic analysis of documentation using a knowledge base that is updated automatically. The article presents the development of the algorithm for optical recognition of a document, preparation of an image for optical recognition of a document, an example of the application of the Sauvola method for binarization of an image, and an analysis of the research results. The proposed implementation allows the text recognition on scanned/photographed documents.

1974 ◽  
Vol 7 (4) ◽  
pp. 267
Author(s):  
Alice S. Clark

As more academic and public libraries have some form of bibliographic description of their complete collection available in machine-readable form, public service librarians are devising ways to use the information for better retrieval. Research at the Ohio State University tested user response to paper and COM output from selected areas of the shelflist. Results indicated users at remote locations found such lists helpful, with some indication that paper printout was more popular than microfiche.


1982 ◽  
Vol 4 (4) ◽  
pp. 161-165
Author(s):  
Wolfgang Nedobity

The increased production and publication of professional and scientific literature makes it necessary that abstracts are pro duced in a quick, efficient and economical way. This can be achieved by the mechanization of abstracting. With the aid of computers, extracts can be produced of all kinds of texts which are available in machine-readable form. The main problem of this procedure is how to determine the key sentences of a text, i.e., the passages that contain the most relevant information. Various methods have been developed for this purpose; the one presented here is based on the fact that in order to convey relevant information, subject terminology is used. In many cases subject terminologies are now available in machine-reada ble form too and thus can be easily applied to the automatic production of abstracts.


Sign in / Sign up

Export Citation Format

Share Document