scholarly journals Reading Systems: An Introduction to Digital Document Processing

Author(s):  
Lambert Schomaker
10.1142/8280 ◽  
2011 ◽  
Author(s):  
Bidyut Baran Chaudhuri ◽  
Swapan Kumar Parui

Author(s):  
MATTHEW Y. MA ◽  
JINHONG K. GUO ◽  
PATRICK S. P. WANG

XML has been widely used as metadata for image retrieval. As a standard, it makes it easier to index and retrieve information across different platforms. However, how to automatically convert an image into XML format remains a challenge. In this paper, a system for generating structured document in XML from digitally captured document images is presented. The system is aimed at providing an easy to use tool for average users without requiring depth of knowledge in the document processing areas. Further, a XML/XSL generator is developed to accurately represent a document in a XML structure, yet in a representation that reflects its original layout.


Sign in / Sign up

Export Citation Format

Share Document