document understanding Latest Research Papers

MC-OCR Challenge 2021: Towards Document Understanding for Unconstrained Mobile-Captured Vietnamese Receipts

10.1109/rivf51545.2021.9642126 ◽

2021 ◽

Author(s):

Hoai Viet Nguyen ◽

Linh Bao Doan ◽

Hoang Viet Trinh ◽

Hoang Huy Phan ◽

Ta Minh Thanh

Keyword(s):

Document Understanding

Que2Search: Fast and Accurate Query and Document Understanding for Search at Facebook

Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining ◽

10.1145/3447548.3467127 ◽

2021 ◽

Author(s):

Yiqun Liu ◽

Kaushik Rangadurai ◽

Yunzhong He ◽

Siddarth Malreddy ◽

Xunlong Gui ◽

...

Keyword(s):

Document Understanding

Deep Understanding of Technical Documents: An Enhancement on Diagrams Understanding

International Journal of Artificial Intelligence Tools ◽

10.1142/s0218213021500275 ◽

2021 ◽

Vol 30 (05) ◽

pp. 2150027

Author(s):

Michail S. Alexiou ◽

Nikolaos Gkorgkolis ◽

Sukarno Mertoguno ◽

Nikolaos G. Bourbakis

Keyword(s):

Petri Net ◽

Deep Understanding ◽

Complex Structures ◽

Technical Document ◽

Stochastic Petri Net ◽

Document Understanding ◽

Processing Step ◽

Holistic Understanding ◽

Mathematical Formulas ◽

Technical Documents

Humans are capable of understanding the knowledge that is included in technical documents automatically by consciously combining the information that is presented in the document’s individual modalities. These modalities are mathematical formulas, charts, tables, diagram images and etc. In this paper, we significantly enhance a previously presented technical document understanding methodology3 that emulates the way that humans also perceive information. More specifically, we make the original diagram understanding methodology adaptive to larger architectures with more complex structures and modules. The overall understanding methodology results in the generation of a Stochastic Petri-net (SPN) graph that describes the system’s functionality. Finally, we conclude with the introduction of the hierarchical association of different diagram images from the same technical document. This processing step aims to provide a holistic understanding of all illustrated diagram information.

Deep learning for graphics recognition: document understanding and beyond

International Journal on Document Analysis and Recognition (IJDAR) ◽

10.1007/s10032-021-00372-6 ◽

2021 ◽

Author(s):

Jean-Christophe Burie ◽

Alicia Fornés ◽

K. C. Santosh ◽

Muhammad Muzzamil Luqman

Keyword(s):

Deep Learning ◽

Graphics Recognition ◽

Document Understanding

Research on the Importance of Data Enhancement Technology in Power Document Understanding

Journal of Physics Conference Series ◽

10.1088/1742-6596/1827/1/012041 ◽

2021 ◽

Vol 1827 (1) ◽

pp. 012041

Author(s):

Ming Gao ◽

Jiayan Wang ◽

Wenfei Zhang ◽

Dehui Wang ◽

Zheng Peng ◽

...

Keyword(s):

Document Understanding ◽

Enhancement Technology

LayoutLMv2: Multi-modal Pre-training for Visually-rich Document Understanding

10.18653/v1/2021.acl-long.201 ◽

2021 ◽

Author(s):

Yang Xu ◽

Yiheng Xu ◽

Tengchao Lv ◽

Lei Cui ◽

Furu Wei ◽

...

Keyword(s):

Document Understanding

Decontextualization: Making Sentences Stand-Alone

Transactions of the Association for Computational Linguistics ◽

10.1162/tacl_a_00377 ◽

2021 ◽

Vol 9 ◽

pp. 447-461

Author(s):

Eunsol Choi ◽

Jennimaria Palomaki ◽

Matthew Lamm ◽

Tom Kwiatkowski ◽

Dipanjan Das ◽

...

Keyword(s):

Question Answering ◽

Document Understanding ◽

Local Window

Abstract Models for question answering, dialogue agents, and summarization often interpret the meaning of a sentence in a rich context and use that meaning in a new context. Taking excerpts of text can be problematic, as key pieces may not be explicit in a local window. We isolate and define the problem of sentence decontextualization: taking a sentence together with its context and rewriting it to be interpretable out of context, while preserving its meaning. We describe an annotation procedure, collect data on the Wikipedia corpus, and use the data to train models to automatically decontextualize sentences. We present preliminary studies that show the value of sentence decontextualization in a user-facing task, and as preprocessing for systems that perform document understanding. We argue that decontextualization is an important subtask in many downstream applications, and that the definitions and resources provided can benefit tasks that operate on sentences that occur in a richer context.

Going Full-TILT Boogie on Document Understanding with Text-Image-Layout Transformer

10.1007/978-3-030-86331-9_47 ◽

2021 ◽

pp. 732-747

Author(s):

Rafał Powalski ◽

Łukasz Borchmann ◽

Dawid Jurkiewicz ◽

Tomasz Dwojak ◽

Michał Pietruszka ◽

...

Keyword(s):

Document Understanding

CiteWorth: Cite-Worthiness Detection for Improved Scientific Document Understanding

10.18653/v1/2021.findings-acl.157 ◽

2021 ◽

Author(s):

Dustin Wright ◽

Isabelle Augenstein

Keyword(s):

Document Understanding

TRIE: End-to-End Text Reading and Information Extraction for Document Understanding

Proceedings of the 28th ACM International Conference on Multimedia ◽

10.1145/3394171.3413900 ◽

2020 ◽

Author(s):

Peng Zhang ◽

Yunlu Xu ◽

Zhanzhan Cheng ◽

Shiliang Pu ◽

Jing Lu ◽

...

Keyword(s):

Information Extraction ◽

Document Understanding ◽

Text Reading ◽

End To End

document understanding
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

MC-OCR Challenge 2021: Towards Document Understanding for Unconstrained Mobile-Captured Vietnamese Receipts

Que2Search: Fast and Accurate Query and Document Understanding for Search at Facebook

Deep Understanding of Technical Documents: An Enhancement on Diagrams Understanding

Deep learning for graphics recognition: document understanding and beyond

Research on the Importance of Data Enhancement Technology in Power Document Understanding

LayoutLMv2: Multi-modal Pre-training for Visually-rich Document Understanding

Decontextualization: Making Sentences Stand-Alone

Going Full-TILT Boogie on Document Understanding with Text-Image-Layout Transformer

CiteWorth: Cite-Worthiness Detection for Improved Scientific Document Understanding

TRIE: End-to-End Text Reading and Information Extraction for Document Understanding

Export Citation Format

document understandingRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

MC-OCR Challenge 2021: Towards Document Understanding for Unconstrained Mobile-Captured Vietnamese Receipts

Que2Search: Fast and Accurate Query and Document Understanding for Search at Facebook

Deep Understanding of Technical Documents: An Enhancement on Diagrams Understanding

Deep learning for graphics recognition: document understanding and beyond

Research on the Importance of Data Enhancement Technology in Power Document Understanding

LayoutLMv2: Multi-modal Pre-training for Visually-rich Document Understanding

Decontextualization: Making Sentences Stand-Alone

Going Full-TILT Boogie on Document Understanding with Text-Image-Layout Transformer

CiteWorth: Cite-Worthiness Detection for Improved Scientific Document Understanding

TRIE: End-to-End Text Reading and Information Extraction for Document Understanding

document understanding
Recently Published Documents