scholarly journals DWIE: An entity-centric dataset for multi-task document-level information extraction

2021 ◽  
Vol 58 (4) ◽  
pp. 102563
Author(s):  
Klim Zaporojets ◽  
Johannes Deleu ◽  
Chris Develder ◽  
Thomas Demeester
2020 ◽  
Author(s):  
Sarthak Jain ◽  
Madeleine van Zuylen ◽  
Hannaneh Hajishirzi ◽  
Iz Beltagy

2003 ◽  
Author(s):  
Rohini K. Srihari ◽  
Wei Li ◽  
Cheng Niu ◽  
Thomas Cornell

2006 ◽  
Vol 14 (01) ◽  
Author(s):  
ROHINI K. SRIHARI ◽  
WEI LI ◽  
THOMAS CORNELL ◽  
CHENG NIU

Electronics ◽  
2021 ◽  
Vol 10 (7) ◽  
pp. 824
Author(s):  
Peng Wang ◽  
Zhenkai Deng ◽  
Ruilong Cui

Extracting financial events from numerous financial announcements is very important for investors to make right decisions. However, it is still challenging that event arguments always scatter in multiple sentences in a financial announcement, while most existing event extraction models only work in sentence-level scenarios. To address this problem, this paper proposes a relation-aware Transformer-based Document-level Joint Event Extraction model (TDJEE), which encodes relations between words into the context and leverages modified Transformer to capture document-level information to fill event arguments. Meanwhile, the absence of labeled data in financial domain could lead models be unstable in extraction results, which is known as the cold start problem. Furthermore, a Fonduer-based knowledge base combined with the distant supervision method is proposed to simplify the event labeling and provide high quality labeled training corpus for model training and evaluating. Experimental results on real-world Chinese financial announcement show that, compared with other models, TDJEE achieves competitive results and can effectively extract event arguments across multiple sentences.


IEEE Access ◽  
2020 ◽  
Vol 8 ◽  
pp. 29677-29689 ◽  
Author(s):  
Lin Qiu ◽  
Dongyu Ru ◽  
Quanyu Long ◽  
Weinan Zhang ◽  
Yong Yu

Sign in / Sign up

Export Citation Format

Share Document