Chinese Character Relationship Extraction Method Based on BERT

Author(s):  
Dengtao Liu ◽  
Qianchao Wang
2021 ◽  
Author(s):  
Baosheng Yin ◽  
Yifei Sun

Abstract As an important part of information extraction, relationship extraction aims to extract the relationships between given entities from natural language text. On the basis of the pre-training model R-BERT, this paper proposes an entity relationship extraction method that integrates entity dependency path and pre-training model, which generates a dependency parse tree by dependency parsing, obtains the dependency path of entity pair via a given entity, and uses entity dependency path to exclude such information as modifier chunks and useless entities in sentences. This model has achieved good F1 value performance on the SemEval2010 Task 8 dataset. Experiments on dataset show that dependency parsing can provide context information for models and improve performance.


Author(s):  
XINGMING SUN ◽  
LIHUA YANG ◽  
Y. Y. TANG ◽  
YUNFA HU

Stroke extraction of Chinese characters plays an important role in Chinese character information processing such as character recognition, document analysis, document compression and storage, font automation and so on. By analyzing the structure of Chinese characters deeply, this paper developed a novel method to extract strokes of Chinese characters directly from the original character pattern image. Two theorems, eight rules and an algorithm for stroke extraction of Chinese characters are presented. This method can overcome the difficulties encountered in disposing the intersection or connection of different strokes, and can eliminate noises successfully. Our experiments have shown that this method can extract strokes both accurately and efficiently.


2013 ◽  
Vol 774-776 ◽  
pp. 1636-1641
Author(s):  
Ze Min Liu ◽  
Zhi Guo He ◽  
Yu Dong Cao

Feature extraction is very difficult for handwritten Chinese character because of large Chinese characters set, complex structure and very large shape variations. The recognition rate by currently used feature extraction methods is far from the requirements of the people. For this problem, a new supervised independent component analysis (SICA) algorithm based on J-divergence entropy is proposed for feature extraction, which can measure the difference between different categories. The scheme takes full advantage of good extraction local features capability and powerful capability to handle data with non-Gaussian distribution by ICA, and the extracted feature component and classification can be tightly combined. The experiments show that the feature extraction method based on SICA is superior to that of gradient-based and that of ICA.


Sign in / Sign up

Export Citation Format

Share Document