Rewiev of current text representation technics for semantic relationship extraction

Michał Gałusza

doi:10.5604/01.3001.0015.2733

Rewiev of current text representation technics for semantic relationship extraction

Computer Science and Mathematical Modelling ◽

10.5604/01.3001.0015.2733 ◽

2021 ◽

Vol 0 (11-12/2020) ◽

pp. 13-22

Author(s):

Michał Gałusza

Keyword(s):

Text Processing ◽

Text Representation ◽

Semantic Relationship ◽

Relationship Extraction

Article provides review on current most popular text processing technics; sketches their evolution and compares sequence and dependency models in detecting semantic relationship between words.

Download Full-text

Cognitive psychology and text processing: From text representation to text-world

Semiotica ◽

10.1515/semi.1989.77.1-3.271 ◽

1989 ◽

Vol 77 (1-3) ◽

Cited By ~ 1

Author(s):

GUY DENHlÈRE ◽

SERGE BAUDET

Keyword(s):

Cognitive Psychology ◽

Text Processing ◽

Text Representation

Download Full-text

Research on Text Representation Model Integrated Semantic Relationship

2015 IEEE International Conference on Systems, Man, and Cybernetics ◽

10.1109/smc.2015.478 ◽

2015 ◽

Cited By ~ 1

Author(s):

Jianlin Zhu ◽

You Fang ◽

Xiaoping Yang ◽

Qian Wang

Keyword(s):

Text Representation ◽

Semantic Relationship ◽

Representation Model

Download Full-text

Learning Conceptual-Contextual Embeddings for Medical Text

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i05.6504 ◽

2020 ◽

Vol 34 (05) ◽

pp. 9579-9586

Author(s):

Xiao Zhang ◽

Dejing Dou ◽

Ji Wu

Keyword(s):

Text Processing ◽

Language Models ◽

Text Representation ◽

Context Model ◽

Language Understanding ◽

Health Records ◽

Medical Text ◽

Wide Range ◽

Structured Knowledge ◽

Embedding Methods

External knowledge is often useful for natural language understanding tasks. We introduce a contextual text representation model called Conceptual-Contextual (CC) embeddings, which incorporates structured knowledge into text representations. Unlike entity embedding methods, our approach encodes a knowledge graph into a context model. CC embeddings can be easily reused for a wide range of tasks in a similar fashion to pre-trained language models. Our model effectively encodes the huge UMLS database by leveraging semantic generalizability. Experiments on electronic health records (EHRs) and medical text processing benchmarks showed our model gives a major boost to the performance of supervised medical NLP tasks.

Download Full-text

SemSeq4FD: Integrating global semantic relationship and local sequential order to enhance text representation for fake news detection

Expert Systems with Applications ◽

10.1016/j.eswa.2020.114090 ◽

2021 ◽

Vol 166 ◽

pp. 114090

Author(s):

Yuhang Wang ◽

Li Wang ◽

Yanjie Yang ◽

Tao Lian

Keyword(s):

Text Representation ◽

Semantic Relationship ◽

Fake News ◽

Sequential Order

Download Full-text

Automation of solving planimetry problems written in Ukrainian

PROBLEMS IN PROGRAMMING ◽

10.15407/pp2020.04.071 ◽

2020 ◽

pp. 071-080

Author(s):

O.P. Zhezherun ◽

◽

O.R. Smysh ◽

◽

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Text Processing ◽

Comprehensive Analysis ◽

Text Representation ◽

Natural Languages ◽

Different Types ◽

Mathematical Problems ◽

Further Development

The article focuses on developing a software solution for solving planimetry problems that are written in Ukrainian. We discuss tendencies and available abilities in Ukrainian natural language processing. Presenting a comprehensive analysis of different types of describing a problem, which shows regularities in the formulation and structure of the text representation of problems. Also, we demonstrate the similarities of writing a problem not only in Ukrainian but also in Belarusian, English, and Russian languages. The final result of the paper is a system that uses the morphosyntactic analyzer to process a problem’s text and provide the answer to it. Ukrainian natural language processing is growing rapidly and showing impressive results. Huge possibilities appear as the Gold standard annotated corpus for Ukrainian language was recently developed. The created architecture is flexible, which indicates the possibility of adding both new geometry figures and their properties, as well as the additional logic to the program. The developed system with a little reformatting can be used with other natural languages, such as English, Belarusian or Russian, as the algorithm for text processing is universal due to the globally accepted representations for presenting such types of mathematical problems. Therefore, the further development of the system is possible.

Download Full-text

ORTHOGRAPHIC CASE RESTORATION USING SUPERVISED LEARNING WITHOUT MANUAL ANNOTATION

International Journal of Artificial Intelligence Tools ◽

10.1142/s0218213004001454 ◽

2004 ◽

Vol 13 (01) ◽

pp. 141-156 ◽

Cited By ~ 1

Author(s):

CHENG NIU ◽

WEI LI ◽

JIHONG DING ◽

ROHINI K. SRIHARI

Keyword(s):

Question Answering ◽

Hidden Markov ◽

Text Processing ◽

Traditional Approach ◽

Language Model ◽

Original System ◽

System Complexity ◽

Rule Based ◽

Named Entity ◽

Relationship Extraction

One challenge in text processing is the treatment of case insensitive documents such as speech recognition results. The traditional approach is to re-train a language model excluding case-related features. This paper presents an alternative two-step approach whereby a preprocessing module (Step 1) is designed to restore case-sensitive form which is subsequently processed by the original system (Step 2). Step 1 is mainly implemented as a Hidden Markov Model trained on a large raw corpus of case sensitive documents. It is demonstrated that this approach (i) outperforms the feature exclusion approach for named entity tagging, (ii) leads to limited degradation for parsing, relationship extraction and case insensitive question answering, (iii) reduces system complexity, and (iv) has wide applicability: the restored text can be used in both statistical model and rule-based systems.

Download Full-text