Improving Text Similarity Measurement by Critical Sentence Vector Model

Information Retrieval Technology - Lecture Notes in Computer Science ◽

10.1007/11562382_44 ◽

2005 ◽

pp. 522-527 ◽

Author(s):

Wei Li ◽

Kam-Fai Wong ◽

Chunfa Yuan ◽

Wenjie Li ◽

Yunqing Xia

Keyword(s):

Vector Model ◽

Similarity Measurement ◽

Text Similarity ◽

Critical Sentence

Download Full-text

Short Answer Scoring in English Grammar Using Text Similarity Measurement

2018 International Conference on Computing, Engineering, and Design (ICCED) ◽

10.1109/icced.2018.00034 ◽

2018 ◽

Author(s):

Akeem Olowolayemo ◽

Santhy David Nawi ◽

Teddy Mantoro

Keyword(s):

Similarity Measurement ◽

Text Similarity ◽

Short Answer ◽

English Grammar

Download Full-text

Measurement of Text Similarity: A Survey

Information ◽

10.3390/info11090421 ◽

2020 ◽

Vol 11 (9) ◽

pp. 421

Author(s):

Jiapeng Wang ◽

Yihong Dong

Keyword(s):

Language Processing ◽

Question Answering ◽

Semantic Distance ◽

Similarity Measurement ◽

Text Representation ◽

Text Similarity ◽

Discussion Section ◽

Advantages And Disadvantages ◽

Comprehensive Classification ◽

Distribution Distance

Text similarity measurement is the basis of natural language processing tasks, which play an important role in information retrieval, automatic question answering, machine translation, dialogue systems, and document matching. This paper systematically combs the research status of similarity measurement, analyzes the advantages and disadvantages of current methods, develops a more comprehensive classification description system of text similarity measurement algorithms, and summarizes the future development direction. With the aim of providing reference for related research and application, the text similarity measurement method is described by two aspects: text distance and text representation. The text distance can be divided into length distance, distribution distance, and semantic distance; text representation is divided into string-based, corpus-based, single-semantic text, multi-semantic text, and graph-structure-based representation. Finally, the development of text similarity is also summarized in the discussion section.

Download Full-text

Sequential Pattern Mining Algorithm Based on Text Data: Taking the Fault Text Records as an Example

Sustainability ◽

10.3390/su10114330 ◽

2018 ◽

Vol 10 (11) ◽

pp. 4330 ◽

Author(s):

Xinglong Yuan ◽

Wenbing Chang ◽

Shenghan Zhou ◽

Yang Cheng

Keyword(s):

Time Series ◽

Pattern Mining ◽

Sequential Pattern Mining ◽

Sequential Pattern ◽

Fault Classification ◽

Sequential Patterns ◽

Series Data ◽

Similarity Measurement ◽

Text Similarity ◽

Sequential pattern mining (SPM) is an effective and important method for analyzing time series. This paper proposed a SPM algorithm to mine fault sequential patterns in text data. Because the structure of text data is poor and there are many different forms of text expression for the same concept, the traditional SPM algorithm cannot be directly applied to text data. The proposed algorithm is designed to solve this problem. First, this study measured the similarity of fault text data and classified similar faults into one class. Next, this paper proposed a new text similarity measurement model based on the word embedding distance. Compared with the classic text similarity measurement method, this model can achieve good results in short text classification. Then, on the basis of fault classification, this paper proposed the SPM algorithm with an event window, which is a time soft constraint for obtaining a certain number of sequential patterns according to needs. Finally, this study used the fault text records of a certain aircraft as experimental data for mining fault sequential patterns. Experiment showed that this algorithm can effectively mine sequential patterns in text data. The proposed algorithm can be widely applied to text time series data in many fields such as industry, business, finance and so on.

Download Full-text

Research on text similarity computing based on word vector model of neural networks

2015 6th IEEE International Conference on Software Engineering and Service Science (ICSESS) ◽

10.1109/icsess.2015.7339221 ◽

2015 ◽

Author(s):

Yuan Sun ◽

Weikang Li ◽

Peilei Dong

Keyword(s):

Neural Networks ◽

Vector Model ◽

Text Similarity

Download Full-text

A Text Similarity Measurement Method Based on Singular Value Decomposition and Semantic Relevance

Journal of Information Processing Systems ◽

10.3745/jips.02.0067 ◽

2017 ◽

Author(s):

Xu Li ◽

Chunlong Yao ◽

Fenglong Fan ◽

Xiaoqiang Yu

Keyword(s):

Singular Value Decomposition ◽

Measurement Method ◽

Singular Value ◽

Similarity Measurement ◽

Text Similarity ◽

Semantic Relevance ◽

Value Decomposition

Download Full-text

A Text Similarity Measurement Combining Word Semantic Information with TF-IDF Method

Chinese Journal of Computers ◽

10.3724/sp.j.1016.2011.00856 ◽

2011 ◽

Vol 34 (5) ◽

pp. 856-864 ◽

Author(s):

Cheng-Hui HUANG ◽

Jian YIN ◽

Fang HOU

Keyword(s):

Semantic Information ◽

Similarity Measurement ◽

Text Similarity

Download Full-text

Short Text Similarity Measurement Using Context from Bag of Word Pairs and Word Co-occurrence

Communications in Computer and Information Science - Data Science ◽

10.1007/978-981-15-2810-1_22 ◽

2020 ◽

pp. 221-231

Author(s):

Shuiqiao Yang ◽

Guangyan Huang ◽

Bahadorreza Ofoghi

Keyword(s):

Similarity Measurement ◽

Text Similarity ◽

Short Text Similarity

Download Full-text

Chinese Text Similarity Algorithm Based on Part-of-Speech Tagging and Word Vector Model

Journal of Computers ◽

10.17706/jcp.14.4.311-317 ◽

2019 ◽

Vol 14 (4) ◽

pp. 311-317

Author(s):

Zhixin Ma

Keyword(s):

Chinese Text ◽

Vector Model ◽

Text Similarity ◽

Part Of Speech Tagging ◽

Part Of Speech ◽

Similarity Algorithm ◽

Download Full-text

Research on the Text Length’s Effect of the Text Similarity Measurement

Communications in Computer and Information Science - Information and Automation ◽

10.1007/978-3-642-19853-3_16 ◽

2011 ◽

pp. 112-117

Author(s):

Yan Niu ◽

Yongchao Chen

Keyword(s):

Similarity Measurement ◽

Text Similarity

Download Full-text

Text Similarity Measurement Method Based on BiLSTM-SECapsNet Model

10.1109/icivc52351.2021.9527010 ◽

2021 ◽

Author(s):

Shanping Zhang ◽

Xiaowei Xu ◽

Ye Tao ◽

Xiaodong Wang ◽

Qiuchen Wang ◽

...

Keyword(s):

Measurement Method ◽

Similarity Measurement ◽

Text Similarity

Download Full-text