Short Text Matching Algorithm Based on BERT

2021 ◽  
2015 ◽  
Vol 24 (6) ◽  
pp. 849-866 ◽  
Author(s):  
Fuat Basık ◽  
Buğra Gedik ◽  
Hakan Ferhatosmanoğlu ◽  
Mert Emin Kalender

Complexity ◽  
2021 ◽  
Vol 2021 ◽  
pp. 1-9
Author(s):  
Yin Xu ◽  
Hong Ma

Machine learning enables machines to learn rules from a large amount of data input from the outside world through algorithms, so as to identify and judge. It is the main task of the government to further emphasize the importance of improving the housing security mechanism, expand the proportion of affordable housing, increase financial investment, improve the construction quality of affordable housing, and ensure fair distribution. It can be seen that the legal system of housing security is essentially a system to solve the social problems brought by housing marketization, and it is an important part of the whole national housing system. More and more attention has been paid to solving the housing difficulties of low- and middle-income people and establishing a housing security legal system suitable for China’s national conditions and development stage. Aiming at the deep learning problem, a text matching algorithm suitable for the field of housing law and policy is proposed. Classifier based on matching algorithm is a promising classification technology. The research on the legal system of housing security is in the exploratory stage, involving various theoretical and practical research studies. Compare the improved depth learning algorithm with the general algorithm, so as to clearly understand the advantages and disadvantages of the improved depth learning algorithm and depth learning algorithm. This paper introduces the practical application of the deep learning model and fast learning algorithm in detail. Creatively put forward to transform it into an independent public law basis or into an independent savings system.


2021 ◽  
pp. 1-13
Author(s):  
Jiawen Shi ◽  
Hong Li ◽  
Chiyu Wang ◽  
Zhicheng Pang ◽  
Jiale Zhou

Short text matching is one of the fundamental technologies in natural language processing. In previous studies, most of the text matching networks are initially designed for English text. The common approach to applying them to Chinese is segmenting each sentence into words, and then taking these words as input. However, this method often results in word segmentation errors. Chinese short text matching faces the challenges of constructing effective features and understanding the semantic relationship between two sentences. In this work, we propose a novel lexicon-based pseudo-siamese model (CL2 N), which can fully mine the information expressed in Chinese text. Instead of utilizing a character-sequence or a single word-sequence, CL2 N augments the text representation with multi-granularity information in characters and lexicons. Additionally, it integrates sentence-level features through single-sentence features as well as interactive features. Experimental studies on two Chinese text matching datasets show that our model has better performance than the state-of-the-art short text matching models, and the proposed method can solve the error propagation problem of Chinese word segmentation. Particularly, the incorporation of single-sentence features and interactive features allows the network to capture the contextual semantics and co-attentive lexical information, which contributes to our best result.


Mathematics ◽  
2021 ◽  
Vol 9 (10) ◽  
pp. 1129
Author(s):  
Shihong Chen ◽  
Tianjiao Xu

QA matching is a very important task in natural language processing, but current research on text matching focuses more on short text matching rather than long text matching. Compared with short text matching, long text matching is rich in information, but distracting information is frequent. This paper extracted question-and-answer pairs about psychological counseling to research long text QA-matching technology based on deep learning. We adjusted DSSM (Deep Structured Semantic Model) to make it suitable for the QA-matching task. Moreover, for better extraction of long text features, we also improved DSSM by enriching the text representation layer, using a bidirectional neural network and attention mechanism. The experimental results show that BiGRU–Dattention–DSSM performs better at matching questions and answers.


Author(s):  
Pooja Kudi ◽  
Amitkumar Manekar ◽  
Kavita Daware ◽  
Tejaswini Dhatrak

2021 ◽  
Vol 2021 ◽  
pp. 1-9
Author(s):  
Li Xiang ◽  
Li ZongXun

The majority of the traditional methods deal with text matching at the word level which remains uncertain as the text semantic features are ignored. This also leads to the problems of low recall and high space utilization of text matching while the comprehensiveness of matching results is poor. The resultant method, thus, cannot process long text and short text simultaneously. The current study proposes a text matching algorithm for Korean Peninsula language knowledge base based on density clustering. Using the deep multiview semantic document representation model, the semantic vector of the text to be matched is captured for semantic dependency which is utilized to extract the text semantic features. As per the feature extraction outcomes, the text similarity is calculated by subtree matching method, and a semantic classification model based on SWEM and pseudo-twin network is designed for semantic text classification. Finally, the text matching of Korean Peninsula language knowledge base is carried out by applying density clustering algorithm. Experimental results show that the proposed method has high matching recall rate with low space requirements and can effectively match long and short texts concurrently.


2020 ◽  
Author(s):  
Zhe Hu ◽  
Zuohui Fu ◽  
Cheng Peng ◽  
Weiwei Wang
Keyword(s):  

2020 ◽  
Author(s):  
Lu Chen ◽  
Yanbin Zhao ◽  
Boer Lyu ◽  
Lesheng Jin ◽  
Zhi Chen ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document