Construction of Word Segmentation Model Based on HMM + BI-LSTM

Author(s):  
Hang Zhang ◽  
Bin Wen
2021 ◽  
Vol 94 ◽  
pp. 107354
Author(s):  
Xingyu Yan ◽  
Xiaofan Xiong ◽  
Xiufeng Cheng ◽  
Yujing Huang ◽  
Haitao Zhu ◽  
...  

Author(s):  
Young-Suk Lee ◽  
Kishore Papineni ◽  
Salim Roukos ◽  
Ossama Emam ◽  
Hany Hassan

2018 ◽  
Vol 5 (19) ◽  
pp. 155444
Author(s):  
Sadiq Khan ◽  
Khairullah Khan ◽  
Wahab Khan

2021 ◽  
Vol 2021 ◽  
pp. 1-10
Author(s):  
Jigen Luo ◽  
Wangping Xiong ◽  
Jianqiang Du ◽  
Yingfeng Liu ◽  
Jianwen Li ◽  
...  

The text similarity calculation plays a crucial role as the core work of artificial intelligence commercial applications such as traditional Chinese medicine (TCM) auxiliary diagnosis, intelligent question and answer, and prescription recommendation. However, TCM texts have problems such as short sentence expression, inaccurate word segmentation, strong semantic relevance, high feature dimension, and sparseness. This study comprehensively considers the temporal information of sentence context and proposes a TCM text similarity calculation model based on the bidirectional temporal Siamese network (BTSN). We used the enhanced representation through knowledge integration (ERNIE) pretrained language model to train character vectors instead of word vectors and solved the problem of inaccurate word segmentation in TCM. In the Siamese network, the traditional fully connected neural network was replaced by a deep bidirectional long short-term memory (BLSTM) to capture the contextual semantics of the current word information. The improved similarity BLSTM was used to map the sentence that is to be tested into two sets of low-dimensional numerical vectors. Then, we performed similarity calculation training. Experiments on the two datasets of financial and TCM show that the performance of the BTSN model in this study was better than that of other similarity calculation models. When the number of layers of the BLSTM reached 6 layers, the accuracy of the model was the highest. This verifies that the text similarity calculation model proposed in this study has high engineering value.


2020 ◽  
Vol 43 ◽  
Author(s):  
Peter Dayan

Abstract Bayesian decision theory provides a simple formal elucidation of some of the ways that representation and representational abstraction are involved with, and exploit, both prediction and its rather distant cousin, predictive coding. Both model-free and model-based methods are involved.


2001 ◽  
Vol 7 (S2) ◽  
pp. 578-579
Author(s):  
David W. Knowles ◽  
Sophie A. Lelièvre ◽  
Carlos Ortiz de Solόrzano ◽  
Stephen J. Lockett ◽  
Mina J. Bissell ◽  
...  

The extracellular matrix (ECM) plays a critical role in directing cell behaviour and morphogenesis by regulating gene expression and nuclear organization. Using non-malignant (S1) human mammary epithelial cells (HMECs), it was previously shown that ECM-induced morphogenesis is accompanied by the redistribution of nuclear mitotic apparatus (NuMA) protein from a diffuse pattern in proliferating cells, to a multi-focal pattern as HMECs growth arrested and completed morphogenesis . A process taking 10 to 14 days.To further investigate the link between NuMA distribution and the growth stage of HMECs, we have investigated the distribution of NuMA in non-malignant S1 cells and their malignant, T4, counter-part using a novel model-based image analysis technique. This technique, based on a multi-scale Gaussian blur analysis (Figure 1), quantifies the size of punctate features in an image. Cells were cultured in the presence and absence of a reconstituted basement membrane (rBM) and imaged in 3D using confocal microscopy, for fluorescently labeled monoclonal antibodies to NuMA (fαNuMA) and fluorescently labeled total DNA.


Sign in / Sign up

Export Citation Format

Share Document