Chinese Word Segmentation Based on the First Kind of Spline Weight Function Neural Networks

2014 ◽  
Vol 513-517 ◽  
pp. 683-686 ◽  
Author(s):  
Dai Yuan Zhang ◽  
Yan Xu

With the continuous development of information technology, word segmentation technology becomes an important link in dealing with the increasing amount of information conveniently. Different from English word segmented by spaces, Chinese writing is continuous, and there is no space between words, this brings a lot of trouble to word segmentation. In this article, through the analysis of different property of words to get a code that can be used for training and combined it with the first kind of spline weight function neural network, then by training a large number of existing rules encoding to generate a study method that can divide the statement correctly.

2019 ◽  
Vol 15 (3) ◽  
pp. 47-62 ◽  
Author(s):  
Chenghai Yu ◽  
Shupei Wang ◽  
Jiajun Guo

Chinese word segmentation is the basis of the Chinese natural language processing (NLP). With the development of the deep learning, various neural network models are applied to the Chinese word segmentation. However, current neural network models have the characteristics of artificial feature extraction, nonstandard word-weight, inability to effectively use long-distance information and long training time of models in Chinese word segmentation. To solve a series of problems, this article presents a CNN-Bidirectional GRU-CRF neural network model (CNN Bidirectional GRU CRF Network, CBiGCN), which breaks through the limit of conventional method window, truly realizes end-to-end processing and applies to the neural network model by the five-Tag set method, bias-variable-weight greedy strategy and supplements by Goldstein-Armijo guidelines. Besides, this model, with simple structure, is easy to be operated. And it can automatically learn features, reduces large amounts of tasks on specific knowledge in the form of handcrafted features and data pre-processing, makes use of context information effectively. The authors set an experiment with two data corpuses for Chinese word segmentation to evaluate their system. The experiment verified their new model can obtain better Chinese word segmentation results and greatly reduce training time.


2014 ◽  
Vol 687-691 ◽  
pp. 1540-1543 ◽  
Author(s):  
Ke Zhu

Under the influence of network information resources in exponentially growing, we are entering the information society, and all aspects of our lives has permeated with information technology. But the Chinese word segmentation technology for processing Chinese information becomes more and more important. Therefore, this paper sets out a series of Chinese word segmentation techniques, which mainly consists of Chinese word segmentation technology based on statistic, Chinese word segmentation techniques based on dictionary and hybrid techniques of Chinese word segmentation and segmentation technology based on knowledge and understanding.


Sign in / Sign up

Export Citation Format

Share Document