Improving Natural Language Parser Accuracy by Unknown Word Replacement

Author(s):  
Raihan Kibria ◽  
Khandaker Tabin Hasan
2011 ◽  
Vol 474-476 ◽  
pp. 460-465
Author(s):  
Bo Sun ◽  
Sheng Hui Huang ◽  
Xiao Hua Liu

Unknown word is a kind of word that is not included in the sub_word vocabulary, but must be cut out by the word segmentation program. Peoples’ names, place names and translated names are the major unknown words.Unknown Chinese words is a difficult problem in natural language processing, and also contributed to the low rate of correct segmention. This paper introduces the finite multi-list method that using the word fragments’ capability to composite a word and the location in the word tree to process the unknown Chinese words.The experiment recall is 70.67% ,the correct rate is 43.65% .The result of the experiment shows that unknown Chinese word identification based on the finite multi-list method is feasible.


1987 ◽  
Vol 32 (1) ◽  
pp. 33-34
Author(s):  
Greg N. Carlson
Keyword(s):  

2012 ◽  
Author(s):  
Loes Stukken ◽  
Wouter Voorspoels ◽  
Gert Storms ◽  
Wolf Vanpaemel
Keyword(s):  

2004 ◽  
Author(s):  
Harry E. Blanchard ◽  
Osamuyimen T. Stewart
Keyword(s):  

Sign in / Sign up

Export Citation Format

Share Document