A New Similarity Measure for Automatic Construction of the Unknown Word Lexical Dictionary

Author(s):  
Myunggwon Hwang ◽  
Pankoo Kim

This paper deals with research that automatically constructs a lexical dictionary of unknown words as an automatic lexical dictionary expansion. The lexical dictionary has been usefully applied to various fields for semantic information processing. It has limitations in which it only processes terms defined in the dictionary. Under this circumstance, the concept of “Unknown Word (UW)” is defined. UW is considered a word, not defined in WordNet, that is an existing representative lexical dictionary. Here is where a new method to construct UW lexical dictionary through inputting various document collections that are scattered on the WebWeb is proposed. The authors grasp related terms of UW and measure semantic relatedness (similarity) between an UW and a related term(s). The relatedness is obtained by calculating both probabilistic relationship and semantic relationship. This research can extend UW lexical dictionary with an abundant number of UW. It is also possible to prepare a foundation for semantic retrieval by simultaneously using the UW lexical dictionary and WordNet.

2014 ◽  
Vol 701-702 ◽  
pp. 413-417
Author(s):  
Jie Ran ◽  
Ji Ya Huang ◽  
Zu Xiao

Word similarity computing is a crucial question in information processing technology. In this paper, an integrated word similarity computing method is proposed by analyzed morpheme's similarity, word order's similarity and word length's similarity, and parameters of the method are decided by experiments. The experiments show that this method has high efficiency.


2021 ◽  
Author(s):  
Feng Ma ◽  
Ying Yang ◽  
Tilei Gao

1990 ◽  
Vol 37 ◽  
pp. 51-58
Author(s):  
Carolien Schouten-van Parreren

Within the larger framework of a project on Mixed Ability Teaching, a qualitative experiment was carried out with respect to the individual differences between pupils of very different ability ranges, when learning French. This experiment was meant to gain insight into the nature of the differences concerning vocabulary learning and reading strategies. 69 pupils (12-15 year) pupils of very different ability ranges (but being educated together) were presented with a variety of vocabulary learning and reading tasks. They worked individually or in pairs and were requested to think aloud. The following tasks were used: 1) while reading a story, guessing the meaning of unknown words from the context, 2) after having read a story, memorizing the meaning of unknown words by means of vocabulary cards, 3) intensive reading of a relatively difficult illustrated story, 4) recalling the meaning of new words incidentally acquired (or not), while reading a story, 5) doing an exercise, involving different reading strategies. The analysis of the protocol records focused on the causes of the differences between weak and strong pupils. The differences which were found could be related to two relevant general strategies: guessing the meaning of an unknown word from the context and analyzing the word form of an unknown word. The main results were the following: 1) the attention of weak pupils tends to be exclusively drawn by one source of information; weak pupils are not able to integrate information from different sources (advance knowledge, text, word forms, context, illustrations, cues), 2) weak pupils take no account whatsoever of the sentence structure, 3) weak pupils have difficulties in generalizing from a new word to an already known word (in the target language or in the mother tongue). The article concludes with some implications for foreign language teaching.


2011 ◽  
Vol 109 ◽  
pp. 612-616 ◽  
Author(s):  
Dun Li ◽  
Wei Tu ◽  
Lei Shi

New word identification is one of the difficult problems of the Chinese information processing. This paper presents a new method to identify new words. First of all, the text is segmented using N-Gram; then PPM is used to identify the new words which are in the text; finally, the new identified words are added to update the dictionary using LRU. Compared with three well-known word segmentation systems, the experimental results show that this method can improve the precision and recall rate of new word identification to a certain extent.


Author(s):  
Fatima Zahrae El Malaki

Do Moroccan EFL learners depend on the context to infer the meaning of unknown words occurring in sentences? This study investigates the way intermediate and advanced learners infer the meaning of fake words. To this end, the subjects took a test consisting of 60 items with three multiple choices. Subjects were asked to provide appropriate, inappropriate meanings of the unknown word or none of the choices without using dictionaries. The Chi-2 tests were adopted to determine whether there is a) a statistically significant difference between the three categories and b) a statistically significant difference between intermediate and advanced learners’ inferencing results. The findings demonstrate that the context along with the lexical knowledge of the L2 learners play the most important role in understanding vocabulary.


Author(s):  
Suwan Tongphu

<p>A similarity measure is one classical problem in Description Logic which aims at identifying the similarity between concepts in an ontology. Finding a hierarchy distance among concepts in an ontology is one popular technique. However, one major drawback of such a technique is that it usually ignores a concept definition analysis. This work introduces a new method for similarity measure. The proposed system semantically analyzes structures of two concept descriptions and then computes the similarity score based on the number of shared features. The efficiency of the proposed algorithm is measured by means of the satisfaction of desirable properties and intensive experiments on the Snomed ct ontology.</p>


2021 ◽  
Vol 2140 (1) ◽  
pp. 012017
Author(s):  
V Y Zhukov ◽  
I N Rostokin ◽  
G G Shchukin

Abstract The article considers the issues of improving the quality of vertical wind profile measurements. In order to make these measurements under any meteorological conditions, data from profilometers operating in different wave ranges, from optical to centimetre, are combined. At the same time, the resolving power of the latter is considerably inferior to that of the former. We propose a way to improve the quality of measurements in the radio band by applying a new method of information processing, using estimates of the width of the spectrum and the peculiarities of their spatial distribution.


Sign in / Sign up

Export Citation Format

Share Document