Indonesian Shallow Stemmer for Text Reading Support System

Author(s):  
Hajime Mochizuki ◽  
Yuhei Nakamura ◽  
Kohji Shibano
2012 ◽  
Vol 2 (3) ◽  
pp. 5-6
Author(s):  
Kristina HMELJAK SANGAWA

Having received a lively response to our call for papers on the lexicography of Japanese as a second language, the editorial board decided to dedicate two issues of this year's ALA to this theme, and I am happy to introduce the second round of papers, after the first thematic issue published in October this year.This issue is again divided into two parts. The first two papers offer analyses of two aspects of existing dictionaries from the point of view of Japanese language learners, while the following four papers present particular lexicographic projects for learners of Japanese as a foreign language.The first paper, by Kanako Maebo, entitled A survey of register labelling in Japanese dictionaries - Towards the labelling of words in dictionaries for learners of Japanese, analyses register labelling in existing dictionaries of Japanese, both in those expressly intended for learners of Japanese as a second language and those intended for native speakers, pointing out how register information provided by such dictionaries is not sufficient for L2 language production. After stressing the usefulness of usage examples for learners trying to write in Japanese, she offers an example of a corpus-based register analysis and proposes a typology of labels to be assigned to dictionary entries, calling for the development of corpora of different genres to be used for lexical analysis.In the second paper, An analysis of the efficiency of existing kanji indexes and development of a coding-based index, Galina N. Vorobеva and Victor M. Vorobеv tackle one of the most time-consuming tasks learners of Japanese are confronted with: looking up unknown Chinese characters. After a comprehensive description of existing indexes, including less known indexing systems developed by Japanese, Chinese, Russian and German researchers, they compare the efficiency of these systems using the concept of selectivity, and propose their own coding-based system. Although searching for unknown characters is becoming increasingly easy with the use of optical character recognition included in portable electronic dictionaries, tablets and smart-phones, not all learners have yet access to such devices. Efficient indexes for accessing information on Chinese characters are therefore still a valuable tool to support language learners in this most tedious task, while the ability to decompose a character into component parts remains an important basis for character memorisation.The second part of this issue presents four projects aimed at supporting particular lexical needs of learners of Japanese as a second language.In the first paper, Development of a learners' dictionary of polysemous Japanese words and some proposals for learners’ lexicography, Shingo Imai presents a new lexicographic approach to the description of polysemous words. As Imai rightfully stresses, the most basic and common words learned by beginning language learners are actually often very polysemous; being deceivingly simple at first glance, they are often introduced with simple glosses or basic prototypical examples at the first stages of learning, and later treated as known words in intermediate or advanced textbooks, even if used for less common senses which are still unknown to the learners, causing much confusion. In the dictionary series presented here, polysemous headwords are thoroughly and systematically described within their semantic networks, where the connections between core and derived meanings are schematically visualised and exemplified.The following two papers present two of the first and most popular web-based systems for Japanese language learning support, both of which have been developing for more than a decade, supporting Japanese language learners all over the world.Reading Tutor, a reading support system for Japanese language learners, presented by Yoshiko Kawamura, is a widely known and used system based at Tokyo International University, which offers automatic glossing of Japanese text with Japanese definitions and examples, and translations into 28 languages. After introducing the system, its development, functionalities and its tools for signalling the level of difficulty of single words, characters, or whole Japanese texts, the author describes its possible uses in language instruction and autonomous learning, and one concrete example of its application to the development of learning material for a specific segment of learners, foreign candidates to the Japanese national examination for certified care workers, mostly Filipino and Indonesian nurses working in Japan. The author concludes with suggestions for fostering autonomous vocabulary learning.The other Japanese language learning support system with an equally long and successful tradition, developed at Tokyo Institute of Technology, is presented by its initiator, Kikuko Nishina, and one of its younger developers, Bor Hodošček, in Japanese Learning Support Systems: Hinoki Project Report. The article presents the many components of this successful system, including Asunaro, a reading support system aimed especially at science and engineering students and speakers of underrepresented Asian languages, Natsume, a writing assistance system using large-scale corpora to support collocation search, Natane, a learner corpus, and Nutmeg, an automatic error correction system for learners' writing.The last project report, by Tomaž Erjavec and myself, introduces resources and tools being developed at the University of Ljubljana and at Jožef Stefan Institute: JaSlo: Integration of a Japanese-Slovene Bilingual Dictionary with a Corpus Search System. The dictionary, corpora and search tools are being developed primarily for Slovene speaking learners of Japanese, but part of the tools, particularly the corpus of sentences from the web-harvested texts, divided into five difficulty levels, can be used by any learner or teacher of Japanese.I hope you will enjoy reading these articles as much as I did, and wish you a peaceful New Year.  


2015 ◽  
Vol 27 (4) ◽  
pp. 148-151
Author(s):  
Nobuaki Nakazawa ◽  
Yuki Ueno ◽  
Chihiro Numata ◽  
Toshikazu Matsui

Author(s):  
Etsuko Toyoda

<p>[This section does not allow copy &amp; paste. I have copied the abstract of my paper manually].</p><p> </p><p>Websites written in the target language abound for L2 learners to read. Reading unmodified texts in a L2, however, requires a great level of reading ability, especially in the case where L1 is orthographically and syntactically distant from the target language. Thus, it is essential to have an effective computerised reading support system that gives L2 readers sufficient information for comprehension, and that, at the same time, assists in the development of reading ability for future reading. This study analysed four support systems for reading Japanese texts based on criteria supported by previous research findings, in order to determine whether they cover a wide range of information useful for reading comprehension and for the development of reading ability. The results revealed the strengths and weaknesses of the four although none of the four satisfied all the criteria. The paper discusses possible improvements and any potential difficulties that may hinder them.</p>


2012 ◽  
Vol 71 (3) ◽  
pp. 141-148 ◽  
Author(s):  
Doriane Gras ◽  
Hubert Tardieu ◽  
Serge Nicolas

Predictive inferences are anticipations of what could happen next in the text we are reading. These inferences seem to be activated during reading, but a delay is necessary for their construction. To determine the length of this delay, we first used a classical word-naming task. In the second experiment, we used a Stroop-like task to verify that inference activation was not due to strategies applied during the naming task. The results show that predictive inferences are naturally activated during text reading, after approximately 1 s.


Sign in / Sign up

Export Citation Format

Share Document