Design and Evaluation of a Versatile Architecture for a Multilingual Word Prediction System

Author(s):  
Sira E. Palazuelos-Cagigas ◽  
José L. Martín-Sánchez ◽  
Lisset Hierrezuelo Sabatela ◽  
Javier Macías Guarasa
2014 ◽  
Vol 3 (2) ◽  
pp. 137-154 ◽  
Author(s):  
Wessel Stoop ◽  
Antal van den Bosch

Word prediction, or predictive editing, has a long history as a tool for augmentative and assistive communication. Improvements in the state-of-the-art can still be achieved, for instance by training personalized statistical language models. We developed the word prediction system Soothsayer. The main innovation of Soothsayer is that it not only uses idiolects, the language of one individual person, as training data, but also sociolects, the language of the social circle around that person. We use Twitter for data collection and experimentation. The idiolect models are based on individual Twitter feeds, the sociolect models are based on the tweets of a particular person and the tweets of the people he often communicates with. The sociolect approach achieved the best results. For a number of users, more than 50% of the keystrokes could have been saved if they had used Soothsayer.


1992 ◽  
Vol 8 (4) ◽  
pp. 304-311 ◽  
Author(s):  
Alan Newell ◽  
John Arnott ◽  
Lynda Booth ◽  
William Beattie ◽  
Bernadette Brophy ◽  
...  

2007 ◽  
Vol 21 (3) ◽  
pp. 479-491
Author(s):  
Pertti Alvar Väyrynen ◽  
Kai Noponen ◽  
Tapio Seppänen

2014 ◽  
Vol 13 (2) ◽  
pp. 1-29 ◽  
Author(s):  
Manoj Kumar Sharma ◽  
Debasis Samanta

2021 ◽  
Vol 2021 ◽  
pp. 1-9
Author(s):  
Khrystyna Shakhovska ◽  
Iryna Dumyn ◽  
Natalia Kryvinska ◽  
Mohan Krishna Kagita

Text generation, in particular, next-word prediction, is convenient for users because it helps to type without errors and faster. Therefore, a personalized text prediction system is a vital analysis topic for all languages, primarily for Ukrainian, because of limited support for the Ukrainian language tools. LSTM and Markov chains and their hybrid were chosen for next-word prediction. Their sequential nature (current output depends on previous) helps to successfully cope with the next-word prediction task. The Markov chains presented the fastest and adequate results. The hybrid model presents adequate results but it works slowly. Using the model, user can generate not only one word but also a few or a sentence or several sentences, unlike T9.


2020 ◽  
Vol 81 (1-4) ◽  
pp. 49-54
Author(s):  
M. Norré

This article investigates the evaluation of a word prediction system in an Augmentative and Alternative Communication (AAC) software for disabled people. In addition to having a reduced mobility, these users have an altered use of speech that must be compensated by a technological aid offering input methods adapted to their capabilities. To improve their communication speed, different prediction and language modeling techniques are used. We present the parameterization of statistical predictors. Their configuration in French is evaluated by a simulator and tested by a disabled person. The results show that a language model built from a large literary corpus saves more than one keystroke out of two, the performance of these systems varying according to several parameters.


Sign in / Sign up

Export Citation Format

Share Document