scholarly journals Quantifying graphemic variation via large text corpora

2021 ◽  
Vol 40 (3) ◽  
pp. 421-440
Author(s):  
Hanna Lüschow

Abstract The use of some basic computer science concepts could expand the possibilities of (manual) graphematic text corpus analysis. With these it can be shown that graphematic variation decreases constantly in printed German texts from 1600 to 1900. While the variability is continuously lesser on a text-internal level, it decreases faster for the whole available writing system of individual decades. But which changes took place exactly? Which types of variation went away more quickly, which ones persisted? How do we deal with large amounts of data which cannot be processed manually anymore? Which aspects are of special importance or go missing while working with a large textual base? The use of a measurement called entropy quantifies the variability of the spellings of a given word form, lemma, text or subcorpus, with few restrictions but also less details in the results. The difference between two spellings can be measured via Damerau-Levenshtein distance. To a certain degree, automated data handling can also determine the exact changes that took place. Afterwards, these differences can be counted and ranked. As data source the German Text Archive of the Berlin-Brandenburg Academy of Sciences and Humanities is used. It offers for example orthographic normalization – which is extremely useful –, preprocessing of parts of speech and lemmatization. As opposed to many other approaches the establishment of today’s normed spellings is not seen as the aim of the developments and is therefore not the focus of the research. Instead, the differences between individual spellings are of interest. Afterwards intra- and extralinguistic factors which caused these developments should be determined. These methodological findings could subsequently be used for improving research methods in other graphematic fields of interest, e. g. computer-mediated communication.

2021 ◽  
Vol PCP2020 (1) ◽  
pp. 128
Author(s):  
Hayato Tokutake ◽  
James York ◽  
Hiroshi Nakayama

Synchronous computer-mediated communication (SCMC) is a topic of great interest in CALL literature where research has investigated the effectiveness of SCMC compared to traditional face-to-face instruction. However, there are few studies that investigate the intrinsic differences in SCMC modes, particular in terms of their effect on oral communication. At the JALTCALL 2019 conference, we introduced research which assessed the anxiety-reducing affordances of VR. This year we presented results of a follow-up study which focused on the effect of SCMC modality on learners’ speaking performance. 30 participants (15 pairs) completed a spot-the-difference task within three different SCMC modes: voice, video, and virtual reality (VR). Using the complexity, accuracy, and fluency (CAF) model, participants’ oral task performances were analysed. Results suggest that the voice mode promoted the highest structural complexity, however, the VR mode promoted the highest lexical complexity. Findings therefore suggest that different modes of communication may be used to focus on different skill development. Additionally, practitioners should consider how modality affects learner anxiety and choose the most appropriate system for their students and needs. This paper introduces the VR system, a detailed analysis of results, pedagogical implications, and future research directions for the use of VR in language teaching contexts.


ReCALL ◽  
2014 ◽  
Vol 26 (3) ◽  
pp. 333-354 ◽  
Author(s):  
Dogan Yuksel ◽  
Banu Inan

AbstractThis study examined the effects of communication mode (i.e., face to face versus computer mediated communication) on the instances of negotiation of meaning (NofM) and its level of noticing by learners. Sixty-four participants (32 dyads) completed two jigsaw tasks in two different mediums (one in each) and four days after the tasks they were asked to identify the instances where they had communication breakdowns in a stimulated recall protocol. The findings of the study revealed that the average number of the NofM exchanges and durations of the tasks were higher in face to face mode (F2F) but the participants of the synchronous computer mediated communication (SCMC) group noticed a higher average of NofM instances (M=10.72) compared to the F2F group (M=9.13) and the difference was significant. Based on these results, we can argue that F2F promotes a better context for the production of NofM, but the SCMC environment leads to more instances of noticing.


2020 ◽  
Vol 3 (10) ◽  
pp. 100-109
Author(s):  
Bushra W. Ali

Recent researches on computer-mediated communication (CMC) have focused mainly on linguistic analysis for the different types of internet chats. This study observes the differences in chats between the informal social chat and the formal chat by Iraqi university non-capital informants. The difference is on their use of capital and non-capital expressions. The location of the experiment is the University of Sumer. The research focuses upon the capital and non-capital expressions used at the university community depending upon 9 variables for 97 samples. The data reveals that there is a distinction between university and non-university chat on their capital expressions usage: (Laad, Aady, Hello, Dez, Mnower, Shakbark, Wyaya, Ok and Bye), at the non-capital area.  Moreover, data shows that there is a distinction between university chat at the university community and their chat outside the university community. This study observes the written chats of the Iraqi university informants: students, teaching staff, and employees. The study sample is random using social media such as Viber, Facebook and WhatsApp. This is to observe that how the public and private atmospheres participate in language change to be unified at the time of the social networking use. Although they did not use certain expressions at their real life or informal chat, it is explored that how much they used the capital expressions in the non-capital areas. The study manages a quantitative and statistical analysis and investigates the use of capital expressions by users from the noncapital inhabitants’ background. Results of the study revealled that the university informants elude to use the capital expressions at the university atmosphere in the non-capital areas. It also reflects the fact that the use of capital expressions in the non-capital areas is a clear sign for the language unification concept.


Sign in / Sign up

Export Citation Format

Share Document