Analysis of the Production of Pronominal Constructions in Spanish in a Learner Corpus

A Latvian learner corpus “LaVA” is being built in the Institute of Mathematics and Computer Science, University of Latvia. The corpus includes texts written by beginner learners in the first two semesters of learning Latvian as a foreign language. The texts are written by hand and digitized afterwards in order to reduce the issues that could be caused by the necessity to learn not only writing itself but also using a foreign keyboard. One of the features that cannot be digitized is the new letters created by adding diacritical marks which are not used that way in the standard Latvian alphabet. Since one of the essential steps in learning to write in a language is learning the letters and diacritical marks of that language, this study aims to find instances of such newly made letters and to discuss the basic quantitative measures in order to define hypotheses and areas of interest for further research of such usage. Altogether 322 texts were searched, and 175 examples were found. The amount of examples found in 2nd semester texts was less than half the amount of examples found in the 1st semester texts, but the percentage of texts containing examples was higher than expected – more than 33 % in the 1st semester and almost 20 % in the 2nd semester. It leads to a conclusion that this is quite a common occurrence but also prone to reduction in the second semester. The corpus does not provide any data on later semesters so it cannot be predicted when such instances should become a rare, individual feature rather than a common one. The average amount of examples in a text is not high, though. Counting only the texts where at least one example was found, the average amount of examples per text is 2.136 in the 1st semester and 1.690 in the 2nd semester. Considering that the absolute lowest possible value here is 1, it should not be considered as a high value. Therefore, using diacritical marks to make new letters, while a common feature of the Latvian interlanguage, could be characterized as casual rather than systemic. However, that does not exclude the possibility of certain patterns in usage. The currently collected data already shows that there are some words – such as garšo, viņš, ļoti, četri – where examples were found in more than one author’s text. Examples of using unsuitable diacritical marks are also sometimes found next to letters for which said diacritical marks would be suitable. This should be explored more thoroughly using qualitative methods. The size of the corpus keeps growing; the expected size upon completion is 1000 texts. When it is reached, it would be useful to repeat the study and check whether the larger amount of data still confirms the same assumptions. The larger sample size would also allow for more detailed quantitative analysis discussing each letter, diacritical mark, placement of the diacritical mark, and metadata collected for the corpus, such as gender, native language and other spoken languages by the authors of the texts.

Download Full-text

An Analysis of Chinese learners' 'Verb-noun’ Collocation Errors on the Base of Korean Learner Corpus

Korean Association For Learner-Centered Curriculum And Instruction ◽

10.22251/jlcci.2020.20.3.335 ◽

2020 ◽

Vol 20 (3) ◽

pp. 335-356

Author(s):

TINGTING ZHOU

Keyword(s):

Learner Corpus ◽

Chinese Learners

Download Full-text

Commentary: Have Learner Corpus Research and Second Language Acquisition Finally Met?

Learner Corpus Research Meets Second Language Acquisition ◽

10.1017/9781108674577.012 ◽

2020 ◽

pp. 243-257

Author(s):

Sylviane Granger

Keyword(s):

Second Language ◽

Second Language Acquisition ◽

Language Acquisition ◽

Learner Corpus

Download Full-text

Building an Oral and Written Learner Corpus of a School Programme: Methodological Issues

Learner Corpus Research Meets Second Language Acquisition ◽

10.1017/9781108674577.011 ◽

2020 ◽

pp. 214-242

Author(s):

Philippa Bell ◽

Laura Collins ◽

Emma Marsden

Keyword(s):

Methodological Issues ◽

Learner Corpus

Download Full-text

The Multimedia Adult ESL Learner Corpus

TESOL Quarterly ◽

10.2307/3588405 ◽

2003 ◽

Vol 37 (3) ◽

pp. 546 ◽

Cited By ~ 38

Author(s):

Stephen Reder ◽

Kathryn Harris ◽

Kristen Setzler

Keyword(s):

Learner Corpus ◽

Adult Esl

Download Full-text

Multi‐Word Expressions in Second Language Writing: A Large‐Scale Longitudinal Learner Corpus Study

Language Learning ◽

10.1111/lang.12383 ◽

2019 ◽

Vol 70 (2) ◽

pp. 420-463 ◽

Cited By ~ 1

Author(s):

Anna Siyanova‐Chanturia ◽

Stefania Spina

Keyword(s):

Second Language ◽

Large Scale ◽

Second Language Writing ◽

Corpus Study ◽

Learner Corpus ◽

Language Writing

Download Full-text

‘Almost people’: A Learner Corpus Account of L2 Use and Misuse of Non-numerical Quantification

Open Linguistics ◽

10.1515/opli-2016-0015 ◽

2016 ◽

Vol 2 (1) ◽

Author(s):

Peter Crosthwaite ◽

Lavigne L.Y. Choy ◽

Yeonsuk Bae

Keyword(s):

English Learners ◽

English Speakers ◽

L2 Proficiency ◽

L1 Transfer ◽

Learner Corpus ◽

Proficiency Level ◽

Closed Class ◽

Corpus Data ◽

Noun Number ◽

L1 English

AbstractWe present an Integrated Contrastive Model of non-numerical quantificational NPs (NNQs, i.e. ‘some people’) produced by L1 English speakers and Mandarin and Korean L2 English learners. Learner corpus data was sourced from the ICNALE (Ishikawa, 2011, 2013) across four L2 proficiency levels. An average 10% of L2 NNQs were specific to L2 varieties, including noun number mismatches (*‘many child’), omitting obligatory quantifiers after adverbs (*‘almost people’), adding unnecessary particles (*‘all of people’) and non-L1 English-like quantifier/noun agreement (*‘many water’). Significantly fewer ‘openclass’ NNQs (e.g a number of people) are produced by L2 learners, preferring ‘closed-class’ single lexical quantifiers (following L1-like use). While such production is predictable via L1 transfer, Korean L2 English learners produced significantly more L2-like NNQs at each proficiency level, which was not entirely predictable under a transfer account. We thus consider whether positive transfer of other linguistic forms (i.e. definiteness marking) aids the learnability of other L2 forms (i.e. expression of quantification).

Download Full-text