scholarly journals Phraseographie

Author(s):  
Erla Hallsteinsdóttir

Multiword expressions – i.e. phraseological units – like idioms and collocations are one of the most interesting part of every language. In this article, I investigate phraseological units from a lexicographical point of view. I discuss the theoretical and methodological basis of phraseography as a discipline that includes aspects of lexicography, phraseology, corpus linguistics and theories of language learning. I demonstrate the importance of corpora as a source for the lexicographer and the use of corpus data. I also discuss the requirements for the lexicographical treatment of phraseological units by the compilation of a phraseological database for language learners in relation to their assumed needs that have already been described in detail.

2021 ◽  
Vol 14 (11) ◽  
pp. 108
Author(s):  
Sultan H. Alharbi

The term ‘struggling language learner’ is one that is usually ascribed to students who are trying, without much success to master the English language in an academic setting. As a case study, this study was carried out to gain insights into the ‘struggles’ of the struggling English as a foreign language (EFL) learners. Ten students were selected from those with the 20th least percentile in their English language courses. Observation of and discussion with these categories of learners revealed that many language learners had a point at which they began to take learning English language more seriously. In such positive or negative situations, their language learning journey improved therefrom. The findings showed that 80% of the participants believed that their English language proficiency was ‘very good’, while 20% of these participants believed that their English language proficiency was ‘average’. Also, the findings indicated that there was a statistically significant association (i.e., p < .05) between English language proficiency of the learners and the following observable attributes: willingness to learn for educational purposes; willingness to learn for career development; and students’ continuation without losing focus. The study proposes a fresh evaluation of the problems faced by EFL struggling learners by bringing to light a multifaceted, meaningful consideration of their learning attitudes from socio-psychological point of view, offering a comprehensive account of these learners and their learning difficulties as well as their attitudes and outlook while taking lessons as freshmen at the university.  


Author(s):  
Angela Chambers ◽  
Martin Wynne

Since the early 1990s, researchers have been investigating the effectiveness of corpora as a resource in language learning, mostly creating their own small corpora. As it is neither feasible nor desirable to envisage a future in which all teachers create their own corpora, and as the content of language courses is similar in many universities throughout the world, the sharing of resources is clearly necessary if corpus data are to be made available to language teachers and learners on a large scale. Taking one small corpus as an example, this chapter aims to investigate the issues arising if corpus consultation is to become an integral part of the language-learning environment. The chapter firstly deals with fundamental questions concerning the creation and reusability of corpora, namely planning, construction, documentation, and also legal, moral and technical issues. It then explores the issues arising from the use of a corpus of familiar texts, in this case a French journalistic corpus, with advanced learners. In conclusion we propose a framework for the optimal use of corpora with language learners in the context of higher education.


2021 ◽  
pp. 162-177
Author(s):  
Antra Kļavinska ◽  

Several text corpora have been created in Latvia, including learner corpora. One of the latest projects is the Latvian Language Learner Corpus (LaVA), which contains the works of international students studying in Latvian higher education institutions who are learning Latvian as a foreign language. The texts are morphologically tagged automatically, and learner errors are tagged manually. A sufficient scope of publications is available, which provides the theoretical basis for the creation of Latvian language learner corpora; however, there is a lack of studies or practical methodological guidelines concerning the opportunities for their application, and there is little data about the use of text corpora in language acquisition. The aim of this study is to explain from the theoretical perspective for what purposes learner corpus data may be used, as well as to illustrate the methodological groundwork with examples from the LaVA corpus. Analysis of theoretical literature has demonstrated the functions and meaning of learner corpora in research, and experience with the use of corpora in acquiring a foreign language has been analysed. Examples of the use of the LaVA corpus as a didactic resource have been prepared using Corpus Linguistics methods. The study was conducted within the state research programme project “The Latvian Language”. After studying the functions of learner corpora from the theoretical perspective, it was concluded that the target audience of the LaVA corpus mainly includes teachers of Latvian as a foreign language (LATS), authors of teaching materials, as well as Latvian language learners. To facilitate the use of the LaVA corpus, it is important to have basic knowledge of Corpus Linguistics, an understanding of the theory of language, as well as an understanding of foreign language teaching methodology. LATS teachers can use the LaVA corpus data in the creation of curricula and teaching materials, in the preparation of language proficiency tests, etc. Using the inductive approach in language acquisition, language learners can also become language researchers, can analyse the errors of other learners, etc. Undeniably, the LaVA corpus can be used in broader linguistic research, for example, in contrastive interlanguage analysis, comparing the data of language learners with the data of native speakers or the data of different groups of language learners.


Author(s):  
Stephen Jeaco

While corpus tools provide several different ways to display relationships between words within texts and across texts, the main format for viewing concordance data is Key Word in Context (KWIC). In Computer Aided Language Learning, concordance lines in KWIC format may be accessed inside a concordancer or within other software through links to corpus data. Language learners can and do gain useful insights from exploring concordance data in KWIC format, but some kinds of information may be harder to see, some patterning of use may not be so obvious, and reading of complete examples may not be very easy. The Prime Machine was developed for language learners and aims to make corpus data easier to access and interpret. This paper introduces the design of the Cards Tab, which provides an additional way of viewing concordance data. Results from three evaluations with language learners and teachers show positive attitudes towards this display.


1999 ◽  
Vol 4 (1) ◽  
pp. 77-111 ◽  
Author(s):  
Inge de Monnink

In this article I argue that, from a methodological point of view, descriptive studies improve considerably if they use a multi-method approach to the data, more specifically, if they use a combination of corpus data and experimental data. In the modern conception of corpus linguistics, intuitive data play an important role. The linguist formulates research hypotheses based on his or her intuitive knowledge. These hypotheses are then tested on the corpus data. I argue that a sound descriptive study should not end with simply stating the results from the corpus study. Instead, the corpus data have to be supplemented. An appropriate way to supplement corpus data is through the use of elicitation techniques. I illustrate the multi-method approach on a case study of floating postmodification in the English noun phrase.


2016 ◽  
Vol 7 (3) ◽  
pp. 305-326 ◽  
Author(s):  
Ursula Ritzau ◽  
Lian Malai Madsen

AbstractSociolinguists have recently suggested a range of new terms to re-conceptualise language and language use. Most of these are based on the empirical documentation of speakers using linguistically hybrid constructions which are understood as reflecting speakers’ orientation to norms of linguistic hybridity. In this article we bring data typical of SLA research and sociolinguistic theorisation together by discussing data collected among Swiss German university students learning Danish in the light of such sociolinguistic concepts. We show how in some cases, the students signal investment in and alignment with hybrid language use, but in others the students “polylanguage” from a form-analytic point of view, while the co- and context suggest they orient strongly to an idea of “pure” Danish. In these cases their hybrid linguistic productions are more likely to be explained by their status as language learners. We argue that these observations point to the need for a closer consideration of speaker stances towards language forms as well as a need for considering repertoire restrictions and learner ambitions in current sociolinguistic conceptualisations of linguistic hybridity.


ExELL ◽  
2020 ◽  
Vol 8 (2) ◽  
pp. 144-184
Author(s):  
Maja Séguin

Abstract The use of synonymy and near-synonymy allows us to differently express similar ideas and meanings, as well as perspectives. However, their use and nuances may be unclear to language learners, such as the selected case of the verbs explain and clarify. This paper examines the usage of the two verbs by looking into corpus data and uses Sinclair's methodological procedure as an alternative to dictionary references. Also included is a discussion of aspects and criticisms of corpus linguistics, mentions (and uses of) computer technologies for the analysis of language by discovering usage patterns, significant exceptions and semantic prosody, and exploring whether using corpora in the classroom would be beneficial to language learners.


Author(s):  
Catherine Caws ◽  
Stewart Arneil

In corpus linguistics, texts are typically annotated in order to focus the attention on: (a) the form of the text or words, and (b) the structure of sentences (that is, morphological and syntactic tagging). Yet, when dealing with language learning and the development of skills other than just linguistic ones, other types of annotations are needed. Annotating with either a specific learner or pedagogy in mind often engages the researcher in more complex issues than the ones just related to corpus linguistics. In this article, we report on the methods used to create a digital library of videos and annotated transcripts called FrancoToile (http://francotoile.uvic.ca). As a needs-driven corpus, FrancoToile includes annotations within the video transcripts in order to help users develop their cultural and linguistic literacies in French. These annotations must relate directly to the purpose of the system (the development of cultural and linguistic literacies) and to the specific skill or competency that we hope language learners will gain. We analyze learning needs, modify the software, and observe and engage with users on an ongoing basis to create a language tool that will better address users’ needs. This approach of incorporating user feedback increases the usefulness of the annotated videos. We continue to seek means to encourage the involvement of users, both teachers and learners, in the process of corpora editing and content building.


Author(s):  
Stephen Jeaco

While corpus tools provide several different ways to display relationships between words within texts and across texts, the main format for viewing concordance data is Key Word in Context (KWIC). In Computer Aided Language Learning, concordance lines in KWIC format may be accessed inside a concordancer or within other software through links to corpus data. Language learners can and do gain useful insights from exploring concordance data in KWIC format, but some kinds of information may be harder to see, some patterning of use may not be so obvious, and reading of complete examples may not be very easy. The Prime Machine was developed for language learners and aims to make corpus data easier to access and interpret. This paper introduces the design of the Cards Tab, which provides an additional way of viewing concordance data. Results from three evaluations with language learners and teachers show positive attitudes towards this display.


Sign in / Sign up

Export Citation Format

Share Document