Korpusbasert undervisning i fremmedspråkene: La elevens nysgjerrighet sette dagsorden

Margrete Dyvik Cardona; Anders Alvsåker Didriksen; Anje Müller Gjesdal

doi:10.5617/adno.1127

Korpusbasert undervisning i fremmedspråkene: La elevens nysgjerrighet sette dagsorden

Acta Didactica Norge ◽

10.5617/adno.1127 ◽

2014 ◽

Vol 8 (2) ◽

pp. 4

Author(s):

Margrete Dyvik Cardona ◽

Anders Alvsåker Didriksen ◽

Anje Müller Gjesdal

Keyword(s):

Open Access ◽

Parallel Corpus ◽

Multilingual Corpus

Korpus er en type digitale språkressurser (maskinlesbare tekstsamlinger) som er mye brukt i moderne lingvistikk som empirisk støtte for studiet av ulike språklige fenomener. Korpus kan med hell brukes i fremmedspråkundervisningen, men er frem til nå ikke i tilstrekkelig grad blitt tatt i bruk i norsk skole og høyere utdanning. I denne artikkelen gir vi en gjennomgang av eksisterende forskningslitteratur på feltet, med hovedvekt på bidrag fra det angloamerikanske språkområdet. Deretter går vi gjennom to korpus som er tilgjengelige for norske brukere, Norwegian-Spanish Parallel Corpus og Oslo Multilingual Corpus. Vi viser hvordan disse korpusene kan brukes i undervisningen av ulike språklige fenomener, fra vokabular til sosiolingvistiske fenomener. På grunnlag av eksisterende forskning på pedagogiske applikasjoner av korpus samt våre egne forslag til undervisningsopplegg er det grunn til å tro at bruk av korpus vil kunne være et verdifullt bidrag til fremmedspråkundervisningen i norske klasserom. Likevel finnes visse utfordringer som hindrer at korpus tas i bruk. Både mangel på tid, gruppestørrelser og teknologiske hindre kan stå i veien. Bruk av korpus i fremtidens klasserom er også nært knyttet til ulike trender, både innenfor teknologi og forskning. For det først blir datateknologi stadig mer tilgjengelig, parallelt med at elevenes og studentenes digitale kompetanse øker. Dette er fenomener som begge skulle tilsi økt bruk av korpus. Videre er det en sterk bevegelse for å åpne opp tilgang til forskningsdata og –ressurser (Open Access) og denne bølgen kan også brukes til å åpne opp forskningsressurser, herunder korpus, for læringsformål. Til syvende og sist er spørsmålet om bruk av korpus i klasserommet imidlertid avhengig av en tettere dialog mellom lærere og korpuslingvister, både i lærerutdanningen så vel som gjennom faglig oppdatering underveis i yrkeslivet.

Download Full-text

COVID-19 recommender system based on an annotated multilingual corpus

Genomics & Informatics ◽

10.5808/gi.21008 ◽

2021 ◽

Vol 19 (3) ◽

pp. e24

Author(s):

Márcia Barros ◽

Pedro Ruas ◽

Diana Sousa ◽

Ali Haider Bangash ◽

Francisco M. Couto

Keyword(s):

Text Mining ◽

Recommender System ◽

Related Literature ◽

Related Research ◽

Parallel Corpus ◽

Recent Advances ◽

Incoming Information ◽

Multilingual Corpus

Tracking the most recent advances in Coronavirus disease 2019 (COVID-19)‒related research is essential, given the disease's novelty and its impact on society. However, with the publication pace speeding up, researchers and clinicians require automatic approaches to keep up with the incoming information regarding this disease. A solution to this problem requires the development of text mining pipelines; the efficiency of which strongly depends on the availability of curated corpora. However, there is a lack of COVID-19‒related corpora, even more, if considering other languages besides English. This project's main contribution was the annotation of a multilingual parallel corpus and the generation of a recommendation dataset (EN-PT and EN-ES) regarding relevant entities, their relations, and recommendation, providing this resource to the community to improve the text mining research on COVID-19‒related literature. This work was developed during the 7th Biomedical Linked Annotation Hackathon (BLAH7).

Download Full-text

Swedish gärna and German gern(e) and their English correspondences

Languages in Contrast ◽

10.1075/lic.13.2.06aij ◽

2013 ◽

Vol 13 (2) ◽

pp. 238-259

Author(s):

Karin Aijmer ◽

Bengt Altenberg

Keyword(s):

Speech Acts ◽

Parallel Corpora ◽

Parallel Corpus ◽

Contrastive Study ◽

Wide Range ◽

Multilingual Corpus ◽

The Subject ◽

The Rich ◽

Linguistic Phenomenon ◽

Common Function

The Swedish adverb gärna, related to German gern(e), has no obvious equivalent in English. To explore this cross-linguistic phenomenon the English correspondences of gärna are examined on the basis of the English-Swedish Parallel Corpus, a bidirectional translation corpus. The study shows that gärna has a wide range of English correspondences (translations as well as sources), representing a variety of grammatical categories (verb, adjective, adverb, noun, etc). In addition, the English texts contain a large number of omissions and unidentifiable sources (zero). The most common function of gärna is to express willingness or readiness on the part of the subject, but in the absence of a volitional controller it can also indicate a habitual tendency and even convey implications such as reluctance. It is also used in speech acts expressing offers, promises and requests and in responses to such speech acts. To compare the Swedish adverb with its German cognate gern(e) a similar contrastive study of the English correspondences of this adverb was made on the basis of the Oslo Multilingual Corpus. The studies clearly demonstrate the rich multifunctionality of the two adverbs and the advantages of using bidirectional parallel corpora in contrastive research.

Download Full-text

TELLING in English, Norwegian and French: A three-way contrast

Bergen Language and Linguistics Studies ◽

10.15845/bells.v11i1.3438 ◽

2021 ◽

Vol 11 (1) ◽

pp. 47-68

Author(s):

Thomas Egan

Keyword(s):

Double Object ◽

Direct Speech ◽

Parallel Corpus ◽

Multilingual Corpus ◽

Direct Objects

This paper presents the results of a study of double object constructions containing the cognate verbs English tell and Norwegian fortelle, based on data from the English–Norwegian Parallel Corpus. The results show that there is a certain degree of correspondence between the two verbs in constructions with nominal direct objects, with less mutual correspondence in constructions with finite clausal objects, very little correspondence in constructions with objects in the form of direct speech, and none whatsoever in the case of non-finite clausal objects, which only occur with tell. The paper then expands the topic to include tell predications in French. The data were retrieved from the Oslo Multilingual Corpus. It transpires that the form of French translations of Norwegian expressions are more similar, at least for some constructions, to the Norwegian originals than are their English counterparts.

Download Full-text

ParCourE: A Parallel Corpus Explorer for a Massively Multilingual Corpus

10.18653/v1/2021.acl-demo.8 ◽

2021 ◽

Author(s):

Ayyoob ImaniGooghari ◽

Masoud Jalili Sabet ◽

Philipp Dufter ◽

Michael Cysou ◽

Hinrich Schütze

Keyword(s):

Parallel Corpus ◽

Multilingual Corpus

Download Full-text

Application of multilingual corpus in contrastive studies (on the example of the Bulgarian-Polish-Lithuanian parallel corpus)

Cognitive Studies | Études cognitives ◽

10.11649/cs.2010.013 ◽

2015 ◽

pp. 217-239

Author(s):

Ludmila Dimitrova ◽

Violetta Koseska-Toszewa ◽

Danuta Roszko ◽

Roman Roszko

Keyword(s):

Point Of View ◽

Corpus Annotation ◽

Parallel Corpus ◽

Slavic Languages ◽

Part Of Speech ◽

Language Research ◽

Multilingual Corpus ◽

The One ◽

The Eu

Application of multilingual corpus in contrastive studies (on the example of the Bulgarian-Polish-Lithuanian parallel corpus)In this paper we present applications of a trilingual corpus in language research. Comparative and contrastive studies of Polish and Bulgarian as well as Polish and Lithuanian have been already conducted, but up to the best of our knowledge no such studies exist for Bulgarian and Lithuanian. On the one hand, it is interesting to note that two Slavic languages are compared to a Baltic language (Lithuanian). On the other hand, the three languages are marginally present in the EU because of the later ascension of the three countries to the EU. The paper shortly describes the first electronic Bulgarian–Polish–Lithuanian experimental corpus, currently under development only for research. We also focus our attention on the morphosyntactic annotation of the parallel trilingual corpus according to the Corpus Encoding Standard: we present a review of the Part-of-Speech (POS) classification of the participle in the three languages – Bulgarian, Polish, and Lithuanian in comparison to another POS, the adjective. We briefly discuss tagsets for corpus annotation from the point of view of possible unification in the future with some examples.

Download Full-text

A multilingual corpus study of the competition between past and perfect in narrative discourse

Journal of Linguistics ◽

10.1017/s0022226721000244 ◽

2021 ◽

pp. 1-35

Author(s):

MARTIJN VAN DER KLIS ◽

BERT LE BRUYN ◽

HENRIËTTE DE SWART

Keyword(s):

Narrative Discourse ◽

Present Perfect ◽

Corpus Study ◽

British English ◽

Parallel Corpus ◽

Fine Grained ◽

Multilingual Corpus ◽

French Novel ◽

Crosslinguistic Variation ◽

Dimensions Of Variation

The western European present perfect is subject to substantial crosslinguistic variation. The literature, however, focuses on individual languages or on comparisons of a restricted number of languages. We piece together the puzzle and do so in a data-driven way by comparing the use of the present perfect through a parallel corpus based on the French novel L’Étranger and its translations in Italian, German, Dutch, European Spanish, British English, and Modern Greek. We introduce and showcase Translation Mining, a software suite combining a parallel corpus database with annotation and analysis tools. Translation Mining allows us to generate descriptive statistics of tense use across languages but also to visualize variation through its multidimensional scaling component and to link the variation we find to the underlying data through its integrated setup. We confirm that the present perfect competes with the past and we reveal the fine-grained scalar nature of the variation. To complete the puzzle, we ascertain the dimensions of variation, ranging from lexical and compositional semantics to dynamic semantics and pragmatics.1

Download Full-text

Open Access – Sie haben die Wahl mit OpenMind

Musik- Tanz und Kunsttherapie ◽

10.1026/0933-6885.20.2.47 ◽

2009 ◽

Vol 20 (2) ◽

pp. 47-48

Author(s):

G.-Jürgen Hogrefe

Keyword(s):

Open Access

Download Full-text

Strukturierte klinische Interviews zur Erfassung psychischer Störungen über die Lebensspanne

Zeitschrift für Klinische Psychologie und Psychotherapie ◽

10.1026/1616-3443/a000430 ◽

2017 ◽

Vol 46 (3) ◽

pp. 176-186 ◽

Cited By ~ 36

Author(s):

Jürgen Margraf ◽

Jan Christopher Cwik ◽

Verena Pflug ◽

Silvia Schneider

Keyword(s):

Open Access ◽

Psychische Störungen ◽

Dsm 5

Zusammenfassung. Psychische Störungen können über die ganze Lebensspanne auftreten. Strukturierte klinische Interviews sind zentrale Hilfsmittel für ihre rasche, zuverlässige und umfassende Diagnostik. Im deutschsprachigen Raum stehen mit den Verfahren der DIPS-Familie Interviews zur Diagnostik psychischer Störungen über die gesamte Lebensspanne zur Verfügung, die seit den 90er Jahren regelmäßig aktualisiert wurden. Ihre Reliabilität, Validität und Akzeptanz wurde wiederholt in großen Stichproben aus ambulanten, stationären und Forschungssettings überprüft. Die Einführung des DSM-5 erforderte eine umfassende Überarbeitung der DIPS-Interviews, deren wesentliche Merkmale dargestellt werden. Um die breitere Verwendung von strukturierten klinischen Interviews zu fördern, werden die Verfahren der DIPS-Familie neu als „Open Access-Dokumente“ zur Verfügung gestellt. Abschließend werden weitere Entwicklungen zu Training, Dissemination und Computerisierung im Ausblick angesprochen.

Download Full-text