What do post-editors correct? A fine-grained analysis of SMT and NMT errors

Tradumàtica tecnologies de la traducció ◽

10.5565/rev/tradumatica.286 ◽

2021 ◽

pp. 131-147

Author(s):

Sergi Alvarez-Vidal ◽

Antoni Oliver ◽

Toni Badia

Keyword(s):

Pilot Study ◽

Fine Grained ◽

Medical Text ◽

Statistical Mt

The recent improvements in neural MT (NMT) have driven a shift from statistical MT (SMT) to NMT. However, to assess the usefulness of MT models for post-editing (PE) and have a detailed insight of the output they produce, we need to analyse the most frequent errors and how they affect the task. We present a pilot study of a fine-grained analysis of MT errors based on post-editors corrections for an English to Spanish medical text translated with SMT and NMT. We use the MQM taxonomy to compare the two MT models and have a categorized classification of the errors produced. Even though results show a great variation among post-editors’ corrections, for this language combination fewer errors are corrected by post-editors in the NMT output. NMT also produces fewer accuracy errors and errors that are less critical.

Download Full-text

Classifying Non-Sentential Utterances in Dialogue: A Machine Learning Approach

Computational Linguistics ◽

10.1162/coli.2007.33.3.397 ◽

2007 ◽

Vol 33 (3) ◽

pp. 397-427 ◽

Cited By ~ 17

Author(s):

Raquel Fernández ◽

Jonathan Ginzburg ◽

Shalom Lappin

Keyword(s):

Machine Learning ◽

Pilot Study ◽

Full Range ◽

Learning Approach ◽

Learning Methods ◽

Fine Grained ◽

Machine Learning Methods ◽

Machine Learning Approach ◽

The Right

In this article we use well-known machine learning methods to tackle a novel task, namely the classification of non-sentential utterances (NSUs) in dialogue. We introduce a fine-grained taxonomy of NSU classes based on corpus work, and then report on the results of several machine learning experiments. First, we present a pilot study focused on one of the NSU classes in the taxonomy—bare wh-phrases or “sluices”—and explore the task of disambiguating between the different readings that sluices can convey. We then extend the approach to classify the full range of NSU classes, obtaining results of around an 87% weighted F-score. Thus our experiments show that, for the taxonomy adopted, the task of identifying the right NSU class can be successfully learned, and hence provide a very encouraging basis for the more general enterprise of fully processing NSUs.

Download Full-text

A Machine Vision Approach for Bioreactor Foam Sensing

SLAS TECHNOLOGY Translating Life Sciences Innovation ◽

10.1177/24726303211008861 ◽

2021 ◽

pp. 247263032110088

Author(s):

Jonas Austerjost ◽

Robert Söldner ◽

Christoffer Edlund ◽

Johan Trygg ◽

David Pollard ◽

...

Keyword(s):

Machine Learning ◽

Machine Vision ◽

State Of The Art ◽

Low Cost ◽

High Accuracy ◽

Consumer Electronics ◽

Learning System ◽

Automotive Applications ◽

Fine Grained

Machine vision is a powerful technology that has become increasingly popular and accurate during the last decade due to rapid advances in the field of machine learning. The majority of machine vision applications are currently found in consumer electronics, automotive applications, and quality control, yet the potential for bioprocessing applications is tremendous. For instance, detecting and controlling foam emergence is important for all upstream bioprocesses, but the lack of robust foam sensing often leads to batch failures from foam-outs or overaddition of antifoam agents. Here, we report a new low-cost, flexible, and reliable foam sensor concept for bioreactor applications. The concept applies convolutional neural networks (CNNs), a state-of-the-art machine learning system for image processing. The implemented method shows high accuracy for both binary foam detection (foam/no foam) and fine-grained classification of foam levels.

Download Full-text

Datives with psych nouns and adjectives in Basque

Folia Linguistica ◽

10.1515/flin-2020-2050 ◽

2020 ◽

Vol 54 (3) ◽

pp. 647-696

Author(s):

Beatriz Fernández ◽

Fernando Zúñiga ◽

Ane Berro

Keyword(s):

Natural Language ◽

Linguistic Theory ◽

Psychological State ◽

Formal Expression ◽

Fine Grained ◽

Psych Verbs ◽

Other Regarding ◽

Psychological Verbs

Abstract This paper explores the formal expression of two Basque dative argument types in combination with psych nouns and adjectives, in intransitive and transitive clauses: (i) those that express the experiencer, and (ii) those that express the stimulus of the psychological state denoted by the psych noun and adjective. In the intransitive structure involving a dative experiencer (DatExpIS), the stimulus is in the absolutive case, and the intransitive copula izan ‘be’ shows both dative and absolutive agreement. This construction basically corresponds to those built upon the piacere type of psychological verbs typified in (Belletti, Adriana & Luigi Rizzi. 1988. Psych-verbs and θ-theory. Natural Language and Linguistic Theory 6. 291–352) three-way classification of Italian psych verbs. In the intransitive structure involving a dative stimulus (DatStimIS), the experiencer is marked by absolutive case, and the same intransitive copula shows both absolutive and dative agreement (with the latter corresponding to the dative stimulus and not to the experiencer). We show that the behavior of the dative argument in the two constructions is just the opposite of each other regarding a number of morphosyntactic tests, including agreement, constituency, hierarchy and selection. Additionally, we explore two parallel transitive constructions that involve either a dative experiencer and an ergative stimulus (DatExpTS) or a dative stimulus and an ergative experiencer (DatStimTS), which employ the transitive copula *edun ‘have’. Considering these configurations, we propose an extended and more fine-grained typology of psych predicates.

Download Full-text

Inter-observer variability in the classification of ovarian cancer cell type using microscopy: a pilot study

10.1117/12.2082264 ◽

2015 ◽

Cited By ~ 1

Author(s):

Marios A. Gavrielides ◽

Brigitte M. Ronnett ◽

Russell Vang ◽

Jeffrey D. Seidman

Keyword(s):

Ovarian Cancer ◽

Pilot Study ◽

Cancer Cell ◽

Ovarian Cancer Cell ◽

Observer Variability ◽

Cell Type ◽

Inter Observer Variability ◽

Cancer Cell Type

Download Full-text

Fine-grained Classification of Malicious Code Based on CNN and Multi-resolution Feature Fusion

10.1109/iccia52886.2021.00031 ◽

2021 ◽

Author(s):

Junmiao Liang ◽

Zhenhu Ning ◽

Yihua Zhou ◽

Dongzhi Cao

Keyword(s):

Feature Fusion ◽

Malicious Code ◽

Fine Grained

Download Full-text

Change Taxonomy: A Fine-Grained Classification of Software Change

IT Professional ◽

10.1109/mitp.2018.043141666 ◽

2018 ◽

Vol 20 (4) ◽

pp. 28-36 ◽

Cited By ~ 1

Author(s):

Mohamed Elkholy ◽

Ahmed Elfatatry

Keyword(s):

Fine Grained ◽

Software Change

Download Full-text

Mechanomyography Spasticity Assessment of Flexor and Extensor Wrist Muscles for the Classification of Boccia Athletes in Para Sports: A Pilot Study

IFMBE Proceedings - VIII Latin American Conference on Biomedical Engineering and XLII National Conference on Biomedical Engineering ◽

10.1007/978-3-030-30648-9_154 ◽

2019 ◽

pp. 1184-1190

Author(s):

Elgison da Luz dos Santos ◽

Maria de Fátima Fernandes Vara ◽

Maira Ranciaro ◽

Gustavo Tanaka Zelaga ◽

Amanda Mayara Pereira Gomes ◽

...

Keyword(s):

Pilot Study

Download Full-text

Experimental analysis on the optimal excitation wavelength for fine-grained identification of refined oil pollutants on water surface based on laser-induced fluorescence

10.21203/rs.3.rs-756586/v2 ◽

2021 ◽

Author(s):

Ming Xie ◽

Yunpeng Jia ◽

Ying Li ◽

Xiaohua Cai ◽

Kai Cao

Keyword(s):

Oil Spill ◽

Theoretical Basis ◽

Water Surface ◽

Laser Induced Fluorescence ◽

Excitation Wavelength ◽

Refined Oil ◽

Fine Grained ◽

Optimal Excitation ◽

Oil Spill Identification

Abstract Laser-induced fluorescence (LIF) is an effective, all-weather oil spill identification method that has been widely applied for oil spill monitoring. However, the distinguishability on oil types is seldom considered while selecting excitation wavelength. This study is intended to find the optimal excitation wavelength for fine-grained classification of refined oil pollutants using LIF by comparing the distinguishability of fluorometric spectra under various excitation wavelengths on some typical types of refined-oil samples. The results show that the fluorometric spectra of oil samples significantly vary under different excitation wavelengths, and the four types of oil applied in this study are most likely to be distinguished under the excitation wavelengths of 395 nm and 420 nm. This study is expected to improve the ability of oil types identification using LIF method without increasing time or other cost, and also provides theoretical basis for the development of portable LIF devices for oil spill identification.

Download Full-text