scholarly journals What do post-editors correct? A fine-grained analysis of SMT and NMT errors

Author(s):  
Sergi Alvarez-Vidal ◽  
Antoni Oliver ◽  
Toni Badia

The recent improvements in neural MT (NMT) have driven a shift from statistical MT (SMT) to NMT. However, to assess the usefulness of MT models for post-editing (PE) and have a detailed insight of the output they produce, we need to analyse the most frequent errors and how they affect the task. We present a pilot study of a fine-grained analysis of MT errors based on post-editors corrections for an English to Spanish medical text translated with SMT and NMT. We use the MQM taxonomy to compare the two MT models and have a categorized classification of the errors produced. Even though results show a great variation among post-editors’ corrections, for this language combination fewer errors are corrected by post-editors in the NMT output. NMT also produces fewer accuracy errors and errors that are less critical.

2007 ◽  
Vol 33 (3) ◽  
pp. 397-427 ◽  
Author(s):  
Raquel Fernández ◽  
Jonathan Ginzburg ◽  
Shalom Lappin

In this article we use well-known machine learning methods to tackle a novel task, namely the classification of non-sentential utterances (NSUs) in dialogue. We introduce a fine-grained taxonomy of NSU classes based on corpus work, and then report on the results of several machine learning experiments. First, we present a pilot study focused on one of the NSU classes in the taxonomy—bare wh-phrases or “sluices”—and explore the task of disambiguating between the different readings that sluices can convey. We then extend the approach to classify the full range of NSU classes, obtaining results of around an 87% weighted F-score. Thus our experiments show that, for the taxonomy adopted, the task of identifying the right NSU class can be successfully learned, and hence provide a very encouraging basis for the more general enterprise of fully processing NSUs.


Author(s):  
Jonas Austerjost ◽  
Robert Söldner ◽  
Christoffer Edlund ◽  
Johan Trygg ◽  
David Pollard ◽  
...  

Machine vision is a powerful technology that has become increasingly popular and accurate during the last decade due to rapid advances in the field of machine learning. The majority of machine vision applications are currently found in consumer electronics, automotive applications, and quality control, yet the potential for bioprocessing applications is tremendous. For instance, detecting and controlling foam emergence is important for all upstream bioprocesses, but the lack of robust foam sensing often leads to batch failures from foam-outs or overaddition of antifoam agents. Here, we report a new low-cost, flexible, and reliable foam sensor concept for bioreactor applications. The concept applies convolutional neural networks (CNNs), a state-of-the-art machine learning system for image processing. The implemented method shows high accuracy for both binary foam detection (foam/no foam) and fine-grained classification of foam levels.


2020 ◽  
Vol 54 (3) ◽  
pp. 647-696
Author(s):  
Beatriz Fernández ◽  
Fernando Zúñiga ◽  
Ane Berro

Abstract This paper explores the formal expression of two Basque dative argument types in combination with psych nouns and adjectives, in intransitive and transitive clauses: (i) those that express the experiencer, and (ii) those that express the stimulus of the psychological state denoted by the psych noun and adjective. In the intransitive structure involving a dative experiencer (DatExpIS), the stimulus is in the absolutive case, and the intransitive copula izan ‘be’ shows both dative and absolutive agreement. This construction basically corresponds to those built upon the piacere type of psychological verbs typified in (Belletti, Adriana & Luigi Rizzi. 1988. Psych-verbs and θ-theory. Natural Language and Linguistic Theory 6. 291–352) three-way classification of Italian psych verbs. In the intransitive structure involving a dative stimulus (DatStimIS), the experiencer is marked by absolutive case, and the same intransitive copula shows both absolutive and dative agreement (with the latter corresponding to the dative stimulus and not to the experiencer). We show that the behavior of the dative argument in the two constructions is just the opposite of each other regarding a number of morphosyntactic tests, including agreement, constituency, hierarchy and selection. Additionally, we explore two parallel transitive constructions that involve either a dative experiencer and an ergative stimulus (DatExpTS) or a dative stimulus and an ergative experiencer (DatStimTS), which employ the transitive copula *edun ‘have’. Considering these configurations, we propose an extended and more fine-grained typology of psych predicates.


2018 ◽  
Vol 20 (4) ◽  
pp. 28-36 ◽  
Author(s):  
Mohamed Elkholy ◽  
Ahmed Elfatatry
Keyword(s):  

Author(s):  
Elgison da Luz dos Santos ◽  
Maria de Fátima Fernandes Vara ◽  
Maira Ranciaro ◽  
Gustavo Tanaka Zelaga ◽  
Amanda Mayara Pereira Gomes ◽  
...  
Keyword(s):  

2021 ◽  
Author(s):  
Ming Xie ◽  
Yunpeng Jia ◽  
Ying Li ◽  
Xiaohua Cai ◽  
Kai Cao

Abstract Laser-induced fluorescence (LIF) is an effective, all-weather oil spill identification method that has been widely applied for oil spill monitoring. However, the distinguishability on oil types is seldom considered while selecting excitation wavelength. This study is intended to find the optimal excitation wavelength for fine-grained classification of refined oil pollutants using LIF by comparing the distinguishability of fluorometric spectra under various excitation wavelengths on some typical types of refined-oil samples. The results show that the fluorometric spectra of oil samples significantly vary under different excitation wavelengths, and the four types of oil applied in this study are most likely to be distinguished under the excitation wavelengths of 395 nm and 420 nm. This study is expected to improve the ability of oil types identification using LIF method without increasing time or other cost, and also provides theoretical basis for the development of portable LIF devices for oil spill identification.


Sign in / Sign up

Export Citation Format

Share Document