A Neural Network Architecture for Detecting Grammatical Errors in Statistical Machine Translation

Abstract In this paper we present a Neural Network (NN) architecture for detecting grammatical errors in Statistical Machine Translation (SMT) using monolingual morpho-syntactic word representations in combination with surface and syntactic context windows. We test our approach on two language pairs and two tasks, namely detecting grammatical errors and predicting overall post-editing effort. Our results show that this approach is not only able to accurately detect grammatical errors but it also performs well as a quality estimation system for predicting overall post-editing effort, which is characterised by all types of MT errors. Furthermore, we show that this approach is portable to other languages.

Download Full-text

Estimating word-level quality of statistical machine translation output using monolingual information alone

Natural Language Engineering ◽

10.1017/s1351324919000111 ◽

2019 ◽

Vol 26 (1) ◽

pp. 73-94

Author(s):

Arda Tezcan ◽

Véronique Hoste ◽

Lieve Macken

Keyword(s):

Machine Translation ◽

Network Architecture ◽

State Of The Art ◽

Statistical Machine Translation ◽

Neural Network Architecture ◽

Quality Estimation ◽

Word Level ◽

Syntactic Features ◽

Grammatical Errors

AbstractVarious studies show that statistical machine translation (SMT) systems suffer from fluency errors, especially in the form of grammatical errors and errors related to idiomatic word choices. In this study, we investigate the effectiveness of using monolingual information contained in the machine-translated text to estimate word-level quality of SMT output. We propose a recurrent neural network architecture which uses morpho-syntactic features and word embeddings as word representations within surface and syntactic n-grams. We test the proposed method on two language pairs and for two tasks, namely detecting fluency errors and predicting overall post-editing effort. Our results show that this method is effective for capturing all types of fluency errors at once. Moreover, on the task of predicting post-editing effort, while solely relying on monolingual information, it achieves on-par results with the state-of-the-art quality estimation systems which use both bilingual and monolingual information.

Download Full-text

A Resting State fMRI Study on The Functional Connectivity, Neural Network Architecture and Neural Network Properties of PTSD

PsycEXTRA Dataset ◽

10.1037/e533652013-471 ◽

2012 ◽

Author(s):

Xiaodan Yan ◽

Charles Marmar

Keyword(s):

Neural Network ◽

Functional Connectivity ◽

Resting State ◽

Network Architecture ◽

Resting State Fmri ◽

Neural Network Architecture ◽

Fmri Study ◽

Network Properties

Download Full-text

SCORING MODELING BASED ON NEURAL NETWORKS FOR DETERMINING A BANK BORROWER'S RATING

Economy of Ukraine ◽

10.15407/economyukr.2020.10.054 ◽

2020 ◽

Vol 2020 (10) ◽

pp. 54-62

Author(s):

Oleksii VASYLIEV ◽

Keyword(s):

Neural Network ◽

Neural Networks ◽

Network Architecture ◽

Statistical Data ◽

Activation Function ◽

Decision Making Process ◽

Neural Network Architecture ◽

Acceptable Accuracy ◽

The Neural Network ◽

Sigmoid Activation Function

The problem of applying neural networks to calculate ratings used in banking in the decision-making process on granting or not granting loans to borrowers is considered. The task is to determine the rating function of the borrower based on a set of statistical data on the effectiveness of loans provided by the bank. When constructing a regression model to calculate the rating function, it is necessary to know its general form. If so, the task is to calculate the parameters that are included in the expression for the rating function. In contrast to this approach, in the case of using neural networks, there is no need to specify the general form for the rating function. Instead, certain neural network architecture is chosen and parameters are calculated for it on the basis of statistical data. Importantly, the same neural network architecture can be used to process different sets of statistical data. The disadvantages of using neural networks include the need to calculate a large number of parameters. There is also no universal algorithm that would determine the optimal neural network architecture. As an example of the use of neural networks to determine the borrower's rating, a model system is considered, in which the borrower's rating is determined by a known non-analytical rating function. A neural network with two inner layers, which contain, respectively, three and two neurons and have a sigmoid activation function, is used for modeling. It is shown that the use of the neural network allows restoring the borrower's rating function with quite acceptable accuracy.

Download Full-text

Evolving Neural Network Architecture

10.21236/ada264802 ◽

1993 ◽

Author(s):

John R. McDonnell ◽

Don Waagen

Keyword(s):

Neural Network ◽

Network Architecture ◽

Neural Network Architecture

Download Full-text

Analysis of Using Regularization Technique in The Convolutional Neural Network Architecture to Detect Paddy Disease for Small Dataset

Journal of Physics Conference Series ◽

10.1088/1742-6596/1726/1/012010 ◽

2021 ◽

Vol 1726 ◽

pp. 012010

Author(s):

S Mujahidin ◽

N F Azhar ◽

B Prihasto

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Network Architecture ◽

Neural Network Architecture ◽

Regularization Technique ◽

Small Dataset

Download Full-text

Exploring a Siamese Neural Network Architecture for One-Shot Drug Discovery

2020 IEEE 20th International Conference on Bioinformatics and Bioengineering (BIBE) ◽

10.1109/bibe50027.2020.00035 ◽

2020 ◽

Author(s):

Luis Torres ◽

Nelson Monteiro ◽

Jose Oliveira ◽

Joel Arrais ◽

Bernardete Ribeiro

Keyword(s):

Neural Network ◽

Drug Discovery ◽

Network Architecture ◽

Neural Network Architecture

Download Full-text

optNet-50: An Optimized Residual Neural Network Architecture of Deep Learning for Driver's Distraction

2020 IEEE 23rd International Multitopic Conference (INMIC) ◽

10.1109/inmic50486.2020.9318087 ◽

2020 ◽

Author(s):

Tahir Abbas ◽

Syed Farooq Ali ◽

Aadil Zia Khan ◽

Irfan Kareem

Keyword(s):

Neural Network ◽

Deep Learning ◽

Network Architecture ◽

Neural Network Architecture

Download Full-text

Deepening the IDA* algorithm for knowledge graph reasoning through neural network architecture

Neurocomputing ◽

10.1016/j.neucom.2020.12.040 ◽

2021 ◽

Vol 429 ◽

pp. 101-109

Author(s):

Qi Wang ◽

Yongsheng Hao ◽

Feng Chen

Keyword(s):

Neural Network ◽

Network Architecture ◽

Knowledge Graph ◽

Neural Network Architecture

Download Full-text

Learning Subject-Generalized Topographical EEG Embeddings Using Deep Variational Autoencoders and Domain-Adversarial Regularization

Sensors ◽

10.3390/s21051792 ◽

2021 ◽

Vol 21 (5) ◽

pp. 1792

Author(s):

Juan Hagad ◽

Tsukasa Kimura ◽

Ken-ichi Fukui ◽

Masayuki Numao

Keyword(s):

Neural Network ◽

Network Architecture ◽

Emotion Classification ◽

Limited Data ◽

Neural Network Architecture ◽

Building Models ◽

The Subject ◽

Data Constraints ◽

Input Level ◽

Normally Distributed

Two of the biggest challenges in building models for detecting emotions from electroencephalography (EEG) devices are the relatively small amount of labeled samples and the strong variability of signal feature distributions between different subjects. In this study, we propose a context-generalized model that tackles the data constraints and subject variability simultaneously using a deep neural network architecture optimized for normally distributed subject-independent feature embeddings. Variational autoencoders (VAEs) at the input level allow the lower feature layers of the model to be trained on both labeled and unlabeled samples, maximizing the use of the limited data resources. Meanwhile, variational regularization encourages the model to learn Gaussian-distributed feature embeddings, resulting in robustness to small dataset imbalances. Subject-adversarial regularization applied to the bi-lateral features further enforces subject-independence on the final feature embedding used for emotion classification. The results from subject-independent performance experiments on the SEED and DEAP EEG-emotion datasets show that our model generalizes better across subjects than other state-of-the-art feature embeddings when paired with deep learning classifiers. Furthermore, qualitative analysis of the embedding space reveals that our proposed subject-invariant bi-lateral variational domain adversarial neural network (BiVDANN) architecture may improve the subject-independent performance by discovering normally distributed features.

Download Full-text