neural models Latest Research Papers

Joined Type Length Encoding for Nested Named Entity Recognition

ACM Transactions on Asian and Low-Resource Language Information Processing ◽

10.1145/3487057 ◽

2022 ◽

Vol 21 (3) ◽

pp. 1-23

Author(s):

Mohammad Sadegh Sheikhaei ◽

Hasan Zafari ◽

Yuan Tian

Keyword(s):

State Of The Art ◽

Named Entity Recognition ◽

Ensemble Method ◽

Entity Recognition ◽

Neural Models ◽

Named Entities ◽

Layer Sequence ◽

Named Entity ◽

The Ensemble Method ◽

Single Sequence

In this article, we propose a new encoding scheme for named entity recognition (NER) called Joined Type-Length encoding (JoinedTL). Unlike most existing named entity encoding schemes, which focus on flat entities, JoinedTL can label nested named entities in a single sequence. JoinedTL uses a packed encoding to represent both type and span of a named entity, which not only results in less tagged tokens compared to existing encoding schemes, but also enables it to support nested NER. We evaluate the effectiveness of JoinedTL for nested NER on three nested NER datasets: GENIA in English, GermEval in German, and PerNest, our newly created nested NER dataset in Persian. We apply CharLSTM+WordLSTM+CRF, a three-layer sequence tagging model on three datasets encoded using JoinedTL and two existing nested NE encoding schemes, i.e., JoinedBIO and JoinedBILOU. Our experiment results show that CharLSTM+WordLSTM+CRF trained with JoinedTL encoded datasets can achieve competitive F1 scores as the ones trained with datasets encoded by two other encodings, but with 27%–48% less tagged tokens. To leverage the power of three different encodings, i.e., JoinedTL, JoinedBIO, and JoinedBILOU, we propose an encoding-based ensemble method for nested NER. Evaluation results show that the ensemble method achieves higher F1 scores on all datasets than the three models each trained using one of the three encodings. By using nested NE encodings including JoinedTL with CharLSTM+WordLSTM+CRF, we establish new state-of-the-art performance with an F1 score of 83.7 on PerNest, 74.9 on GENIA, and 70.5 on GermEval, surpassing two recent neural models specially designed for nested NER.

A Self-Supervised Representation Learning of Sentence Structure for Authorship Attribution

ACM Transactions on Knowledge Discovery from Data ◽

10.1145/3491203 ◽

2022 ◽

Vol 16 (4) ◽

pp. 1-16

Author(s):

Fereshteh Jafariakinabad ◽

Kien A. Hua

Keyword(s):

Structural Information ◽

Syntactic Structure ◽

Representation Learning ◽

Authorship Attribution ◽

Sentence Structure ◽

Vector Representation ◽

Writing Style ◽

Neural Models ◽

Syntactic Information ◽

Classification Tasks

The syntactic structure of sentences in a document substantially informs about its authorial writing style. Sentence representation learning has been widely explored in recent years and it has been shown that it improves the generalization of different downstream tasks across many domains. Even though utilizing probing methods in several studies suggests that these learned contextual representations implicitly encode some amount of syntax, explicit syntactic information further improves the performance of deep neural models in the domain of authorship attribution. These observations have motivated us to investigate the explicit representation learning of syntactic structure of sentences. In this article, we propose a self-supervised framework for learning structural representations of sentences. The self-supervised network contains two components; a lexical sub-network and a syntactic sub-network which take the sequence of words and their corresponding structural labels as the input, respectively. Due to the n -to-1 mapping of words to their structural labels, each word will be embedded into a vector representation which mainly carries structural information. We evaluate the learned structural representations of sentences using different probing tasks, and subsequently utilize them in the authorship attribution task. Our experimental results indicate that the structural embeddings significantly improve the classification tasks when concatenated with the existing pre-trained word embeddings.

Time-Domain Joint Training Strategies of Speech Enhancement and Intent Classification Neural Models

Sensors ◽

10.3390/s22010374 ◽

2022 ◽

Vol 22 (1) ◽

pp. 374

Author(s):

Mohamed Nabih Ali ◽

Daniele Falavigna ◽

Alessio Brutti

Keyword(s):

Speech Enhancement ◽

Time Domain ◽

Background Noise ◽

State Of The Art ◽

Noisy Environments ◽

Neural Models ◽

Convolutional Network ◽

Environmental Perturbations ◽

Front End ◽

The Time Domain

Robustness against background noise and reverberation is essential for many real-world speech-based applications. One way to achieve this robustness is to employ a speech enhancement front-end that, independently of the back-end, removes the environmental perturbations from the target speech signal. However, although the enhancement front-end typically increases the speech quality from an intelligibility perspective, it tends to introduce distortions which deteriorate the performance of subsequent processing modules. In this paper, we investigate strategies for jointly training neural models for both speech enhancement and the back-end, which optimize a combined loss function. In this way, the enhancement front-end is guided by the back-end to provide more effective enhancement. Differently from typical state-of-the-art approaches employing on spectral features or neural embeddings, we operate in the time domain, processing raw waveforms in both components. As application scenario we consider intent classification in noisy environments. In particular, the front-end speech enhancement module is based on Wave-U-Net while the intent classifier is implemented as a temporal convolutional network. Exhaustive experiments are reported on versions of the Fluent Speech Commands corpus contaminated with noises from the Microsoft Scalable Noisy Speech Dataset, shedding light and providing insight about the most promising training approaches.

Biomaterial-based strategies for in vitro neural models

Biomaterials Science ◽

10.1039/d1bm01361k ◽

2022 ◽

Author(s):

Alp Ozgun ◽

David Lomboni ◽

Hallie Arnott ◽

William A. Staines ◽

John Woulfe ◽

...

Keyword(s):

Neural Tissue ◽

Neural Models ◽

Tissue Models

This review provides a comprehensive compendium of commonly used biomaterials as well as the different fabrication techniques employed for the design of 3D neural tissue models.

Humidity Forecasting in a Potato Plantation using Time-Series Neural Models

Journal of Computational Science ◽

10.1016/j.jocs.2021.101547 ◽

2022 ◽

pp. 101547

Author(s):

Mercedes Yartu ◽

Carlos Cambra ◽

Milagros Navarro ◽

Carlos Rad ◽

Ángel Arroyo ◽

...

Keyword(s):

Time Series ◽

Neural Models

Deep Neural Models for Key-Phrase Indexing

Pervasive Computing and Social Networking - Lecture Notes in Networks and Systems ◽

10.1007/978-981-16-5640-8_37 ◽

2022 ◽

pp. 483-497

Author(s):

Saurabh Sharma ◽

Vishal Gupta ◽

Mamta Juneja

Keyword(s):

Neural Models

Morphological Analysis of Japanese Hiragana Sentences using the BI-LSTM CRF Model

10.5121/csit.2021.112310 ◽

2021 ◽

Author(s):

Jun Izutsu ◽

Kanako Komiya

Keyword(s):

Natural Language Processing ◽

Language Processing ◽

Essential Role ◽

Morphological Analysis ◽

Training Data ◽

Fine Tuning ◽

Chinese Characters ◽

Neural Models ◽

Text Data ◽

Parts Of Speech

This study proposes a method to develop neural models of the morphological analyzer for Japanese Hiragana sentences using the Bi-LSTM CRF model. Morphological analysis is a technique that divides text data into words and assigns information such as parts of speech. This technique plays an essential role in downstream applications in Japanese natural language processing systems because the Japanese language does not have word delimiters between words. Hiragana is a type of Japanese phonogramic characters, which is used for texts for children or people who cannot read Chinese characters. Morphological analysis of Hiragana sentences is more difficult than that of ordinary Japanese sentences because there is less information for dividing. For morphological analysis of Hiragana sentences, we demonstrated the effectiveness of fine-tuning using a model based on ordinary Japanese text and examined the influence of training data on texts of various genres.

Deep Learning for Text Style Transfer: A Survey

Computational Linguistics ◽

10.1162/coli_a_00426 ◽

2021 ◽

pp. 1-51

Author(s):

Di Jin ◽

Zhijing Jin ◽

Zhiting Hu ◽

Olga Vechtomova ◽

Rada Mihalcea

Keyword(s):

Deep Learning ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Neural Models ◽

Style Transfer ◽

Language Generation ◽

Parallel Data ◽

The Rich ◽

Significant Attention

Abstract Text style transfer is an important task in natural language generation, which aims to control certain attributes in the generated text, such as politeness, emotion, humor, and many others. It has a long history in the field of natural language processing, and recently has re-gained significant attention thanks to the promising performance brought by deep neural models. In this paper, we present a systematic survey of the research on neural text style transfer, spanning over 100 representative articles since the first neural text style transfer work in 2017. We discuss the task formulation, existing datasets and subtasks, evaluation, as well as the rich methodologies in the presence of parallel and non-parallel data. We also provide discussions on a variety of important topics regarding the future development of this task.

A Probabilistic Re-Intepretation of Confidence Scores in Multi-Exit Models

Entropy ◽

10.3390/e24010001 ◽

2021 ◽

Vol 24 (1) ◽

pp. 1

Author(s):

Jary Pomponi ◽

Simone Scardapane ◽

Aurelio Uncini

Keyword(s):

Neural Network ◽

Image Classification ◽

Deep Neural Network ◽

Confidence Score ◽

Good Prediction ◽

Neural Models ◽

Stopping Criteria ◽

New Approach ◽

The Real ◽

Early Exit

In this paper, we propose a new approach to train a deep neural network with multiple intermediate auxiliary classifiers, branching from it. These ‘multi-exits’ models can be used to reduce the inference time by performing early exit on the intermediate branches, if the confidence of the prediction is higher than a threshold. They rely on the assumption that not all the samples require the same amount of processing to yield a good prediction. In this paper, we propose a way to train jointly all the branches of a multi-exit model without hyper-parameters, by weighting the predictions from each branch with a trained confidence score. Each confidence score is an approximation of the real one produced by the branch, and it is calculated and regularized while training the rest of the model. We evaluate our proposal on a set of image classification benchmarks, using different neural models and early-exit stopping criteria.

Neural models for monitoring the transmembrane flux in the vinasse clarification process by crossflow microfiltration

C Q D - Revista Eletrônica Paulista de Matemática ◽

10.21167/cqdvol21ic202123169664aabsjmserf3849 ◽

2021 ◽

Vol 21 (IC) ◽

pp. 38-49

Author(s):

André Arthur Bueno da Silva ◽

Juliana Maria da Silva ◽

Érica Regina Filletti

Keyword(s):

Neural Models ◽

Transmembrane Flux ◽

Crossflow Microfiltration

neural models
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Joined Type Length Encoding for Nested Named Entity Recognition

A Self-Supervised Representation Learning of Sentence Structure for Authorship Attribution

Time-Domain Joint Training Strategies of Speech Enhancement and Intent Classification Neural Models

Biomaterial-based strategies for in vitro neural models

Humidity Forecasting in a Potato Plantation using Time-Series Neural Models

Deep Neural Models for Key-Phrase Indexing

Morphological Analysis of Japanese Hiragana Sentences using the BI-LSTM CRF Model

Deep Learning for Text Style Transfer: A Survey

A Probabilistic Re-Intepretation of Confidence Scores in Multi-Exit Models

Neural models for monitoring the transmembrane flux in the vinasse clarification process by crossflow microfiltration

Export Citation Format

neural modelsRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Joined Type Length Encoding for Nested Named Entity Recognition

A Self-Supervised Representation Learning of Sentence Structure for Authorship Attribution

Time-Domain Joint Training Strategies of Speech Enhancement and Intent Classification Neural Models

Biomaterial-based strategies for in vitro neural models

Humidity Forecasting in a Potato Plantation using Time-Series Neural Models

Deep Neural Models for Key-Phrase Indexing

Morphological Analysis of Japanese Hiragana Sentences using the BI-LSTM CRF Model

Deep Learning for Text Style Transfer: A Survey

A Probabilistic Re-Intepretation of Confidence Scores in Multi-Exit Models

Neural models for monitoring the transmembrane flux in the vinasse clarification process by crossflow microfiltration

neural models
Recently Published Documents