Natural language modeling with syntactic structure dependency

DIBERT: Dependency Injected Bidirectional Encoder Representations from Transformers

10.36227/techrxiv.16444611.v1 ◽

2021 ◽

Author(s):

Abdul Wahab ◽

Rafet Sifa

Keyword(s):

Natural Language ◽

Sentiment Analysis ◽

Semantic Similarity ◽

Syntactic Structure ◽

Language Modeling ◽

Benchmark Dataset ◽

Fine Tuning ◽

New Model ◽

Dependency Tree ◽

Better Than

<div> <div> <div> <p> </p><div> <div> <div> <p>In this paper, we propose a new model named DIBERT which stands for Dependency Injected Bidirectional Encoder Representations from Transformers. DIBERT is a variation of the BERT and has an additional third objective called Parent Prediction (PP) apart from Masked Language Modeling (MLM) and Next Sentence Prediction (NSP). PP injects the syntactic structure of a dependency tree while pre-training the DIBERT which generates syntax-aware generic representations. We use the WikiText-103 benchmark dataset to pre-train both BERT- Base and DIBERT. After fine-tuning, we observe that DIBERT performs better than BERT-Base on various downstream tasks including Semantic Similarity, Natural Language Inference and Sentiment Analysis. </p> </div> </div> </div> </div> </div> </div>

Download Full-text

DIBERT: Dependency Injected Bidirectional Encoder Representations from Transformers

10.36227/techrxiv.16444611.v2 ◽

2021 ◽

Author(s):

Abdul Wahab ◽

Rafet Sifa

Keyword(s):

Natural Language ◽

Sentiment Analysis ◽

Semantic Similarity ◽

Syntactic Structure ◽

Language Modeling ◽

Benchmark Dataset ◽

Fine Tuning ◽

New Model ◽

Dependency Tree ◽

Better Than

<div> <div> <div> <p> </p><div> <div> <div> <p>In this paper, we propose a new model named DIBERT which stands for Dependency Injected Bidirectional Encoder Representations from Transformers. DIBERT is a variation of the BERT and has an additional third objective called Parent Prediction (PP) apart from Masked Language Modeling (MLM) and Next Sentence Prediction (NSP). PP injects the syntactic structure of a dependency tree while pre-training the DIBERT which generates syntax-aware generic representations. We use the WikiText-103 benchmark dataset to pre-train both BERT- Base and DIBERT. After fine-tuning, we observe that DIBERT performs better than BERT-Base on various downstream tasks including Semantic Similarity, Natural Language Inference and Sentiment Analysis. </p> </div> </div> </div> </div> </div> </div>

Download Full-text

DIBERT: Dependency Injected Bidirectional Encoder Representations from Transformers

10.36227/techrxiv.16444611 ◽

2021 ◽

Author(s):

Abdul Wahab ◽

Rafet Sifa

Keyword(s):

Natural Language ◽

Sentiment Analysis ◽

Semantic Similarity ◽

Syntactic Structure ◽

Language Modeling ◽

Benchmark Dataset ◽

Fine Tuning ◽

New Model ◽

Dependency Tree ◽

Better Than

<div> <div> <div> <p> </p><div> <div> <div> <p>In this paper, we propose a new model named DIBERT which stands for Dependency Injected Bidirectional Encoder Representations from Transformers. DIBERT is a variation of the BERT and has an additional third objective called Parent Prediction (PP) apart from Masked Language Modeling (MLM) and Next Sentence Prediction (NSP). PP injects the syntactic structure of a dependency tree while pre-training the DIBERT which generates syntax-aware generic representations. We use the WikiText-103 benchmark dataset to pre-train both BERT- Base and DIBERT. After fine-tuning, we observe that DIBERT performs better than BERT-Base on various downstream tasks including Semantic Similarity, Natural Language Inference and Sentiment Analysis. </p> </div> </div> </div> </div> </div> </div>

Download Full-text

The Natural Language Modeling Procedure

Next Generation Information Technologies and Systems - Lecture Notes in Computer Science ◽

10.1007/3-540-45431-4_10 ◽

2002 ◽

pp. 123-146 ◽

Cited By ~ 3

Author(s):

Peter Bollen

Keyword(s):

Natural Language ◽

Language Modeling ◽

Modeling Procedure

Download Full-text

A Sequential and Intensive Weighted Language Modeling Scheme for Multi-Task Learning-Based Natural Language Understanding

Applied Sciences ◽

10.3390/app11073095 ◽

2021 ◽

Vol 11 (7) ◽

pp. 3095

Author(s):

Suhyune Son ◽

Seonjeong Hwang ◽

Sohyeun Bae ◽

Soo Jun Park ◽

Jang-Hwan Choi

Keyword(s):

Natural Language ◽

Language Processing ◽

Empirical Investigation ◽

Natural Language Understanding ◽

Language Modeling ◽

Language Understanding ◽

Task Learning ◽

Language Representation ◽

Internal Transfer ◽

Two Stages

Multi-task learning (MTL) approaches are actively used for various natural language processing (NLP) tasks. The Multi-Task Deep Neural Network (MT-DNN) has contributed significantly to improving the performance of natural language understanding (NLU) tasks. However, one drawback is that confusion about the language representation of various tasks arises during the training of the MT-DNN model. Inspired by the internal-transfer weighting of MTL in medical imaging, we introduce a Sequential and Intensive Weighted Language Modeling (SIWLM) scheme. The SIWLM consists of two stages: (1) Sequential weighted learning (SWL), which trains a model to learn entire tasks sequentially and concentrically, and (2) Intensive weighted learning (IWL), which enables the model to focus on the central task. We apply this scheme to the MT-DNN model and call this model the MTDNN-SIWLM. Our model achieves higher performance than the existing reference algorithms on six out of the eight GLUE benchmark tasks. Moreover, our model outperforms MT-DNN by 0.77 on average on the overall task. Finally, we conducted a thorough empirical investigation to determine the optimal weight for each GLUE task.

Download Full-text

On Learning Interpreted Languages with Recurrent Models

Computational Linguistics ◽

10.1162/coli_a_00431 ◽

2022 ◽

pp. 1-13

Author(s):

Denis Paperno

Keyword(s):

Natural Language ◽

Data Processing ◽

Syntactic Structure ◽

Neural Nets ◽

Training Data ◽

Sequential Data ◽

Extensive Training ◽

Formal Syntax ◽

Compositional Interpretation

Abstract Can recurrent neural nets, inspired by human sequential data processing, learn to understand language? We construct simplified datasets reflecting core properties of natural language as modeled in formal syntax and semantics: recursive syntactic structure and compositionality. We find LSTM and GRU networks to generalise to compositional interpretation well, but only in the most favorable learning settings, with a well-paced curriculum, extensive training data, and left-to-right (but not right-to-left) composition.

Download Full-text

SISTEM CUSTOMER SERVICE PT KERETA API INDONESIA (PERSERO) BERBASIS CHATBOT MENGGUNAKAN KOMPUTASI BAHASA

METHOMIKA: Jurnal Manajemen Informatika dan Komputerisasi Akuntansi ◽

10.46880/jmika.vol5no1.pp37-41 ◽

2021 ◽

Vol 5 (1) ◽

pp. 37-41

Author(s):

Nia Shafira ◽

◽

Etin Martiana ◽

Rengga Asmara

Keyword(s):

Human Resources ◽

Natural Language ◽

Customer Service ◽

Customer Loyalty ◽

Service Provider ◽

Language Modeling ◽

Service Performance ◽

Service Workers ◽

New Approach ◽

Automatic Information

As the main train service provider company in Indonesia, PT Kereta Api Indonesia (PT KAI) has many customers who need information. In order to maintain customer loyalty, PT KAI must respond quickly and be adaptive to technology to provide the best service to customers. Limited human resources make PT KAI unable to serve customers simultaneously, so customers often have to wait for a response. In order to provide the best service, automatic messages are needed in order to help customer service performance respond quickly and at the same time with no cost, access anytime and anywhere. This study proposes a new approach with chatbots as a medium for conveying automatic information quickly and simultaneously. This chatbot is made with a computational language that focuses on natural language modeling and cosine similarity as a method for calculating the proximity of inputs and databases. This research can help PT KAI's customer service workers to answer customer needs automatically.

Download Full-text

Smoothing Techniques for Tree-k-Grammar-Based Natural Language Modeling

Pattern Recognition and Image Analysis - Lecture Notes in Computer Science ◽

10.1007/978-3-540-44871-6_122 ◽

2003 ◽

pp. 1057-1065

Author(s):

Jose L. Verdú-Mas ◽

Jorge Calera-Rubio ◽

Rafael C. Carrasco

Keyword(s):

Natural Language ◽

Language Modeling ◽

Smoothing Techniques

Download Full-text

The indivisibility of words

Journal of Linguistics ◽

10.1017/s0022226700013098 ◽

1979 ◽

Vol 15 (1) ◽

pp. 39-47 ◽

Cited By ~ 4

Author(s):

Geoffrey Sampson

Keyword(s):

Natural Language ◽

Syntactic Structure ◽

Semantic Representation ◽

Tree Structure ◽

Breaking Point ◽

Derivational Morphology ◽

Adequate Description ◽

Linguistic Representation ◽

Level I

Many contemporary linguists hold that an adequate description of a natural language must represent many of its vocabulary items as syntactically and/or semantically complex. A sentence containing the word kill, for instance, will on this view be assigned a ‘deep syntactic structure’ or ‘semantic representation’ in which kill is represented by a portion or portions of tree-structure, the lowest nodes of which are labelled with ‘semantic primitives’ such as CAUSE and DIE, or CAUSE, BECOME, NOT and ALIVE. In the case of words such as cats or walked, which are formed in accordance with productive rules of ‘inflexional’ rather than ‘derivational’ morphology, there is little dispute that their composite status will be reflected at most or all levels of linguistic representation. (That is why I refer, above, to ‘vocabulary items’: cat and cats may be called different ‘words’, but not different elements of the English vocbulary.) When morphologically simple words such as kill are treated as composite at a ‘deeper’ level, I, for one, find my credulity strained to breaking point. (The case of words formed in accordance with productive or non-productive rules of derivational morphology, such as killer or kingly, is an intermediate one and I shall briefly return to it below.)

Download Full-text

Constructing Corpora for the Development and Evaluation of Paraphrase Systems

Computational Linguistics ◽

10.1162/coli.08-003-r1-07-044 ◽

2008 ◽

Vol 34 (4) ◽

pp. 597-614 ◽

Cited By ~ 19

Author(s):

Trevor Cohn ◽

Chris Callison-Burch ◽

Mirella Lapata

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Syntactic Structure ◽

Nominal Data ◽

Parallel Corpus ◽

Word Alignments ◽

Definition Of

Automatic paraphrasing is an important component in many natural language processing tasks. In this article we present a new parallel corpus with paraphrase annotations. We adopt a definition of paraphrase based on word alignments and show that it yields high inter-annotator agreement. As Kappa is suited to nominal data, we employ an alternative agreement statistic which is appropriate for structured alignment tasks. We discuss how the corpus can be usefully employed in evaluating paraphrase systems automatically (e.g., by measuring precision, recall, and F1) and also in developing linguistically rich paraphrase models based on syntactic structure.

Download Full-text