Text Genre Detection Using Doc2Vec Word-embedding Language Model

Dongsung Kim

doi:10.29403/li.23.2.2

A Latent Variable Model for Learning Distributional Relation Vectors

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2019/682 ◽

2019 ◽

Author(s):

Jose Camacho-Collados ◽

Luis Espinosa-Anke ◽

Shoaib Jameel ◽

Steven Schockaert

Keyword(s):

Latent Variable ◽

Language Model ◽

Word Embedding ◽

Latent Variable Model ◽

Variable Model ◽

The Given ◽

The Relationship ◽

Unsupervised Approaches ◽

Target Words

Recently a number of unsupervised approaches have been proposed for learning vectors that capture the relationship between two words. Inspired by word embedding models, these approaches rely on co-occurrence statistics that are obtained from sentences in which the two target words appear. However, the number of such sentences is often quite small, and most of the words that occur in them are not relevant for characterizing the considered relationship. As a result, standard co-occurrence statistics typically lead to noisy relation vectors. To address this issue, we propose a latent variable model that aims to explicitly determine what words from the given sentences best characterize the relationship between the two target words. Relation vectors then correspond to the parameters of a simple unigram language model which is estimated from these words.

Download Full-text

Fracture Mechanics Method for Word Embedding Generation of Neural Probabilistic Linguistic Model

Computational Intelligence and Neuroscience ◽

10.1155/2016/3506261 ◽

2016 ◽

Vol 2016 ◽

pp. 1-11

Author(s):

Size Bi ◽

Xiao Liang ◽

Ting-lei Huang

Keyword(s):

Fracture Mechanics ◽

Language Model ◽

Named Entity Recognition ◽

Word Embedding ◽

Entity Recognition ◽

Semantic Role Labeling ◽

Part Of Speech ◽

Linguistic Model ◽

Traditional Language ◽

Long Time

Word embedding, a lexical vector representation generated via the neural linguistic model (NLM), is empirically demonstrated to be appropriate for improvement of the performance of traditional language model. However, the supreme dimensionality that is inherent in NLM contributes to the problems of hyperparameters and long-time training in modeling. Here, we propose a force-directed method to improve such problems for simplifying the generation of word embedding. In this framework, each word is assumed as a point in the real world; thus it can approximately simulate the physical movement following certain mechanics. To simulate the variation of meaning in phrases, we use the fracture mechanics to do the formation and breakdown of meaning combined by a 2-gram word group. With the experiments on the natural linguistic tasks of part-of-speech tagging, named entity recognition and semantic role labeling, the result demonstrated that the 2-dimensional word embedding can rival the word embeddings generated by classic NLMs, in terms of accuracy, recall, and text visualization.

Download Full-text

G2Basy: A framework to improve the RNN language model and ease overfitting problem

PLoS ONE ◽

10.1371/journal.pone.0249820 ◽

2021 ◽

Vol 16 (4) ◽

pp. e0249820

Author(s):

Lu Yuwen ◽

Shuyu Chen ◽

Xiaohan Yuan

Keyword(s):

Language Model ◽

Word Embedding ◽

Language Models ◽

Batch Size ◽

Training Process ◽

Improve Performance ◽

Step Size ◽

Learning Rates ◽

Speed Up ◽

Overfitting Problem

Recurrent neural networks are efficient ways of training language models, and various RNN networks have been proposed to improve performance. However, with the increase of network scales, the overfitting problem becomes more urgent. In this paper, we propose a framework—G2Basy—to speed up the training process and ease the overfitting problem. Instead of using predefined hyperparameters, we devise a gradient increasing and decreasing technique that changes the parameters training batch size and input dropout simultaneously by a user-defined step size. Together with a pretrained word embedding initialization procedure and the introduction of different optimizers at different learning rates, our framework speeds up the training process dramatically and improves performance compared with a benchmark model of the same scale. For the word embedding initialization, we propose the concept of “artificial features” to describe the characteristics of the obtained word embeddings. We experiment on two of the most often used corpora—the Penn Treebank and WikiText-2 datasets—and both outperform the benchmark results and show potential towards further improvement. Furthermore, our framework shows better results with the larger and more complicated WikiText-2 corpus than with the Penn Treebank. Compared with other state-of-the-art results, we achieve comparable results with network scales hundreds of times smaller and within fewer training epochs.

Download Full-text

MenuNER: Domain-Adapted BERT Based NER Approach for a Domain with Limited Dataset and Its Application to Food Menu Domain

Applied Sciences ◽

10.3390/app11136007 ◽

2021 ◽

Vol 11 (13) ◽

pp. 6007

Author(s):

Muzamil Hussain Syed ◽

Sun-Tae Chung

Keyword(s):

Domain Adaptation ◽

Language Model ◽

Named Entity Recognition ◽

Word Embedding ◽

Fine Tuning ◽

Entity Recognition ◽

Language Models ◽

Feature Vectors ◽

Named Entity ◽

Domain Specific

Entity-based information extraction is one of the main applications of Natural Language Processing (NLP). Recently, deep transfer-learning utilizing contextualized word embedding from pre-trained language models has shown remarkable results for many NLP tasks, including Named-entity recognition (NER). BERT (Bidirectional Encoder Representations from Transformers) is gaining prominent attention among various contextualized word embedding models as a state-of-the-art pre-trained language model. It is quite expensive to train a BERT model from scratch for a new application domain since it needs a huge dataset and enormous computing time. In this paper, we focus on menu entity extraction from online user reviews for the restaurant and propose a simple but effective approach for NER task on a new domain where a large dataset is rarely available or difficult to prepare, such as food menu domain, based on domain adaptation technique for word embedding and fine-tuning the popular NER task network model ‘Bi-LSTM+CRF’ with extended feature vectors. The proposed NER approach (named as ‘MenuNER’) consists of two step-processes: (1) Domain adaptation for target domain; further pre-training of the off-the-shelf BERT language model (BERT-base) in semi-supervised fashion on a domain-specific dataset, and (2) Supervised fine-tuning the popular Bi-LSTM+CRF network for downstream task with extended feature vectors obtained by concatenating word embedding from the domain-adapted pre-trained BERT model from the first step, character embedding and POS tag feature information. Experimental results on handcrafted food menu corpus from customers’ review dataset show that our proposed approach for domain-specific NER task, that is: food menu named-entity recognition, performs significantly better than the one based on the baseline off-the-shelf BERT-base model. The proposed approach achieves 92.5% F1 score on the YELP dataset for the MenuNER task.

Download Full-text

Structured Word Embedding for Low Memory Neural Network Language Model

10.21437/interspeech.2018-1057 ◽

2018 ◽

Author(s):

Kaiyu Shi ◽

Kai Yu

Keyword(s):

Neural Network ◽

Language Model ◽

Word Embedding ◽

Network Language

Download Full-text

A Recurrent Neural Network Language Model Based on Word Embedding

Web and Big Data - Lecture Notes in Computer Science ◽

10.1007/978-3-030-01298-4_30 ◽

2018 ◽

pp. 368-377 ◽

Cited By ~ 2

Author(s):

Shuaimin Li ◽

Jungang Xu

Keyword(s):

Neural Network ◽

Recurrent Neural Network ◽

Language Model ◽

Word Embedding ◽

Model Based ◽

Network Language

Download Full-text

Word Embedding based Generalized Language Model for Information Retrieval

Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval - SIGIR '15 ◽

10.1145/2766462.2767780 ◽

2015 ◽

Cited By ~ 86

Author(s):

Debasis Ganguly ◽

Dwaipayan Roy ◽

Mandar Mitra ◽

Gareth J.F. Jones

Keyword(s):

Information Retrieval ◽

Language Model ◽

Word Embedding

Download Full-text

Character n-Gram Embeddings to Improve RNN Language Models

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33015074 ◽

2019 ◽

Vol 33 ◽

pp. 5074-5082 ◽

Cited By ~ 2

Author(s):

Sho Takase ◽

Jun Suzuki ◽

Masaaki Nagata

Keyword(s):

Neural Network ◽

Machine Translation ◽

Recurrent Neural Network ◽

Language Model ◽

Language Modeling ◽

Word Embedding ◽

Experimental Results ◽

Language Models ◽

Word Embeddings ◽

N Gram

This paper proposes a novel Recurrent Neural Network (RNN) language model that takes advantage of character information. We focus on character n-grams based on research in the field of word embedding construction (Wieting et al. 2016). Our proposed method constructs word embeddings from character ngram embeddings and combines them with ordinary word embeddings. We demonstrate that the proposed method achieves the best perplexities on the language modeling datasets: Penn Treebank, WikiText-2, and WikiText-103. Moreover, we conduct experiments on application tasks: machine translation and headline generation. The experimental results indicate that our proposed method also positively affects these tasks

Download Full-text

Neural Machine Translation with Word Embedding Transferred from Language Model

Journal of Digital Contents Society ◽

10.9728/dcs.2019.20.11.2211 ◽

2019 ◽

Vol 20 (11) ◽

pp. 2211-2216 ◽

Cited By ~ 1

Author(s):

Chanung Jeong ◽

Heeyoul Choi

Keyword(s):

Machine Translation ◽

Language Model ◽

Word Embedding ◽

Neural Machine Translation

Download Full-text

Class Language Model based on Word Embedding and POS Tagging

KIISE Transactions on Computing Practices ◽

10.5626/ktcp.2016.22.7.315 ◽

2016 ◽

Vol 22 (7) ◽

pp. 315-319 ◽

Cited By ~ 2

Author(s):

Euisok Chung ◽

Jeon-Gue Park

Keyword(s):

Language Model ◽

Word Embedding ◽

Pos Tagging ◽

Model Based

Download Full-text