Pronunciation-Enhanced Chinese Word Embedding

Near-Lossless Binarization of Word Embeddings

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33017104 ◽

2019 ◽

Vol 33 ◽

pp. 7104-7111 ◽

Cited By ~ 3

Author(s):

Julien Tissier ◽

Christophe Gravier ◽

Amaury Habrard

Keyword(s):

Sentiment Analysis ◽

Semantic Similarity ◽

Text Classification ◽

Semantic Information ◽

State Of The Art ◽

Floating Point ◽

Word Embeddings ◽

Binary Vectors ◽

Starting Point ◽

Memory Footprint

Word embeddings are commonly used as a starting point in many NLP models to achieve state-of-the-art performances. However, with a large vocabulary and many dimensions, these floating-point representations are expensive both in terms of memory and calculations which makes them unsuitable for use on low-resource devices. The method proposed in this paper transforms real-valued embeddings into binary embeddings while preserving semantic information, requiring only 128 or 256 bits for each vector. This leads to a small memory footprint and fast vector operations. The model is based on an autoencoder architecture, which also allows to reconstruct original vectors from the binary ones. Experimental results on semantic similarity, text classification and sentiment analysis tasks show that the binarization of word embeddings only leads to a loss of ∼2% in accuracy while vector size is reduced by 97%. Furthermore, a top-k benchmark demonstrates that using these binary vectors is 30 times faster than using real-valued vectors.

Download Full-text

Specializing Word Embeddings (for Parsing) by Information Bottleneck (Extended Abstract)

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2020/658 ◽

2020 ◽

Author(s):

Xiang Lisa Li ◽

Jason Eisner

Keyword(s):

Dimensionality Reduction ◽

Semantic Information ◽

State Of The Art ◽

Word Embedding ◽

Discrete Version ◽

Word Embeddings ◽

Continuous Version ◽

Continuous Vector ◽

Information Bottleneck ◽

Art Performance

Pre-trained word embeddings like ELMo and BERT contain rich syntactic and semantic information, resulting in state-of-the-art performance on various tasks. We propose a very fast variational information bottleneck (VIB) method to nonlinearly compress these embeddings, keeping only the information that helps a discriminative parser. We compress each word embedding to either a discrete tag or a continuous vector. In the discrete version, our automatically compressed tags form an alternative tag set: we show experimentally that our tags capture most of the information in traditional POS tag annotations, but our tag sequences can be parsed more accurately at the same level of tag granularity. In the continuous version, we show experimentally that moderately compressing the word embeddings by our method yields a more accurate parser in 8 of 9 languages, unlike simple dimensionality reduction.

Download Full-text

Combining Word Embedding and Semantic Lexicon for Chinese Word Similarity Computation

Natural Language Understanding and Intelligent Applications - Lecture Notes in Computer Science ◽

10.1007/978-3-319-50496-4_69 ◽

2016 ◽

pp. 766-777 ◽

Cited By ~ 6

Author(s):

Jiahuan Pei ◽

Cong Zhang ◽

Degen Huang ◽

Jianjun Ma

Keyword(s):

Word Embedding ◽

Chinese Word ◽

Word Similarity ◽

Similarity Computation

Download Full-text

Multi-Sense Embeddings per Word

10.31219/osf.io/udfhn ◽

2020 ◽

Author(s):

Masashi Sugiyama

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Research Area ◽

Word Embedding ◽

The Other ◽

Word Embeddings ◽

Word Similarity ◽

Better Than ◽

Non Parametric

Recently, word embeddings have been used in many natural language processing problems successfully and how to train a robust and accurate word embedding system efficiently is a popular research area. Since many, if not all, words have more than one sense, it is necessary to learn vectors for all senses of word separately. Therefore, in this project, we have explored two multi-sense word embedding models, including Multi-Sense Skip-gram (MSSG) model and Non-parametric Multi-sense Skip Gram model (NP-MSSG). Furthermore, we propose an extension of the Multi-Sense Skip-gram model called Incremental Multi-Sense Skip-gram (IMSSG) model which could learn the vectors of all senses per word incrementally. We evaluate all the systems on word similarity task and show that IMSSG is better than the other models.

Download Full-text

Text classification with semantically enriched word embeddings

Natural Language Engineering ◽

10.1017/s1351324920000170 ◽

2020 ◽

pp. 1-35

Author(s):

N. Pittaras ◽

G. Giannakopoulos ◽

G. Papadakis ◽

V. Karkaletsis

Keyword(s):

Text Classification ◽

Semantic Information ◽

Classification Performance ◽

Classification Task ◽

Propagation Mechanism ◽

Word Embeddings ◽

Performance Loss ◽

Part Of Speech ◽

Box Models ◽

Document Frequency

Abstract The recent breakthroughs in deep neural architectures across multiple machine learning fields have led to the widespread use of deep neural models. These learners are often applied as black-box models that ignore or insufficiently utilize a wealth of preexisting semantic information. In this study, we focus on the text classification task, investigating methods for augmenting the input to deep neural networks (DNNs) with semantic information. We extract semantics for the words in the preprocessed text from the WordNet semantic graph, in the form of weighted concept terms that form a semantic frequency vector. Concepts are selected via a variety of semantic disambiguation techniques, including a basic, a part-of-speech-based, and a semantic embedding projection method. Additionally, we consider a weight propagation mechanism that exploits semantic relationships in the concept graph and conveys a spreading activation component. We enrich word2vec embeddings with the resulting semantic vector through concatenation or replacement and apply the semantically augmented word embeddings on the classification task via a DNN. Experimental results over established datasets demonstrate that our approach of semantic augmentation in the input space boosts classification performance significantly, with concatenation offering the best performance. We also note additional interesting findings produced by our approach regarding the behavior of term frequency - inverse document frequency normalization on semantic vectors, along with the radical dimensionality reduction potential with negligible performance loss.

Download Full-text

NMT Multi-Sense Embeddings per Word

10.31219/osf.io/k623t ◽

2019 ◽

Author(s):

William Jin

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Research Area ◽

Word Embedding ◽

The Other ◽

Word Embeddings ◽

Word Similarity ◽

Better Than ◽

Non Parametric

Recently, word embeddings have been used in many natural language processing problems successfully and how to train a robust and accurate word embedding system efficiently is a popular research area. Since many, if not all, words have more than one sense, it is necessary to learn vectors for all senses of word separately. Therefore, in this project, we have explored two multi-sense word embedding models, including Multi-Sense Skip-gram (MSSG) model and Non-parametric Multi-sense Skip Gram model (NP-MSSG). Furthermore, we propose an extension of the Multi-Sense Skip-gram model called Incremental Multi-Sense Skip-gram (IMSSG) model which could learn the vectors of all senses per word incrementally. We evaluate all the systems on word similarity task and show that IMSSG is better than the other models.

Download Full-text

Hierarchical Convolutional Attention Networks Using Joint Chinese Word Embedding for Text Classification

PRICAI 2019: Trends in Artificial Intelligence - Lecture Notes in Computer Science ◽

10.1007/978-3-030-29894-4_18 ◽

2019 ◽

pp. 234-246

Author(s):

Kaiqiang Zhang ◽

Shupeng Wang ◽

Binbin Li ◽

Feng Mei ◽

Jianyu Zhang

Keyword(s):

Text Classification ◽

Word Embedding ◽

Chinese Word ◽

Attention Networks

Download Full-text

Word Embedding Techniques for Sentiment Analyzers

10.4018/978-1-7998-8061-5.ch013 ◽

2021 ◽

pp. 233-252

Author(s):

Upendar Rao Rayala ◽

Karthick Seshadri

Keyword(s):

Social Networks ◽

Natural Language Processing ◽

Natural Language ◽

Sentiment Analysis ◽

Language Processing ◽

Word Embedding ◽

Word Embeddings ◽

The Public ◽

The Given ◽

Research Domain

Sentiment analysis is perceived to be a multi-disciplinary research domain composed of machine learning, artificial intelligence, deep learning, image processing, and social networks. Sentiment analysis can be used to determine opinions of the public about products and to find the customers' interest and their feedback through social networks. To perform any natural language processing task, the input text/comments should be represented in a numerical form. Word embeddings represent the given text/sentences/words as a vector that can be employed in performing subsequent natural language processing tasks. In this chapter, the authors discuss different techniques that can improve the performance of sentiment analysis using concepts and techniques like traditional word embeddings, sentiment embeddings, emoticons, lexicons, and neural networks. This chapter also traces the evolution of word embedding techniques with a chronological discussion of the recent research advancements in word embedding techniques.

Download Full-text

Implicit use of radicals in learning characters for nonnative learners of Chinese

Applied Psycholinguistics ◽

10.1017/s0142716415000090 ◽

2015 ◽

Vol 37 (3) ◽

pp. 507-527 ◽

Cited By ~ 9

Author(s):

JIE ZHANG ◽

HONG LI ◽

QIONG DONG ◽

JIE XU ◽

ELIZABETH SHOLAR

Keyword(s):

Semantic Information ◽

Word Reading ◽

Chinese Characters ◽

Learning Trial ◽

Chinese Word ◽

Semantic Transparency ◽

Facilitation Effect ◽

Learning To Read ◽

Learning Trials ◽

Better Than

ABSTRACTThis study investigated whether beginning nonnative learners of Chinese can use phonological and semantic information of radicals to learn the sounds and meanings of new Chinese characters. Thirty-four seventh- and eighth-grade American adolescents, who received intensive Chinese instruction for one semester, were taught 16 compound pseudocharacters paired with novel pictures over three learning trials. After each learning trial, students were asked to produce the sounds and meanings of pseudocharacters in which semantic transparency and phonetic regularity of radicals were manipulated. Results showed a facilitation effect of transparent semantic radicals in learning character meanings in early trials. There was a trend that students learned to read regular and transparent characters better than irregular and opaque characters. The ability to learn orthography–pronunciation association uniquely predicted Chinese word reading after controlling for semantic and phonetic radical knowledge. These findings suggest a predominant use of semantic strategies and the importance of orthography to phonology mappings in learning to read Chinese for beginning nonnative learners of Chinese.

Download Full-text

An Integrated Word Embedding-Based Dual-Task Learning Method for Sentiment Analysis

Arabian Journal for Science and Engineering ◽

10.1007/s13369-019-04241-7 ◽

2019 ◽

Vol 45 (4) ◽

pp. 2571-2586

Author(s):

Yanping Fu ◽

Yun Liu ◽

Sheng-Lung Peng

Keyword(s):

Sentiment Analysis ◽

Dual Task ◽

Word Embedding ◽

Learning Method ◽

Task Learning

Download Full-text