Generic framework for multilingual short text categorization using convolutional neural network

Convolutional neural network (CNN) has revolutionized the field of natural language processing, which is considerably efficient at semantics analysis that underlies difficult natural language processing problems in a variety of domains. The deceptive opinion detection is an important application of the existing CNN models. The detection mechanism based on CNN models has better self-adaptability and can effectively identify all kinds of deceptive opinions. Online opinions are quite short, varying in their types and content. In order to effectively identify deceptive opinions, we need to comprehensively study the characteristics of deceptive opinions and explore novel characteristics besides the textual semantics and emotional polarity that have been widely used in text analysis. In this paper, we optimize the convolutional neural network model by embedding the word order characteristics in its convolution layer and pooling layer, which makes convolutional neural network more suitable for short text classification and deceptive opinions detection. The TensorFlow-based experiments demonstrate that the proposed detection mechanism achieves more accurate deceptive opinion detection results.

Download Full-text

A character-level convolutional neural network with dynamic input length for Thai text categorization

2017 9th International Conference on Knowledge and Smart Technology (KST) ◽

10.1109/kst.2017.7886102 ◽

2017 ◽

Cited By ~ 2

Author(s):

Thanabhat Koomsubha ◽

Peerapon Vateekul

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Text Categorization

Download Full-text

A new short text sentimental classification method based on multi-mixed convolutional neural network

2018 IEEE 3rd International Conference on Cloud Computing and Big Data Analysis (ICCCBDA) ◽

10.1109/icccbda.2018.8386493 ◽

2018 ◽

Author(s):

Hao Lidong ◽

Zhao Hui

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Classification Method ◽

Short Text

Download Full-text

Verbal aggression detection on Twitter comments: convolutional neural network for short-text sentiment analysis

Neural Computing and Applications ◽

10.1007/s00521-018-3442-0 ◽

2018 ◽

Vol 32 (15) ◽

pp. 10809-10818 ◽

Cited By ~ 6

Author(s):

Junyi Chen ◽

Shankai Yan ◽

Ka-Chun Wong

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Sentiment Analysis ◽

Verbal Aggression ◽

Short Text ◽

Text Sentiment Analysis

Download Full-text

Short text sentiment analysis based on convolutional neural network

2018 14th International Conference on Wireless and Mobile Computing, Networking and Communications (WiMob) ◽

10.1109/wimob.2018.8589127 ◽

2018 ◽

Cited By ~ 3

Author(s):

Weisen Li ◽

Zhiqing Li ◽

Xupeng Fang

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Sentiment Analysis ◽

Short Text ◽

Text Sentiment Analysis

Download Full-text

Incorporating Context-Relevant Knowledge into Convolutional Neural Networks for Short Text Classification

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.330110067 ◽

2019 ◽

Vol 33 ◽

pp. 10067-10068 ◽

Cited By ~ 2

Author(s):

Jingyun Xu ◽

Yi Cai

Keyword(s):

Neural Network ◽

Neural Networks ◽

Convolutional Neural Network ◽

Convolutional Neural Networks ◽

Text Classification ◽

Classification Methods ◽

Short Text ◽

Proposed Model ◽

High Level ◽

Context Features

Some text classification methods don’t work well on short texts due to the data sparsity. What’s more, they don’t fully exploit context-relevant knowledge. In order to tackle these problems, we propose a neural network to incorporate context-relevant knowledge into a convolutional neural network for short text classification. Our model consists of two modules. The first module utilizes two layers to extract concept and context features respectively and then employs an attention layer to extract those context-relevant concepts. The second module utilizes a convolutional neural network to extract high-level features from the word and the contextrelevant concept features. The experimental results on three datasets show that our proposed model outperforms the stateof-the-art models.

Download Full-text

A Robust Morpheme Sequence and Convolutional Neural Network-Based Uyghur and Kazakh Short Text Classification

Information ◽

10.3390/info10120387 ◽

2019 ◽

Vol 10 (12) ◽

pp. 387 ◽

Cited By ~ 2

Author(s):

Sardar Parhat ◽

Mijit Ablimit ◽

Askar Hamdulla

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Text Classification ◽

High Reliability ◽

Short Text ◽

Low Resource ◽

Linguistic Resources ◽

Text Corpora ◽

A Value ◽

Text Content

In this paper, based on the multilingual morphological analyzer, we researched the similar low-resource languages, Uyghur and Kazakh, short text classification. Generally, the online linguistic resources of these languages are noisy. So a preprocessing is necessary and can significantly improve the accuracy. Uyghur and Kazakh are the languages with derivational morphology, in which words are coined by stems concatenated with suffixes. Usually, terms are used as the representation of text content while excluding functional parts as stop words in these languages. By extracting stems we can collect necessary terms and exclude stop words. Morpheme segmentation tool can split text into morphemes with 95% high reliability. After preparing both word- and morpheme-based training text corpora, we apply convolutional neural network (CNN) as a feature selection and text classification algorithm to perform text classification tasks. Experimental results show that the morpheme-based approach outperformed the word-based approach. Word embedding technique is frequently used in text representation both in the framework of neural networks and as a value expression, and can map language units into a sequential vector space based on context, and it is a natural way to extract and predict out-of-vocabulary (OOV) from context information. Multilingual morphological analysis has provided a convenient way for processing tasks of low resource languages like Uyghur and Kazakh.

Download Full-text