Text Classification Based on Enriched Vector Space Model

A kind of text classification method based on fuzzy vector space model and neural networks is proposed in the paper according to the problems that a text can be belongs to many types during the text classification. Fuzzy theory is adopted in the method to look the occurring position of feature items in text on as the important degree (membership) reflecting text subject, and fully considered the position information while the features are extracted, thus the fuzzy feature vectors are constructed, as a result, the text classification is close to the manual classification method. The established networks are constituted of input layer, hidden layer and output layer, the input layer completes the inputs of classification samples, hidden layer extracts the implicit pattern features of input samples, the output layer is used to output the classification results. Finally the effectiveness of this method is proved by some documents of Wan Fang data in experimental section. (Abstract)

Download Full-text

A Method for Chinese Text Classification Based on Three-Dimensional Vector Space Model

2012 International Conference on Computer Science and Service System ◽

10.1109/csss.2012.334 ◽

2012 ◽

Cited By ~ 2

Author(s):

Jixian Zhang ◽

Qinglin Wang ◽

Yuan Li ◽

Dongmei Li ◽

Yuexing Hao

Keyword(s):

Vector Space ◽

Chinese Text ◽

Text Classification ◽

Three Dimensional ◽

Vector Space Model ◽

Dimensional Vector ◽

Dimensional Vector Space ◽

Space Model ◽

Chinese Text Classification

Download Full-text

Design and analysis of a general vector space model for data classification in Internet of Things

EURASIP Journal on Wireless Communications and Networking ◽

10.1186/s13638-019-1581-3 ◽

2019 ◽

Vol 2019 (1) ◽

Cited By ~ 3

Author(s):

Jinguo Sang ◽

Shanchen Pang ◽

Yang Zha ◽

Fan Yang

Keyword(s):

Internet Of Things ◽

Vector Space ◽

Text Classification ◽

Vector Space Model ◽

Classification Algorithm ◽

Space Model ◽

Amount Of Information ◽

Access Information ◽

Weighting Methods ◽

General Vector

AbstractThe amount of information increases explosively in Internet of Things, because more and more data are sensed by large amount of sensors. The explosive growth of information makes it difficult to access information efficiently, so it is an effective method to decrease the amount of information to be transferred on network by text classification. This paper proposes a new text classification algorithm based on vector space model. This algorithm improves the feature selection and weighting methods by introducing synonym replacement to traditional text classification algorithms. The experimental results show that the proposed classification algorithm has considerably improved the precision and recall of classification.

Download Full-text

Analysis of Text Classification with various Term Weighting Schemes in Vector Space Model

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.d1938.0891020 ◽

2020 ◽

Vol 9 (10) ◽

pp. 390-393

Keyword(s):

Vector Space ◽

Text Classification ◽

Naive Bayes ◽

Information Gain ◽

Vector Space Model ◽

Naïve Bayes ◽

Weighting Scheme ◽

Term Weighting ◽

Space Model ◽

Weighting Methods

Term Weighting Scheme (TWS) is a key component of the matching mechanism when using the vector space model In the context of information retrieval (IR) from text documents, the this paper described a new approach of term weighting methods to improve the classification performance. In this study, we propose an effective term weighting scheme, which gives highest accuracy with compare to the text classification methods. We compared performance parameter of KNN and Naïve Bayes Classification with different Weighting Method, Weight information gain, SVM and proposed method.We have implemented many term-weighting methods (TWM) on Amazon data collections in combination with Information-Gain and SVM and KNN algorithm and Naïve Bayes Algorithm.

Download Full-text

STUDY ON TERM CO-OCCURRENCE BASED ON VECTOR SPACE MODEL AND ITS APPLICATION IN TEXT CLASSIFICATION

Proceedings of the 11th Joint International Computer Conference ◽

10.1142/9789812701534_0087 ◽

2005 ◽

Author(s):

Yueheng SUN ◽

Pilian HE ◽

Lanlan CHENG ◽

Guangyuan WU

Keyword(s):

Vector Space ◽

Text Classification ◽

Vector Space Model ◽

Space Model

Download Full-text

Beyond vector space model for hierarchical Arabic text classification: A Markov chain approach

Information Processing & Management ◽

10.1016/j.ipm.2017.10.003 ◽

2018 ◽

Vol 54 (1) ◽

pp. 105-115 ◽

Cited By ~ 9

Author(s):

Fawaz S. Al-Anzi ◽

Dia AbuZeina

Keyword(s):

Markov Chain ◽

Vector Space ◽

Text Classification ◽

Vector Space Model ◽

Arabic Text ◽

Space Model ◽

Markov Chain Approach ◽

Arabic Text Classification

Download Full-text

A Kind of Self-Constructed Category Dictionary in Chinese Text Classification

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.644-650.2206 ◽

2014 ◽

Vol 644-650 ◽

pp. 2206-2210

Author(s):

Kun Zhou ◽

Ya Ping Dai ◽

Feng Gao ◽

Ji Hong Zou

Keyword(s):

Vector Space ◽

Chinese Text ◽

Text Classification ◽

Feature Vector ◽

Vector Space Model ◽

Recall Rate ◽

Support Vector ◽

Space Model ◽

Chinese Text Classification ◽

Feature Vector Space

By means of word-segmentation technology in TRIP database and each word that appears in a database will be account in detail, a kind of self-constructed category dictionary (SCC-dictionary) in Chinese text classification is proposed. For solving high dimension and sparseness problem exit in vector space model, a four-dimensional feature vector space model (FFVSM) is presented in this paper. With Support Vector Machine (SVM) algorithm, the text classifier is designed. Experimental results show there are two achievements in this paper: first, SCC-dictionary can replace the artificial-written dictionary with the same effect; second, the FFVSM will not only reduce the computing load than high-dimensional feature vector space model, but also keep the precision of classification as 86.87%, recall rate as 95.12%, and F1 value as 90.81%.

Download Full-text

A Kind of Text Classification Method Based on Fuzzy Vector Space Model and Neural Networks

Proceedings of the 2nd International Conference on Computer Science and Electronics Engineering (ICCSEE 2013) ◽

10.2991/iccsee.2013.492 ◽

2013 ◽

Author(s):

JunHui PAN ◽

Hui LI

Keyword(s):

Neural Networks ◽

Vector Space ◽

Text Classification ◽

Vector Space Model ◽

Classification Method ◽

Space Model ◽

Fuzzy Vector

Download Full-text