Improved Term Weighting Factors for Keyword Extraction in Hierarchical Category Structure and Thai Text Classification

Advances in Intelligent Systems and Computing - Advances in Intelligent Informatics, Smart Technology and Natural Language Processing ◽

10.1007/978-3-319-94703-7_6 ◽

2018 ◽

pp. 58-67

Author(s):

Boonthida Chiraratanasopha ◽

Thanaruk Theeramunkong ◽

Salin Boonbrahm

Keyword(s):

Text Classification ◽

Category Structure ◽

Keyword Extraction ◽

Term Weighting ◽

Weighting Factors

Download Full-text

Effect of Term Weighting on Keyword Extraction in Hierarchical Category Structure

Computing and Informatics ◽

10.31577/cai_2021_1_57 ◽

2021 ◽

Vol 40 (1) ◽

pp. 57-82

Author(s):

Boonthida Chiraratanasopha ◽

Salin Boonbrahm ◽

Thanaruk Theeramunkong

Keyword(s):

Category Structure ◽

Keyword Extraction ◽

Term Weighting

Download Full-text

Improving Term Weighting Schemes for Short Text Classification in Vector Space Model

IEEE Access ◽

10.1109/access.2019.2953918 ◽

2019 ◽

Vol 7 ◽

pp. 166578-166592

Author(s):

Surender Singh Samant ◽

N. L. Bhanu Murthy ◽

Aruna Malapati

Keyword(s):

Vector Space ◽

Text Classification ◽

Vector Space Model ◽

Term Weighting ◽

Weighting Schemes ◽

Short Text ◽

Space Model

Download Full-text

Grammatical Dependency-Based Relations for Term Weighting in Text Classification

Advances in Knowledge Discovery and Data Mining - Lecture Notes in Computer Science ◽

10.1007/978-3-642-20841-6_39 ◽

2011 ◽

pp. 476-487 ◽

Cited By ~ 1

Author(s):

Dat Huynh ◽

Dat Tran ◽

Wanli Ma ◽

Dharmendra Sharma

Keyword(s):

Text Classification ◽

Term Weighting

Download Full-text

A novel term weighting scheme for text classification: TF-MONO

Journal of Informetrics ◽

10.1016/j.joi.2020.101076 ◽

2020 ◽

Vol 14 (4) ◽

pp. 101076 ◽

Cited By ~ 1

Author(s):

Turgut Dogan ◽

Alper Kursat Uysal

Keyword(s):

Text Classification ◽

Weighting Scheme ◽

Term Weighting

Download Full-text

An effective term weighting method using random walk model for text classification

2008 11th International Conference on Computer and Information Technology ◽

10.1109/iccitechn.2008.4803000 ◽

2008 ◽

Cited By ~ 1

Author(s):

Md. Rafiqul Islam ◽

Md. Rakibul Islam

Keyword(s):

Random Walk ◽

Text Classification ◽

Random Walk Model ◽

Term Weighting ◽

Weighting Method

Download Full-text

Domain identification and keyword extraction of radio news using term weighting

1997 IEEE Workshop on Automatic Speech Recognition and Understanding Proceedings ◽

10.1109/asru.1997.659134 ◽

2002 ◽

Author(s):

Y. Suzuki ◽

F. Fukumoto ◽

Y. Sekiguchi

Keyword(s):

Keyword Extraction ◽

Term Weighting ◽

Domain Identification ◽

Radio News

Download Full-text

RANDOM WALK TERM WEIGHTING FOR IMPROVED TEXT CLASSIFICATION

International Journal of Semantic Computing ◽

10.1142/s1793351x07000263 ◽

2007 ◽

Vol 01 (04) ◽

pp. 421-439 ◽

Cited By ~ 17

Author(s):

SAMER HASSAN ◽

RADA MIHALCEA ◽

CARMEN BANEA

Keyword(s):

Random Walk ◽

Text Classification ◽

Feature Weighting ◽

Random Walk Model ◽

Term Weighting ◽

New Approach ◽

Standard Classification ◽

Term Weights ◽

Traditional Term ◽

Word Feature

This paper describes a new approach for estimating term weights in a document, and shows how the new weighting scheme can be used to improve the accuracy of a text classifier. The method uses term co-occurrence as a measure of dependency between word features. A random walk model is applied on a graph encoding words and co-occurrence dependencies, resulting in scores that represent a quantification of how a particular word feature contributes to a given context. Experiments performed on three standard classification datasets show that the new random walk based approach outperforms the traditional term frequency approach of feature weighting.

Download Full-text

High Relevance Keyword Extraction facility for Bayesian text classification on different domains of varying characteristic

Expert Systems with Applications ◽

10.1016/j.eswa.2011.07.116 ◽

2012 ◽

Vol 39 (1) ◽

pp. 1147-1155 ◽

Cited By ~ 17

Author(s):

Lam Hong Lee ◽

Dino Isa ◽

Wou Onn Choo ◽

Wen Yeen Chue

Keyword(s):

Text Classification ◽

Keyword Extraction

Download Full-text

An Improved Algorithm to Term Weighting in Text Classification

2010 International Conference on Multimedia Technology ◽

10.1109/icmult.2010.5630962 ◽

2010 ◽

Cited By ~ 4

Author(s):

Ran Li ◽

Xianjiu Guo

Keyword(s):

Text Classification ◽

Term Weighting ◽

Improved Algorithm

Download Full-text

Hierarchical text classification using Relative Inverse Document Frequency

ECTI Transactions on Computer and Information Technology (ECTI-CIT) ◽

10.37936/ecti-cit.2021152.240515 ◽

2021 ◽

Vol 15 (2) ◽

pp. 166-176

Author(s):

Boonthida Chiraratanasopha ◽

Thanaruk Theeramunkong ◽

Salin Boonbrahm

Keyword(s):

Text Classification ◽

Term Weighting ◽

Hierarchical Tree ◽

Inverse Document Frequency ◽

Document Frequency ◽

Relative Inverse ◽

The Hierarchical Structure ◽

Family Based ◽

Hierarchical Text Classification

Automatic hierarchical text classification has been a challenging and in-needed task with an increasing of hierarchical taxonomy from the booming of knowledge organization. The hierarchical structure identifies the relationships of dependence between different categories in which can be overlapped of generalized and specific concepts within the tree. This paper presents the use of frequency of the occurring term in related categories among the hierarchical tree to help in document classification. The four extended term weighting of Relative Inverse Document Frequency (IDFr) including its located category, its parent category, its sibling categories and its child categories are exploited to generate a classifier model using centroid-based technique. From the experiment on hierarchical text classification of Thai documents, the IDFr achieved the best accuracy and F-measure as 53.65% and 50.80% in Top-n features set from family-based evaluation in which are higher than TF-IDF for 2.35% and 1.15% in the same settings, respectively.

Download Full-text