Document Categorization Using Graph Structuring

Use of A Domain-Specific Ontology to Support Automated Document Categorization at the Concept Level: Method Development and Evaluation

Expert Systems with Applications ◽

10.1016/j.eswa.2021.114681 ◽

2021 ◽

pp. 114681

Author(s):

Yen-Hsien Lee ◽

Paul Jen-Hwa Hu ◽

Wan-Jung Tsao ◽

Liang Li

Keyword(s):

Method Development ◽

Domain Specific ◽

Document Categorization ◽

Level Method

Download Full-text

The Effect of Preprocessing on Arabic Document Categorization

Algorithms ◽

10.3390/a9020027 ◽

2016 ◽

Vol 9 (2) ◽

pp. 27 ◽

Cited By ~ 20

Author(s):

Abdullah Ayedh ◽

Guanzheng TAN ◽

Khaled Alwesabi ◽

Hamdi Rajeh

Keyword(s):

Document Categorization

Download Full-text

Application of hierarchical temporal memory theory for document categorization

2017 IEEE SmartWorld, Ubiquitous Intelligence & Computing, Advanced & Trusted Computed, Scalable Computing & Communications, Cloud & Big Data Computing, Internet of People and Smart City Innovation (SmartWorld/SCALCOM/UIC/ATC/CBDCom/IOP/SCI) ◽

10.1109/uic-atc.2017.8397473 ◽

2017 ◽

Author(s):

Deven Shah ◽

Pinak Ghate ◽

Manali Paranjape ◽

Amit Kumar

Keyword(s):

Temporal Memory ◽

Memory Theory ◽

Hierarchical Temporal Memory ◽

Document Categorization

Download Full-text

An Efficient Document Categorization Model Based on LSA and BPNN

Sixth International Conference on Advanced Language Processing and Web Information Technology (ALPIT 2007) ◽

10.1109/alpit.2007.88 ◽

2007 ◽

Cited By ~ 2

Author(s):

Cheng Hua Li ◽

Soon Cheol Park

Keyword(s):

Model Based ◽

Document Categorization ◽

Categorization Model

Download Full-text

Hierarchical document categorization with k-NN and concept-based thesauri

Information Processing & Management ◽

10.1016/j.ipm.2005.04.003 ◽

2006 ◽

Vol 42 (2) ◽

pp. 387-406 ◽

Cited By ~ 21

Author(s):

Sun Lee Bang ◽

Jae Dong Yang ◽

Hyung Jeong Yang

Keyword(s):

Document Categorization

Download Full-text

Integrated Approach of ANN and GA for Document Categorization

Advances in Soft Computing - International Symposium on Distributed Computing and Artificial Intelligence 2008 (DCAI 2008) ◽

10.1007/978-3-540-85863-8_51 ◽

2008 ◽

pp. 434-442

Author(s):

Karina Leyto-Delgado ◽

Ivan Lopez-Arevalo ◽

Victor Sosa-Sosa

Keyword(s):

Integrated Approach ◽

Document Categorization

Download Full-text

An Efficient Approach for Document Categorization Using Weighted Sum

Lecture Notes in Electrical Engineering - ICDSMLA 2019 ◽

10.1007/978-981-15-1420-3_73 ◽

2020 ◽

pp. 686-695

Author(s):

Vimuktha E. Salis ◽

Ranjana S. Chakrasali ◽

Chowdaiah Pathanjali

Keyword(s):

Weighted Sum ◽

Efficient Approach ◽

Document Categorization

Download Full-text

Semantics-Based Document Categorization Employing Semi-Supervised Learning

Advances in Linguistics and Communication Studies - Modern Computational Models of Semantic Discovery in Natural Language ◽

10.4018/978-1-4666-8690-8.ch005 ◽

2015 ◽

pp. 112-140 ◽

Cited By ~ 1

Author(s):

Jan Žižka ◽

František Dařena

Keyword(s):

Machine Learning ◽

Unsupervised Learning ◽

Supervised Learning ◽

Real World ◽

Supervised Machine Learning ◽

The Internet ◽

Learning Method ◽

Label Information ◽

Document Categorization

The automated categorization of unstructured textual documents according to their semantic contents plays important role particularly linked with the ever growing volume of such data originating from the Internet. Having a sufficient number of labeled examples, a suitable supervised machine learning-based classifier can be trained. When no labeling is available, an unsupervised learning method can be applied, however, the missing label information often leads to worse classification results. This chapter demonstrates a method based on semi-supervised learning when a smallish set of manually labeled examples improves the categorization process in comparison with clustering, and the results are comparable with the supervised learning output. For the illustration, a real-world dataset coming from the Internet is used as the input of the supervised, unsupervised, and semi-supervised learning. The results are shown for different number of the starting labeled samples used as “seeds” to automatically label the remaining volume of unlabeled items.

Download Full-text

Medical Document Categorization Using a Priori Knowledge

Artificial Neural Networks: Biological Inspirations – ICANN 2005 - Lecture Notes in Computer Science ◽

10.1007/11550822_99 ◽

2005 ◽

pp. 641-646 ◽

Cited By ~ 2

Author(s):

Lukasz Itert ◽

Włodzisław Duch ◽

John Pestian

Keyword(s):

A Priori ◽

A Priori Knowledge ◽

Document Categorization ◽

Medical Document ◽

Priori Knowledge

Download Full-text

Local and Global Latent Semantic Analysis for Text Categorization

International Journal of Information Retrieval Research ◽

10.4018/ijirr.2014070101 ◽

2014 ◽

Vol 4 (3) ◽

pp. 1-13

Author(s):

Khadoudja Ghanem

Keyword(s):

Information Retrieval ◽

High Precision ◽

Latent Semantic Analysis ◽

Classification Accuracy ◽

Text Categorization ◽

Semantic Analysis ◽

Experimental Results ◽

Semantic Approach ◽

Document Categorization ◽

Second Use

In this paper the authors propose a semantic approach to document categorization. The idea is to create for each category a semantic index (representative term vector) by performing a local Latent Semantic Analysis (LSA) followed by a clustering process. A second use of LSA (Global LSA) is adopted on a term-Class matrix in order to retrieve the class which is the most similar to the query (document to classify) in the same way where the LSA is used to retrieve documents which are the most similar to a query in Information Retrieval. The proposed system is evaluated on a popular dataset which is 20 Newsgroup corpus. Obtained results show the effectiveness of the method compared with those obtained with the classic KNN and SVM classifiers as well as with methods presented in the literature. Experimental results show that the new method has high precision and recall rates and classification accuracy is significantly improved.

Download Full-text