Comparison of SVM and Some Older Classification Algorithms in Text Classification Tasks

Headnote Prediction Using Machine Learning

The International Arab Journal of Information Technology ◽

10.34028/iajit/18/5/7 ◽

2021 ◽

Vol 18 (5) ◽

Author(s):

Sarmad Mahar ◽

Sahar Zafar ◽

Kamran Nishat

Keyword(s):

Machine Learning ◽

Feature Extraction ◽

Active Learning ◽

Text Classification ◽

Extraction Methods ◽

Text Summarization ◽

Training Data ◽

Second Step ◽

Support Vector ◽

Classification Algorithms

Headnotes are the precise explanation and summary of legal points in an issued judgment. Law journals hire experienced lawyers to write these headnotes. These headnotes help the reader quickly determine the issue discussed in the case. Headnotes comprise two parts. The first part comprises the topic discussed in the judgment, and the second part contains a summary of that judgment. In this thesis, we design, develop and evaluate headnote prediction using machine learning, without involving human involvement. We divided this task into a two steps process. In the first step, we predict law points used in the judgment by using text classification algorithms. The second step generates a summary of the judgment using text summarization techniques. To achieve this task, we created a Databank by extracting data from different law sources in Pakistan. We labelled training data generated based on Pakistan law websites. We tested different feature extraction methods on judiciary data to improve our system. Using these feature extraction methods, we developed a dictionary of terminology for ease of reference and utility. Our approach achieves 65% accuracy by using Linear Support Vector Classification with tri-gram and without stemmer. Using active learning our system can continuously improve the accuracy with the increased labelled examples provided by the users of the system.

Download Full-text

Applying Text Classification Algorithms in Web Services Robustness Testing

2010 29th IEEE Symposium on Reliable Distributed Systems ◽

10.1109/srds.2010.36 ◽

2010 ◽

Cited By ~ 9

Author(s):

Nuno Laranjeiro ◽

Rui Oliveira ◽

Marco Vieira

Keyword(s):

Web Services ◽

Text Classification ◽

Classification Algorithms ◽

Robustness Testing

Download Full-text

Meta-learning of Text Classification Tasks

Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications - Lecture Notes in Computer Science ◽

10.1007/978-3-030-33904-3_10 ◽

2019 ◽

pp. 107-119

Author(s):

Jorge G. Madrid ◽

Hugo Jair Escalante

Keyword(s):

Text Classification ◽

Meta Learning ◽

Classification Tasks

Download Full-text

Analysis of Text Classification Algorithms: A Review

International Journal of Trend in Scientific Research and Development ◽

10.31142/ijtsrd21448 ◽

2019 ◽

Vol Volume-3 (Issue-2) ◽

pp. 579-581

Author(s):

Nida Zafar Khan ◽

Prof. S. R. Yadav ◽

Keyword(s):

Text Classification ◽

Classification Algorithms

Download Full-text

Understanding the Influence of Hyperparameters on Text Embeddings for Text Classification Tasks

Research and Advanced Technology for Digital Libraries - Lecture Notes in Computer Science ◽

10.1007/978-3-319-67008-9_16 ◽

2017 ◽

pp. 193-204 ◽

Cited By ~ 3

Author(s):

Nils Witt ◽

Christin Seifert

Keyword(s):

Text Classification ◽

Classification Tasks

Download Full-text

Sentiment Analysis of Text Classification Algorithms Using Confusion Matrix

Communications in Computer and Information Science - Cyberspace Data and Intelligence, and Cyber-Living, Syndrome, and Health ◽

10.1007/978-981-15-1922-2_16 ◽

2019 ◽

pp. 231-241

Author(s):

Babacar Gaye ◽

Aziguli Wulamu

Keyword(s):

Sentiment Analysis ◽

Text Classification ◽

Confusion Matrix ◽

Classification Algorithms

Download Full-text

Deep Domain Adaptation for Low-Resource Cross-Lingual Text Classification Tasks

Communications in Computer and Information Science - Computational Linguistics ◽

10.1007/978-981-15-6168-9_14 ◽

2020 ◽

pp. 155-168

Author(s):

Guan-Yuan Chen ◽

Von-Wun Soo

Keyword(s):

Text Classification ◽

Domain Adaptation ◽

Low Resource ◽

Classification Tasks ◽

Cross Lingual

Download Full-text

Explicit Interaction Model towards Text Classification

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33016359 ◽

2019 ◽

Vol 33 ◽

pp. 6359-6366 ◽

Cited By ~ 3

Author(s):

Cunxiao Du ◽

Zhaozheng Chen ◽

Fuli Feng ◽

Lei Zhu ◽

Tian Gan ◽

...

Keyword(s):

Language Processing ◽

Text Classification ◽

Deep Neural Networks ◽

Interaction Mechanism ◽

Interaction Model ◽

Classification Task ◽

Fine Grained ◽

Word Level ◽

Benchmark Datasets ◽

Classification Tasks

Text classification is one of the fundamental tasks in natural language processing. Recently, deep neural networks have achieved promising performance in the text classification task compared to shallow models. Despite of the significance of deep models, they ignore the fine-grained (matching signals between words and classes) classification clues since their classifications mainly rely on the text-level representations. To address this problem, we introduce the interaction mechanism to incorporate word-level matching signals into the text classification task. In particular, we design a novel framework, EXplicit interAction Model (dubbed as EXAM), equipped with the interaction mechanism. We justified the proposed approach on several benchmark datasets including both multilabel and multi-class text classification tasks. Extensive experimental results demonstrate the superiority of the proposed method. As a byproduct, we have released the codes and parameter settings to facilitate other researches.

Download Full-text

Fusing Logical Relationship Information of Text in Neural Network for Text Classification

Mathematical Problems in Engineering ◽

10.1155/2020/5426795 ◽

2020 ◽

Vol 2020 ◽

pp. 1-16 ◽

Cited By ~ 1

Author(s):

Heyong Wang ◽

Dehang Zeng

Keyword(s):

Neural Network ◽

Text Classification ◽

Information Science ◽

Classification Algorithms ◽

Human Beings ◽

Central Idea ◽

Logical Relationship ◽

The Relationship ◽

Different Parts ◽

Better Than

With the development of computer science and information science, text classification technology has been greatly developed and its application scenarios have been widened. In traditional process of text classification, the existing method will lose much logical relationship information of text. The logical relationship information of a text refers to the relationship information among different logical parts of the text, such as title, abstract, and body. When human beings are reading, they will take title as an important part to remind the central idea of the article, abstract as a brief summary of the content of the article, and body as a detailed description of the article. In most of the text classification studies, researchers concern more about the relationship among words (word frequency, semantics, etc.) and neglect the logical relationship information of text. It will lose information about the relationship among different parts (title, body, etc.) and have an influence on the performance of text classification. Therefore, we propose a text classification algorithm—fusing the logical relationship information of text in neural network (FLRIOTINN), which complements the logical relationship information into text classification algorithms. Experiments show that the effect of FLRIOTINN is better than the conventional backpropagation neural networks which does not consider the logical relationship information of text.

Download Full-text

Towards Robust Text Classification with Semantics-Aware Recurrent Neural Architecture

Machine Learning and Knowledge Extraction ◽

10.3390/make1020034 ◽

2019 ◽

Vol 1 (2) ◽

pp. 575-589 ◽

Cited By ~ 1

Author(s):

Blaž Škrlj ◽

Jan Kralj ◽

Nada Lavrač ◽

Senja Pollak

Keyword(s):

Text Mining ◽

Language Processing ◽

Text Classification ◽

Deep Neural Networks ◽

Semantic Knowledge ◽

Text Documents ◽

Neural Architecture ◽

Classification Tasks ◽

And Gender ◽

Semantic Resources

Deep neural networks are becoming ubiquitous in text mining and natural language processing, but semantic resources, such as taxonomies and ontologies, are yet to be fully exploited in a deep learning setting. This paper presents an efficient semantic text mining approach, which converts semantic information related to a given set of documents into a set of novel features that are used for learning. The proposed Semantics-aware Recurrent deep Neural Architecture (SRNA) enables the system to learn simultaneously from the semantic vectors and from the raw text documents. We test the effectiveness of the approach on three text classification tasks: news topic categorization, sentiment analysis and gender profiling. The experiments show that the proposed approach outperforms the approach without semantic knowledge, with highest accuracy gain (up to 10%) achieved on short document fragments.

Download Full-text