CoPHE: A Count-Preserving Hierarchical Evaluation Metric in Large-Scale Multi-Label Text Classification

Text Classification for a Large-Scale Taxonomy Using Dynamically Mixed Local and Global Models for a Node

Lecture Notes in Computer Science - Advances in Information Retrieval ◽

10.1007/978-3-642-20161-5_4 ◽

2011 ◽

pp. 7-18 ◽

Cited By ~ 1

Author(s):

Heung-Seon Oh ◽

Yoonjung Choi ◽

Sung-Hyon Myaeng

Keyword(s):

Text Classification ◽

Large Scale ◽

Global Models

Download Full-text

Adjusting BERT’s Pooling Layer for Large-Scale Multi-Label Text Classification

Text, Speech, and Dialogue - Lecture Notes in Computer Science ◽

10.1007/978-3-030-58323-1_23 ◽

2020 ◽

pp. 214-221 ◽

Cited By ~ 1

Author(s):

Jan Lehečka ◽

Jan Švec ◽

Pavel Ircing ◽

Luboš Šmídl

Keyword(s):

Text Classification ◽

Large Scale

Download Full-text

Large-scale text classification with deeper and wider convolution neural network

International Journal of Simulation and Process Modelling ◽

10.1504/ijspm.2020.106977 ◽

2020 ◽

Vol 15 (1/2) ◽

pp. 120

Author(s):

Min Huang ◽

Wei Huang

Keyword(s):

Neural Network ◽

Text Classification ◽

Large Scale ◽

Convolution Neural Network

Download Full-text

CharTeC-Net: An Efficient and Lightweight Character-Based Convolutional Network for Text Classification

Journal of Electrical and Computer Engineering ◽

10.1155/2020/9701427 ◽

2020 ◽

Vol 2020 ◽

pp. 1-7 ◽

Cited By ~ 2

Author(s):

Aboubakar Nasser Samatin Njikam ◽

Huan Zhao

Keyword(s):

Text Classification ◽

Building Block ◽

Large Scale ◽

State Of The Art ◽

Building Blocks ◽

Training Data ◽

Superior Performance ◽

Classification Problems ◽

Computationally Efficient ◽

Convolutional Network

This paper introduces an extremely lightweight (with just over around two hundred thousand parameters) and computationally efficient CNN architecture, named CharTeC-Net (Character-based Text Classification Network), for character-based text classification problems. This new architecture is composed of four building blocks for feature extraction. Each of these building blocks, except the last one, uses 1 × 1 pointwise convolutional layers to add more nonlinearity to the network and to increase the dimensions within each building block. In addition, shortcut connections are used in each building block to facilitate the flow of gradients over the network, but more importantly to ensure that the original signal present in the training data is shared across each building block. Experiments on eight standard large-scale text classification and sentiment analysis datasets demonstrate CharTeC-Net’s superior performance over baseline methods and yields competitive accuracy compared with state-of-the-art methods, although CharTeC-Net has only between 181,427 and 225,323 parameters and weighs less than 1 megabyte.

Download Full-text

Hierarchical Graph Transformer-Based Deep Learning Model for Large-Scale Multi-Label Text Classification

IEEE Access ◽

10.1109/access.2020.2972751 ◽

2020 ◽

Vol 8 ◽

pp. 30885-30896 ◽

Cited By ~ 3

Author(s):

Jibing Gong ◽

Hongyuan Ma ◽

Zhiyong Teng ◽

Qi Teng ◽

Hekai Zhang ◽

...

Keyword(s):

Deep Learning ◽

Text Classification ◽

Large Scale ◽

Learning Model ◽

Hierarchical Graph ◽

Deep Learning Model

Download Full-text

Large Scale Text Classification Using Map Reduce and Naive Bayes Algorithm for Domain Specified Ontology Building

2015 7th International Conference on Intelligent Human-Machine Systems and Cybernetics ◽

10.1109/ihmsc.2015.24 ◽

2015 ◽

Cited By ~ 5

Author(s):

Joan Santoso ◽

Eko Mulyanto Yuniarno ◽

Mochamad Hariadi

Keyword(s):

Text Classification ◽

Large Scale ◽

Naive Bayes ◽

Naïve Bayes ◽

Map Reduce ◽

Ontology Building ◽

Bayes Algorithm

Download Full-text

An Empirical Comparative Study on Two Large-Scale Hierarchical Text Classification Approaches

International Journal of Computer Processing Of Languages ◽

10.1142/s1793840611002383 ◽

2011 ◽

Vol 23 (04) ◽

pp. 309-325

Author(s):

JIAN ZHANG ◽

HAI ZHAO ◽

LIQING ZHANG ◽

BAO-LIANG LU

Keyword(s):

Comparative Study ◽

Text Classification ◽

Large Scale ◽

Hierarchical Text Classification

Download Full-text

Large-Scale Hierarchical Text Classification Based on Path Semantic Vector and Prior Information

2009 International Conference on Computational Intelligence and Security ◽

10.1109/cis.2009.38 ◽

2009 ◽

Author(s):

Feng Gao ◽

Weiming Fu ◽

Yiping Zhong ◽

Danfeng Zhao

Keyword(s):

Text Classification ◽

Large Scale ◽

Prior Information ◽

Hierarchical Text Classification

Download Full-text

A Weakly Supervised and Deep Learning Method for an Additive Topic Analysis of Large Corpora

10.31235/osf.io/nfr3p ◽

2019 ◽

Cited By ~ 2

Author(s):

Yair Fogel-Dror ◽

Shaul R. Shenhav ◽

Tamir Sheafer

Keyword(s):

Content Analysis ◽

Deep Learning ◽

Text Classification ◽

Large Scale ◽

Low Cost ◽

Initial Number ◽

Analysis Method ◽

Training Set ◽

Topic Analysis ◽

Weakly Supervised

The collaborative effort of theory-driven content analysis can benefit significantly from the use of topic analysis methods, which allow researchers to add more categories while developing or testing a theory. This additive approach enables the reuse of previous efforts of analysis or even the merging of separate research projects, thereby making these methods more accessible and increasing the discipline’s ability to create and share content analysis capabilities. This paper proposes a weakly supervised topic analysis method that uses both a low-cost unsupervised method to compile a training set and supervised deep learning as an additive and accurate text classification method. We test the validity of the method, specifically its additivity, by comparing the results of the method after adding 200 categories to an initial number of 450. We show that the suggested method provides a foundation for a low-cost solution for large-scale topic analysis.

Download Full-text

Improving Dialog Evaluation with a Multi-reference Adversarial Dataset and Large Scale Pretraining

Transactions of the Association for Computational Linguistics ◽

10.1162/tacl_a_00347 ◽

2020 ◽

Vol 8 ◽

pp. 810-827

Author(s):

Ananya B. Sai ◽

Akash Kumar Mohankumar ◽

Siddhartha Arora ◽

Mitesh M. Khapra

Keyword(s):

Large Scale ◽

Evaluation Metrics ◽

Model Based ◽

Evaluation Models ◽

Adversarial Examples ◽

N Gram ◽

Evaluation Metric ◽

Better Than

There is an increasing focus on model-based dialog evaluation metrics such as ADEM, RUBER, and the more recent BERT-based metrics. These models aim to assign a high score to all relevant responses and a low score to all irrelevant responses. Ideally, such models should be trained using multiple relevant and irrelevant responses for any given context. However, no such data is publicly available, and hence existing models are usually trained using a single relevant response and multiple randomly selected responses from other contexts (random negatives). To allow for better training and robust evaluation of model-based metrics, we introduce the DailyDialog++ dataset, consisting of (i) five relevant responses for each context and (ii) five adversarially crafted irrelevant responses for each context. Using this dataset, we first show that even in the presence of multiple correct references, n-gram based metrics and embedding based metrics do not perform well at separating relevant responses from even random negatives. While model-based metrics perform better than n-gram and embedding based metrics on random negatives, their performance drops substantially when evaluated on adversarial examples. To check if large scale pretraining could help, we propose a new BERT-based evaluation metric called DEB, which is pretrained on 727M Reddit conversations and then finetuned on our dataset. DEB significantly outperforms existing models, showing better correlation with human judgments and better performance on random negatives (88.27% accuracy). However, its performance again drops substantially when evaluated on adversarial responses, thereby highlighting that even large-scale pretrained evaluation models are not robust to the adversarial examples in our dataset. The dataset 1 and code 2 are publicly available.

Download Full-text