A Novel Feature-based Bayesian Model for Query Focused Multi-document Summarization

Transactions of the Association for Computational Linguistics ◽

10.1162/tacl_a_00212 ◽

2013 ◽

Vol 1 ◽

pp. 89-98 ◽

Cited By ~ 6

Author(s):

Jiwei Li ◽

Sujian Li

Keyword(s):

Supervised Learning ◽

Bayesian Model ◽

Topic Model ◽

Topic Models ◽

Experimental Results ◽

Learning Methods ◽

Document Summarization ◽

Feature Based

Supervised learning methods and LDA based topic model have been successfully applied in the field of multi-document summarization. In this paper, we propose a novel supervised approach that can incorporate rich sentence features into Bayesian topic models in a principled way, thus taking advantages of both topic model and feature based supervised learning methods. Experimental results on DUC2007, TAC2008 and TAC2009 demonstrate the effectiveness of our approach.

Download Full-text

A Method for Constructing Supervised Topic Model Based on Term Frequency-Inverse Topic Frequency

Symmetry ◽

10.3390/sym11121486 ◽

2019 ◽

Vol 11 (12) ◽

pp. 1486

Author(s):

Zhinan Gou ◽

Zheng Huo ◽

Yuanzhen Liu ◽

Yi Yang

Keyword(s):

Topic Modeling ◽

Topic Model ◽

State Of The Art ◽

Topic Models ◽

Document Classification ◽

Experimental Results ◽

Tag Recommendation ◽

Term Frequency ◽

Series Of Experiments ◽

Dirichlet Prior

Supervised topic modeling has been successfully applied in the fields of document classification and tag recommendation in recent years. However, most existing models neglect the fact that topic terms have the ability to distinguish topics. In this paper, we propose a term frequency-inverse topic frequency (TF-ITF) method for constructing a supervised topic model, in which the weight of each topic term indicates the ability to distinguish topics. We conduct a series of experiments with not only the symmetric Dirichlet prior parameters but also the asymmetric Dirichlet prior parameters. Experimental results demonstrate that the result of introducing TF-ITF into a supervised topic model outperforms several state-of-the-art supervised topic models.

Download Full-text

A Self-Aggregated Hierarchical Topic Model for Short Texts

10.5121/csit.2021.111212 ◽

2021 ◽

Author(s):

Yue Niu ◽

Hongjie Zhang

Keyword(s):

Real World ◽

Semantic Information ◽

Topic Model ◽

Topic Models ◽

The Internet ◽

Ontology Learning ◽

Learning Methods ◽

Structured Information ◽

Topic Hierarchy

With the growth of the internet, short texts such as tweets from Twitter, news titles from the RSS, or comments from Amazon have become very prevalent. Many tasks need to retrieve information hidden from the content of short texts. So ontology learning methods are proposed for retrieving structured information. Topic hierarchy is a typical ontology that consists of concepts and taxonomy relations between concepts. Current hierarchical topic models are not specially designed for short texts. These methods use word co-occurrence to construct concepts and general-special word relations to construct taxonomy topics. But in short texts, word cooccurrence is sparse and lacking general-special word relations. To overcome this two problems and provide an interpretable result, we designed a hierarchical topic model which aggregates short texts into long documents and constructing topics and relations. Because long documents add additional semantic information, our model can avoid the sparsity of word cooccurrence. In experiments, we measured the quality of concepts by topic coherence metric on four real-world short texts corpus. The result showed that our topic hierarchy is more interpretable than other methods.

Download Full-text

Unsupervised Trademark Retrieval Method Based on Attention Mechanism

Sensors ◽

10.3390/s21051894 ◽

2021 ◽

Vol 21 (5) ◽

pp. 1894

Author(s):

Jiangzhong Cao ◽

Yunfei Huang ◽

Qingyun Dai ◽

Wing-Kuen Ling

Keyword(s):

Supervised Learning ◽

Attention Mechanism ◽

Feature Representation ◽

Experimental Results ◽

Learning Methods ◽

Retrieval Method ◽

Average Rank ◽

Good Feature ◽

Key Features ◽

Better Than

Aiming at the high cost of data labeling and ignoring the internal relevance of features in existing trademark retrieval methods, this paper proposes an unsupervised trademark retrieval method based on attention mechanism. In the proposed method, the instance discrimination framework is adopted and a lightweight attention mechanism is introduced to allocate a more reasonable learning weight to key features. With an unsupervised way, this proposed method can obtain good feature representation of trademarks and improve the performance of trademark retrieval. Extensive comparative experiments on the METU trademark dataset are conducted. The experimental results show that the proposed method is significantly better than traditional trademark retrieval methods and most existing supervised learning methods. The proposed method obtained a smaller value of NAR (Normalized Average Rank) at 0.051, which verifies the effectiveness of the proposed method in trademark retrieval.

Download Full-text

Semi-supervised Max-margin Topic Model with Manifold Posterior Regularization

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2017/259 ◽

2017 ◽

Cited By ~ 2

Author(s):

Wenbo Hu ◽

Jun Zhu ◽

Hang Su ◽

Jingwei Zhuo ◽

Bo Zhang

Keyword(s):

Supervised Learning ◽

Topic Model ◽

Topic Models ◽

Stochastic Gradient ◽

Mcmc Method ◽

Tight Coupling ◽

Label Information ◽

Latent Topic ◽

Latent Topics ◽

Qualitative Performance

Supervised topic models leverage label information to learn discriminative latent topic representations. As collecting a fully labeled dataset is often time-consuming, semi-supervised learning is of high interest. In this paper, we present an effective semi-supervised max-margin topic model by naturally introducing manifold posterior regularization to a regularized Bayesian topic model, named LapMedLDA. The model jointly learns latent topics and a related classifier with only a small fraction of labeled documents. To perform the approximate inference, we derive an efficient stochastic gradient MCMC method. Unlike the previous semi-supervised topic models, our model adopts a tight coupling between the generative topic model and the discriminative classifier. Extensive experiments demonstrate that such tight coupling brings significant benefits in quantitative and qualitative performance.

Download Full-text

Analysing the supervised learning methods for prediction of healthcare data in cloud environment A Survey

International Journal of Computer Sciences and Engineering ◽

10.26438/ijcse/v6i3.434438 ◽

2018 ◽

Vol 6 (3) ◽

pp. 434-438

Author(s):

N.M. Annigeri ◽

◽

S. Shetty ◽

A.P. Patil ◽

◽

...

Keyword(s):

Supervised Learning ◽

Cloud Environment ◽

Learning Methods ◽

Healthcare Data

Download Full-text

Two Ensemble-CNN Approaches for Colorectal Cancer Tissue Type Classification

Journal of Imaging ◽

10.3390/jimaging7030051 ◽

2021 ◽

Vol 7 (3) ◽

pp. 51

Author(s):

Emanuela Paladini ◽

Edoardo Vantaggiato ◽

Fares Bougourzi ◽

Cosimo Distante ◽

Abdenour Hadid ◽

...

Keyword(s):

Colorectal Cancer ◽

Deep Learning ◽

Digital Pathology ◽

Texture Features ◽

Tissue Type ◽

Cancer Tissue ◽

Learning Methods ◽

Feature Based ◽

Type Classification ◽

Whole Slide Images

In recent years, automatic tissue phenotyping has attracted increasing interest in the Digital Pathology (DP) field. For Colorectal Cancer (CRC), tissue phenotyping can diagnose the cancer and differentiate between different cancer grades. The development of Whole Slide Images (WSIs) has provided the required data for creating automatic tissue phenotyping systems. In this paper, we study different hand-crafted feature-based and deep learning methods using two popular multi-classes CRC-tissue-type databases: Kather-CRC-2016 and CRC-TP. For the hand-crafted features, we use two texture descriptors (LPQ and BSIF) and their combination. In addition, two classifiers are used (SVM and NN) to classify the texture features into distinct CRC tissue types. For the deep learning methods, we evaluate four Convolutional Neural Network (CNN) architectures (ResNet-101, ResNeXt-50, Inception-v3, and DenseNet-161). Moreover, we propose two Ensemble CNN approaches: Mean-Ensemble-CNN and NN-Ensemble-CNN. The experimental results show that the proposed approaches outperformed the hand-crafted feature-based methods, CNN architectures and the state-of-the-art methods in both databases.

Download Full-text

Self-Supervised Learning for Autonomous Vehicles Perception: A Conciliation Between Analytical and Learning Methods

IEEE Signal Processing Magazine ◽

10.1109/msp.2020.2977269 ◽

2021 ◽

Vol 38 (1) ◽

pp. 31-41

Author(s):

Florent Chiaroni ◽

Mohamed-Cherif Rahal ◽

Nicolas Hueber ◽

Frederic Dufaux

Keyword(s):

Supervised Learning ◽

Autonomous Vehicles ◽

Learning Methods

Download Full-text

Assigning gene ontology categories (GO) to yeast genes using text-based supervised learning methods

Proceedings. 2004 IEEE Computational Systems Bioinformatics Conference, 2004. CSB 2004. ◽

10.1109/csb.2004.1332476 ◽

2004 ◽

Cited By ~ 2

Author(s):

T. Izumitani ◽

H. Taira ◽

H. Kazawa ◽

E. Maeda

Keyword(s):

Gene Ontology ◽

Supervised Learning ◽

Learning Methods ◽

Yeast Genes

Download Full-text

Efficient dynamic routing in Spectrally-Spatially Flexible Optical Networks based on traffic categorization and supervised learning methods

Optical Switching and Networking ◽

10.1016/j.osn.2021.100650 ◽

2021 ◽

pp. 100650

Author(s):

Róża Goścień ◽

Paweł Ksieniewicz

Keyword(s):

Supervised Learning ◽

Optical Networks ◽

Dynamic Routing ◽

Learning Methods

Download Full-text

Feature based cluster ranking approach for single document summarization

International Journal of Information Technology ◽

10.1007/s41870-021-00853-1 ◽

2022 ◽

Author(s):

Aakanksha Sharaff ◽

Mohit Jain ◽

Geethika Modugula

Keyword(s):

Document Summarization ◽

Feature Based ◽

Cluster Ranking

Download Full-text