SE4ExSum: An Integrated Semantic-aware Neural Approach with Graph Convolutional Network for Extractive Text Summarization

Recently, advanced techniques in deep learning such as recurrent neural network (GRU, LSTM and Bi-LSTM) and auto-encoding (attention-based transformer and BERT) have achieved great successes in multiple application domains including text summarization. Recent state-of-the-art encoding-based text summarization models such as BertSum, PreSum and DiscoBert have demonstrated significant improvements on extractive text summarization tasks. However, recent models still encounter common problems related to the language-specific dependency which requires the supports of the external NLP tools. Besides that, recent advanced text representation methods, such as BERT as the sentence-level textual encoder, also fail to fully capture the representation of a full-length document. To address these challenges, in this paper we proposed a novel s emantic-ware e mbedding approach for ex tractive text sum marization , called as: SE4ExSum. Our proposed SE4ExSum is an integration between the use of feature graph-of-words (FGOW) with BERT-based encoder for effectively learning the word/sentence-level representations of a given document. Then, the g raph c onvolutional n etwork (GCN) based encoder is applied to learn the global document's representation which is then used to facilitate the text summarization task. Extensive experiments on benchmark datasets show the effectiveness of our proposed model in comparing with recent state-of-the-art text summarization models.

Download Full-text

Two-Level Transformer and Auxiliary Coherence Modeling for Improved Text Segmentation

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i05.6284 ◽

2020 ◽

Vol 34 (05) ◽

pp. 7797-7804

Author(s):

Goran Glavašš ◽

Swapna Somasundaran

Keyword(s):

State Of The Art ◽

Language Transfer ◽

Text Segmentation ◽

Word Embeddings ◽

Neural Architecture ◽

Text Coherence ◽

Sentence Level ◽

Proposed Model ◽

Benchmark Datasets ◽

Cross Lingual

Breaking down the structure of long texts into semantically coherent segments makes the texts more readable and supports downstream applications like summarization and retrieval. Starting from an apparent link between text coherence and segmentation, we introduce a novel supervised model for text segmentation with simple but explicit coherence modeling. Our model – a neural architecture consisting of two hierarchically connected Transformer networks – is a multi-task learning model that couples the sentence-level segmentation objective with the coherence objective that differentiates correct sequences of sentences from corrupt ones. The proposed model, dubbed Coherence-Aware Text Segmentation (CATS), yields state-of-the-art segmentation performance on a collection of benchmark datasets. Furthermore, by coupling CATS with cross-lingual word embeddings, we demonstrate its effectiveness in zero-shot language transfer: it can successfully segment texts in languages unseen in training.

Download Full-text

FLEX: Faithful Linguistic Explanations for Neural Net Based Model Decisions

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33012539 ◽

2019 ◽

Vol 33 ◽

pp. 2539-2546

Author(s):

Sandareka Wickramanayake ◽

Wynne Hsu ◽

Mong Li Lee

Keyword(s):

Neural Network ◽

Deep Learning ◽

State Of The Art ◽

Neural Net ◽

End User ◽

Learning Network ◽

Benchmark Datasets ◽

Deep Learning Network ◽

Post Hoc ◽

User Trust

Explaining the decisions of a Deep Learning Network is imperative to safeguard end-user trust. Such explanations must be intuitive, descriptive, and faithfully explain why a model makes its decisions. In this work, we propose a framework called FLEX (Faithful Linguistic EXplanations) that generates post-hoc linguistic justifications to rationalize the decision of a Convolutional Neural Network. FLEX explains a model’s decision in terms of features that are responsible for the decision. We derive a novel way to associate such features to words, and introduce a new decision-relevance metric that measures the faithfulness of an explanation to a model’s reasoning. Experiment results on two benchmark datasets demonstrate that the proposed framework can generate discriminative and faithful explanations compared to state-of-the-art explanation generators. We also show how FLEX can generate explanations for images of unseen classes as well as automatically annotate objects in images.

Download Full-text

Batik Classification using Deep Convolutional Network Transfer Learning

Jurnal Ilmu Komputer dan Informasi ◽

10.21609/jiki.v11i2.507 ◽

2018 ◽

Vol 11 (2) ◽

pp. 59 ◽

Cited By ~ 5

Author(s):

Yohanes Gultom ◽

Aniati Murni Arymurthy ◽

Rian Josua Masikome

Keyword(s):

Neural Network ◽

Deep Learning ◽

Cultural Heritage ◽

Convolutional Neural Network ◽

Transfer Learning ◽

Data Representation ◽

Convolutional Network ◽

Proposed Model ◽

Feature Extractor ◽

Research Task

Batik fabric is one of the most profound cultural heritage in Indonesia. Hence, continuous research on understanding it is necessary to preserve it. Despite of being one of the most common research task, Batik’s pattern automatic classification still requires some improvement especially in regards to invariance dilemma. Convolutional neural network (ConvNet) is one of deep learning architecture which able to learn data representation by combining local receptive inputs, weight sharing and convolutions in order to solve invariance dilemma in image classification. Using dataset of 2,092 Batik patches (5 classes), the experiments show that the proposed model, which used deep ConvNet VGG16 as feature extractor (transfer learning), achieves slightly better average of 89 ± 7% accuracy than SIFT and SURF-based that achieve 88 ± 10% and 88 ± 8% respectively. Despite of that, SIFT reaches around 5% better accuracy in rotated and scaled dataset.

Download Full-text

Fake or Genuine? Contextualised Text Representation for Fake Review Detection

10.5121/csit.2021.112311 ◽

2021 ◽

Author(s):

Rami Mohawesh ◽

Shuxiang Xu ◽

Matthew Springer ◽

Muna Al-Hawawreh ◽

Sumbal Maqsood

Keyword(s):

State Of The Art ◽

Online Reviews ◽

Learning Approaches ◽

Text Representation ◽

Purchasing Decisions ◽

Linguistic Features ◽

Proposed Model ◽

Benchmark Datasets ◽

Hidden Patterns ◽

Fake Reviews

Online reviews have a significant influence on customers' purchasing decisions for any products or services. However, fake reviews can mislead both consumers and companies. Several models have been developed to detect fake reviews using machine learning approaches. Many of these models have some limitations resulting in low accuracy in distinguishing between fake and genuine reviews. These models focused only on linguistic features to detect fake reviews and failed to capture the semantic meaning of the reviews. To deal with this, this paper proposes a new ensemble model that employs transformer architecture to discover the hidden patterns in a sequence of fake reviews and detect them precisely. The proposed approach combines three transformer models to improve the robustness of fake and genuine behaviour profiling and modelling to detect fake reviews. The experimental results using semi-real benchmark datasets showed the superiority of the proposed model over state-of-the-art models.

Download Full-text

Improved optic disc and cup segmentation in Glaucomatic images using deep learning architecture

Multimedia Tools and Applications ◽

10.1007/s11042-020-10430-6 ◽

2021 ◽

Author(s):

Partha Sarathi Mangipudi ◽

Hari Mohan Pandey ◽

Ankur Choudhary

Keyword(s):

Deep Learning ◽

Optic Disc ◽

Vision Loss ◽

State Of The Art ◽

Key Factors ◽

Glaucoma Diagnosis ◽

Proposed Model ◽

Effective System ◽

Benchmark Datasets ◽

Multiple Experts

AbstractGlaucoma is an ailment causing permanent vision loss but can be prevented through the early detection. Optic disc to cup ratio is one of the key factors for glaucoma diagnosis. But accurate segmentation of disc and cup is still a challenge. To mitigate this challenge, an effective system for optic disc and cup segmentation using deep learning architecture is presented in this paper. Modified Groundtruth is utilized to train the proposed model. It works as fused segmentation marking by multiple experts that helps in improving the performance of the system. Extensive computer simulations are conducted to test the efficiency of the proposed system. For the implementation three standard benchmark datasets such as DRISHTI-GS, DRIONS-DB and RIM-ONE v3 are used. The performance of the proposed system is validated against the state-of-the-art methods. Results indicate an average overlapping score of 96.62%, 96.15% and 98.42% respectively for optic disc segmentation and an average overlapping score of 94.41% is achieved on DRISHTI-GS which is significant for optic cup segmentation.

Download Full-text

A Reinforced Topic-Aware Convolutional Sequence-to-Sequence Model for Abstractive Text Summarization

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2018/619 ◽

2018 ◽

Cited By ~ 17

Author(s):

Li Wang ◽

Junlin Yao ◽

Yunzhe Tao ◽

Li Zhong ◽

Wei Liu ◽

...

Keyword(s):

Deep Learning ◽

Experimental Evaluation ◽

State Of The Art ◽

Text Summarization ◽

The Other ◽

Learning Approach ◽

Automatic Summarization ◽

Word Level ◽

Proposed Model ◽

Abstractive Summarization

In this paper, we propose a deep learning approach to tackle the automatic summarization tasks by incorporating topic information into the convolutional sequence-to-sequence (ConvS2S) model and using self-critical sequence training (SCST) for optimization. Through jointly attending to topics and word-level alignment, our approach can improve coherence, diversity, and informativeness of generated summaries via a biased probability generation mechanism. On the other hand, reinforcement training, like SCST, directly optimizes the proposed model with respect to the non-differentiable metric ROUGE, which also avoids the exposure bias during inference. We carry out the experimental evaluation with state-of-the-art methods over the Gigaword, DUC-2004, and LCSTS datasets. The empirical results demonstrate the superiority of our proposed method in the abstractive summarization.

Download Full-text

Semantic Sentence Matching with Densely-Connected Recurrent and Co-Attentive Information

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33016586 ◽

2019 ◽

Vol 33 ◽

pp. 6586-6593 ◽

Cited By ~ 20

Author(s):

Seonhoon Kim ◽

Inho Kang ◽

Nojun Kwak

Keyword(s):

Neural Network ◽

Natural Language ◽

Question Answering ◽

State Of The Art ◽

Attention Mechanism ◽

Semantic Relationship ◽

Convolutional Network ◽

Benchmark Datasets ◽

Feature Information ◽

Sentence Matching

Sentence matching is widely used in various natural language tasks such as natural language inference, paraphrase identification, and question answering. For these tasks, understanding logical and semantic relationship between two sentences is required but it is yet challenging. Although attention mechanism is useful to capture the semantic relationship and to properly align the elements of two sentences, previous methods of attention mechanism simply use a summation operation which does not retain original features enough. Inspired by DenseNet, a densely connected convolutional network, we propose a densely-connected co-attentive recurrent neural network, each layer of which uses concatenated information of attentive features as well as hidden features of all the preceding recurrent layers. It enables preserving the original and the co-attentive feature information from the bottommost word embedding layer to the uppermost recurrent layer. To alleviate the problem of an ever-increasing size of feature vectors due to dense concatenation operations, we also propose to use an autoencoder after dense concatenation. We evaluate our proposed architecture on highly competitive benchmark datasets related to sentence matching. Experimental results show that our architecture, which retains recurrent and attentive features, achieves state-of-the-art performances for most of the tasks.

Download Full-text

Multi-scale 3D-convolutional neural network for hyperspectral image classification

Indonesian Journal of Electrical Engineering and Computer Science ◽

10.11591/ijeecs.v25.i1.pp307-316 ◽

2022 ◽

Vol 25 (1) ◽

pp. 307

Author(s):

Murali Kanthi ◽

Thogarcheti Hitendra Sarma ◽

Chigarapalle Shoba Bindu

Keyword(s):

Neural Network ◽

Deep Learning ◽

Convolutional Neural Network ◽

Classification Accuracy ◽

Hyperspectral Image ◽

State Of The Art ◽

Spatial Dimension ◽

Multi Scale ◽

Proposed Model ◽

Spectral Channels

Deep Learning methods are state-of-the-art approaches for pixel-based hyperspectral images (HSI) classification. High classification accuracy has been achieved by extracting deep features from both spatial-spectral channels. However, the efficiency of such spatial-spectral approaches depends on the spatial dimension of each patch and there is no theoretically valid approach to find the optimum spatial dimension to be considered. It is more valid to extract spatial features by considering varying neighborhood scales in spatial dimensions. In this regard, this article proposes a deep convolutional neural network (CNN) model wherein three different multi-scale spatial-spectral patches are used to extract the features in both the spatial and spectral channels. In order to extract these potential features, the proposed deep learning architecture takes three patches various scales in spatial dimension. 3D convolution is performed on each selected patch and the process runs through entire image. The proposed is named as multi-scale three-dimensional convolutional neural network (MS-3DCNN). The efficiency of the proposed model is being verified through the experimental studies on three publicly available benchmark datasets including Pavia University, Indian Pines, and Salinas. It is empirically proved that the classification accuracy of the proposed model is improved when compared with the remaining state-of-the-art methods.

Download Full-text

Fully Symmetric Convolutional Network for Effective Image Denoising

Applied Sciences ◽

10.3390/app9040778 ◽

2019 ◽

Vol 9 (4) ◽

pp. 778 ◽

Cited By ~ 3

Author(s):

Steffi Priyanka ◽

Yuan-Kai Wang

Keyword(s):

Neural Network ◽

Image Processing ◽

Image Denoising ◽

Gpu Computing ◽

State Of The Art ◽

Large Data ◽

Convolutional Network ◽

Proposed Model ◽

Feature Extractor ◽

A Chain

Neural-network-based image denoising is one of the promising approaches to deal with problems in image processing. In this work, a deep fully symmetric convolutional–deconvolutional neural network (FSCN) is proposed for image denoising. The proposed model comprises a novel architecture with a chain of successive symmetric convolutional–deconvolutional layers. This framework learns convolutional–deconvolutional mappings from corrupted images to the clean ones in an end-to-end fashion without using image priors. The convolutional layers act as feature extractor to encode primary components of the image contents while eliminating corruptions, and the deconvolutional layers then decode the image abstractions to recover the image content details. An adaptive moment optimizer is used to minimize the reconstruction loss as it is appropriate for large data and noisy images. Extensive experiments were conducted for image denoising to evaluate the FSCN model against the existing state-of-the-art denoising algorithms. The results show that the proposed model achieves superior denoising, both qualitatively and quantitatively. This work also presents the efficient implementation of the FSCN model by using GPU computing which makes it easy and attractive for practical denoising applications.

Download Full-text

A novel fully convolutional network for visual saliency prediction

PeerJ Computer Science ◽

10.7717/peerj-cs.280 ◽

2020 ◽

Vol 6 ◽

pp. e280

Author(s):

Bashir Muftah Ghariba ◽

Mohamed S. Shehata ◽

Peter McGuire

Keyword(s):

Deep Learning ◽

Visual Saliency ◽

Superior Performance ◽

Convolutional Network ◽

Fully Convolutional Network ◽

Proposed Model ◽

Saliency Prediction ◽

Benchmark Datasets ◽

The Many ◽

Deep Learning Model

A human Visual System (HVS) has the ability to pay visual attention, which is one of the many functions of the HVS. Despite the many advancements being made in visual saliency prediction, there continues to be room for improvement. Deep learning has recently been used to deal with this task. This study proposes a novel deep learning model based on a Fully Convolutional Network (FCN) architecture. The proposed model is trained in an end-to-end style and designed to predict visual saliency. The entire proposed model is fully training style from scratch to extract distinguishing features. The proposed model is evaluated using several benchmark datasets, such as MIT300, MIT1003, TORONTO, and DUT-OMRON. The quantitative and qualitative experiment analyses demonstrate that the proposed model achieves superior performance for predicting visual saliency.

Download Full-text