Aspect-Aware Multimodal Summarization for Chinese E-Commerce Products

Haoran Li; Peng Yuan; Song Xu; Youzheng Wu; Xiaodong He; Bowen Zhou

doi:10.1609/aaai.v34i05.6332

Aspect-Aware Multimodal Summarization for Chinese E-Commerce Products

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i05.6332 ◽

2020 ◽

Vol 34 (05) ◽

pp. 8188-8195

Author(s):

Haoran Li ◽

Peng Yuan ◽

Song Xu ◽

Youzheng Wu ◽

Xiaodong He ◽

...

Keyword(s):

Energy Efficiency ◽

Significant Role ◽

Visual Information ◽

Large Scale ◽

Product Information ◽

Comparative Methods ◽

Text Summarization ◽

Experimental Results ◽

Summarization System ◽

Abstractive Summarization

We present an abstractive summarization system that produces summary for Chinese e-commerce products. This task is more challenging than general text summarization. First, the appearance of a product typically plays a significant role in customers' decisions to buy the product or not, which requires that the summarization model effectively use the visual information of the product. Furthermore, different products have remarkable features in various aspects, such as “energy efficiency” and “large capacity” for refrigerators. Meanwhile, different customers may care about different aspects. Thus, the summarizer needs to capture the most attractive aspects of a product that resonate with potential purchasers. We propose an aspect-aware multimodal summarization model that can effectively incorporate the visual information and also determine the most salient aspects of a product. We construct a large-scale Chinese e-commerce product summarization dataset that contains approximately 1.4 million manually created product summaries that are paired with detailed product information, including an image, a title, and other textual descriptions for each product. The experimental results on this dataset demonstrate that our models significantly outperform the comparative methods in terms of both the ROUGE score and manual evaluations.

Download Full-text

Abstractive Text Summarization by Incorporating Reader Comments

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33016399 ◽

2019 ◽

Vol 33 ◽

pp. 6399-6406 ◽

Cited By ~ 4

Author(s):

Shen Gao ◽

Xiuying Chen ◽

Piji Li ◽

Zhaochun Ren ◽

Lidong Bing ◽

...

Keyword(s):

Real World ◽

Large Scale ◽

Text Summarization ◽

Experimental Results ◽

Semantic Gap ◽

Main Aspect ◽

Adversarial Learning ◽

Large Scale Dataset ◽

Reader Comments ◽

Abstractive Summarization

In neural abstractive summarization field, conventional sequence-to-sequence based models often suffer from summarizing the wrong aspect of the document with respect to the main aspect. To tackle this problem, we propose the task of reader-aware abstractive summary generation, which utilizes the reader comments to help the model produce better summary about the main aspect. Unlike traditional abstractive summarization task, reader-aware summarization confronts two main challenges: (1) Comments are informal and noisy; (2) jointly modeling the news document and the reader comments is challenging. To tackle the above challenges, we design an adversarial learning model named reader-aware summary generator (RASG), which consists of four components: (1) a sequence-to-sequence based summary generator; (2) a reader attention module capturing the reader focused aspects; (3) a supervisor modeling the semantic gap between the generated summary and reader focused aspects; (4) a goal tracker producing the goal for each generation step. The supervisor and the goal tacker are used to guide the training of our framework in an adversarial manner. Extensive experiments are conducted on our large-scale real-world text summarization dataset, and the results show that RASG achieves the stateof-the-art performance in terms of both automatic metrics and human evaluations. The experimental results also demonstrate the effectiveness of each module in our framework. We release our large-scale dataset for further research1.

Download Full-text

A Hierarchical End-to-End Model for Jointly Improving Text Summarization and Sentiment Classification

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2018/591 ◽

2018 ◽

Cited By ~ 15

Author(s):

Shuming Ma ◽

Xu Sun ◽

Junyang Lin ◽

Xuancheng Ren

Keyword(s):

Hierarchical Structure ◽

Online Reviews ◽

Text Summarization ◽

Sentiment Classification ◽

Experimental Results ◽

Joint Learning ◽

End To End ◽

Abstractive Summarization ◽

Main Ideas ◽

Different Levels

Text summarization and sentiment classification both aim to capture the main ideas of the text but at different levels. Text summarization is to describe the text within a few sentences, while sentiment classification can be regarded as a special type of summarization which ``summarizes'' the text into a even more abstract fashion, i.e., a sentiment class. Based on this idea, we propose a hierarchical end-to-end model for joint learning of text summarization and sentiment classification, where the sentiment classification label is treated as the further ``summarization'' of the text summarization output. Hence, the sentiment classification layer is put upon the text summarization layer, and a hierarchical structure is derived. Experimental results on Amazon online reviews datasets show that our model achieves better performance than the strong baseline systems on both abstractive summarization and sentiment classification.

Download Full-text

Exploiting Semantic Term Relations in Text Summarization

International Journal of Information Retrieval Research ◽

10.4018/ijirr.289607 ◽

2022 ◽

Vol 12 (1) ◽

pp. 0-0

Keyword(s):

Similarity Measure ◽

Text Summarization ◽

Experimental Results ◽

Semantic Relations ◽

Document Summarization ◽

Term Frequency ◽

Redundancy Removal ◽

Term Similarity ◽

Summarization System

The traditional frequency based approach to creating multi-document extractive summary ranks sentences based on scores computed by summing up TF*IDF weights of words contained in the sentences. In this approach, TF or term frequency is calculated based on how frequently a term (word) occurs in the input and TF calculated in this way does not take into account the semantic relations among terms. In this paper, we propose methods that exploits semantic term relations for improving sentence ranking and redundancy removal steps of a summarization system. Our proposed summarization system has been tested on DUC 2003 and DUC 2004 benchmark multi-document summarization datasets. The experimental results reveal that performance of our multi-document text summarizer is significantly improved when the distributional term similarity measure is used for finding semantic term relations. Our multi-document text summarizer also outperforms some well known summarization baselines to which it is compared.

Download Full-text

Approaches to Large-Scale User Opinion Summarization for the Web

Advances in Data Mining and Database Management - Innovative Document Summarization Techniques ◽

10.4018/978-1-4666-5019-0.ch007 ◽

2014 ◽

pp. 163-184

Author(s):

William Darling

Keyword(s):

Real World ◽

Domain Knowledge ◽

Large Scale ◽

Text Summarization ◽

User Comments ◽

Opinion Summarization ◽

Real World Problem ◽

World Opinion ◽

Summarization System ◽

The Web

This chapter discusses approaches to applying text summarization research to the real-world problem of opinion summarization of user comments. Following a brief overview of the history of research in text summarization, the authors consider large scale user opinion summarization on the Web, a summarization problem that is distinct from the traditional domain that the research has focused on until very recently. More specifically, they consider opinion summarization of large datasets that generally include large degrees of noise and little editorial structure. To deal with this kind of real-world problem, the chapter addresses three major areas that must be considered and adhered to when designing systems for this type of problem: simple techniques, domain knowledge, and evaluative testing. Each area is covered in detail, and throughout the chapter, the lessons are applied to a case study that aims to apply the recommendations to designing a real-world opinion summarization system for a fictional book publisher.

Download Full-text

Increasing Energy Efficiency of Mobile Nodes Using Distributed Energy-Adaptive Location Based Cooperative- Medium Access Control Protocol (DEL-CMAC) in Large Scale MANETs

International Review on Computers and Software (IRECOS) ◽

10.15866/irecos.v9i10.984 ◽

2014 ◽

Vol 9 (10) ◽

pp. 1651

Author(s):

Vijayakumar Adaickalam ◽

K. Selvamani Kadhirvelu

Keyword(s):

Energy Efficiency ◽

Access Control ◽

Medium Access Control ◽

Large Scale ◽

Medium Access Control Protocol ◽

Mobile Nodes ◽

Distributed Energy ◽

Medium Access ◽

Control Protocol

Download Full-text

Engaging Supply-Chain Manufacturers to Optimize Delivery of Automation: Case Study and Lessons Learned from Optimizing BNR and Energy Efficiency within Large-Scale Aeration Automation at Tallman Island WPCP, New York City

Proceedings of the Water Environment Federation ◽

10.2175/193864715819542278 ◽

2015 ◽

Vol 2015 (10) ◽

pp. 3006-3014

Author(s):

R. J Kowalski ◽

J Finnigan ◽

A Kreel ◽

M Zaman

Keyword(s):

New York ◽

Energy Efficiency ◽

New York City ◽

Supply Chain ◽

York City ◽

Large Scale ◽

Lessons Learned

Download Full-text

Statistical and machine learning models for optimizing energy in parallel applications

The International Journal of High Performance Computing Applications ◽

10.1177/1094342019842915 ◽

2019 ◽

Vol 33 (6) ◽

pp. 1079-1097 ◽

Cited By ~ 2

Author(s):

Mark Endrei ◽

Chao Jin ◽

Minh Ngoc Dinh ◽

David Abramson ◽

Heidi Poxon ◽

...

Keyword(s):

Machine Learning ◽

Energy Efficiency ◽

High Performance ◽

Large Scale ◽

Energy Use ◽

Parallel Applications ◽

Learning Models ◽

Trade Off ◽

Time Required ◽

Machine Learning Models

Rising power costs and constraints are driving a growing focus on the energy efficiency of high performance computing systems. The unique characteristics of a particular system and workload and their effect on performance and energy efficiency are typically difficult for application users to assess and to control. Settings for optimum performance and energy efficiency can also diverge, so we need to identify trade-off options that guide a suitable balance between energy use and performance. We present statistical and machine learning models that only require a small number of runs to make accurate Pareto-optimal trade-off predictions using parameters that users can control. We study model training and validation using several parallel kernels and more complex workloads, including Algebraic Multigrid (AMG), Large-scale Atomic Molecular Massively Parallel Simulator, and Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics. We demonstrate that we can train the models using as few as 12 runs, with prediction error of less than 10%. Our AMG results identify trade-off options that provide up to 45% improvement in energy efficiency for around 10% performance loss. We reduce the sample measurement time required for AMG by 90%, from 13 h to 74 min.

Download Full-text

A bootstrapping approach for robust topic analysis

Natural Language Engineering ◽

10.1017/s1351324902002929 ◽

2002 ◽

Vol 8 (2-3) ◽

pp. 209-233 ◽

Cited By ~ 1

Author(s):

OLIVIER FERRET ◽

BRIGITTE GRAU

Keyword(s):

Information Extraction ◽

Large Scale ◽

Text Summarization ◽

Great Precision ◽

Topic Analysis ◽

Structured Knowledge

Topic analysis is important for many applications dealing with texts, such as text summarization or information extraction. However, it can be done with great precision only if it relies on structured knowledge, which is difficult to produce on a large scale. In this paper, we propose using bootstrapping to solve this problem: a first topic analysis based on a weakly structured source of knowledge, a collocation network, is used for learning explicit topic representations that then support a more precise and reliable topic analysis.

Download Full-text

How the Multiplicity of Suggested Information Affects the Behavior of a User in a Recommender System

Electronics ◽

10.3390/electronics10060741 ◽

2021 ◽

Vol 10 (6) ◽

pp. 741

Author(s):

Yuseok Ban ◽

Kyungjae Lee

Keyword(s):

High Precision ◽

Recommender System ◽

Visual Information ◽

Experimental Results ◽

Supplementary Information ◽

Digital Platform ◽

Different Types ◽

Types Of Information

Many researchers have suggested improving the retention of a user in the digital platform using a recommender system. Recent studies show that there are many potential ways to assist users to find interesting items, other than high-precision rating predictions. In this paper, we study how the diverse types of information suggested to a user can influence their behavior. The types have been divided into visual information, evaluative information, categorial information, and narrational information. Based on our experimental results, we analyze how different types of supplementary information affect the performance of a recommender in terms of encouraging users to click more items or spend more time in the digital platform.

Download Full-text