Trends and Applications of Text Summarization Techniques - Advances in Data Mining and Database Management

The Development of Single-Document Abstractive Text Summarizer During the Last Decade

Trends and Applications of Text Summarization Techniques - Advances in Data Mining and Database Management ◽

10.4018/978-1-5225-9373-7.ch002 ◽

2020 ◽

pp. 32-60

Author(s):

Amal M. Al-Numai ◽

Aqil M. Azmi

Keyword(s):

Text Summarization ◽

Electronic Text ◽

Original Structure ◽

Text Documents ◽

Text Document ◽

Single Text ◽

Evaluation Techniques ◽

Automatic Text

As the number of electronic text documents is increasing so is the need for an automatic text summarizer. The summary can be extractive, compression, or abstractive. In the former, the more important sentences are retained, more or less in their original structure, while the second one involves reducing the length of each sentence. For the latter, it requires a fusion of multiple sentences and/or paraphrasing. This chapter focuses on the abstractive text summarization (ATS) of a single text document. The study explores what ATS is. Additionally, the literature of the field of ATS is investigated. Different datasets and evaluation techniques used in assessing the summarizers are discussed. The fact is that ATS is much more challenging than its extractive counterpart, and as such, there are a few works in this area for all the languages.

Named Entity Recognition in Document Summarization

Trends and Applications of Text Summarization Techniques - Advances in Data Mining and Database Management ◽

10.4018/978-1-5225-9373-7.ch005 ◽

2020 ◽

pp. 125-149

Author(s):

Sandhya P. ◽

Mahek Laxmikant Kantesaria

Keyword(s):

Information Extraction ◽

Named Entity Recognition ◽

Entity Recognition ◽

Second Step ◽

Original Text ◽

Automatic Summarization ◽

Document Summarization ◽

Named Entity ◽

Text Document ◽

Different Types

Named entity recognition (NER) is a subtask of the information extraction. NER system reads the text and highlights the entities. NER will separate different entities according to the project. NER is the process of two steps. The steps are detection of names and classifications of them. The first step is further divided into the segmentation. The second step will consist to choose an ontology which will organize the things categorically. Document summarization is also called automatic summarization. It is a process in which the text document with the help of software will create a summary by selecting the important points of the original text. In this chapter, the authors explain how document summarization is performed using named entity recognition. They discuss about the different types of summarization techniques. They also discuss about how NER works and its applications. The libraries available for NER-based information extraction are explained. They finally explain how NER is applied into document summarization.

Mining Scientific and Technical Literature

Trends and Applications of Text Summarization Techniques - Advances in Data Mining and Database Management ◽

10.4018/978-1-5225-9373-7.ch003 ◽

2020 ◽

pp. 61-87

Author(s):

Junsheng Zhang ◽

Wen Zeng

Keyword(s):

Text Mining ◽

Scientific Information ◽

Relation Extraction ◽

Knowledge Extraction ◽

Information Services ◽

Text Summarization ◽

Intelligence Analysis ◽

Technical Literature ◽

Research Directions ◽

Relational Network

In this chapter, the authors study text mining technologies such as knowledge extraction and summarization on scientific and technical literature. First, they analyze the needs of scientific information services and intelligence analysis on massive scientific and technical literature. Second, terminology recognition and relation extraction are important tasks of knowledge extraction. Third, they study knowledge extraction based on terminology recognition and relation extraction. Fourth, based on terminology and relational network, they study the text summarization techniques and applications. Last, they give comments on current research and applications on text summarization and give their viewpoints for the possible research directions in the future.

Opinion Mining and Product Review Summarization in E-Commerce

Trends and Applications of Text Summarization Techniques - Advances in Data Mining and Database Management ◽

10.4018/978-1-5225-9373-7.ch008 ◽

2020 ◽

pp. 216-243

Author(s):

Enakshi Jana ◽

V. Uma

Keyword(s):

User Experience ◽

Opinion Mining ◽

Online Shopping ◽

Online Reviews ◽

The Internet ◽

Product Reviews ◽

Product Review ◽

Document Summarization ◽

Review Summarization ◽

Product Manufacture

With the immense increase of the number of users of the internet and simultaneously the massive expansion of the e-commerce platform, millions of products are sold online. To improve user experience and satisfaction, online shopping platform enables every user to give their reviews for each and every product that they buy online. Reviews are long and contain only a few sentences which are related to a particular feature of that product. It becomes very difficult for the user to understand other customer views about different features of the product. So, we need accurate opinion-based review summarization which will help both customers and product manufacture to understand and focus on a particular aspect of the product. In this chapter, the authors discuss the abstractive document summarization method to summarize e-commerce product reviews. This chapter has an in-depth explanation about different types of document summarization and how that can be applied to e-commerce product reviews.

Scaling and Semantically-Enriching Language-Agnostic Summarization

Trends and Applications of Text Summarization Techniques - Advances in Data Mining and Database Management ◽

10.4018/978-1-5225-9373-7.ch009 ◽

2020 ◽

pp. 244-292

Author(s):

George Giannakopoulos ◽

George Kiomourtzis ◽

Nikiforos Pittaras ◽

Vangelis Karkaletsis

Keyword(s):

Big Data ◽

Semantic Information ◽

Base Approach ◽

Redundancy Removal ◽

Recent Advances ◽

Research Problems ◽

Recent Developments ◽

News Summarization ◽

N Gram

This chapter describes the evolution of a real, multi-document, multilingual news summarization methodology and application, named NewSum, the research problems behind it, as well as the steps taken to solve these problems. The system uses the representation of n-gram graphs to perform sentence selection and redundancy removal towards summary generation. In addition, it tackles problems related to topic and subtopic detection (via clustering), demonstrates multi-lingual applicability, and—through recent advances—scalability to big data. Furthermore, recent developments over the algorithm allow it to utilize semantic information to better identify and outline events, so as to offer an overall improvement over the base approach.

Summarization in the Financial and Regulatory Domain

Trends and Applications of Text Summarization Techniques - Advances in Data Mining and Database Management ◽

10.4018/978-1-5225-9373-7.ch007 ◽

2020 ◽

pp. 187-215 ◽

Cited By ~ 1

Author(s):

Jochen L. Leidner

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Entry Point ◽

Special Consideration ◽

Regulatory Domain ◽

Automatic Summarization ◽

Regulatory Domains

This chapter presents an introduction to automatic summarization techniques with special consideration of the financial and regulatory domains. It aims to provide an entry point to the field for readers interested in natural language processing (NLP) who are experts in the finance and/or regulatory domain, or to NLP researchers who would like to learn more about financial and regulatory applications. After introducing some core summarization concepts and the two domains are considered, some key methods and systems are described. Evaluation and quality concerns are also summarized. To conclude, some pointers for future reading are provided.

Text Classification and Topic Modeling for Online Discussion Forums

Trends and Applications of Text Summarization Techniques - Advances in Data Mining and Database Management ◽

10.4018/978-1-5225-9373-7.ch006 ◽

2020 ◽

pp. 151-186

Author(s):

Xin Zhao ◽

Zhe Jiang ◽

Jeff Gray

Keyword(s):

Machine Learning ◽

Domain Knowledge ◽

Online Discussion ◽

Machine Learning Techniques ◽

Learning Approaches ◽

Discussion Forums ◽

Online Forums ◽

Unsupervised Machine Learning ◽

Online Discussion Forums ◽

Learning Techniques

Online discussion forums play an important role in building and sharing domain knowledge. An extensive amount of information can be found in online forums, covering every aspect of life and professional discourse. This chapter introduces the application of supervised and unsupervised machine learning techniques to analyze forum questions. This chapter starts with supervised machine learning techniques to classify forum posts into pre-defined topic categories. As a supporting technique, web scraping is also discussed to gather data from an online forum. After this, this chapter introduces unsupervised learning techniques to identify latent topics in documents. The combination of supervised and unsupervised machine learning approaches offers us deeper insights of the data obtained from online forums. This chapter demonstrates these techniques through a case study on a very large online discussion forum called LabVIEW from the systems modeling community. In the end, the authors list future trends in applying machine learning to understand the expertise captured in online expert communities.

Data Text Mining Based on Swarm Intelligence Techniques

Trends and Applications of Text Summarization Techniques - Advances in Data Mining and Database Management ◽

10.4018/978-1-5225-9373-7.ch004 ◽

2020 ◽

pp. 88-124

Author(s):

Mohamed Atef Mosa

Keyword(s):

Text Mining ◽

Swarm Intelligence ◽

Web Mining ◽

Heuristic Algorithms ◽

Optimization Techniques ◽

Short Text ◽

First Time ◽

Automatic Text ◽

Great Growth ◽

The Web

Due to the great growth of data on the web, mining to extract the most informative data as a conceptual brief would be beneficial for certain users. Therefore, there is great enthusiasm concerning the developing automatic text summary approaches. In this chapter, the authors highlight using the swarm intelligence (SI) optimization techniques for the first time in solving the problem of text summary. In addition, a convincing justification of why nature-heuristic algorithms, especially ant colony optimization (ACO), are the best algorithms for solving complicated optimization tasks is introduced. Moreover, it has been perceived that the problem of text summary had not been formalized as a multi-objective optimization (MOO) task before, despite there are many contradictory objectives in needing to be achieved. The SI has not been employed before to support the real-time tasks. Therefore, a novel framework of short text summary has been proposed to fulfill this issue. Ultimately, this chapter will enthuse researchers for further consideration for SI algorithms in solving summary tasks.

Combining Machine Learning and Natural Language Processing for Language-Specific, Multi-Lingual, and Cross-Lingual Text Summarization

Trends and Applications of Text Summarization Techniques - Advances in Data Mining and Database Management ◽

10.4018/978-1-5225-9373-7.ch001 ◽

2020 ◽

pp. 1-31

Author(s):

Luca Cagliero ◽

Paolo Garza ◽

Moreno La Quatra

Keyword(s):

Machine Learning ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Research Community ◽

Text Summarization ◽

Web Based ◽

Recent Advances ◽

Pros And Cons ◽

Cross Lingual

The recent advances in multimedia and web-based applications have eased the accessibility to large collections of textual documents. To automate the process of document analysis, the research community has put relevant efforts into extracting short summaries of the document content. However, most of the early proposed summarization methods were tailored to English-written textual corpora or to collections of documents all written in the same language. More recently, the joint efforts of the machine learning and the natural language processing communities have produced more portable and flexible solutions, which can be applied to documents written in different languages. This chapter first overviews the most relevant language-specific summarization algorithms. Then, it presents the most recent advances in multi- and cross-lingual text summarization. The chapter classifies the presented methodology, highlights the main pros and cons, and discusses the perspectives of the extension of the current research towards cross-lingual summarization systems.

Trends and Applications of Text Summarization Techniques - Advances in Data Mining and Database Management
Latest Publications

TOTAL DOCUMENTS

H-INDEX

Published By IGI Global

The Development of Single-Document Abstractive Text Summarizer During the Last Decade

Named Entity Recognition in Document Summarization

Mining Scientific and Technical Literature

Opinion Mining and Product Review Summarization in E-Commerce

Scaling and Semantically-Enriching Language-Agnostic Summarization

Summarization in the Financial and Regulatory Domain

Text Classification and Topic Modeling for Online Discussion Forums

Data Text Mining Based on Swarm Intelligence Techniques

Combining Machine Learning and Natural Language Processing for Language-Specific, Multi-Lingual, and Cross-Lingual Text Summarization

Export Citation Format

Trends and Applications of Text Summarization Techniques - Advances in Data Mining and Database ManagementLatest Publications

TOTAL DOCUMENTS

H-INDEX

Published By IGI Global

The Development of Single-Document Abstractive Text Summarizer During the Last Decade

Named Entity Recognition in Document Summarization

Mining Scientific and Technical Literature

Opinion Mining and Product Review Summarization in E-Commerce

Scaling and Semantically-Enriching Language-Agnostic Summarization

Summarization in the Financial and Regulatory Domain

Text Classification and Topic Modeling for Online Discussion Forums

Data Text Mining Based on Swarm Intelligence Techniques

Combining Machine Learning and Natural Language Processing for Language-Specific, Multi-Lingual, and Cross-Lingual Text Summarization

Trends and Applications of Text Summarization Techniques - Advances in Data Mining and Database Management
Latest Publications