Trends and Applications of Text Summarization Techniques - Advances in Data Mining and Database Management
Latest Publications


TOTAL DOCUMENTS

9
(FIVE YEARS 9)

H-INDEX

1
(FIVE YEARS 1)

Published By IGI Global

9781522593737, 9781522593751

Author(s):  
Amal M. Al-Numai ◽  
Aqil M. Azmi

As the number of electronic text documents is increasing so is the need for an automatic text summarizer. The summary can be extractive, compression, or abstractive. In the former, the more important sentences are retained, more or less in their original structure, while the second one involves reducing the length of each sentence. For the latter, it requires a fusion of multiple sentences and/or paraphrasing. This chapter focuses on the abstractive text summarization (ATS) of a single text document. The study explores what ATS is. Additionally, the literature of the field of ATS is investigated. Different datasets and evaluation techniques used in assessing the summarizers are discussed. The fact is that ATS is much more challenging than its extractive counterpart, and as such, there are a few works in this area for all the languages.


Author(s):  
Sandhya P. ◽  
Mahek Laxmikant Kantesaria

Named entity recognition (NER) is a subtask of the information extraction. NER system reads the text and highlights the entities. NER will separate different entities according to the project. NER is the process of two steps. The steps are detection of names and classifications of them. The first step is further divided into the segmentation. The second step will consist to choose an ontology which will organize the things categorically. Document summarization is also called automatic summarization. It is a process in which the text document with the help of software will create a summary by selecting the important points of the original text. In this chapter, the authors explain how document summarization is performed using named entity recognition. They discuss about the different types of summarization techniques. They also discuss about how NER works and its applications. The libraries available for NER-based information extraction are explained. They finally explain how NER is applied into document summarization.


Author(s):  
Junsheng Zhang ◽  
Wen Zeng

In this chapter, the authors study text mining technologies such as knowledge extraction and summarization on scientific and technical literature. First, they analyze the needs of scientific information services and intelligence analysis on massive scientific and technical literature. Second, terminology recognition and relation extraction are important tasks of knowledge extraction. Third, they study knowledge extraction based on terminology recognition and relation extraction. Fourth, based on terminology and relational network, they study the text summarization techniques and applications. Last, they give comments on current research and applications on text summarization and give their viewpoints for the possible research directions in the future.


Author(s):  
Enakshi Jana ◽  
V. Uma

With the immense increase of the number of users of the internet and simultaneously the massive expansion of the e-commerce platform, millions of products are sold online. To improve user experience and satisfaction, online shopping platform enables every user to give their reviews for each and every product that they buy online. Reviews are long and contain only a few sentences which are related to a particular feature of that product. It becomes very difficult for the user to understand other customer views about different features of the product. So, we need accurate opinion-based review summarization which will help both customers and product manufacture to understand and focus on a particular aspect of the product. In this chapter, the authors discuss the abstractive document summarization method to summarize e-commerce product reviews. This chapter has an in-depth explanation about different types of document summarization and how that can be applied to e-commerce product reviews.


Author(s):  
George Giannakopoulos ◽  
George Kiomourtzis ◽  
Nikiforos Pittaras ◽  
Vangelis Karkaletsis

This chapter describes the evolution of a real, multi-document, multilingual news summarization methodology and application, named NewSum, the research problems behind it, as well as the steps taken to solve these problems. The system uses the representation of n-gram graphs to perform sentence selection and redundancy removal towards summary generation. In addition, it tackles problems related to topic and subtopic detection (via clustering), demonstrates multi-lingual applicability, and—through recent advances—scalability to big data. Furthermore, recent developments over the algorithm allow it to utilize semantic information to better identify and outline events, so as to offer an overall improvement over the base approach.


Author(s):  
Jochen L. Leidner

This chapter presents an introduction to automatic summarization techniques with special consideration of the financial and regulatory domains. It aims to provide an entry point to the field for readers interested in natural language processing (NLP) who are experts in the finance and/or regulatory domain, or to NLP researchers who would like to learn more about financial and regulatory applications. After introducing some core summarization concepts and the two domains are considered, some key methods and systems are described. Evaluation and quality concerns are also summarized. To conclude, some pointers for future reading are provided.


Author(s):  
Xin Zhao ◽  
Zhe Jiang ◽  
Jeff Gray

Online discussion forums play an important role in building and sharing domain knowledge. An extensive amount of information can be found in online forums, covering every aspect of life and professional discourse. This chapter introduces the application of supervised and unsupervised machine learning techniques to analyze forum questions. This chapter starts with supervised machine learning techniques to classify forum posts into pre-defined topic categories. As a supporting technique, web scraping is also discussed to gather data from an online forum. After this, this chapter introduces unsupervised learning techniques to identify latent topics in documents. The combination of supervised and unsupervised machine learning approaches offers us deeper insights of the data obtained from online forums. This chapter demonstrates these techniques through a case study on a very large online discussion forum called LabVIEW from the systems modeling community. In the end, the authors list future trends in applying machine learning to understand the expertise captured in online expert communities.


Author(s):  
Mohamed Atef Mosa

Due to the great growth of data on the web, mining to extract the most informative data as a conceptual brief would be beneficial for certain users. Therefore, there is great enthusiasm concerning the developing automatic text summary approaches. In this chapter, the authors highlight using the swarm intelligence (SI) optimization techniques for the first time in solving the problem of text summary. In addition, a convincing justification of why nature-heuristic algorithms, especially ant colony optimization (ACO), are the best algorithms for solving complicated optimization tasks is introduced. Moreover, it has been perceived that the problem of text summary had not been formalized as a multi-objective optimization (MOO) task before, despite there are many contradictory objectives in needing to be achieved. The SI has not been employed before to support the real-time tasks. Therefore, a novel framework of short text summary has been proposed to fulfill this issue. Ultimately, this chapter will enthuse researchers for further consideration for SI algorithms in solving summary tasks.


Author(s):  
Luca Cagliero ◽  
Paolo Garza ◽  
Moreno La Quatra

The recent advances in multimedia and web-based applications have eased the accessibility to large collections of textual documents. To automate the process of document analysis, the research community has put relevant efforts into extracting short summaries of the document content. However, most of the early proposed summarization methods were tailored to English-written textual corpora or to collections of documents all written in the same language. More recently, the joint efforts of the machine learning and the natural language processing communities have produced more portable and flexible solutions, which can be applied to documents written in different languages. This chapter first overviews the most relevant language-specific summarization algorithms. Then, it presents the most recent advances in multi- and cross-lingual text summarization. The chapter classifies the presented methodology, highlights the main pros and cons, and discusses the perspectives of the extension of the current research towards cross-lingual summarization systems.


Sign in / Sign up

Export Citation Format

Share Document