Extractive Based Single Document Text Summarization Using Clustering Approach

Pankaj Kailas Bhole; A. J. Agrawal

doi:10.11591/ijai.v3.i2.pp73-78

Extractive Based Single Document Text Summarization Using Clustering Approach

IAES International Journal of Artificial Intelligence (IJ-AI) ◽

10.11591/ijai.v3.i2.pp73-78 ◽

2014 ◽

Vol 3 (2) ◽

pp. 73

Author(s):

Pankaj Kailas Bhole ◽

A. J. Agrawal

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Computational Intelligence ◽

Clustering Algorithm ◽

Text Summarization ◽

Large Set ◽

Meaningful Information ◽

Time Reading ◽

Clustering Approach

Text summarization is an old challenge in text mining but in dire need of researcher’s attention in the areas of computational intelligence, machine learning and natural language processing. We extract a set of features from each sentence that helps identify its importance in the document. Every time reading full text is time consuming. Clustering approach is useful to decide which type of data present in document. In this paper we introduce the concept of k-mean clustering for natural language processing of text for word matching and in order to extract meaningful information from large set of offline documents, data mining document clustering algorithm are adopted.

Download Full-text

Natural Language Processing (NLP) based Text Summarization - A Survey

2021 6th International Conference on Inventive Computation Technologies (ICICT) ◽

10.1109/icict50816.2021.9358703 ◽

2021 ◽

Author(s):

Ishitva Awasthi ◽

Kuntal Gupta ◽

Prabjot Singh Bhogal ◽

Sahejpreet Singh Anand ◽

Piyush Kumar Soni

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Text Summarization

Download Full-text

Leveraging Natural Language Processing Applications Using Machine Learning

Handbook of Research on Emerging Trends and Applications of Machine Learning - Advances in Computational Intelligence and Robotics ◽

10.4018/978-1-5225-9643-1.ch016 ◽

2020 ◽

pp. 338-360

Author(s):

Janjanam Prabhudas ◽

C. H. Pradeep Reddy

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Text Summarization ◽

Feature Representation ◽

Learning Models ◽

Primary Focus ◽

And Performance

The enormous increase of information along with the computational abilities of machines created innovative applications in natural language processing by invoking machine learning models. This chapter will project the trends of natural language processing by employing machine learning and its models in the context of text summarization. This chapter is organized to make the researcher understand technical perspectives regarding feature representation and their models to consider before applying on language-oriented tasks. Further, the present chapter revises the details of primary models of deep learning, its applications, and performance in the context of language processing. The primary focus of this chapter is to illustrate the technical research findings and gaps of text summarization based on deep learning along with state-of-the-art deep learning models for TS.

Download Full-text

Text Summarization Using Natural Language Processing

Information and Communication Technology for Competitive Strategies (ICTCS 2020) - Lecture Notes in Networks and Systems ◽

10.1007/978-981-16-0739-4_62 ◽

2021 ◽

pp. 653-663

Author(s):

G. Sreenivasulu ◽

N. Thulasi Chitra ◽

B. Sujatha ◽

K. Venu Madhav

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Text Summarization

Download Full-text

Natural Language Processing based Abstractive Text Summarization of Reviews

2020 International Conference on Electronics and Sustainable Communication Systems (ICESC) ◽

10.1109/icesc48915.2020.9155759 ◽

2020 ◽

Author(s):

Janice Shah ◽

Manasvi Sagathiya ◽

Kareena Redij ◽

Varsha Hole

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Text Summarization

Download Full-text

A brief review on text summarization methods

International Journal of Engineering & Technology ◽

10.14419/ijet.v7i4.5.25070 ◽

2018 ◽

Vol 7 (4.5) ◽

pp. 728

Author(s):

Rasmita Rautray ◽

Lopamudra Swain ◽

Rasmita Dash ◽

Rajashree Dash

Keyword(s):

Information Retrieval ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Text Summarization ◽

Large Corpus

In present scenario, text summarization is a popular and active field of research in both the Information Retrieval (IR) and Natural Language Processing (NLP) communities. Summarization is important for IR since it is a means to identify useful information by condensing the document from large corpus of data in an efficient way. In this study, different aspects of text summarization methods with strength, limitation and gap within the methods are presented.

Download Full-text

Automatic Text Summarization and Keyword Extraction using Natural Language Processing

2020 International Conference on Electronics and Sustainable Communication Systems (ICESC) ◽

10.1109/icesc48915.2020.9155852 ◽

2020 ◽

Author(s):

Avinash Payak ◽

Saurabh Rai ◽

Kanishka Shrivastava ◽

Reshma Gulwani

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Text Summarization ◽

Keyword Extraction ◽

Automatic Text Summarization ◽

Automatic Text

Download Full-text

Predicting Verification Methods from Natural Language Requirements

10.31224/osf.io/wxv9e ◽

2020 ◽

Author(s):

Michael Prendergast

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Semantic Analysis ◽

Nearest Neighbor ◽

Large Set ◽

Technical Requirements ◽

Verification Methods ◽

Natural Language Requirements ◽

Processing Algorithms

Abstract – A Verification Cross-Reference Matrix (VCRM) is a table that depicts the verification methods for requirements in a specification. Usually requirement labels are rows, available test methods are columns, and an “X” in a cell indicates usage of a verification method for that requirement. Verification methods include Demonstration, Inspection, Analysis and Test, and sometimes Certification, Similarity and/or Analogy. VCRMs enable acquirers and stakeholders to quickly understand how a product’s requirements will be tested.Maintaining consistency of very large VCRMs can be challenging, and inconsistent verification methods can result in a large set of uncoordinated “spaghetti tests”. Natural language processing algorithms that can identify similarities between requirements offer promise in addressing this challenge.This paper applies and compares compares four natural language processing algorithms to the problem of automatically populating VCRMs from natural language requirements: Naïve Bayesian inference, (b) Nearest Neighbor by weighted Dice similarity, (c) Nearest Neighbor with Latent Semantic Analysis similarity, and (d) an ensemble method combining the first three approaches. The VCRMs used for this study are for slot machine technical requirements derived from gaming regulations from the countries of Australia and New Zealand, the province of Nova Scotia (Canada), the state of Michigan (United States) and recommendations from the International Association of Gaming Regulators (IAGR).

Download Full-text

Teaching Natural Language Processing through Big Data Text Summarization with Problem-Based Learning

Data and Information Management ◽

10.2478/dim-2020-0003 ◽

2020 ◽

Vol 4 (1) ◽

pp. 18-43

Author(s):

Liuqing Li ◽

Jack Geissinger ◽

William A. Ingram ◽

Edward A. Fox

Keyword(s):

Big Data ◽

Natural Language Processing ◽

Graduate Students ◽

Natural Language ◽

Information Management ◽

Language Processing ◽

Problem Based Learning ◽

Text Summarization ◽

Data Sets ◽

Student Teams

AbstractNatural language processing (NLP) covers a large number of topics and tasks related to data and information management, leading to a complex and challenging teaching process. Meanwhile, problem-based learning is a teaching technique specifically designed to motivate students to learn efficiently, work collaboratively, and communicate effectively. With this aim, we developed a problem-based learning course for both undergraduate and graduate students to teach NLP. We provided student teams with big data sets, basic guidelines, cloud computing resources, and other aids to help different teams in summarizing two types of big collections: Web pages related to events, and electronic theses and dissertations (ETDs). Student teams then deployed different libraries, tools, methods, and algorithms to solve the task of big data text summarization. Summarization is an ideal problem to address learning NLP since it involves all levels of linguistics, as well as many of the tools and techniques used by NLP practitioners. The evaluation results showed that all teams generated coherent and readable summaries. Many summaries were of high quality and accurately described their corresponding events or ETD chapters, and the teams produced them along with NLP pipelines in a single semester. Further, both undergraduate and graduate students gave statistically significant positive feedback, relative to other courses in the Department of Computer Science. Accordingly, we encourage educators in the data and information management field to use our approach or similar methods in their teaching and hope that other researchers will also use our data sets and synergistic solutions to approach the new and challenging tasks we addressed.

Download Full-text

Weakly Supervised Domain Detection

Transactions of the Association for Computational Linguistics ◽

10.1162/tacl_a_00287 ◽

2019 ◽

Vol 7 ◽

pp. 581-596

Author(s):

Yumo Xu ◽

Mirella Lapata

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Text Classification ◽

Multiple Instance Learning ◽

Text Summarization ◽

Multilabel Classification ◽

Processing Task ◽

Weakly Supervised ◽

Natural Language Processing Task

In this paper we introduce domain detection as a new natural language processing task. We argue that the ability to detect textual segments that are domain-heavy (i.e., sentences or phrases that are representative of and provide evidence for a given domain) could enhance the robustness and portability of various text classification applications. We propose an encoder-detector framework for domain detection and bootstrap classifiers with multiple instance learning. The model is hierarchically organized and suited to multilabel classification. We demonstrate that despite learning with minimal supervision, our model can be applied to text spans of different granularities, languages, and genres. We also showcase the potential of domain detection for text summarization.

Download Full-text

GUI Based Text Summarizing of Social Response

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.d1710.029420 ◽

2020 ◽

Vol 9 (4) ◽

pp. 1773-1776

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Text Summarization ◽

Social Response ◽

Extractive Summarization ◽

News Stories ◽

Social Reaction ◽

Abstractive Summarization

Text Summarization is one of those utilizations of Natural Language Processing (NLP) which will undoubtedly hugy affect our lives. For the most part, Text outline can comprehensively be partitioned into two classifications, Extractive Summarization and Abstractive Summarization and the execution of seq2seq model for rundown of literary information utilizing of tensor stream/keras and showed on amazon or social reaction surveys, issues and news stories. Content rundown is a subdomain of Natural Language Processing that manages removing synopses from tremendous lumps of writings. There are two fundamental sorts of methods utilized for content rundown: NLP-based procedures and profound learning based strategies. Along these lines, our point is to look at spacy, gensim and nltk synopsis system by the info prerequisites. It will see a basic NLP-based system for content rundown. Or maybe it will basically utilize Python's NLTK library for content abridging.

Download Full-text