A Study on Tools and Techniques of Big Data Analytics for Text Summarization From Multi-Documents
Multi-document summarization extracts and summarizes the information without affecting its original context from the different sources of documents. It has been carried out using extractive text summarization and abstractive text summarization. Extractive summarization extracts summaries from verbatim lines, and abstractive summarization extracts new lines of summary from the source documents. Abstractive summarization is an advanced technology compared to extractive summarization. This research studies extractive summarization of multi documents from internet resources using word frequency counting and with maximum coverage using K-means clustering. In an internet search, the search algorithm shows the results from different websites using crawling and indexing. However, the search and text summary take place from hundreds, thousands, maybe millions of documents. To handle and manipulate these huge amounts of information, big data and its techniques are applied widely. This research also addresses big data techniques and tools that are available for multi-document summarization.