Knowledge Discovery in Text Mining Technique Using Association Rules Extraction

Author(s):  
Vaishali Bhujade ◽  
N.J. Janwe
2018 ◽  
Vol 22 (7) ◽  
pp. 1471-1488 ◽  
Author(s):  
Antonio Usai ◽  
Marco Pironti ◽  
Monika Mital ◽  
Chiraz Aouina Mejri

Purpose The aim of this work is to increase awareness of the potential of the technique of text mining to discover knowledge and further promote research collaboration between knowledge management and the information technology communities. Since its emergence, text mining has involved multidisciplinary studies, focused primarily on database technology, Web-based collaborative writing, text analysis, machine learning and knowledge discovery. However, owing to the large amount of research in this field, it is becoming increasingly difficult to identify existing studies and therefore suggest new topics. Design/methodology/approach This article offers a systematic review of 85 academic outputs (articles and books) focused on knowledge discovery derived from the text mining technique. The systematic review is conducted by applying “text mining at the term level, in which knowledge discovery takes place on a more focused collection of words and phrases that are extracted from and label each document” (Feldman et al., 1998, p. 1). Findings The results revealed that the keywords extracted to be associated with the main labels, id est, knowledge discovery and text mining, can be categorized in two periods: from 1998 to 2009, the term knowledge and text were always used. From 2010 to 2017 in addition to these terms, sentiment analysis, review manipulation, microblogging data and knowledgeable users were the other terms frequently used. Besides this, it is possible to notice the technical, engineering nature of each term present in the first decade. Whereas, a diverse range of fields such as business, marketing and finance emerged from 2010 to 2017 owing to a greater interest in the online environment. Originality/value This is a first comprehensive systematic review on knowledge discovery and text mining through the use of a text mining technique at term level, which offers to reduce redundant research and to avoid the possibility of missing relevant publications.


Author(s):  
Yi-fang Brook Wu ◽  
Xin Chen

This chapter presents a methodology for personalized knowledge discovery from text. Traditionally, problems with text mining are numerous rules derived and many already known to the user. Our proposed algorithm derives user’s background knowledge from a set of documents provided by the user, and exploits such knowledge in the process of knowledge discovery from text. Keywords are extracted from background documents and clustered into a concept hierarchy that captures the semantic usage of keywords and their relationships in the background documents. Target documents are retrieved by selecting documents that are relevant to the user’s background. Association rules are discovered among noun phrases extracted from target documents. Novelty of an association rule is defined as the semantic distance between the antecedent and the consequent of a rule in the background knowledge. The experiment shows that our novelty measure performs better than support and confidence in identifying novel knowledge.


2016 ◽  
pp. 713-732
Author(s):  
Asmae Dami ◽  
Mohamed Fakir ◽  
Belaid Bouikhalene

This chapter is located in the intersection of two research themes, namely: Information Retrieval and Knowledge Discovery from texts (Text mining). The purpose of this paper is two-fold: first, it focuses on Information Retrieval (IR) whose purpose is to implement a set of models and systems for selecting a set of documents satisfying user needs in terms of information expressed as a query. An information retrieval system is composed mainly of two processes the representation and retrieval process. The process of representation is called indexing, which allows representation of documents and queries by descriptors, or indexes. These descriptors reflect the contents of documents. The retrieval process consists on the comparison between documents representations and query representation. The second aim of this paper is to discover the relationships between terms (keywords) descriptors of documents in a document database. The correlations (relationships) between terms are extracted by using a technique of the Text mining, mainly association rules.


2014 ◽  
Vol 7 (4) ◽  
pp. 42-62
Author(s):  
Asmae Dami ◽  
Mohamed Fakir ◽  
Belaid Bouikhalene

This paper is located in the intersection of two research themes, namely: Information Retrieval and Knowledge Discovery from texts (Text mining). The purpose of this paper is two-fold: first, it focuses on Information Retrieval (IR) whose purpose is to implement a set of models and systems for selecting a set of documents satisfying user needs in terms of information expressed as a query. An information retrieval system is composed mainly of two processes the representation and retrieval process. The process of representation is called indexing, which allows representation of documents and queries by descriptors, or indexes. These descriptors reflect the contents of documents. The retrieval process consists on the comparison between documents representations and query representation. The second aim of this paper is to discover the relationships between terms (keywords) descriptors of documents in a document database. The correlations (relationships) between terms are extracted by using a technique of the Text mining, mainly association rules.


2013 ◽  
Vol 310 ◽  
pp. 567-571
Author(s):  
Arun Thotapalli Sundararaman

Visualization is an important technique for analysis of knowledge derived from text mining. While different approaches exist for visualization, this paper presents a novel way of visualizing the strength of association between multiple terms that summarizes association in the form of a matrix. This approach is expected to improve the way decision makers analyze insights from text mining.


PLoS ONE ◽  
2012 ◽  
Vol 7 (4) ◽  
pp. e33427 ◽  
Author(s):  
Anna Korhonen ◽  
Diarmuid Ó Séaghdha ◽  
Ilona Silins ◽  
Lin Sun ◽  
Johan Högberg ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document