Iterative visual clustering for unstructured text mining

Author(s):  
Qian You ◽  
Shiaofen Fang ◽  
Patricia Ebright
Author(s):  
Byung-Kwon Park ◽  
Il-Yeol Song

As the amount of data grows very fast inside and outside of an enterprise, it is getting important to seamlessly analyze both data types for total business intelligence. The data can be classified into two categories: structured and unstructured. For getting total business intelligence, it is important to seamlessly analyze both of them. Especially, as most of business data are unstructured text documents, including the Web pages in Internet, we need a Text OLAP solution to perform multidimensional analysis of text documents in the same way as structured relational data. We first survey the representative works selected for demonstrating how the technologies of text mining and information retrieval can be applied for multidimensional analysis of text documents, because they are major technologies handling text data. And then, we survey the representative works selected for demonstrating how we can associate and consolidate both unstructured text documents and structured relation data for obtaining total business intelligence. Finally, we present a future business intelligence platform architecture as well as related research topics. We expect the proposed total heterogeneous business intelligence architecture, which integrates information retrieval, text mining, and information extraction technologies all together, including relational OLAP technologies, would make a better platform toward total business intelligence.


Information ◽  
2019 ◽  
Vol 10 (9) ◽  
pp. 279
Author(s):  
Pablo Gamallo ◽  
Marcos Garcia

Natural language processing (NLP) and Text Mining (TM) are a set of overlapping strategies working on unstructured text [...]


Author(s):  
Micah J. Crowsey ◽  
Amanda R. Ram ◽  
David H. Gutierrez ◽  
Gregory W. Paladino ◽  
K. P. White

Data plays an important role in success of any organization, so organizations required more data to make decision for their planning to improvement. The data that are generating for any organization, in which 80 to 90 percent data belongs to unstructured data type.Text mining is the process that indicate retrieve appealing and unknown information from unstructured text. Social network sites also generate huge amounts of data,with the help of these data people’s behavior and thought easily determine but analysis of these data is a difficult task. This paper proposed an efficient approach for text mining using machine learning.


Author(s):  
Dan Sullivan

As the demand for more effective Business Intelligence (BI) techniques increases, BI practitioners find they must expand the scope of their data to include unstructured text. To exploit those information resources, techniques such as text mining are essential. This chapter describes three fundamental techniques for text mining in business intelligence: term extraction, information extraction, and link analysis. Term extraction, the most basic technique, identifies key terms and logical entities, such as the names of organizations, locations, dates, and monetary amounts. Information extraction builds on terms extracted from text to identify basic relationships, such as the roles of different companies in a merger or the promotion of a chemical reaction by an enzyme. Link analysis combines multiple relationships to form multistep models of complex processes such as metabolic pathways. The discussion of each technique includes an outline of the basic steps involved, characteristics of appropriate applications, and an overview of its limitations.


2017 ◽  
pp. 223-261
Author(s):  
Chaomei Chen ◽  
Min Song

Sign in / Sign up

Export Citation Format

Share Document