scholarly journals Visual Data Mining from Visualization to Visual Information Mining

Author(s):  
Herna L. Viktor ◽  
Eric Paquet

The current explosion of data and information, which are mainly caused by the continuous adoption of data warehouses and the extensive use of the Internet and its related technologies, has increased the urgent need for the development of techniques for intelligent data analysis. Data mining, which concerns the discovery and extraction of knowledge chunks from large data repositories, addresses this need. Data mining automates the discovery of hidden patterns and relationships that may not always be obvious. Data mining tools include classification techniques (such as decision trees, rule induction programs and neural networks) (Kou et al., 2007); clustering algorithms and association rule approaches, amongst others. Data mining has been fruitfully used in many of domains, including marketing, medicine, finance, engineering and bioinformatics. There still are, however, a number of factors that militate against the widespread adoption and use of this new technology. This is mainly due to the fact that the results of many data mining techniques are often difficult to understand. For example, the results of a data mining effort producing 300 pages of rules will be difficult to analyze. The visual representation of the knowledge embedded in such rules will help to heighten the comprehensibility of the results. The visualization of the data itself, as well as the data mining process should go a long way towards increasing the user’s understanding of and faith in the data mining process. That is, data and information visualization provide users with the ability to obtain new insights into the knowledge, as discovered from large repositories. This paper describes a number of important visual data mining issues and introduces techniques employed to improve the understandability of the results of data mining. Firstly, the visualization of data prior to, and during, data mining is addressed. Through data visualization, the quality of the data can be assessed throughout the knowledge discovery process, which includes data preprocessing, data mining and reporting. We also discuss information visualization, i.e. how the knowledge, as discovered by a data mining tool, may be visualized throughout the data mining process. This aspect includes visualization of the results of data mining as well as the learning process. In addition, the paper shows how virtual reality and collaborative virtual environments may be used to obtain an immersive perspective of the data and the data mining process as well as how visual data mining can be used to directly mine functionality with specific applications in the emerging field of proteomics.

2008 ◽  
pp. 1623-1630
Author(s):  
Herna L. Viktor ◽  
Eric Paquet

The current explosion of data and information, mainly caused by data warehousing technologies as well as the extensive use of the Internet and its related technologies, has increased the urgent need for the development of techniques for intelligent data analysis. Data mining, which concerns the discovery and extraction of knowledge chunks from large data repositories, is aimed at addressing this need. Data mining automates the discovery of hidden patterns and relationships that may not always be obvious. Data mining tools include classification techniques (such as decision trees, rule induction programs and neural networks) (Han & Kamber, 2001), clustering algorithms and association rule approaches, amongst others.


Author(s):  
Herna L. Viktor ◽  
Eric Paquet

The current explosion of data and information, mainly caused by data warehousing technologies as well as the extensive use of the Internet and its related technologies, has increased the urgent need for the development of techniques for intelligent data analysis. Data mining, which concerns the discovery and extraction of knowledge chunks from large data repositories, is aimed at addressing this need. Data mining automates the discovery of hidden patterns and relationships that may not always be obvious. Data mining tools include classification techniques (such as decision trees, rule induction programs and neural networks) (Han & Kamber, 2001), clustering algorithms and association rule approaches, amongst others.


Geophysics ◽  
2009 ◽  
Vol 74 (1) ◽  
pp. P1-P11 ◽  
Author(s):  
Iván Dimitri Marroquín ◽  
Jean-Jules Brault ◽  
Bruce S. Hart

Seismic facies analysis aims to identify clusters (groups) of similar seismic trace shapes, where each cluster can be considered to represent variability in lithology, rock properties, and/or fluid content of the strata being imaged. Unfortunately, it is not always clear whether the seismic data has a natural clustering structure. Cluster analysis consists of a family of approaches that have significant potential for classifying seismic trace shapes into meaningful clusters. The clustering can be performed using a supervised process (assigning a pattern to a predefined cluster) or an unsupervised process (partitioning a collection of patterns into groups without predefined clusters). We evaluate and compare different unsupervised clustering algorithms (e.g., partition, hierarchical, probabilistic, and soft competitive models) for pattern recognition based entirely on the characteristics of the seismic response. From validation results on simple data sets, we demonstrate that a self-organizing maps algorithm implemented in a visual data-mining approach outperforms all other clustering algorithms for interpreting the cluster structure. We apply this approach to 2D seismic models generated using a discrete, known number of different stratigraphic geometries. The visual strategy recovers the correct number of end-member seismic facies in the model tests, showing that it is suitable for pattern recognition in highly correlated and continuous seismic data.


Constant streaming of data for any instances at such high volumes provides insight in various organizations. Analyzing and identifying the pattern from the huge volumes of data has become difficult with its raw form of data. Visualization of information and visual data mining helps to deal with the flood of information. Constant streaming of data for any instances at such high volumes provides insight in various organizations. Analyzing and identifying the pattern from the huge volumes of data has become difficult with its raw form of data. Visualization of information and visual data mining helps to deal with the flood of information. Visual data representation takes the data and its results to all the stakeholders in a meaningful manner which comes out of the data mining process. Recent developments have brought a large number of information visualization techniques to explore the large data sets which can be converted into useful information and knowledge. Observations and inspection data gathered from chemical and gas industries are being piled up on a daily basis as raw data. Continuous analysis is a new term evolving in the industry which continuously performs on the streaming data to have real-time analysis and prediction on-live. In this paper, usage of the various graphing model as per the respective information obtained from the organization have been discussed and justified. It also describes the value addition in making the decisions by representations through graphs and charts for better understanding. Heatmap, Scattergram and customized Radar plots the analyzed data as in the required format to visualize the prediction done for the occupational incidents in chemical and gas industries. As a result of the graphing model, representation provides a higher level of confidence in the findings of the analysis. This fact takes a better visual representation technique and transforms them to provide better results with faster processing and understanding.


Sign in / Sign up

Export Citation Format

Share Document