scholarly journals Profiling visitors of a national park in Italy through unsupervised classification of mixed data

Author(s):  
Giulia Caruso ◽  
Adelia Evangelista ◽  
Stefano Antonio Gattone

Cluster analysis has for long been an effective tool for analysing data. Thus, several disciplines, such as marketing, psychology and computer sciences, just to mention a few, did take advantage from its contribution over time. Traditionally, this kind of algorithm concentrates only on numerical or categorical data at a time. In this work, instead, we analyse a dataset composed of mixed data, namely both numerical than categorical ones. More precisely, we focus on profiling visitors of the National Park of Majella in the Abruzzo region of Italy, which observations are characterized by variables such as gender, age, profession, expectations and satisfaction rate on park services. Applying a standard clustering procedure would be wholly inappropriate in this case. Therefore, we hereby propose an unsupervised classification of mixed data, a specific procedure capable of processing both numerical than categorical variables simultaneously, releasing truly precious information. In conclusion, our application therefore emphasizes how cluster analysis for mixed data can lead to discover particularly informative patterns, allowing to lay the groundwork for an accurate customers profiling, starting point for a detailed marketing analysis.

Author(s):  
Andreas Rauber ◽  
◽  
Jan Paralic ◽  

Cluster analysis is one of the most prominent methods for the analysis of large, unknown datasets. It provides a particularly suitable tool for obtaining a first overview of data, forming a prominent starting point for further evaluation. . In this paper, we present some lessons learned during the application of two clustering approaches to the analysis of castle admission ticket sales data. A Bayesian unsupervised classification based on AutoClass and an unsupervised neural network, the Self-Organizing Map, are used to obtain a first impression of the available data to form the basis for further exploration. We show that this type of cluster analysis provides a suitable first step in the knowledge discovery process. The different types of result representation and their suitability of providing a first insight into datasets are analyzed and compared.


Hydrology ◽  
2019 ◽  
Vol 6 (2) ◽  
pp. 53 ◽  
Author(s):  
Tedesche ◽  
Trochim ◽  
Fassnacht ◽  
Wolken

Perennial snowfields in Gates of the Arctic National Park and Preserve (GAAR) in the central Brooks Range of Alaska are a critical component of the cryosphere. They serve as habitat for an array of wildlife, including caribou, a species that is crucial as a food and cultural resource for rural subsistence hunters and Native Alaskans. Snowfields also influence hydrology, vegetation, permafrost, and have the potential to preserve valuable archaeological artifacts. By deriving time series maps using cloud computing and supervised classification of Landsat satellite imagery, we calculated areas and evaluated extent changes. We also derived changes in elevations of the perennial snowfields that remained stable for at least four years. For the study period of 1985 to 2017, we found that total areas of perennial snowfields in GAAR are decreasing, with most of the notable changes in the latter half of the study period. Equilibrium areas, or bright areas, of the snowfields are shrinking, while ablation, or dark areas, are growing. We also found that the snowfields occur at higher elevations over time. Climate change may be altering the distribution, elevation, and extent of perennial snowfields in GAAR, which could affect caribou populations and subsistence lifestyles in rural Alaska.


PeerJ ◽  
2018 ◽  
Vol 6 ◽  
pp. e4264 ◽  
Author(s):  
Gerardo Mendizabal-Ruiz ◽  
Israel Román-Godínez ◽  
Sulema Torres-Ramos ◽  
Ricardo A. Salido-Ruiz ◽  
Hugo Vélez-Pérez ◽  
...  

Genomic signal processing (GSP) methods which convert DNA data to numerical values have recently been proposed, which would offer the opportunity of employing existing digital signal processing methods for genomic data. One of the most used methods for exploring data is cluster analysis which refers to the unsupervised classification of patterns in data. In this paper, we propose a novel approach for performing cluster analysis of DNA sequences that is based on the use of GSP methods and the K-means algorithm. We also propose a visualization method that facilitates the easy inspection and analysis of the results and possible hidden behaviors. Our results support the feasibility of employing the proposed method to find and easily visualize interesting features of sets of DNA data.


Geografie ◽  
1997 ◽  
Vol 102 (1) ◽  
pp. 17-30
Author(s):  
Jaromír Kolejka ◽  
Jásim K. Shallal

Surface soil data have been processed using the unsupervised classification (cluster analysis). Three soil categories with different erosional characteristics have been detected: heavily, moderately and slightly/no damaged soils. The supervised satellite image classification (MLC) was based on the data taken from case study areas in the proximity of classified soil sample sites on the vegetation free-fields.


Author(s):  
Xiangji Huang

Clustering is the process of grouping a collection of objects (usually represented as points in a multidimensional space) into classes of similar objects. Cluster analysis is a very important tool in data analysis. It is a set of methodologies for automatic classification of a collection of patterns into clusters based on similarity. Intuitively, patterns within the same cluster are more similar to each other than patterns belonging to a different cluster. It is important to understand the difference between clustering (unsupervised classification) and supervised classification.


Author(s):  
Xiangji Huang

Clustering is the process of grouping a collection of objects (usually represented as points in a multidimensional space) into classes of similar objects. Cluster analysis is a very important tool in data analysis. It is a set of methodologies for automatic classification of a collection of patterns into clusters based on similarity. Intuitively, patterns within the same cluster are more similar to each other than patterns belonging to a different cluster. It is important to understand the difference between clustering (unsupervised classification) and supervised classification.


2006 ◽  
Vol 37 (01) ◽  
Author(s):  
W Hermann ◽  
T Villmann ◽  
HJ Kühn ◽  
P Baum ◽  
G Reichel ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document