Exemplars can Reciprocate Principal Components

2021 ◽

Vol 2021 ◽

pp. 1-10

Author(s):

Yaping Li

Keyword(s):

Optimization Algorithm ◽

Categorical Data ◽

Clustering Algorithm ◽

Feature Vector ◽

Clustering Methods ◽

Clustering Method ◽

Swarm Optimization ◽

Tree Structures ◽

High Data ◽

Glowworm Swarm Optimization

The main objective of this paper is to present a new clustering algorithm for metadata trees based on K-prototypes algorithm, GSO (glowworm swarm optimization) algorithm, and maximal frequent path (MFP). Metadata tree clustering includes computing the feature vector of the metadata tree and the feature vector clustering. Therefore, traditional data clustering methods are not suitable directly for metadata trees. As the main method to calculate eigenvectors, the MFP method also faces the difficulties of high computational complexity and loss of key information. Generally, the K-prototypes algorithm is suitable for clustering of mixed-attribute data such as feature vectors, but the K-prototypes algorithm is sensitive to the initial clustering center. Compared with other swarm intelligence algorithms, the GSO algorithm has more efficient global search advantages, which are suitable for solving multimodal problems and also useful to optimize the K-prototypes algorithm. To address the clustering of metadata tree structures in terms of clustering accuracy and high data dimension, this paper combines the GSO algorithm, K-prototypes algorithm, and MFP together to study and design a new metadata structure clustering method. Firstly, MFP is used to describe metadata tree features, and the key parameter of categorical data is introduced into the feature vector of MFP to improve the accuracy of the feature vector to describe the metadata tree; secondly, GSO is combined with K-prototypes to design GSOKP for clustering the feature vector that contains numeric data and categorical data so as to improve the clustering accuracy; finally, tests are conducted with a set of metadata trees. The experimental results show that the designed metadata tree clustering method GSOKP-FP has certain advantages in respect to clustering accuracy and time complexity.

Download Full-text

A clustering algorithm for ipsative variables

DYNA ◽

10.15446/dyna.v86n211.77835 ◽

2019 ◽

Vol 86 (211) ◽

pp. 94-101

Author(s):

Jesica Rubiano Moreno ◽

Carlos Alonso Malaver ◽

Samuel Nucamendi Guillén ◽

Carlos López Hernández

Keyword(s):

Clustering Algorithm ◽

Data Distribution ◽

Extensive Study ◽

Clustering Method ◽

Motivational Profiles ◽

Random Groups

The aim of this study is to introduce a new clustering method for ipsatives variables. This method can be used for nominals or ordinals variables for which responses must be mutually exclusive, and it is independent of data distribution. The proposed method is applied to outline motivational profiles for individuals based on a declared preferences set. A case study is used to analyze the performance of the proposed algorithm by comparing proposed method results versus the PAM method. Results show that proposed method generate a better segmentation and differentiated groups. An extensive study was conducted to validate the performance clustering method against a set of random groups by clustering measures.

Download Full-text

Kernal Based Semi-Supervised Clustering and its Application in Leave Recognition of Bauhinia Blakeana Leaves

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.756-759.3849 ◽

2013 ◽

Vol 756-759 ◽

pp. 3849-3854

Author(s):

Xi Yang Yang ◽

Fu Sheng Yu

Keyword(s):

Fuzzy Clustering ◽

Principal Components ◽

Leaf Spot ◽

Clustering Algorithm ◽

Projection Algorithm ◽

Leaf Spot Disease ◽

Clustering Method ◽

Supervised Clustering ◽

Spot Disease ◽

Fuzzy Clustering Algorithm

A novel kernel based semi-supervised fuzzy clustering algorithm is proposed, and its iterative formula is given. This new algorithm can effectively improve the efficiency of the clustering algorithm. Combined with Fisher projection algorithm, two principal components are extracted from 7 hue statistics and 11 green value statistics, this new semi-supervised clustering method is applied to recognize the angular leaf spot disease of Bauhinia blakeana. The results showed that the consistent rate is 100% for the labeled leaves, and above 95% for other unlabeled leaves.

Download Full-text

Application of K-Means Clustering Algorithm for Determination of Fire-Prone Areas Utilizing Hotspots in West Kalimantan Province

International Journal of Advances in Data and Information Systems ◽

10.25008/ijadis.v1i1.7 ◽

2020 ◽

Vol 1 (1) ◽

pp. 9-16

Author(s):

Nabila Amalia Khairani ◽

Edi Sutoyo

Keyword(s):

Data Mining ◽

Forest Fires ◽

Clustering Algorithm ◽

Social Aspects ◽

Mining Method ◽

Clustering Method ◽

West Kalimantan ◽

A Value ◽

The Impact

Forest and land fires are disasters that often occur in Indonesia. In 2007, 2012 and 2015 forest fires that occurred in Sumatra and Kalimantan attracted global attention because they brought smog pollution to neighboring countries. One of the regions that has the highest fire hotspots is West Kalimantan Province. Forest and land fires have an impact on health, especially on the communities around the scene, as well as on the economic and social aspects. This must be overcome, one of them is by knowing the location of the area of ??fire and can analyze the causes of forest and land fires. With the impact caused by forest and land fires, the purpose of this study is to apply the clustering method using the k-means algorithm to be able to determine the hotspot prone areas in West Kalimantan Province. And evaluate the results of the cluster that has been obtained from the clustering method using the k-means algorithm. Data mining is a suitable method to be able to find out information on hotspot areas. The data mining method used is clustering because this method can process hotspot data into information that can inform areas prone to hotspots. This clustering uses k-means algorithm which is grouping data based on similar characteristics. The hotspots data obtained are grouped into 3 clusters with the results obtained for cluster 0 as many as 284 hotspots including hazardous areas, 215 hotspots including non-prone areas and 129 points that belong to very vulnerable areas. Then the clustering results were evaluated using the Davies-Bouldin Index (DBI) method with a value of 3.112 which indicates that the clustering results of 3 clusters were not optimal.

Download Full-text

Comparison of Clustering K-Means, Fuzzy C-Means, and Linkage for Nasa Active Fire Dataset

International Journal of Artificial Intelligence & Robotics (IJAIR) ◽

10.25139/ijair.v2i2.3030 ◽

2020 ◽

Vol 2 (2) ◽

pp. 34

Author(s):

Muchamad Kurniawan ◽

Rani Rotul Muhima ◽

Siti Agustini

Keyword(s):

Forest Fires ◽

Clustering Algorithm ◽

Hot Spot ◽

Clustering Methods ◽

Clustering Method ◽

Simple Method ◽

Fuzzy C Means ◽

Total Distance ◽

Active Fire ◽

Average Linkage

One of the causes of forest fires is the lack of speed of handling when a fire occurs. This can be anticipated by determining how many extinguishing units are in the center of the hot spot. To get hotspots, NASA has provided an active fire dataset. The clustering method is used to get the most optimal centroid point. The clustering methods we use are K-Means, Fuzzy C-Means (FCM), and Average Linkage. The reason for using K-means is a simple method and has been applied in various areas. FCM is a partition-based clustering algorithm which is a development of the K-means method. The hierarchical based clustering method is represented by the Average Linkage method. The measurement technique that uses is the sum of the internal distance of each cluster. Elbow evaluation is used to evaluate the optimal cluster. The results obtained after conducting the K-Means trial obtained the best results with a total distance of 145.35 km, and the best clusters from this method were 4 clusters. Meanwhile, the total distance values obtained from the FCM and Linkage methods were 154.13 km and 266.61 km.

Download Full-text

Application of K-Means Clustering Algorithm for Determination of Fire-Prone Areas Utilizing Hotspots in West Kalimantan Province

International Journal of Advances in Data and Information Systems ◽

10.25008/ijadis.v1i1.13 ◽

2020 ◽

Vol 1 (1) ◽

pp. 9-16 ◽

Cited By ~ 1

Author(s):

Nabila Amalia Khairani ◽

Edi Sutoyo

Keyword(s):

Data Mining ◽

Forest Fires ◽

Clustering Algorithm ◽

Social Aspects ◽

Mining Method ◽

Clustering Method ◽

West Kalimantan ◽

A Value ◽

The Impact

Forest and land fires are disasters that often occur in Indonesia. In 2007, 2012 and 2015 forest fires that occurred in Sumatra and Kalimantan attracted global attention because they brought smog pollution to neighboring countries. One of the regions that has the highest fire hotspots is West Kalimantan Province. Forest and land fires have an impact on health, especially on the communities around the scene, as well as on the economic and social aspects. This must be overcome, one of them is by knowing the location of the area of ??fire and can analyze the causes of forest and land fires. With the impact caused by forest and land fires, the purpose of this study is to apply the clustering method using the k-means algorithm to be able to determine the hotspot prone areas in West Kalimantan Province. And evaluate the results of the cluster that has been obtained from the clustering method using the k-means algorithm. Data mining is a suitable method to be able to find out information on hotspot areas. The data mining method used is clustering because this method can process hotspot data into information that can inform areas prone to hotspots. This clustering uses k-means algorithm which is grouping data based on similar characteristics. The hotspots data obtained are grouped into 3 clusters with the results obtained for cluster 0 as many as 284 hotspots including hazardous areas, 215 hotspots including non-prone areas and 129 points that belong to very vulnerable areas. Then the clustering results were evaluated using the Davies-Bouldin Index (DBI) method with a value of 3.112 which indicates that the clustering results of 3 clusters were not optimal.

Download Full-text

Teknik Data Mining Dalam Clustering Produksi Susu Segar Di Indonesia Dengan Algoritma K-Means

BRAHMANA: Jurnal Penerapan Kecerdasan Buatan ◽

10.30645/brahmana.v1i1.5 ◽

2019 ◽

Vol 1 (1) ◽

pp. 31-39

Author(s):

Ilham Safitra Damanik ◽

Sundari Retno Andani ◽

Dedi Sehendro

Keyword(s):

Data Mining ◽

Milk Production ◽

Clustering Algorithm ◽

Clustering Method ◽

Data Mining Techniques ◽

Low Level ◽

Fresh Milk ◽

Nutritional Needs ◽

High Level ◽

Level Cluster

Milk is an important intake to meet nutritional needs. Both consumed by children, and adults. Indonesia has many producers of fresh milk, but it is not sufficient for national milk needs. Data mining is a science in the field of computers that is widely used in research. one of the data mining techniques is Clustering. Clustering is a method by grouping data. The Clustering method will be more optimal if you use a lot of data. Data to be used are provincial data in Indonesia from 2000 to 2017 obtained from the Central Statistics Agency. The results of this study are in Clusters based on 2 milk-producing groups, namely high-dairy producers and low-milk producing regions. From 27 data on fresh milk production in Indonesia, two high-level provinces can be obtained, namely: West Java and East Java. And 25 others were added in 7 provinces which did not follow the calculation of the K-Means Clustering Algorithm, including in the low level cluster.

Download Full-text

A Virtual Laboratory to Practice Mobile Wireless Sensor Networks: A Case Study on Energy Efficient and Safe Weighted Clustering Algorithm

Journal of Information Processing Systems ◽

10.3745/jips.02.0019 ◽

2015 ◽

Keyword(s):

Wireless Sensor Networks ◽

Sensor Networks ◽

Energy Efficient ◽

Clustering Algorithm ◽

Wireless Sensor ◽

Virtual Laboratory ◽

Mobile Wireless ◽

Weighted Clustering ◽

Mobile Wireless Sensor

Download Full-text

Transfer of knowledge in chemical equipment reliability

Collection of Czechoslovak Chemical Communications ◽

10.1135/cccc19892692 ◽

1989 ◽

Vol 54 (10) ◽

pp. 2692-2710 ◽

Cited By ~ 3

Author(s):

František Babinec ◽

Mirko Dohnal

Keyword(s):

Fuzzy Clustering ◽

Clustering Algorithm ◽

Chemical Equipment ◽

Transfer Of Knowledge ◽

Equipment Reliability ◽

Fuzzy Clustering Algorithm

The problem of transformation of data on the reliability of chemical equipment obtained in particular conditions to other equipment in other conditions is treated. A fuzzy clustering algorithm is defined for this problem. The method is illustrated on a case study.

Download Full-text

Spatial Analysis of the Drivers, Characteristics, and Effects of Forest Fragmentation

Sustainability ◽

10.3390/su13063246 ◽

2021 ◽

Vol 13 (6) ◽

pp. 3246

Author(s):

Zoe Slattery ◽

Richard Fenner

Keyword(s):

Forest Fragmentation ◽

Forest Fires ◽

Geographical Information ◽

Agricultural Expansion ◽

Forest Patches ◽

Mato Grosso ◽

Remote Imaging ◽

Landscape Characteristics ◽

High Level

Building on the existing literature, this study examines whether specific drivers of forest fragmentation cause particular fragmentation characteristics, and how these characteristics can be linked to their effects on forest-dwelling species. This research uses Landsat remote imaging to examine the changing patterns of forests. It focuses on areas which have undergone a high level of a specific fragmentation driver, in particular either agricultural expansion or commodity-driven deforestation. Seven municipalities in the states of Rondônia and Mato Grosso in Brazil are selected as case study areas, as these states experienced a high level of commodity-driven deforestation and agricultural expansion respectively. Land cover maps of each municipality are created using the Geographical Information System software ArcGIS Spatial Analyst extension. The resulting categorical maps are input into Fragstats fragmentation software to calculate quantifiable fragmentation metrics for each municipality. To determine the effects that these characteristics are likely to cause, this study uses a literature review to determine how species traits affect their responses to forest fragmentation. Results indicate that, in areas that underwent agricultural expansion, the remaining forest patches became more complex in shape with longer edges and lost a large amount of core area. This negatively affects species which are either highly dispersive or specialist to core forest habitat. In areas that underwent commodity-driven deforestation, it was more likely that forest patches would become less aggregated and create disjunct core areas. This negatively affects smaller, sedentary animals which do not naturally travel long distances. This study is significant in that it links individual fragmentation drivers to their landscape characteristics, and in turn uses these to predict effects on species with particular traits. This information will prove useful for forest managers, particularly in the case study municipalities examined in this study, in deciding which species require further protection measures. The methodology could be applied to other drivers of forest fragmentation such as forest fires.

Download Full-text

Exemplars can Reciprocate Principal Components

Glowworm Swarm Optimization Algorithm- and K-Prototypes Algorithm-Based Metadata Tree Clustering

A clustering algorithm for ipsative variables

Kernal Based Semi-Supervised Clustering and its Application in Leave Recognition of Bauhinia Blakeana Leaves

Application of K-Means Clustering Algorithm for Determination of Fire-Prone Areas Utilizing Hotspots in West Kalimantan Province

Comparison of Clustering K-Means, Fuzzy C-Means, and Linkage for Nasa Active Fire Dataset

Application of K-Means Clustering Algorithm for Determination of Fire-Prone Areas Utilizing Hotspots in West Kalimantan Province

Teknik Data Mining Dalam Clustering Produksi Susu Segar Di Indonesia Dengan Algoritma K-Means

A Virtual Laboratory to Practice Mobile Wireless Sensor Networks: A Case Study on Energy Efficient and Safe Weighted Clustering Algorithm

Transfer of knowledge in chemical equipment reliability

Spatial Analysis of the Drivers, Characteristics, and Effects of Forest Fragmentation

Export Citation Format