TEXTUAL-BASED CLUSTERING OF WEB DOCUMENTS

In our study we presented an effective method for clustering of Web pages. From flat HTML files we extracted keywords, formed feature vectors as representation of Web pages and applied them to a clustering method. We took advantage of the Fuzzy C-Means clustering algorithm (FCM). We demonstrated an organized and schematic manner of data collection. Various categories of Web pages were retrieved from ODP (Open Directory Project) in order to create our datasets. The results of clustering proved that the method performs well for all datasets. Finally, we presented a comprehensive experimental study examining: the behavior of the algorithm for different input parameters, internal structure of datasets and classification experiments.

Download Full-text

Fuzzy C-Means Clustering Algorithm with Multiple Fuzzification Coefficients

Algorithms ◽

10.3390/a13070158 ◽

2020 ◽

Vol 13 (7) ◽

pp. 158

Author(s):

Tran Dinh Khang ◽

Nguyen Duc Vuong ◽

Manh-Kien Tran ◽

Michael Fowler

Keyword(s):

Fuzzy Clustering ◽

Clustering Algorithm ◽

Clustering Methods ◽

Clustering Method ◽

Machine Learning Technique ◽

Practical Applications ◽

Fuzzy C Means ◽

Fuzzy Clustering Method ◽

Learning Technique ◽

Fuzzy C Means Clustering

Clustering is an unsupervised machine learning technique with many practical applications that has gathered extensive research interest. Aside from deterministic or probabilistic techniques, fuzzy C-means clustering (FCM) is also a common clustering technique. Since the advent of the FCM method, many improvements have been made to increase clustering efficiency. These improvements focus on adjusting the membership representation of elements in the clusters, or on fuzzifying and defuzzifying techniques, as well as the distance function between elements. This study proposes a novel fuzzy clustering algorithm using multiple different fuzzification coefficients depending on the characteristics of each data sample. The proposed fuzzy clustering method has similar calculation steps to FCM with some modifications. The formulas are derived to ensure convergence. The main contribution of this approach is the utilization of multiple fuzzification coefficients as opposed to only one coefficient in the original FCM algorithm. The new algorithm is then evaluated with experiments on several common datasets and the results show that the proposed algorithm is more efficient compared to the original FCM as well as other clustering methods.

Download Full-text

Technique Based on Fuzzy Logic for Cotton Bale Lay-down Management

Fibres and Textiles in Eastern Europe ◽

10.5604/12303666.1228163 ◽

2017 ◽

Vol 25 (0) ◽

pp. 30-33

Author(s):

Subhasis Das ◽

Anindya Ghosh

Keyword(s):

Fuzzy Logic ◽

Clustering Algorithm ◽

Fibre Content ◽

New Technique ◽

Clustering Method ◽

Fuzzy C Means ◽

Convenient Tool ◽

Fuzzy C Means Clustering ◽

Cotton Bale ◽

A New Technique

In this paper a new technique has been proposed for cotton bale management using fuzzy logic. The fuzzy c-means clustering algorithm has been applied for clustering cotton bales into 5 categories from 1200 randomly chosen bales of the J-34 variety. In order to cluster bales of different categories, eight fibre properties, viz., the strength, elongation, upper half mean length, length uniformity, short fibre content, micronaire, reflectance and yellowness of each bale have been considered. The fuzzy c-means clustering method is able to handle the haziness that may be present in the boundaries between adjacent classes of cotton bales as compared to the K-means clustering method. This method may be used as a convenient tool for the consistent picking of different bale mixes from any number of bales in a warehouse.

Download Full-text

Fuzzy c-means Clustering Algorithm for Brain Tumor Segmentation

International Journal of Advanced Research in Computer Science and Software Engineering ◽

10.23956/ijarcsse/v7i6/0198 ◽

2017 ◽

Vol 7 (6) ◽

pp. 668-670

Author(s):

A. Florence ◽

◽

J. G. R Sathiaseelan ◽

Keyword(s):

Brain Tumor ◽

Clustering Algorithm ◽

Tumor Segmentation ◽

Brain Tumor Segmentation ◽

Fuzzy C Means ◽

Fuzzy C Means Clustering

Download Full-text

The fall point identification of cluster warhead based on fuzzy C-Means clustering method

Advanced Control, Automation and Robotics ◽

10.2495/acar140451 ◽

2015 ◽

Author(s):

G.L. Wang ◽

S.Q. Dong ◽

X.L. Shen ◽

J.R. Lu

Keyword(s):

Clustering Method ◽

Fuzzy C Means ◽

Fuzzy C Means Clustering

Download Full-text

Automatic measurement of traditional Chinese costume from its silhouette through Fuzzy c-means clustering method

Journal of Engineered Fibers and Fabrics ◽

10.1177/1558925020978323 ◽

2020 ◽

Vol 15 ◽

pp. 155892502097832

Author(s):

Jiaqin Zhang ◽

Jingan Wang ◽

Le Xing ◽

Hui’e Liang

Keyword(s):

Industrial Application ◽

Clustering Algorithm ◽

Color Space ◽

Automatic Measurement ◽

Feature Point ◽

Feature Points ◽

Point Location ◽

Fuzzy C Means ◽

Fuzzy C Means Clustering ◽

Environmental Robustness

As the precious cultural heritage of the Chinese nation, traditional costumes are in urgent need of scientific research and protection. In particular, there are scanty studies on costume silhouettes, due to the reasons of the need for cultural relic protection, and the strong subjectivity of manual measurement, which limit the accuracy of quantitative research. This paper presents an automatic measurement method for traditional Chinese costume dimensions based on fuzzy C-means clustering and silhouette feature point location. The method is consisted of six steps: (1) costume image acquisition; (2) costume image preprocessing; (3) color space transformation; (4) object clustering segmentation; (5) costume silhouette feature point location; and (6) costume measurement. First, the relative total variation model was used to obtain the environmental robustness and costume color adaptability. Second, the FCM clustering algorithm was used to implement image segmentation to extract the outer silhouette of the costume. Finally, automatic measurement of costume silhouette was achieved by locating its feature points. The experimental results demonstrated that the proposed method could effectively segment the outer silhouette of a costume image and locate the feature points of the silhouette. The measurement accuracy could meet the requirements of industrial application, thus providing the dual value of costume culture research and industrial application.

Download Full-text