Incremental Interval Type-2 Fuzzy Clustering of Data Streams using Single Pass Method

Data Streams create new challenges for fuzzy clustering algorithms, specifically Interval Type-2 Fuzzy C-Means (IT2FCM). One problem associated with IT2FCM is that it tends to be sensitive to initialization conditions and therefore, fails to return global optima. This problem has been addressed by optimizing IT2FCM using Ant Colony Optimization approach. However, IT2FCM-ACO obtain clusters for the whole dataset which is not suitable for clustering large streaming datasets that may be coming continuously and evolves with time. Thus, the clusters generated will also evolve with time. Additionally, the incoming data may not be available in memory all at once because of its size. Therefore, to encounter the challenges of a large data stream environment we propose improvising IT2FCM-ACO to generate clusters incrementally. The proposed algorithm produces clusters by determining appropriate cluster centers on a certain percentage of available datasets and then the obtained cluster centroids are combined with new incoming data points to generate another set of cluster centers. The process continues until all the data are scanned. The previous data points are released from memory which reduces time and space complexity. Thus, the proposed incremental method produces data partitions comparable to IT2FCM-ACO. The performance of the proposed method is evaluated on large real-life datasets. The results obtained from several fuzzy cluster validity index measures show the enhanced performance of the proposed method over other clustering algorithms. The proposed algorithm also improves upon the run time and produces excellent speed-ups for all datasets.

Download Full-text

Towards Expert-Inspired Automatic Criterion to Cut a Dendrogram for Real-Industrial Applications

10.3233/faia210140 ◽

2021 ◽

Author(s):

Shikha Suman ◽

Ashutosh Karna ◽

Karina Gibert

Keyword(s):

Hierarchical Clustering ◽

Clustering Algorithms ◽

Computational Cost ◽

Real Life ◽

Ground Truth ◽

Industrial Applications ◽

Underlying Structure ◽

Cluster Validity ◽

Cluster Validity Index ◽

Number Of Clusters

Hierarchical clustering is one of the most preferred choices to understand the underlying structure of a dataset and defining typologies, with multiple applications in real life. Among the existing clustering algorithms, the hierarchical family is one of the most popular, as it permits to understand the inner structure of the dataset and find the number of clusters as an output, unlike popular methods, like k-means. One can adjust the granularity of final clustering to the goals of the analysis themselves. The number of clusters in a hierarchical method relies on the analysis of the resulting dendrogram itself. Experts have criteria to visually inspect the dendrogram and determine the number of clusters. Finding automatic criteria to imitate experts in this task is still an open problem. But, dependence on the expert to cut the tree represents a limitation in real applications like the fields industry 4.0 and additive manufacturing. This paper analyses several cluster validity indexes in the context of determining the suitable number of clusters in hierarchical clustering. A new Cluster Validity Index (CVI) is proposed such that it properly catches the implicit criteria used by experts when analyzing dendrograms. The proposal has been applied on a range of datasets and validated against experts ground-truth overcoming the results obtained by the State of the Art and also significantly reduces the computational cost.

Download Full-text

Interval type-2 fuzzy inference-based failure mode and effect analysis model in a group decision-making setting

Kybernetes ◽

10.1108/k-02-2021-0152 ◽

2021 ◽

Vol ahead-of-print (ahead-of-print) ◽

Author(s):

İlker Gölcük

Keyword(s):

Failure Modes ◽

Fuzzy Inference ◽

Real Life ◽

Effect Analysis ◽

Inference System ◽

Content Type ◽

Proposed Model ◽

Interval Type

PurposeThis paper proposes an integrated IT2F-FMEA model under a group decision-making setting. In risk assessment models, experts' evaluations are often aggregated beforehand, and necessary computations are performed, which in turn, may cause a loss of information and valuable individual opinions. The proposed integrated IT2F-FMEA model aims to calculate risk priority numbers from the experts' evaluations and then fuse experts' judgments using a novel integrated model.Design/methodology/approachThis paper presents a novel failure mode and effect analysis (FMEA) model by integrating the fuzzy inference system, best-worst method (BWM) and weighted aggregated sum-product assessment (WASPAS) methods under interval type-2 fuzzy (IT2F) environment. The proposed FMEA approach utilizes the Mamdani-type IT2F inference system to calculate risk priority numbers. The individual FMEA results are combined by using integrated IT2F-BWM and IT2F-WASPAS methods.FindingsThe proposed model is implemented in a real-life case study in the furniture industry. According to the case study, fifteen failure modes are considered, and the proposed integrated method is used to prioritize the failure modes.Originality/valueMamdani-type singleton IT2F inference model is employed in the FMEA. Additionally, the proposed model allows experts to construct their membership functions and fuzzy rules to capitalize on the experience and knowledge of the experts. The proposed group FMEA model aggregates experts' judgments by using IT2F-BWM and IT2F-WASPAS methods. The proposed model is implemented in a real-life case study in the furniture company.

Download Full-text

A New Optimization Approach to Clustering Fuzzy Data for Type-2 Fuzzy System Modeling

Cross-Disciplinary Applications of Artificial Intelligence and Pattern Recognition - Advances in Computational Intelligence and Robotics ◽

10.4018/978-1-61350-429-1.ch025 ◽

2012 ◽

pp. 499-508

Author(s):

Mohammad Hossein Fazel Zarandi ◽

Milad Avazbeigi

Keyword(s):

Fuzzy Clustering ◽

Fuzzy System ◽

Distance Measure ◽

System Model ◽

Rule Base ◽

Optimization Approach ◽

Fuzzy Data ◽

System Models ◽

Fuzzy System Models

This chapter presents a new optimization method for clustering fuzzy data to generate Type-2 fuzzy system models. For this purpose, first, a new distance measure for calculating the (dis)similarity between fuzzy data is proposed. Then, based on the proposed distance measure, Fuzzy c-Mean (FCM) clustering algorithm is modified. Next, Xie-Beni cluster validity index is modified to be able to valuate Type-2 fuzzy clustering approach. In this index, all operations are fuzzy and the minimization method is fuzzy ranking with Hamming distance. The proposed Type-2 fuzzy clustering method is used for development of indirect approach to Type-2 fuzzy modeling, where the rules are extracted from clustering fuzzy numbers (Zadeh, 1965). Then, the Type-2 fuzzy system is tuned by an inference algorithm for optimization of the main parameters of Type-2 parametric system. In this case, the parameters are: Schweizer and Sklar t-Norm and s-Norm, a-cut of rule-bases, combination of FATI and FITA inference approaches, and Yager parametric defuzzification. Finally, the proposed Type-2 fuzzy system model is applied in prediction of the steel additives in steelmaking process. It is shown that, the proposed Type-2 fuzzy system model is superior in comparison with multiple regressions and Type-1 fuzzy system model, in terms of the minimization the effect of uncertainty in the rule-base fuzzy system models an error reduction.

Download Full-text

Uncertain Fuzzy Clustering: Interval Type-2 Fuzzy Approach to $C$-Means

IEEE Transactions on Fuzzy Systems ◽

10.1109/tfuzz.2006.889763 ◽

2007 ◽

Vol 15 (1) ◽

pp. 107-120 ◽

Cited By ~ 276

Author(s):

Cheul Hwang ◽

Frank Chung-Hoon Rhee

Keyword(s):

Fuzzy Clustering ◽

Fuzzy Approach ◽

Interval Type

Download Full-text

Land cover classification using interval type-2 fuzzy clustering for multi-spectral satellite imagery

2012 IEEE International Conference on Systems, Man, and Cybernetics (SMC) ◽

10.1109/icsmc.2012.6378097 ◽

2012 ◽

Cited By ~ 3

Author(s):

Long Thanh Ngo ◽

Dzung Dinh Nguyen

Keyword(s):

Land Cover ◽

Satellite Imagery ◽

Fuzzy Clustering ◽

Land Cover Classification ◽

Interval Type

Download Full-text

Fuzzy clustering and fuzzy c-means partition cluster analysis and validation studies on a subset of citescore dataset

International Journal of Electrical and Computer Engineering (IJECE) ◽

10.11591/ijece.v9i4.pp2760-2770 ◽

2019 ◽

Vol 9 (4) ◽

pp. 2760

Author(s):

K. Varada Rajkumar ◽

Adimulam Yesubabu ◽

K. Subrahmanyam

Keyword(s):

Cluster Analysis ◽

Fuzzy Clustering ◽

Time Complexity ◽

Clustering Algorithm ◽

Fuzzy Cluster ◽

Fuzzy Cluster Analysis ◽

Fuzzy C Means ◽

A Value ◽

Data Points ◽

Partition Clustering

A hard partition clustering algorithm assigns equally distant points to one of the clusters, where each datum has the probability to appear in simultaneous assignment to further clusters. The fuzzy cluster analysis assigns membership coefficients of data points which are equidistant between two clusters so the information directs have a place toward in excess of one cluster in the meantime. For a subset of CiteScore dataset, fuzzy clustering (fanny) and fuzzy c-means (fcm) algorithms were implemented to study the data points that lie equally distant from each other. Before analysis, clusterability of the dataset was evaluated with Hopkins statistic which resulted in 0.4371, a value < 0.5, indicating that the data is highly clusterable. The optimal clusters were determined using NbClust package, where it is evidenced that 9 various indices proposed 3 cluster solutions as best clusters. Further, appropriate value of fuzziness parameter <em>m</em> was evaluated to determine the distribution of membership values with variation in <em>m</em> from 1 to 2. Coefficient of variation (CV), also known as relative variability was evaluated to study the spread of data. The time complexity of fuzzy clustering (fanny) and fuzzy c-means algorithms were evaluated by keeping data points constant and varying number of clusters.

Download Full-text

GIVING FUZZINESS TO SPATIAL CLUSTERS: A NEW INDEX FOR CHOOSING THE OPTIMAL NUMBER OF CLUSTERS

International Journal of Artificial Intelligence Tools ◽

10.1142/s0218213013500097 ◽

2013 ◽

Vol 22 (03) ◽

pp. 1350009 ◽

Cited By ~ 2

Author(s):

GEORGE GREKOUSIS

Keyword(s):

Fuzzy Clustering ◽

Spatial Clustering ◽

Clustering Algorithms ◽

Optimal Number ◽

Fuzzy Cluster ◽

Cluster Validation ◽

Number Of Clusters ◽

A Value ◽

Membership Value ◽

Optimal Number Of Clusters

Choosing the optimal number of clusters is a key issue in cluster analysis. Especially when dealing with more spatial clustering, things tend to be more complicated. Cluster validation helps to determine the appropriate number of clusters present in a dataset. Furthermore, cluster validation evaluates and assesses the results of clustering algorithms. There are numerous methods and techniques for choosing the optimal number of clusters via crisp and fuzzy clustering. In this paper, we introduce a new index for fuzzy clustering to determine the optimal number of clusters. This index is not another metric for calculating compactness or separation among partitions. Instead, the index uses several existing indices to give a degree, or fuzziness, to the optimal number of clusters. In this way, not only do the objects in a fuzzy cluster get a membership value, but the number of clusters to be partitioned is given a value as well. The new index is used in the fuzzy c-means algorithm for the geodemographic segmentation of 285 postal codes.

Download Full-text

Image Segmentation Based on Interval Type 2 Fuzzy Clustering and Differential Immune Clone Algorithm

2020 Chinese Automation Congress (CAC) ◽

10.1109/cac51589.2020.9327883 ◽

2020 ◽

Author(s):

Kun Shu ◽

Di Li ◽

Xiangjian Chen

Keyword(s):

Image Segmentation ◽

Fuzzy Clustering ◽

Interval Type

Download Full-text

The integrated PFMEA approach with interval type-2 fuzzy sets and FBWM: A case study in the automotive industry

Proceedings of the Institution of Mechanical Engineers Part D Journal of Automobile Engineering ◽

10.1177/09544070211034799 ◽

2021 ◽

pp. 095440702110347

Author(s):

Nikola Komatina ◽

Danijela Tadić ◽

Aleksandar Aleksić ◽

Nikola Banduka

Keyword(s):

Risk Factors ◽

Risk Factor ◽

Automotive Industry ◽

Real Life ◽

Relative Importance ◽

Effect Analysis ◽

Stable Conditions ◽

Interval Type

The change of market’s demand could be predictable to a certain degree at stable conditions but it may vary due to disruptive events. This research contributes by establishing the improvement of PFMEA (Process Failure Mode and Effect Analysis) analysis in the domain of assessment and determining severity risk factor, as well as identifying of failure priority. According to the researchers’ and practitioners’ suggestions, severity needs to be considered from the multiple aspects. The risk factor severity is considered from the aspects of product importance, quality, and cost. These aspects have different relative importance, which is determined in an exact way. The relative importance of the aspects, as well as the values of the risk factors, was described by linguistic expressions that are modeled by using the Interval type-2 trapezoidal fuzzy numbers (IT2TrFNs). IT2FBWM was used to determine weight vectors of risk factors. The priority of failures is determined according to the Action priority model which proposed by AIAG & VDA (Automotive Industry Action Group and German Association of the Automotive Industry). The proposed methodology is tested in a Case study where the real-life data originated from a company from the Republic of Serbia that operates as a part of an automotive supply chain.

Download Full-text

A Combined Interval Type-2 Fuzzy MCDM Framework for the Resilient Supplier Selection Problem

Mathematics ◽

10.3390/math10010044 ◽

2021 ◽

Vol 10 (1) ◽

pp. 44

Author(s):

Seyed Amirali Hoseini ◽

Sarfaraz Hashemkhani Zolfani ◽

Paulius Skačkauskas ◽

Alireza Fallahpour ◽

Sara Saberi

Keyword(s):

Distance Measure ◽

Hamming Distance ◽

Real Life ◽

Fuzzy Environment ◽

Proposed Model ◽

Order Preference ◽

Simple Additive Weighting ◽

Interval Type ◽

High Degree

Selecting the most resilient supplier is a crucial problem for organizations and managers in the supply chain. However, due to the inherited high degree of uncertainty in real-life projects, developing a decision-making framework in a crisp or fuzzy environment may not present accurate or reliable results for the managers. For this reason, it is better to evaluate the potential suppliers in an Interval Type-2 Fuzzy (IT2F) environment for better dealing with this ambiguity. This study developed an improved combined IT2F Best Worst Method (BWM) and IT2F technique for Order Preference by Similarity to Ideal Solution (TOPSIS) model “Atieh Sazan” Co. as a case study, such that the IT2FBWM was employed for obtaining the weight of criteria. The IT2FTOPSIS was utilized for ranking the potential suppliers based on Hamming distance measure. In both phases, the opinions of experts as IT2F linguistic terms were employed for weighting the criteria and obtaining the relative importance of the alternatives in terms of the evaluative criteria. After obtaining the final results, the proposed model was validated by replacing Analytical Hierarchy Process (AHP) and Simple Additive Weighting (SAW) approaches separately instead of BWM for weighting the criteria. After executing both new models, it was found that the final ranking was similar to the final ranking of the proposed model, representing the reliability and accuracy of the obtained results. Moreover, it was concluded that the resilient criteria of “Reorganization” and “Redundancy” are the most determinant measures for selecting the best supplier rather than measures in the Iranian Construction Industry.

Download Full-text