scholarly journals Clustering Uncertain Data Objects using Jeffreys-Divergence and Maximum Bipartite Matching based Similarity Measure

IEEE Access ◽  
2021 ◽  
pp. 1-1
Author(s):  
Krishna Kumar Sharma ◽  
Ayan Seal ◽  
Anis Yazidi ◽  
Ali Selamat ◽  
Ondrej Krejcar
2021 ◽  
Vol 2021 ◽  
pp. 1-9
Author(s):  
Ezgi Türkarslan ◽  
Jun Ye ◽  
Mehmet Ünver ◽  
Murat Olgun

The main purpose of this study is to construct a base for a new fuzzy set concept that is called consistency fuzzy set (CFS) which expresses the multidimensional uncertain data quite successfully. Our motive is to reduce the complexity and difficulty caused by the information contained in the truth sequence in a fuzzy multiset (FMS) and to present the data of the truth sequence in a more understandable and compact manner. Therefore, this paper introduces the concept of CFS that is characterized with a truth function defined on a universal set 0,1 2 . The first component of the truth pair of a CFS is the average value of the truth sequence of a FMS and the second component is the consistency degree, that is, the fuzzy complement of the standard deviation of the truth sequence of the same FMS. The main contribution of a CFS is the reflection of both the level of the average of the data that can be expressed with the different sequence lengths and the degree of the reasonable information in data via consistency degree. To develop this new concept, this paper also presents a correlation coefficient and a cosine similarity measure between CFSs. Furthermore, the proposed correlation coefficient and cosine similarity measure are applied to a multiperiod medical diagnosis problem. Finally, a comparison analysis is given between the obtained results and the existing results in literature to show the efficiency and rationality of the proposed correlation coefficient and cosine similarity measure.


Author(s):  
I. Voyiatzis ◽  
K. Axiotis ◽  
N. Papaspyrou ◽  
H. Antonopoulou ◽  
C. Efstathiou

2013 ◽  
Vol 278-280 ◽  
pp. 1287-1291 ◽  
Author(s):  
Hong Xin Wan ◽  
Yun Peng

A fuzzy algorithm of customers evaluation based on attributes reduction is presented. The evaluation from the data objects based on key attributes can reduce the data size and algorithm complexity. After Clustering analysis of customers, then the evaluation analysis will process to the clustering data. There are a lot of uncertain data of customer cluster, so the traditional method of classification and evaluation to the incomplete data is very difficult. Superposition evaluation algorithm based on fuzzy set can improve the reliability and accuracy of e-commerce customer evaluation. Evaluation of the e-commerce customer also can improve efficiency, service quality and profitability of e-commerce businesses.


For representing and manipulating uncertain information like fuzzy, incomplete, inconsistent or imprecise, Neutrosophic relation database model is a more general platform, in the human decision-making process. Neutrosophic sets can easily handle real world problems. A new correlation method is introduced in this paper to construct similarity measure, by which decision making problem that exist in real world situation can be easily handled in regard of multiple existing criteria’s or incomplete or inconsistent information. The selection of the best option of alternative can be done by ranking all the other options as per similarity measure depending on concept of similarity. Later in this paper, an explanatory example is given of the proposed method and the comparison results are also presented to show the effective output.. The application in certain domains of medical diagnosis problems having multiple criteria’s in decision making are also discussed in the end of the proposed method.


Author(s):  
Holger Fröhlich

Prediction models for absorption, distribution, metabolic and excretion properties of chemical compounds play a crucial rule in the drug discovery process. Often such models are derived via machine learning techniques. Kernel based learning algorithms, like the well known support vector machine (SVM) have gained a growing interest during the last years for this purpose. One of the key concepts of SVMs is a kernel function, which can be thought of as a special similarity measure. In this Chapter the author describes optimal assignment kernels for multi-labeled molecular graphs. The optimal assignment kernel is based on the idea of a maximal weighted bipartite matching of the atoms of a pair of molecules. At the same time the physico-chemical properties of each single atom are considered as well as the neighborhood in the molecular graph. Later on our similarity measure is extended to deal with reduced graph representations, in which certain structural elements, like rings, donors or acceptors, are condensed in one single node of the graph. Comparisons of the optimal assignment kernel with other graph kernels as well as with classical descriptor based models show a significant improvement in prediction accuracy.


2014 ◽  
Vol 989-994 ◽  
pp. 1775-1778
Author(s):  
Hong Xin Wan ◽  
Yun Peng

The evaluation algorithm is based on the attributes of data objects. There is a certain correlation between attributes, and attributes are divided into key attributes and secondary attributes. This paper proposes an algorithm of attribute reduction based on rough set and the clustering algorithm based on fuzzy set. The algorithm of attributes reduction based on rough set is described in detail first. There are a lot of uncertain data of customer clustering, so traditional method of classification to the incomplete data will be very complex. Clustering algorithm based on fuzzy set can improve the reliability and accuracy of web customers.


Sign in / Sign up

Export Citation Format

Share Document