cosine similarity measure
Recently Published Documents


TOTAL DOCUMENTS

59
(FIVE YEARS 24)

H-INDEX

8
(FIVE YEARS 3)

Author(s):  
Shaha Al-Otaibi ◽  
Nourah Altwoijry ◽  
Alanoud Alqahtani ◽  
Latifah Aldheem ◽  
Mohrah Alqhatani ◽  
...  

Social media have become a discussion platform for individuals and groups. Hence, users belonging to different groups can communicate together. Positive and negative messages as well as media are circulated between those users. Users can form special groups with people who they already know in real life or meet through social networking after being suggested by the system. In this article, we propose a framework for recommending communities to users based on their preferences; for example, a community for people who are interested in certain sports, art, hobbies, diseases, age, case, and so on. The framework is based on a feature extraction algorithm that utilizes user profiling and combines the cosine similarity measure with term frequency to recommend groups or communities. Once the data is received from the user, the system tracks their behavior, the relationships are identified, and then the system recommends one or more communities based on their preferences. Finally, experimental studies are conducted using a prototype developed to test the proposed framework, and results show the importance of our framework in recommending people to communities.


2021 ◽  
pp. 1-11
Author(s):  
Alireza Fakharzadeh Jahromi ◽  
Mehdi Hajiloei ◽  
Yeganeh Dehghani ◽  
Sara Lahoninezhad

To overcome curse of dimensionality for outlier detecting in high dimensional dataset, axis-parallel subspace (SOD) and angle-based outlier detection (ABOD) methods were presented. These methods are also friendly used distance-based to detect outliers. In this regard, based on the reality of fuzzy data for explaining the world phenomena, this paper introduces an extended version of both methods for fuzzy dataset. First, the basic concepts of both methods are explained. Next we provide two metrics based on Euclidean and analytic distance to measure distance between fuzzy objects; also Cosine similarity measure formula for calculating the cosine of angle between two difference vectors in high-dimensional fuzzy dataset is illustrated. Then the algorithms to determine outliers of fuzzy datasets by using these metrics and Cosine similarity measure, based on ABOD and SOD algorithms, are presented. Some numerical experimental examples are also presented, in which both real and synthesis datasets are used, For a real numerical examination, we have applied proposed algorithms to data from 15 Iranian petrochemical companies in a fully fuzzy environment. The obtained results show the significant properties of the new methods in detecting outliers.


2021 ◽  
pp. 1-11
Author(s):  
Carolina Martín-del-Campo-Rodríguez ◽  
Grigori Sidorov ◽  
Ildar Batyrshin

This paper presents a computational model for the unsupervised authorship attribution task based on a traditional machine learning scheme. An improvement over the state of the art is achieved by comparing different feature selection methods on the PAN17 author clustering dataset. To achieve this improvement, specific pre-processing and features extraction methods were proposed, such as a method to separate tokens by type to assign them to only one category. Similarly, special characters are used as part of the punctuation marks to improve the result obtained when applying typed character n-grams. The Weighted cosine similarity measure is applied to improve the B 3 F-score by reducing the vector values where attributes are exclusive. This measure is used to define distances between documents, which later are occupied by the clustering algorithm to perform authorship attribution.


2021 ◽  
Vol 2 (5) ◽  
pp. 9-16
Author(s):  
Hans Eric Ramaroson ◽  
René Rakotomanana ◽  
Hery Zo Andriamanohisoa

Cosine similarity measure plays a significant role in various fields. Literature consultation confirms that the theory of cosine similarity measure has received a great interest and attention from the researchers in the world. The concept of Interval Valued Bipolar Neutrosophic Hesitant Fuzzy Sets (IVBNHFS) is recently presented and very interesting. Every element in IVBNHFS is characterized by six independent membership functions (three positive and three negative). There is no investigation on the Cosine Similarity Measure (CSM) of IVBNHFS. In this study, we firstly define a CSM and a weighted CSM between two IVBNHFS and their applications to Multi-Attribute Decision Making (MADM) process in the Interval Valued Bipolar Neutrosophic Hesitant Fuzzy (IVBNHF) setting. And, we establish some properties of CSM and a weighted CSM. We use this strategy to find out the best alternative in MADM case. Then, the new approach to clarify MADM problems in IVBNHF setting is presented in algorithmic form. And, we solve an illustrative case of MADM to demonstrate the effectiveness, workability, and appropriateness of the proposed approach. Finally, the main conclusion and future opportunity of research paper are overviewed and compiled.


2021 ◽  
pp. 1-10
Author(s):  
Yan Gao ◽  
Chenchen Liu ◽  
Liangyu Zhao ◽  
Kun Zhang

The q-rung orthopair fuzzy set is a powerful and useful tool to deal with uncertainty, but in actual decision-making process, decision-makers are usually required to analyze the actual problem dynamically. Therefore in this paper, we consider the time-series q-rung orthopair fuzzy decision making. First, we introduce the new cosine similarity measure of q-ROFS which combines the cosine similarity measure and the Euclidean distance measure. Then, we combine the advantages of projection method and grey correlation degree, establishing the nonlinear programming model to calculate the weights of attributes. Furthermore, we use the exponential decay model to get the weights formulas of q-ROFS at different times. Then we replace the distance function with grey relational projection and extend TOPSIS method. Based on these, we propose a new MAGDM approach to deal with time-series q-rung orthopair fuzzy problem not only from the point of view of geometry but also from the point of view of algebra. Finally, we give a practical example to illustrate effectiveness and feasibility of the new method.


2021 ◽  
Vol 2021 ◽  
pp. 1-9
Author(s):  
Ezgi Türkarslan ◽  
Jun Ye ◽  
Mehmet Ünver ◽  
Murat Olgun

The main purpose of this study is to construct a base for a new fuzzy set concept that is called consistency fuzzy set (CFS) which expresses the multidimensional uncertain data quite successfully. Our motive is to reduce the complexity and difficulty caused by the information contained in the truth sequence in a fuzzy multiset (FMS) and to present the data of the truth sequence in a more understandable and compact manner. Therefore, this paper introduces the concept of CFS that is characterized with a truth function defined on a universal set 0,1 2 . The first component of the truth pair of a CFS is the average value of the truth sequence of a FMS and the second component is the consistency degree, that is, the fuzzy complement of the standard deviation of the truth sequence of the same FMS. The main contribution of a CFS is the reflection of both the level of the average of the data that can be expressed with the different sequence lengths and the degree of the reasonable information in data via consistency degree. To develop this new concept, this paper also presents a correlation coefficient and a cosine similarity measure between CFSs. Furthermore, the proposed correlation coefficient and cosine similarity measure are applied to a multiperiod medical diagnosis problem. Finally, a comparison analysis is given between the obtained results and the existing results in literature to show the efficiency and rationality of the proposed correlation coefficient and cosine similarity measure.


There are huge tons of transactions being accomplished online every day. This implies that ecommerce is facing the problem of data and information overloads. While customers are shopping via websites, they spend a lot of time to search for the required products based on their needs. This problem can easily be alleviated by having an accurate recommendation system based on a strong algorithm and confident measures in this regard. There are two main techniques for products recommendation; content-based filtering and collaborative filtering. If one of these two techniques implemented on the e-commerce system, a lot of limitations and weak points will appear. This paper aims at generating an optimal list of product, which, in turn, generates an accurate and reliable list of items. The new approach is composed of three components; clustering algorithm, user-based collaborative filtering, and the Cosine similarity measure. This approach implemented using a real dataset of past experienced users. The accuracy of the search results is a matter to users, it recommends the most appropriate products to users of the e-commerce website. This approach shows trustworthy results and achieved a high level of accuracy for recommending products to users.


Sign in / Sign up

Export Citation Format

Share Document