A Comparative Analysis of Euclidean, Jaccard and Cosine Similarity Measure and Arabic Wordnet for Automated Arabic Essay Scoring

Author(s):  
Mohammad Alobed ◽  
Abdallah M M Altrad ◽  
Zainab Binti Abu Bakar
2019 ◽  
Vol 71 (1) ◽  
pp. 18-37 ◽  
Author(s):  
Güleda Doğan ◽  
Umut Al

Purpose The purpose of this paper is to analyze the similarity of intra-indicators used in research-focused international university rankings (Academic Ranking of World Universities (ARWU), NTU, University Ranking by Academic Performance (URAP), Quacquarelli Symonds (QS) and Round University Ranking (RUR)) over years, and show the effect of similar indicators on overall rankings for 2015. The research questions addressed in this study in accordance with these purposes are as follows: At what level are the intra-indicators used in international university rankings similar? Is it possible to group intra-indicators according to their similarities? What is the effect of similar intra-indicators on overall rankings? Design/methodology/approach Indicator-based scores of all universities in five research-focused international university rankings for all years they ranked form the data set of this study for the first and second research questions. The authors used a multidimensional scaling (MDS) and cosine similarity measure to analyze similarity of indicators and to answer these two research questions. Indicator-based scores and overall ranking scores for 2015 are used as data and Spearman correlation test is applied to answer the third research question. Findings Results of the analyses show that the intra-indicators used in ARWU, NTU and URAP are highly similar and that they can be grouped according to their similarities. The authors also examined the effect of similar indicators on 2015 overall ranking lists for these three rankings. NTU and URAP are affected least from the omitted similar indicators, which means it is possible for these two rankings to create very similar overall ranking lists to the existing overall ranking using fewer indicators. Research limitations/implications CWTS, Mapping Scientific Excellence, Nature Index, and SCImago Institutions Rankings (until 2015) are not included in the scope of this paper, since they do not create overall ranking lists. Likewise, Times Higher Education, CWUR and US are not included because of not presenting indicator-based scores. Required data were not accessible for QS for 2010 and 2011. Moreover, although QS ranks more than 700 universities, only first 400 universities in 2012–2015 rankings were able to be analyzed. Although QS’s and RUR’s data were analyzed in this study, it was statistically not possible to reach any conclusion for these two rankings. Practical implications The results of this study may be considered mainly by ranking bodies, policy- and decision-makers. The ranking bodies may use the results to review the indicators they use, to decide on which indicators to use in their rankings, and to question if it is necessary to continue overall rankings. Policy- and decision-makers may also benefit from the results of this study by thinking of giving up using overall ranking results as an important input in their decisions and policies. Originality/value This study is the first to use a MDS and cosine similarity measure for revealing the similarity of indicators. Ranking data is skewed that require conducting nonparametric statistical analysis; therefore, MDS is used. The study covers all ranking years and all universities in the ranking lists, and is different from the similar studies in the literature that analyze data for shorter time intervals and top-ranked universities in the ranking lists. It can be said that the similarity of intra-indicators for URAP, NTU and RUR is analyzed for the first time in this study, based on the literature review.


2021 ◽  
Vol 2021 ◽  
pp. 1-9
Author(s):  
Ezgi Türkarslan ◽  
Jun Ye ◽  
Mehmet Ünver ◽  
Murat Olgun

The main purpose of this study is to construct a base for a new fuzzy set concept that is called consistency fuzzy set (CFS) which expresses the multidimensional uncertain data quite successfully. Our motive is to reduce the complexity and difficulty caused by the information contained in the truth sequence in a fuzzy multiset (FMS) and to present the data of the truth sequence in a more understandable and compact manner. Therefore, this paper introduces the concept of CFS that is characterized with a truth function defined on a universal set 0,1 2 . The first component of the truth pair of a CFS is the average value of the truth sequence of a FMS and the second component is the consistency degree, that is, the fuzzy complement of the standard deviation of the truth sequence of the same FMS. The main contribution of a CFS is the reflection of both the level of the average of the data that can be expressed with the different sequence lengths and the degree of the reasonable information in data via consistency degree. To develop this new concept, this paper also presents a correlation coefficient and a cosine similarity measure between CFSs. Furthermore, the proposed correlation coefficient and cosine similarity measure are applied to a multiperiod medical diagnosis problem. Finally, a comparison analysis is given between the obtained results and the existing results in literature to show the efficiency and rationality of the proposed correlation coefficient and cosine similarity measure.


Author(s):  
Barış Ergen

This chapter investigates how students attending environmental science classes in the Department of Urban and Regional Planning at Bozok University in 2010-2011 and 2011-2012 fall semesters learn concepts related to environmental science through a comparison of two different classes, using the Cosine Similarity Measure (CSM) method. The study demonstrates that the students lack the necessary knowledge about the concepts used in urban and regional planning literature and international conventions.


2018 ◽  
Vol 7 (4.15) ◽  
pp. 277 ◽  
Author(s):  
Madhusree Kuanr ◽  
Bikram Kesari Rath ◽  
Sachi Nandan Mohanty

Recommender systems provide suggestions to the users for choosing particular items from a large pool of items. The purpose of this study is to design a collaborative recommender system for the farmers for recommending giving prior idea regarding a crop which is suitable according to the location of the farmer based on weather condition of the previous months. The proposed system also recommends other seeds, pesticides and instruments according to the preferences in farming and location of the farmers while purchasing the seeds through online. It uses cosine similarity measure to find the similar user according the location of the farmer and fuzzy logic for predicting the yield of rice crop for Kharif season in state Odisha, India. The proposed system is implemented in Mamdani Fuzzy Inference model. The results reveal that it provides prior idea regarding a crop before sowing of seeds.  


2020 ◽  
Vol 39 (5) ◽  
pp. 7863-7880
Author(s):  
Yuanxiang Dong ◽  
Xiaoting Cheng ◽  
Weijie Chen ◽  
Hongbo Shi ◽  
Ke Gong

In actual life, uncertain and inconsistent information exists widely. How to deal with the information so that it can be better applied is a problem that has to be solved. Neutrosophic soft sets can process uncertain and inconsistent information. Also, Dempster-Shafer evidence theory has the advantage of dealing with uncertain information, and it can synthesize uncertain information and deal with subjective judgments effectively. Therefore, this paper creatively combines the Dempster-Shafer evidence theory with the neutrosophic soft sets, and proposes a cosine similarity measure for multi-criteria group decision making. Different from the previous studies, the proposed similarity measure is utilized to measure the similarity between two objects in the structure of neutrosophic soft set, rather than two neutrosophic soft sets. We also propose the objective degree and credibility degree which reflect the decision makers’ subjective preference based on the similarity measure. Then parameter weights are calculated by the objective degree. Additionally, based on credibility degree and parameter weights, we propose the modified score function, modified accuracy function, and modified certainty function, which can be employed to obtain partial order relation and make decisions. Later, we construct an aggregation algorithm for multi-criteria group decision making based on Dempster’s rule of combination and apply the algorithm to a case of medical diagnosis. Finally, by testing and comparing the algorithm, the results demonstrate that the proposed algorithm can solve the multi-criteria group decision making problems effectively.


Author(s):  
Jun Ye

The paper proposes a generalized ordered weighted simplified neutrosophic cosine similarity (GOWSNCS) measure by combining the cosine similarity measure of simplified neutrosophic sets (SNSs) with the generalized ordered weighted averaging (GOWA) operator and investigates its properties and special cases. Then, the author develops a simplified neutrosophic group decision-making method based on the GOWSNCS measure to handle multiple attribute group decision-making problems with simplified neutrosophic information. The prominent characteristics of the GOWSNCS measure are that it not only is a generalization of the cosine similarity measure but also considers the associated weights for attributes and decision makers in the aggregation of the cosine similarity measures of SNSs to alleviate the influence of unduly large or small similarities in the process of information aggregation. Finally, an illustrative example of investment alternatives is provided to demonstrate the application and effectiveness of the developed approach.


Sign in / Sign up

Export Citation Format

Share Document