scholarly journals DBpedia SPARQL Benchmark – Performance Assessment with Real Queries on Real Data

Author(s):  
Mohamed Morsey ◽  
Jens Lehmann ◽  
Sören Auer ◽  
Axel-Cyrille Ngonga Ngomo
2021 ◽  
Author(s):  
Masaki Uto

AbstractPerformance assessment, in which human raters assess examinee performance in a practical task, often involves the use of a scoring rubric consisting of multiple evaluation items to increase the objectivity of evaluation. However, even when using a rubric, assigned scores are known to depend on characteristics of the rubric’s evaluation items and the raters, thus decreasing ability measurement accuracy. To resolve this problem, item response theory (IRT) models that can estimate examinee ability while considering the effects of these characteristics have been proposed. These IRT models assume unidimensionality, meaning that a rubric measures one latent ability. In practice, however, this assumption might not be satisfied because a rubric’s evaluation items are often designed to measure multiple sub-abilities that constitute a targeted ability. To address this issue, this study proposes a multidimensional IRT model for rubric-based performance assessment. Specifically, the proposed model is formulated as a multidimensional extension of a generalized many-facet Rasch model. Moreover, a No-U-Turn variant of the Hamiltonian Markov chain Monte Carlo algorithm is adopted as a parameter estimation method for the proposed model. The proposed model is useful not only for improving the ability measurement accuracy, but also for detailed analysis of rubric quality and rubric construct validity. The study demonstrates the effectiveness of the proposed model through simulation experiments and application to real data.


Author(s):  
Ditdit Nugeraha Utama ◽  
◽  
Deddy Kurniawan

— In the education domain, especially in higher education, a standard of performance assessment is determined by the grade point average (GPA). In its use in the evaluation process, GPA assessment standards have some shortcomings, such as focusing only on the improvement of the GPA value regardless of other factors that can affect an evaluation result, so that the GPA standard cannot be used as one and only one basic reference in determining the results of performance evaluation ideally. The study is a completed version of the previous one. The purpose of this study is to construct a decision support model (DSM) that can conduct performance assessment by considering various parameters that are considered to affect the academic performance results of students. The model was constructed by using three methods: fuzzy logic of Sugeno, conventional, and mathematical. This study uses quantitative data analysis because the data used is real data in the form of personal data and academic value data of students in the field. The predetermined methods were implemented in this study to calculate the academic performance value of students by considering various other parameters outside the GPA parameters. So that the assessment results will be more ideal and can strengthen the standard of GPA assessment in conducting the process of evaluating student performance. Also, finally, the verification and validation values of the model are represented.


Sensors ◽  
2021 ◽  
Vol 21 (10) ◽  
pp. 3559
Author(s):  
Rana Massoud ◽  
Riccardo Berta ◽  
Stefan Poslad ◽  
Alessandro De Gloria ◽  
Francesco Bellotti

Internet of Things technologies are spurring new types of instructional games, namely reality-enhanced serious games (RESGs), that support training directly in the field. This paper investigates a key feature of RESGs, i.e., user performance evaluation using real data, and studies an application of RESGs for promoting fuel-efficient driving, using fuel consumption as an indicator of driver performance. In particular, we propose a reference model for supporting a novel smart sensing dataflow involving the combination of two modules, based on machine learning, to be employed in RESGs in parallel and in real-time. The first module concerns quantitative performance assessment, while the second one targets verbal recommendation. For the assessment module, we compared the performance of three well-established machine learning algorithms: support vector regression, random forest and artificial neural networks. The experiments show that random forest achieves a slightly better performance assessment correlation than the others but requires a higher inference time. The instant recommendation module, implemented using fuzzy logic, triggers advice when inefficient driving patterns are detected. The dataflow has been tested with data from the enviroCar public dataset, exploiting on board diagnostic II (OBD II) standard vehicular interface information. The data covers various driving environments and vehicle models, which makes the system robust for real-world conditions. The results show the feasibility and effectiveness of the proposed approach, attaining a high estimation correlation (R2 = 0.99, with random forest) and punctual verbal feedback to the driver. An important word of caution concerns users’ privacy, as the modules rely on sensitive personal data, and provide information that by no means should be misused.


2021 ◽  
Vol 7 ◽  
pp. 7297-7314
Author(s):  
Said Bentouba ◽  
Mahmoud Bourouis ◽  
Nadjet Zioui ◽  
Arumugam Pirashanthan ◽  
Dhayalan Velauthapillai

2019 ◽  
Vol 26 (1) ◽  
pp. 76-91 ◽  
Author(s):  
Sergio Magdaleno ◽  
Elisabet Lacarra ◽  
Carlos de la Casa ◽  
Manuel López ◽  
Roberto Roldán ◽  
...  

Abstract The European Geostationary Navigation Overlay Service (EGNOS) augments the open public service offered by the GPS in Europe making suitable the use of GPS for safety critical applications. EGNOS is designed according to the same standard [ICAO SARPs, 2018] such as US WAAS, Japanese MSAS, GAGAN in India, SDCM in Russia and KAAS in South Korea and provides over Europe both corrections and integrity information about the GPS system. As the European SBAS, EGNOS offers three services: Open Service, Safety-of-life Service and EDAS. In general, the EGNOS Safety-of-life (SoL) Service is intended for transport applications in different domains (and currently in use by Aviation) where lives could be endangered if the performance of the navigation system is degraded below specific accuracy limits without giving notice in the specified time to alert. This requires that the relevant authority of the particular transport domain determines specific requirements for the navigation service based on the needs of that domain. Even if the main objective of the SBAS systems is the civil aviation community, the advantages provided by this technology are very useful to users from other domains. In this sense, a new EGNOS service for maritime is currently under development with the objective to complement the existing maritime radionavigation systems (e.g. DGNSS) in the European region for enhanced accuracy and integrity information where there is no backup infrastructure or in poorly covered environments. One of the steps needed for the development of this new EGNOS maritime service is the definition of a minimum set of recommendations for receiver manufacturers to provide them with a clear view on how to design their SBAS receivers to be compliant with the requirements defined for such a service. For that, EC, GSA, ESA and ESSP SAS have been working together since 2016 to develop guidelines for manufacturers for the implementation of SBAS in shipborne receiver. These guidelines, developed in the frame of the SBAS Working Group created in the Special Committee (SC) 104 on Differential Global Navigation Satellite Systems (DGNSS) of Radio Technical Commission for Maritime Services (RTCM), define a minimum set SBAS messages to be compliant with the International Maritime Organization (IMO) Resolution A.1046 and additionally provide a test specifications. This paper presents a summary of these SBAS guidelines as well as the preliminary list of tests that must be fulfilled to be compliant. Additionally, a preliminary performance assessment of the EGNOS maritime service based on IMO Res. A.1046 (27) for a 24-months period during 2016, 2017 and 2018 is presented. The performance parameters are calculated using real data to show what level of performance was attained by EGNOS. The assessment was done using both EGNOS ground monitoring stations (RIMS) and fault-free receivers, based on these guidelines, fed with actual data. The performance is shown for each performance parameter defined in the IMO Res. A.1046 (27) and for navigation in Ocean Waters and coastal waters, harbour entrances and harbour approaches. The paper also includes Service Coverage maps representing where EGNOS maritime service based on IMO Res. A.1046 (27) is fulfilling the requirements. Furthermore, GSA and ESSP, with the collaboration of The Norwegian Coastal Administration and Hurtigruten Cruises, carried out a GNSS data collection campaign of 10 days along the Norwegian coast with a trajectory through Trondheim to Kirkenes and Kirkenes to Bergen in February 2018. The aim of this data campaign was to assess EGNOS performance at user level in the maritime domain at high latitudes in Europe. The data campaign includes the navigation outside the MT27 region defined in EGNOS at that moment (70ºN). A performance assessment of EGNOS using some commercial receivers and a software receiver in line with the SBAS guidelines will be presented, showing the observed accuracy and availably results of the EGNOS solution.


2019 ◽  
Vol 35 (1) ◽  
pp. 126-136 ◽  
Author(s):  
Tour Liu ◽  
Tian Lan ◽  
Tao Xin

Abstract. Random response is a very common aberrant response behavior in personality tests and may negatively affect the reliability, validity, or other analytical aspects of psychological assessment. Typically, researchers use a single person-fit index to identify random responses. This study recommends a three-step person-fit analysis procedure. Unlike the typical single person-fit methods, the three-step procedure identifies both global misfit and local misfit individuals using different person-fit indices. This procedure was able to identify more local misfit individuals than single-index method, and a graphical method was used to visualize those particular items in which random response behaviors appear. This method may be useful to researchers in that it will provide them with more information about response behaviors, allowing better evaluation of scale administration and development of more plausible explanations. Real data were used in this study instead of simulation data. In order to create real random responses, an experimental test administration was designed. Four different random response samples were produced using this experimental system.


Sign in / Sign up

Export Citation Format

Share Document