automatic evaluation Latest Research Papers

Abstract PurposeThe assessment of dyskinesia in Parkinson's disease (PD) based on Artificial Intelligence technology is a significant and challenging task. At present, doctors usually use MDS-UPDRS scale to assess the severity of patients. This method is time-consuming and laborious, and there are subjective differences. The evaluation method based on sensor equipment is also widely used, but this method is expensive and needs professional guidance, which is not suitable for remote evaluation and patient self-examination. In addition, it is difficult to collect patient data in medical research, so it is of great significance to find an objective and automatic assessment method for Parkinson's dyskinesia based on small samples.MethodsIn this study, we design an automatic evaluation method combining manual features and convolutional neural network (CNN), which is suitable for small sample classification. Based on the finger tapping video of Parkinson's patients, we use the pose estimation model to obtain the action skeleton information and calculate the feature data. We then use the 5-folds cross validation training model to achieve optimum trade-of between bias and variance, and finally make multi-class prediction through fully connected network (FCN). ResultsOur proposed method achieves the current optimal accuracy of 79.7% in this research. We have compared with the latest methods of related research, and our method is superior to them in terms of accuracy, number of parameters and FLOPs. ConclusionThe method in this paper does not require patients to wear sensor devices, and has obvious advantages in remote clinical evaluation. At the same time, the method of using motion feature data to train CNN model obtains the optimal accuracy, effectively solves the problem of difficult data acquisition in medicine, and provides a new idea for small sample classification.

Download Full-text

Guidance to Pre-tokeniztion for SacreBLEU: Meta-Evaluation in Korean

10.20944/preprints202201.0018.v1 ◽

2022 ◽

Author(s):

Ahrii Kim ◽

Jinhyun Kim

Keyword(s):

Empirical Study ◽

Automatic Evaluation ◽

Human Judgment ◽

Evaluation Data ◽

Human Evaluation ◽

Mt Evaluation ◽

Evaluation Metric ◽

Agglutinative Languages

SacreBLEU, by incorporating a text normalizing step in the pipeline, has been well-received as an automatic evaluation metric in recent years. With agglutinative languages such as Korean, however, the metric cannot provide a conceivable result without the help of customized pre-tokenization. In this regard, this paper endeavors to examine the influence of diversified pre-tokenization schemes –word, morpheme, character, and subword– on the aforementioned metric by performing a meta-evaluation with manually-constructed into-Korean human evaluation data. Our empirical study demonstrates that the correlation of SacreBLEU (to human judgment) fluctuates consistently by the token type. The reliability of the metric even deteriorates due to some tokenization, and MeCab is not an exception. Guiding through the proper usage of tokenizer for each metric, we stress the significance of a character level and the insignificance of a Jamo level in MT evaluation.

Download Full-text

Teaching Research on Automatic Evaluation System of Japanese Pronunciation Based on Artificial Intelligence

2021 International Conference on Big Data Analytics for Cyber-Physical System in Smart City - Lecture Notes on Data Engineering and Communications Technologies ◽

10.1007/978-981-16-7469-3_83 ◽

2022 ◽

pp. 747-755

Author(s):

Yu Lei

Keyword(s):

Artificial Intelligence ◽

Evaluation System ◽

Automatic Evaluation ◽

Teaching Research

Download Full-text

Study on Automatic Evaluation Method of Spoken English Based on Multimodal Discourse Analysis Theory

Security and Communication Networks ◽

10.1155/2021/1486575 ◽

2021 ◽

Vol 2021 ◽

pp. 1-8

Author(s):

Junlong Ren

Keyword(s):

Discourse Analysis ◽

Evaluation Method ◽

Evaluation Methods ◽

Control Group ◽

Speech Signals ◽

Automatic Evaluation ◽

Spoken English ◽

Multimodal Discourse Analysis ◽

Multimodal Discourse ◽

Analysis Theory

Aiming at the low confidence of traditional spoken English automatic evaluation methods, this study designs an automatic evaluation method of spoken English based on multimodal discourse analysis theory. This evaluation method uses sound sensors to collect spoken English pronunciation signals, decomposes the spoken English speech signals by multilayer wavelet feature scale transform, and carries out adaptive filter detection and spectrum analysis on spoken English speech signals according to the results of feature decomposition. Based on multimodal discourse analysis theory, this evaluation method can extract the automatic evaluation features of spoken English and automatically recognize the speech quality according to the results. The experimental results show that, compared with the control group, the designed evaluation method has obvious advantages in confidence evaluation and can solve the problem of low confidence of traditional oral automatic evaluation methods.

Download Full-text

Developing Methods to Evaluate Content Quality of Dementia Websites

10.3233/shti210750 ◽

2021 ◽

Author(s):

Yunshu Zhu ◽

Ting Song ◽

Ping Yu

Keyword(s):

Focus Group ◽

Focus Group Discussion ◽

Questionnaire Survey ◽

Group Discussion ◽

Relevant Literature ◽

The Internet ◽

Quality Information ◽

Automatic Evaluation ◽

Content Quality

With the popularity of the Internet, consumers are likely to resort to websites for dementia information. However, they may not have the knowledge or experience in distinguishing quality information from opinion pieces. This study investigated the developing methods, instruments and parameters for evaluating the content quality of dementia websites. By reviewing 18 existing instruments from the relevant literature, we identified four developing methods – questionnaire survey, automatic evaluation, Delphi method and focus group discussion. These instruments include six parameters – reliability, currency, readability, disclosure, objectivity and relevance – to evaluate the content quality. With the significant social and economic impact of dementia, developing specific instruments to measure the content quality of dementia websites is necessary.

Download Full-text

Automatic evaluation of human oocyte developmental potential from microscopy images

10.1117/12.2604010 ◽

2021 ◽

Author(s):

Denis Baručić ◽

Jan Kybic ◽

Olga Teplá ◽

Zinovij Topurko ◽

Irena Kratochvílová

Keyword(s):

Human Oocyte ◽

Automatic Evaluation ◽

Developmental Potential ◽

Microscopy Images

Download Full-text

Automatic Evaluation of Facility Layouts Through Graph Matching

Advances in Design Engineering II - Lecture Notes in Mechanical Engineering ◽

10.1007/978-3-030-92426-3_33 ◽

2021 ◽

pp. 284-293

Author(s):

Lucía Díaz-Vilariño ◽

José Luis González-Cespón ◽

José Antonio Alonso-Rodríguez ◽

Antonio Fernández-Álvarez

Keyword(s):

Graph Matching ◽

Automatic Evaluation

Download Full-text

Development of an Application for the Automatic Evaluation of the Quality of 3D CAD Models

Advances in Design Engineering II - Lecture Notes in Mechanical Engineering ◽

10.1007/978-3-030-92426-3_39 ◽

2021 ◽

pp. 337-344

Author(s):

Inmaculada Pou Schmidt ◽

Alejandro Rodríguez Ortega ◽

Francisco Albert Gil ◽

Nuria Aleixos Borrás

Keyword(s):

Automatic Evaluation ◽

3D Cad ◽

Cad Models

Download Full-text

Subjective Answer Evaluator

International Journal for Research in Applied Science and Engineering Technology ◽

10.22214/ijraset.2021.39090 ◽

2021 ◽

Vol 9 (11) ◽

pp. 1740-1744

Author(s):

Sarthak Kagliwal

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Text Summarization ◽

Language Models ◽

Automatic Evaluation ◽

Automatic Assessment ◽

Similarity Matching ◽

Unsupervised Approach ◽

Statistical Approaches

Abstract: The automatic assessment of subjective replies necessitates the use of Natural Language Processing and automated assessment. Ontology, semantic similarity matching, and statistical approaches are among the strategies employed. But most of the methods are based on an unsupervised approach. The proposed system uses an unsupervised method and is divided into two modules. The first one is extracting the essential data through text summarization and the second is applying various Natural Language models to the text retrieved from the above step and giving marks to them. Keywords: Automatic Evaluation, NLP, Text Summarization, Similarity Measure, Marks Scoring

Download Full-text

I know what you coded last summer

10.5753/sbie.2021.218673 ◽

2021 ◽

Author(s):

Lucas Mendonça de Souza ◽

Igor Moreira Felix ◽

Bernardo Martins Ferreira ◽

Anarosa Alves Franco Brandão ◽

Leônidas de Oliveira Brandão

Keyword(s):

Programming Language ◽

Learning Analytics ◽

Online Courses ◽

Automatic Evaluation ◽

Evaluation Tools ◽

Introductory Programming ◽

Introductory Programming Courses ◽

Programming Courses

The outbreak of the COVID-19 pandemic caused a surge in enrollments in online courses. Consequently, this boost in numbers of students affected teachers ability to evaluate exercises and resolve doubts. In this context, tools designed to evaluate and provide feedback on code solutions can be used in programming courses to reduce teachers workload. Nonetheless, even with using such tools, the literature shows that learning how to program is a challenging task. Programming is complex and the programming language employed can also affect students outcomes. Thus, designing good exercises can reduce students difficulties in identifying the problem and help reduce syntax challenges. This research employs learning analytics processes on automatic evaluation tools interaction logs and code solutions to find metrics capable of identifying problematic exercises and their difficulty. In this context, an exercise is considered problematic if students have problems interpreting its description or its solution requires complex programming structures like loops, conditionals and recursion. The data comes from online introductory programming courses. Results show that the computed metrics can identify problematic exercises, as well as those that are being challenging.

Download Full-text

automatic evaluation
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

An Automatic Evaluation Method for Parkinson's Dyskinesia Using Finger Tapping Video for Small Samples

Guidance to Pre-tokeniztion for SacreBLEU: Meta-Evaluation in Korean

Teaching Research on Automatic Evaluation System of Japanese Pronunciation Based on Artificial Intelligence

Study on Automatic Evaluation Method of Spoken English Based on Multimodal Discourse Analysis Theory

Developing Methods to Evaluate Content Quality of Dementia Websites

Automatic evaluation of human oocyte developmental potential from microscopy images

Automatic Evaluation of Facility Layouts Through Graph Matching

Development of an Application for the Automatic Evaluation of the Quality of 3D CAD Models

Subjective Answer Evaluator

I know what you coded last summer

Export Citation Format

automatic evaluationRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

An Automatic Evaluation Method for Parkinson's Dyskinesia Using Finger Tapping Video for Small Samples

Guidance to Pre-tokeniztion for SacreBLEU: Meta-Evaluation in Korean

Teaching Research on Automatic Evaluation System of Japanese Pronunciation Based on Artificial Intelligence

Study on Automatic Evaluation Method of Spoken English Based on Multimodal Discourse Analysis Theory

Developing Methods to Evaluate Content Quality of Dementia Websites

Automatic evaluation of human oocyte developmental potential from microscopy images

Automatic Evaluation of Facility Layouts Through Graph Matching

Development of an Application for the Automatic Evaluation of the Quality of 3D CAD Models

Subjective Answer Evaluator

I know what you coded last summer

automatic evaluation
Recently Published Documents