An automatic scoring method for Chinese-English spoken translation based on attention LSTM

An LVCSR Based Automatic Scoring Method in English Reading Tests

2012 4th International Conference on Intelligent Human-Machine Systems and Cybernetics ◽

10.1109/ihmsc.2012.14 ◽

2012 ◽

Cited By ~ 3

Author(s):

Junbo Zhang ◽

Fuping Pan ◽

Yonghong Yan

Keyword(s):

Scoring Method ◽

Reading Tests ◽

English Reading ◽

Automatic Scoring

Download Full-text

Automatic scoring method of English composition based on language depth perception

Journal of Physics Conference Series ◽

10.1088/1742-6596/1486/4/042045 ◽

2020 ◽

Vol 1486 ◽

pp. 042045 ◽

Cited By ~ 1

Author(s):

Tang Dan ◽

Sun Yu

Keyword(s):

Depth Perception ◽

English Composition ◽

Scoring Method ◽

Automatic Scoring

Download Full-text

Automatic Scoring Method for Subjective Questions Based on ALBERT and Cilin

Computer Science and Application ◽

10.12677/csa.2020.109177 ◽

2020 ◽

Vol 10 (09) ◽

pp. 1673-1682

Author(s):

展鑫张

Keyword(s):

Scoring Method ◽

Automatic Scoring

Download Full-text

Automatic scoring method for open answer task in the SJ-CAT speaking test considering utterance difficulty level

Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific ◽

10.1109/apsipa.2014.7041583 ◽

2014 ◽

Author(s):

Hao Lu ◽

Takeshi Yamada ◽

Shingo Imai ◽

Takahiro Shinozaki ◽

Ryuichi Nisimura ◽

...

Keyword(s):

Difficulty Level ◽

Scoring Method ◽

Speaking Test ◽

Automatic Scoring

Download Full-text

Research on the Automatic Scoring Method of English Essay based on the Improved K-Nearest Neighbor Algorithm

Proceedings of the 2016 International Conference on Education, Management and Computing Technology (ICEMCT-16) ◽

10.2991/icemct-16.2016.275 ◽

2016 ◽

Author(s):

Hao Jiang ◽

Yaru Jin

Keyword(s):

Nearest Neighbor ◽

K Nearest Neighbor ◽

Scoring Method ◽

Nearest Neighbor Algorithm ◽

K Nearest Neighbor Algorithm ◽

Automatic Scoring

Download Full-text

Automatic scoring method considering quality and content of speech for scat Japanese speaking test

2012 International Conference on Speech Database and Assessments ◽

10.1109/icsda.2012.6422460 ◽

2012 ◽

Cited By ~ 1

Author(s):

Naoko Okubo ◽

Yuto Yamahata ◽

Takeshi Yamada ◽

Shingo Imai ◽

Kenkichi Ishizuka ◽

...

Keyword(s):

Scoring Method ◽

Speaking Test ◽

Automatic Scoring

Download Full-text

Machine-Scored Syntax: Comparison of the CLAN Automatic Scoring Program to Manual Scoring

Language Speech and Hearing Services in Schools ◽

10.1044/2019_lshss-19-00056 ◽

2020 ◽

Vol 51 (2) ◽

pp. 479-493

Author(s):

Jenny A. Roberts ◽

Evelyn P. Altenberg ◽

Madison Hunter

Keyword(s):

Data Exchange ◽

Child Language ◽

Exchange System ◽

Absolute Point ◽

Language Analysis ◽

Search Patterns ◽

Language Data ◽

Report Accuracy ◽

Point To Point ◽

Automatic Scoring

Purpose The results of automatic machine scoring of the Index of Productive Syntax from the Computerized Language ANalysis (CLAN) tools of the Child Language Data Exchange System of TalkBank (MacWhinney, 2000) were compared to manual scoring to determine the accuracy of the machine-scored method. Method Twenty transcripts of 10 children from archival data of the Weismer Corpus from the Child Language Data Exchange System at 30 and 42 months were examined. Measures of absolute point difference and point-to-point accuracy were compared, as well as points erroneously given and missed. Two new measures for evaluating automatic scoring of the Index of Productive Syntax were introduced: Machine Item Accuracy (MIA) and Cascade Failure Rate— these measures further analyze points erroneously given and missed. Differences in total scores, subscale scores, and individual structures were also reported. Results Mean absolute point difference between machine and hand scoring was 3.65, point-to-point agreement was 72.6%, and MIA was 74.9%. There were large differences in subscales, with Noun Phrase and Verb Phrase subscales generally providing greater accuracy and agreement than Question/Negation and Sentence Structures subscales. There were significantly more erroneous than missed items in machine scoring, attributed to problems of mistagging of elements, imprecise search patterns, and other errors. Cascade failure resulted in an average of 4.65 points lost per transcript. Conclusions The CLAN program showed relatively inaccurate outcomes in comparison to manual scoring on both traditional and new measures of accuracy. Recommendations for improvement of the program include accounting for second exemplar violations and applying cascaded credit, among other suggestions. It was proposed that research on machine-scored syntax routinely report accuracy measures detailing erroneous and missed scores, including MIA, so that researchers and clinicians are aware of the limitations of a machine-scoring program. Supplemental Material https://doi.org/10.23641/asha.11984364

Download Full-text

The Relation Between Linguistic Awareness Skills and Spelling in Adults: A Comparison Among Scoring Procedures

Journal of Speech Language and Hearing Research ◽

10.1044/2020_jslhr-19-00120 ◽

2020 ◽

Vol 63 (4) ◽

pp. 1240-1253

Author(s):

Victoria S. Henbest ◽

Lisa Fitton ◽

Krystal L. Werfel ◽

Kenn Apel

Keyword(s):

Phonemic Awareness ◽

Concurrent Validity ◽

Word Reading ◽

Continuous Measure ◽

Scoring Methods ◽

Syntactic Awareness ◽

Scoring Method ◽

Spelling Performance ◽

Linguistic Awareness ◽

Average Readers

Purpose Spelling is a skill that relies on an individual's linguistic awareness, the ability to overtly manipulate language. The ability to accurately spell is important for academic and career success into adulthood. The spelling skills of adults have received some attention in the literature, but there is limited information regarding which approach for analyzing adults' spelling is optimal for guiding instruction or intervention for those who struggle. Thus, we aimed to examine the concurrent validity of four different scoring methods for measuring adults' spellings (a dichotomous scoring method and three continuous methods) and to determine whether adults' linguistic awareness skills differentially predict spelling outcomes based on the scoring method employed. Method Sixty undergraduate college students who were determined to be average readers as measured by a word reading and contextual word reading task were administered a spelling task as well as morphological, orthographic, phonemic, and syntactic awareness tasks. Results All four scoring methods were highly correlated suggesting high concurrent validity among the measures. Two linguistic awareness skills, morphological awareness and syntactic awareness, predicted spelling performance on both the dichotomous and continuous scoring methods. Contrastively, phonemic awareness and orthographic awareness predicted spelling performance only when spelling was scored using a continuous measure error analysis. Conclusions The results of this study confirm that multiple linguistic awareness skills are important for spelling in adults who are average readers. The results also highlight the need for using continuous measures of spelling when planning intervention or instruction, particularly in the areas of orthographic and phonemic awareness.

Download Full-text

Using Principal Component Scores to Enhance the Validity and Reliability of Big Five Personality Measures

Journal of Individual Differences ◽

10.1027/1614-0001/a000225 ◽

2017 ◽

Vol 38 (2) ◽

pp. 83-93

Author(s):

Jeffrey M. Cucina ◽

Nicholas L. Vasilopoulos ◽

Arwen H. DeCostanza

Keyword(s):

Big Five ◽

Discriminant Validity ◽

Reliability And Validity ◽

Principal Component ◽

Big Five Personality ◽

Validity And Reliability ◽

Scoring Method ◽

Five Factors ◽

Big Five Factors ◽

Component Scores

Abstract. Varimax rotated principal component scores (VRPCS) have previously been offered as a possible solution to the non-orthogonality of scores for the Big Five factors. However, few researchers have examined the reliability and validity of VRPCS. To address this gap, we use a lab study and a field study to investigate whether using VRPCS increase orthogonality, reliability, and criterion-related validity. Compared to the traditional unit-weighting scoring method, the use of VRPCS enhanced the reliability and discriminant validity of the Big Five factors, although there was little improvement in criterion-related validity. Results are discussed in terms of the benefit of using VRPCS instead of traditional unit-weighted sum scores.

Download Full-text

Confound It!

European Journal of Psychological Assessment ◽

10.1027/1015-5759/a000459 ◽

2019 ◽

Vol 35 (6) ◽

pp. 855-867 ◽

Cited By ~ 2

Author(s):

John T. Kulas ◽

Rachael Klahr ◽

Lindsey Knights

Keyword(s):

Social Desirability ◽

Response Pattern ◽

Experimental Manipulation ◽

Scoring Method ◽

Method Effect ◽

Coding Method ◽

The Social ◽

Current Article ◽

Method Factors ◽

Psychological Inventory

Abstract. Many investigators have noted “reverse-coding” method factors when exploring response pattern structure with psychological inventory data. The current article probes for the existence of a confound in these investigations, whereby an item’s level of saturation with socially desirable content tends to covary with the item’s substantive scale keying. We first investigate its existence, demonstrating that 15 of 16 measures that have been previously implicated as exhibiting a reverse-scoring method effect can also be reasonably characterized as exhibiting a scoring key/social desirability confound. A second set of analyses targets the extent to which the confounding variable may confuse interpretation of factor analytic results and documents strong social desirability associations. The results suggest that assessment developers perhaps consider the social desirability scale value of indicators when constructing scale aggregates (and possibly scales when investigating inter-construct associations). Future investigations would ideally disentangle the confound via experimental manipulation.

Download Full-text