Coherent category training enhances generalization and increases reliance on prototype representations

2021 ◽  
Author(s):  
Caitlin Bowman ◽  
Dagmar Zeithamova

A major question for the study of learning and memory is how to tailor learning experiences to promote knowledge that generalizes to new situations. Using category learning as a representative domain, the present study tested two factors thought to influence acquisition of conceptual knowledge: the number of training examples (set size) and the similarity of training examples to the category average (set coherence). Across participants, size and coherence of category training sets were varied in a fully-crossed design. After training, participants demonstrated the breadth of their category knowledge by categorizing novel examples varying in their distance from the category center. Results showed better generalization following more coherent training sets, even when categorizing items furthest from the category center. There was little effect of set size. We also tested the types of representations underlying categorization decisions by fitting formal prototype and exemplar models. Prototype models posit abstract category representations based on the category’s central tendency, whereas exemplar models posit that categories are represented by individual category members. We show that more subjects rely on a prototype strategy following high coherence training, suggesting that more coherent training sets facilitate extraction of the category average. Together, these results provide strong evidence for the benefit of training on examples that are similar to one another and to the category center.

2019 ◽  
Author(s):  
Caitlin Bowman ◽  
Dagmar Zeithamova

Building conceptual knowledge that generalizes to novel situations is a key function of human memory. Category-learning paradigms have long been used to understand the mechanisms of knowledge generalization. In the present study, we tested the conditions that promote formation of new concepts. Participants underwent one of six training conditions that differed in the number of examples per category (set size) and their relative similarity to the category average (set coherence). Performance metrics included rates of category learning, ability to generalize categories to new items of varying similarity to prototypes, and recognition memory for individual examples. In categorization, high set coherence led to faster learning and better generalization, while set size had little effect. Recognition did not differ reliably among conditions. We also tested the nature of memory representations used for categorization and recognition decisions using quantitative prototype and exemplar models fit to behavioral responses. Prototype models posit abstract category representations based on the category’s central tendency, whereas exemplar models posit that categories are represented by individual category members. Prototype strategy use during categorization increased with increasing set coherence, suggesting that coherent training sets facilitate extraction of commonalities within a category. We conclude that learning from a coherent set of examples is an efficient means of forming abstract knowledge that generalizes broadly.


2022 ◽  
Author(s):  
Nabeel Durrani ◽  
Damjan Vukovic ◽  
Maria Antico ◽  
Jeroen van der Burgt ◽  
Ruud JG van van Sloun ◽  
...  

<div>Our automated deep learning-based approach identifies consolidation/collapse in LUS images to aid in the diagnosis of late stages of COVID-19 induced pneumonia, where consolidation/collapse is one of the possible associated pathologies. A common challenge in training such models is that annotating each frame of an ultrasound video requires high labelling effort. This effort in practice becomes prohibitive for large ultrasound datasets. To understand the impact of various degrees of labelling precision, we compare labelling strategies to train fully supervised models (frame-based method, higher labelling effort) and inaccurately supervised models (video-based methods, lower labelling effort), both of which yield binary predictions for LUS videos on a frame-by-frame level. We moreover introduce a novel sampled quaternary method which randomly samples only 10% of the LUS video frames and subsequently assigns (ordinal) categorical labels to all frames in the video based on the fraction of positively annotated samples. This method outperformed the inaccurately supervised video-based method of our previous work on pleural effusions. More surprisingly, this method outperformed the supervised frame-based approach with respect to metrics such as precision-recall area under curve (PR-AUC) and F1 score that are suitable for the class imbalance scenario of our dataset despite being a form of inaccurate learning. This may be due to the combination of a significantly smaller data set size compared to our previous work and the higher complexity of consolidation/collapse compared to pleural effusion, two factors which contribute to label noise and overfitting; specifically, we argue that our video-based method is more robust with respect to label noise and mitigates overfitting in a manner similar to label smoothing. Using clinical expert feedback, separate criteria were developed to exclude data from the training and test sets respectively for our ten-fold cross validation results, which resulted in a PR-AUC score of 73% and an accuracy of 89%. While the efficacy of our classifier using the sampled quaternary method must be verified on a larger consolidation/collapse dataset, when considering the complexity of the pathology, our proposed classifier using the sampled quaternary video-based method is clinically comparable with trained experts and improves over the video-based method of our previous work on pleural effusions.</div>


2019 ◽  
Vol 11 (4) ◽  
pp. 451-467
Author(s):  
Miroslav Nemčok

AbstractParties can not only actively adjust the electoral rules to reach more favourable outcomes, as is most often recognized in political science, but they also passively create an environment that systematically influences electoral competition. This link is theorized and included in the wider framework capturing the mutual dependence of electoral systems and party systems. The impact of passive influence is successfully tested on one out of two factors closely related to party systems: choice set size (i.e., number of options provided to voters) and degree of ideological polarization. The research utilizes established datasets (i.e., Constituency-Level Elections Archive, Party System Polarization Index, Chapel Hill Expert Survey, and Manifesto Project Database) and via regression analysis with clustered robust standard errors concludes that the choice set size constitutes an attribute with passive influence over electoral systems. Thus, it must be reflected when outcomes of electoral systems are estimated or compared across various contexts.


Author(s):  
Fabien Mathy ◽  
Jacob Feldman

Abstract. This study of supervised categorization shows how different kinds of category representations are influenced by the order in which training examples are presented. We used the well-studied 5-4 category structure of Medin and Schaffer (1978) , which allows transfer of category learning to new stimuli to be discriminated as a function of rule-based or similarity-based category knowledge. In the rule-based training condition (thought to facilitate the learning of abstract logical rules and hypothesized to produce rule-based classification), items were grouped by subcategories and randomized within each subcategory. In the similarity-based training condition (thought to facilitate associative learning and hypothesized to produce exemplar classification), transitions between items within the same category were determined by their featural similarity and subcategories were ignored. We found that transfer patterns depended on whether the presentation order was similarity-based, or rule-based, with the participants particularly capitalizing on the rule-based order.


2022 ◽  
Author(s):  
Nabeel Durrani ◽  
Damjan Vukovic ◽  
Maria Antico ◽  
Jeroen van der Burgt ◽  
Ruud JG van van Sloun ◽  
...  

<div>Our automated deep learning-based approach identifies consolidation/collapse in LUS images to aid in the diagnosis of late stages of COVID-19 induced pneumonia, where consolidation/collapse is one of the possible associated pathologies. A common challenge in training such models is that annotating each frame of an ultrasound video requires high labelling effort. This effort in practice becomes prohibitive for large ultrasound datasets. To understand the impact of various degrees of labelling precision, we compare labelling strategies to train fully supervised models (frame-based method, higher labelling effort) and inaccurately supervised models (video-based methods, lower labelling effort), both of which yield binary predictions for LUS videos on a frame-by-frame level. We moreover introduce a novel sampled quaternary method which randomly samples only 10% of the LUS video frames and subsequently assigns (ordinal) categorical labels to all frames in the video based on the fraction of positively annotated samples. This method outperformed the inaccurately supervised video-based method of our previous work on pleural effusions. More surprisingly, this method outperformed the supervised frame-based approach with respect to metrics such as precision-recall area under curve (PR-AUC) and F1 score that are suitable for the class imbalance scenario of our dataset despite being a form of inaccurate learning. This may be due to the combination of a significantly smaller data set size compared to our previous work and the higher complexity of consolidation/collapse compared to pleural effusion, two factors which contribute to label noise and overfitting; specifically, we argue that our video-based method is more robust with respect to label noise and mitigates overfitting in a manner similar to label smoothing. Using clinical expert feedback, separate criteria were developed to exclude data from the training and test sets respectively for our ten-fold cross validation results, which resulted in a PR-AUC score of 73% and an accuracy of 89%. While the efficacy of our classifier using the sampled quaternary method must be verified on a larger consolidation/collapse dataset, when considering the complexity of the pathology, our proposed classifier using the sampled quaternary video-based method is clinically comparable with trained experts and improves over the video-based method of our previous work on pleural effusions.</div>


Author(s):  
Rosa Silva ◽  
Elzbieta Bobrowicz-Campos ◽  
Paulo Santos-Costa ◽  
Isabel Gil ◽  
Hugo Neves ◽  
...  

Background: This study aimed to translate and adapt the Quality of the Carer–Patient Relationship (QCPR) scale into Portuguese and analyse both its psychometric properties and correlation with sociodemographic and clinical variables. Methods: Phase (1) Translate and culturally adapt the scale. Phase (2) Assess the scale’s confirmatory factorial analysis, internal consistency, construct validity, and correlations. Results: The experts classified the overall quality of the translation as adequate. A total of 53 dyads (cared-for person and carer) were assessed. In both versions, measures of central tendency and symmetry were also adequate, and the two factors under investigation had appropriate reliability, although in the conflict/critical factor, this was more fragile. Cronbach’s alpha values were 0.89 for the cared-for person version and 0.91 for the carer version. Conclusions: The QCPR scale showed satisfactory to good values of reliability. The assessment is essential to guarantee structured interventions by health professionals, since the quality of the dyads’ relationship seems to influence both older adults’ quality of life and carers’ health status. This study is a significant contribution to the introduction of the QCPR scale in the Portuguese clinical and scientific culture but also an opportunity to increase its use internationally.


1998 ◽  
Vol 10 (8) ◽  
pp. 2201-2217 ◽  
Author(s):  
Peter Sollich ◽  
David Barber

We analyze online gradient descent learning from finite training sets at noninfinitesimal learning rates η. Exact results are obtained for the time-dependent generalization error of a simple model system: a linear network with a large number of weights N, trained on p = αN examples. This allows us to study in detail the effects of finite training set size α on, for example, the optimal choice of learning rate η. We also compare online and offline learning, for respective optimal settings of η at given final learning time. Online learning turns out to be much more robust to input bias and actually outperforms offline learning when such bias is present; for unbiased inputs, online and offline learning perform almost equally well.


1993 ◽  
Vol 04 (01) ◽  
pp. 15-25 ◽  
Author(s):  
MARTIN MØLLER

Efficient supervised learning on large redundant training sets requires algorithms where the amount of computation involved in preparing each weight update is independent of the training set size. Off-line algorithms like the standard conjugate gradient algorithms do not have this property while on-line algorithms like the stochastic backpropagation algorithm do. A new algorithm combining the good properties of off-line and on-line algorithms is introduced.


Author(s):  
Margaret Sampson ◽  
Nassr Nama ◽  
Katharine O'Hearn ◽  
Kimmo Murto ◽  
Ahmed Nasr ◽  
...  

Abstract Introduction Solutions like crowd screening and machine learning can assist systematic reviewers with heavy screening burdens but require training sets containing a mix of eligible and ineligible studies. This study explores using PubMed's Best Match algorithm to create small training sets containing at least five relevant studies. Methods Six systematic reviews were examined retrospectively. MEDLINE searches were converted and run in PubMed. The ranking of included studies was studied under both Best Match and Most Recent sort conditions. Results Retrieval sizes for the systematic reviews ranged from 151 to 5,406 records and the numbers of relevant records ranged from 8 to 763. The median ranking of relevant records was higher in Best Match for all six reviews, when compared with Most Recent sort. Best Match placed a total of thirty relevant records in the first fifty, at least one for each systematic review. Most Recent sorting placed only ten relevant records in the first fifty. Best Match sorting outperformed Most Recent in all cases and placed five or more relevant records in the first fifty in three of six cases. Discussion Using a predetermined set size such as fifty may not provide enough true positives for an effective systematic review training set. However, screening PubMed records ranked by Best Match and continuing until the desired number of true positives are identified is efficient and effective. Conclusions The Best Match sort in PubMed improves the ranking and increases the proportion of relevant records in the first fifty records relative to sorting by recency.


Sign in / Sign up

Export Citation Format

Share Document