Modeling Topics in the Alternative Uses Task
What is the representation of knowledge underlying human responses to alternative uses test items? This short paper describes an application of Latent Dirichlet Allocation (LDA, also known as topic modeling) to solve this problem of knowledge representation. For thissmall application, a document was de?ned as the set of responses given by a single participant to the alternative uses test brick prompt. This was chosen instead of single responses as the document unit, as single responses to alternative uses items are rather short, and LDA assumesthat documents are probabilistic mixtures of topics. The approach explored in this paper used LDA with Gibbs sampling, with the primary goal of model selection. The log likelihood of the data (log P(w | T)) was computed as to topics varied from 5 to 100. Results showed that the log likelihood increased to a peak at 15 topics and then steadily declined up to 100 topics. In the 15-topic model the most frequently appearing topic was that which gave the highest probability to the terms build, house, step, and smash. Documents best represented by that topic assignment were, on average, more similar to the dictionary de?finition of a brick based on vector cosines computed with Latent Semantic Analysis. Additional implications for using the topic model as a knowledge base for cognitive systems, and also as a tool for quantifying flexibility, the number of categories present in alternative uses response arrays, will also be discussed.