distributional semantic models Latest Research Papers

2021 ◽

Vol 72 ◽

pp. 1281-1305

Author(s):

Atefe Pakzad ◽

Morteza Analoui

Keyword(s):

Vector Space ◽

Selection Method ◽

Word Embedding ◽

Baseline Method ◽

Word Similarity ◽

Semantic Models ◽

Fusion Methods ◽

Distributional Semantic Models ◽

Word Selection ◽

Basis Vectors

Distributional semantic models represent the meaning of words as vectors. We introduce a selection method to learn a vector space that each of its dimensions is a natural word. The selection method starts from the most frequent words and selects a subset, which has the best performance. The method produces a vector space that each of its dimensions is a word. This is the main advantage of the method compared to fusion methods such as NMF, and neural embedding models. We apply the method to the ukWaC corpus and train a vector space of N=1500 basis words. We report tests results on word similarity tasks for MEN, RG-65, SimLex-999, and WordSim353 gold datasets. Also, results show that reducing the number of basis vectors from 5000 to 1500 reduces accuracy by about 1.5-2%. So, we achieve good interpretability without a large penalty. Interpretability evaluation results indicate that the word vectors obtained by the proposed method using N=1500 are more interpretable than word embedding models, and the baseline method. We report the top 15 words of 1500 selected basis words in this paper.

Download Full-text

ViSpa (Vision Spaces): A computer-vision-based representation system for individual images and concept prototypes, with large-scale evaluation

10.31234/osf.io/n4dmq ◽

2021 ◽

Author(s):

Fritz Guenther ◽

Marco Marelli ◽

Sam Tureski ◽

Marco A. Petilli

Keyword(s):

Large Scale ◽

Error Rates ◽

Behavioral Data ◽

Wide Audience ◽

Precise Control ◽

Processing Times ◽

On Line ◽

Image Pairs ◽

Distributional Semantic Models ◽

Visual Domain

Quantitative, data-driven models for mental representations have long enjoyed popularity and success in psychology (for example, distributional semantic models in the language domain), but have largely been missing for the visual domain. To overcome this, we present ViSpa (Vision Spaces), high-dimensional vector spaces that include vision-based representation for naturalistic images as well as concept prototypes. These vectors are derived directly from visual stimuli through a deep convolutional neural network (DCNN) trained to classify images, and allow us to compute vision-based similarity scores between any pair of images and/or concept prototypes. We successfully evaluate these similarities against human behavioral data in a series of large-scale studies, including off-line judgments – visual similarity judgments for the referents of word pairs (Study 1) and for image pairs (Study 2), and typicality judgments for images given a label (Study 3) – as well as on-line processing times and error rates in a discrimination (Study 4) and priming task (Study 5) with naturalistic image material. ViSpa similarities predict behavioral data across all tasks, which renders ViSpa a theoretically appealing model for vision-based representations and a valuable research tool for data analysis and the construction of experimental material: ViSpa allows for precise control over experimental material consisting of images (also in combination with words), and introduces a specifically vision-based similarity for word pairs. To make ViSpa available to a wide audience, this article a) includes (video) tutorials on how to use ViSpa in R, and b) presents a user-friendly web interface at http://vispa.fritzguenther.de.

Download Full-text

Semantic Memory Search and Retrieval in a Novel Cooperative Word Game: A Comparison of Associative and Distributional Semantic Models

Cognitive Science ◽

10.1111/cogs.13053 ◽

2021 ◽

Vol 45 (10) ◽

Author(s):

Abhilasha A. Kumar ◽

Mark Steyvers ◽

David A. Balota

Keyword(s):

Semantic Memory ◽

Memory Search ◽

Semantic Models ◽

Word Game ◽

Distributional Semantic Models ◽

Search And Retrieval

Download Full-text

Hands-on false memories: A combined study with distributional semantics and mouse-tracking

10.31234/osf.io/5gvwe ◽

2021 ◽

Author(s):

Daniele Gatti ◽

Marco Marelli ◽

Giuliana Mazzoni ◽

Tomaso Vecchi ◽

Luca Rinaldi

Keyword(s):

Decision Making ◽

False Memory ◽

Semantic Similarity ◽

Time Window ◽

Recognition Task ◽

Mouse Tracking ◽

Decision Making Processes ◽

Distributional Semantic Models ◽

Temporal Measures ◽

The Impact

Despite mouse-tracking has been taken as a real-time window into different aspects of human decision-making processes, currently little is known about how the decision process unfolds in veridical and false memory retrieval. Here, we directly investigated these processes by predicting participants’ performance in a mouse-tracking version of a typical Deese–Roediger–McDermott (DRM) task through distributional semantic models, a usage-based approach to meaning. Participants were required to study lists of associated words and then to perform a recognition task with the mouse. Results showed that mouse trajectories were extensively affected by the semantic similarity between the words presented in the recognition phase and the ones previously studied. In particular, the higher the semantic similarity, the larger the conflict driving the choice and the higher the irregularity in the trajectory when correctly rejecting new words (i.e., the false memory items). Conversely, on the temporal evolution of the decision, our results showed that semantic similarity affects more complex temporal measures indexing the online decision processes subserving task performance. Together, these findings demonstrate that semantic similarity can affect human behavior at the level of motor control, testifying its influence on online decision-making processes. More generally, our findings complement previous seminal theories on false memory and provide insights on the impact of the semantic memory structure on different decision-making components.

Download Full-text

Avoiding Conflict: When Speaker Coordination Does Not Require Conceptual Agreement

Frontiers in Artificial Intelligence ◽

10.3389/frai.2020.523920 ◽

2021 ◽

Vol 3 ◽

Author(s):

Alexandre Kabbach ◽

Aurélie Herbelot

Keyword(s):

Ad Hoc ◽

Semantic Space ◽

Social Contexts ◽

Linguistic Community ◽

Dense Matrix ◽

Lexical Representations ◽

Successful Communication ◽

Distributional Information ◽

Distributional Semantic Models ◽

Value Decomposition

In this paper we discuss the socialization hypothesis—the idea that speakers of the same (linguistic) community should share similar concepts given that they are exposed to similar environments and operate in highly-coordinated social contexts—and challenge the fact that it is assumed to constitute a prerequisite to successful communication. We do so using distributional semantic models of meaning (DSMs) which create lexical representations via latent aggregation of co-occurrence information between words and contexts. We argue that DSMs constitute particularly adequate tools for exploring the socialization hypothesis given that 1) they provide full control over the notion of background environment, formally characterized as the training corpus from which distributional information is aggregated; and 2) their geometric structure allows for exploiting alignment-based similarity metrics to measure inter-subject alignment over an entire semantic space, rather than a set of limited entries. We propose to model coordination between two different DSMs trained on two distinct corpora as dimensionality selection over a dense matrix obtained via Singular Value Decomposition This approximates an ad-hoc coordination scenario between two speakers as the attempt to align their similarity ratings on a set of word pairs. Our results underline the specific way in which linguistic information is spread across singular vectors, and highlight the need to distinguish agreement from mere compatibility in alignment-based notions of conceptual similarity. Indeed, we show that compatibility emerges from idiosyncrasy so that the unique and distinctive aspects of speakers’ background experiences can actually facilitate—rather than impede—coordination and communication between them. We conclude that the socialization hypothesis may constitute an unnecessary prerequisite to successful communication and that, all things considered, communication is probably best formalized as the cooperative act of avoiding conflict, rather than maximizing agreement.

Download Full-text

Decoding Brain Activity Associated with Literal and Metaphoric Sentence Comprehension Using Distributional Semantic Models

Transactions of the Association for Computational Linguistics ◽

10.1162/tacl_a_00307 ◽

2020 ◽

Vol 8 ◽

pp. 231-246

Author(s):

Vesna G. Djokic ◽

Jean Maillard ◽

Luana Bulat ◽

Ekaterina Shutova

Keyword(s):

Language Processing ◽

Sentence Comprehension ◽

Brain Activity ◽

Activity Patterns ◽

Word Embeddings ◽

Semantic Models ◽

Distributional Semantic Models ◽

Metaphor Processing ◽

Meaning Representation ◽

The Brain

Recent years have seen a growing interest within the natural language processing (NLP) community in evaluating the ability of semantic models to capture human meaning representation in the brain. Existing research has mainly focused on applying semantic models to decode brain activity patterns associated with the meaning of individual words, and, more recently, this approach has been extended to sentences and larger text fragments. Our work is the first to investigate metaphor processing in the brain in this context. We evaluate a range of semantic models (word embeddings, compositional, and visual models) in their ability to decode brain activity associated with reading of both literal and metaphoric sentences. Our results suggest that compositional models and word embeddings are able to capture differences in the processing of literal and metaphoric sentences, providing support for the idea that the literal meaning is not fully accessible during familiar metaphor comprehension.

Download Full-text

Enhanced Bootstrapping Algorithm for Automatic Annotation of Tweets

International Journal of Cognitive Informatics and Natural Intelligence ◽

10.4018/ijcini.2020040103 ◽

2020 ◽

Vol 14 (2) ◽

pp. 35-60

Author(s):

Mudasir Mohd ◽

Rafiya Jan ◽

Nida Hakak

Keyword(s):

Opinion Mining ◽

Word Sense Disambiguation ◽

Automatic Annotation ◽

Word Sense ◽

The Novel ◽

Training Set ◽

Annotation Process ◽

Sense Disambiguation ◽

Supervised Learning Algorithms ◽

Distributional Semantic Models

Annotations are critical in various text mining tasks such as opinion mining, sentiment analysis, word sense disambiguation. Supervised learning algorithms start with the training of the classifier and require manually annotated datasets. However, manual annotations are often subjective, biased, onerous, and burdensome to develop; therefore, there is a need for automatic annotation. Automatic annotators automatically annotate the data for creating the training set for the supervised classifier, but lack subjectivity and ignore semantics of underlying textual structures. The objective of this research is to develop scalable and semantically rich automatic annotation system while incorporating domain dependent characteristics of the annotation process. The authors devised an enhanced bootstrapping algorithm for the automatic annotation of Tweets and employed distributional semantic models (LSA and Word2Vec) to augment the novel Bootstrapping algorithm and tested the proposed algorithm on the 12,000 crowd-sourced annotated Tweets and achieved a 68.56% accuracy which is higher than the baseline accuracy.

Download Full-text

Exploring patterns of stability and change in caregivers’ word usage across early childhood

10.31234/osf.io/fym86 ◽

2020 ◽

Author(s):

Hang Jiang ◽

Michael C. Frank ◽

Vivek Kulkarni ◽

Abdellah Fourtassi

Keyword(s):

Early Childhood ◽

Language Change ◽

Older Children ◽

Word Use ◽

Developmental Context ◽

New Concepts ◽

The World ◽

Linguistic Input ◽

The Individual ◽

Distributional Semantic Models

The linguistic input children receive across early childhood plays a crucial role in shaping their knowledge about the world. To study this input, researchers have begun applying distributional semantic models to large corpora of child-directed speech, extracting various patterns of word use/co-occurrence. Previous work using these models has not measured how these patterns may change over the course of development. In this work, we leverage NLP methods that were originally developed to study historical language change to compare caregivers' use of words when talking to younger and older children. Some words' usage changed more than others'; this variability could be predicted based on the word's properties at both the individual and category level. These findings suggest that the patterns of word use may be tuned to children's developmental context, perhaps scaffolding the acquisition of new concepts and skills.

Download Full-text

Emotion Mining Using Semantic Similarity

Natural Language Processing ◽

10.4018/978-1-7998-0951-7.ch053 ◽

2020 ◽

pp. 1115-1138 ◽

Cited By ~ 2

Author(s):

Rafiya Jan ◽

Afaq Alam Khan

Keyword(s):

Semantic Similarity ◽

Semantic Relatedness ◽

Semantic Features ◽

Emotion Detection ◽

Emotion Classification ◽

Textual Data ◽

Distributional Semantic Models ◽

Affective Information ◽

Emotion Mining

Social networks are considered as the most abundant sources of affective information for sentiment and emotion classification. Emotion classification is the challenging task of classifying emotions into different types. Emotions being universal, the automatic exploration of emotion is considered as a difficult task to perform. A lot of the research is being conducted in the field of automatic emotion detection in textual data streams. However, very little attention is paid towards capturing semantic features of the text. In this article, the authors present the technique of semantic relatedness for automatic classification of emotion in the text using distributional semantic models. This approach uses semantic similarity for measuring the coherence between the two emotionally related entities. Before classification, data is pre-processed to remove the irrelevant fields and inconsistencies and to improve the performance. The proposed approach achieved the accuracy of 71.795%, which is competitive considering as no training or annotation of data is done.

Download Full-text

WORD2VEC NOT DEAD: PREDICTING HYPERNYMS OF CO-HYPONYMS IS BETTER THAN READING DEFINITIONS

Computational Linguistics and Intellectual Technologies ◽

10.28995/2075-7182-2020-19-13-32 ◽

2020 ◽

Author(s):

N. V. Arefyev ◽

◽

M. V. Fedoseev ◽

A. V. Kabanov ◽

V. S. Zizov ◽

...

Keyword(s):

Language Models ◽

Shared Task ◽

Huge Amount ◽

Lexical Resources ◽

Human Labour ◽

Distributional Semantic Models ◽

The Cost ◽

Low Coverage ◽

Future Work ◽

Better Than

Expert-built lexical resources are known to provide information of good quality for the cost of low coverage. This property limits their applicability in modern NLP applications. Building descriptions of lexical-semantic relations manually in sufficient volume requires a huge amount of qualified human labour. However, given some initial version of a taxonomy is already built, automatic or semi-automatic taxonomy enrichment systems can greatly reduce the required efforts. We propose and experiment with two approaches to taxonomy enrichment, one utilizing information from word definitions and another from word usages, and also a combination of them. The first method retrieves co-hyponyms for the target word from distributional semantic models (word2vec) or language models (XLM-R), then looks for hypernyms of co-hyponyms in the taxonomy. The second method tries to extract hypernyms directly from Wiktionary definitions. The proposed methods were evaluated on the Dialogue-2020 shared task on taxonomy enrichment. We found that predicting hypernyms of cohyponyms achieves better results in this task. The combination of both methods improves results further and is among 3 best-performing systems for verbs. An important part of the work is detailed qualitative and error analysis of the proposed methods, which provide interesting observations of their behaviour and ideas for the future work.

Download Full-text

distributional semantic models
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

A Word Selection Method for Producing Interpretable Distributional Semantic Word Vectors

ViSpa (Vision Spaces): A computer-vision-based representation system for individual images and concept prototypes, with large-scale evaluation

Semantic Memory Search and Retrieval in a Novel Cooperative Word Game: A Comparison of Associative and Distributional Semantic Models

Hands-on false memories: A combined study with distributional semantics and mouse-tracking

Avoiding Conflict: When Speaker Coordination Does Not Require Conceptual Agreement

Decoding Brain Activity Associated with Literal and Metaphoric Sentence Comprehension Using Distributional Semantic Models

Enhanced Bootstrapping Algorithm for Automatic Annotation of Tweets

Exploring patterns of stability and change in caregivers’ word usage across early childhood

Emotion Mining Using Semantic Similarity

WORD2VEC NOT DEAD: PREDICTING HYPERNYMS OF CO-HYPONYMS IS BETTER THAN READING DEFINITIONS

Export Citation Format

distributional semantic modelsRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

A Word Selection Method for Producing Interpretable Distributional Semantic Word Vectors

ViSpa (Vision Spaces): A computer-vision-based representation system for individual images and concept prototypes, with large-scale evaluation

Semantic Memory Search and Retrieval in a Novel Cooperative Word Game: A Comparison of Associative and Distributional Semantic Models

Hands-on false memories: A combined study with distributional semantics and mouse-tracking

Avoiding Conflict: When Speaker Coordination Does Not Require Conceptual Agreement

Decoding Brain Activity Associated with Literal and Metaphoric Sentence Comprehension Using Distributional Semantic Models

Enhanced Bootstrapping Algorithm for Automatic Annotation of Tweets

Exploring patterns of stability and change in caregivers’ word usage across early childhood

Emotion Mining Using Semantic Similarity

WORD2VEC NOT DEAD: PREDICTING HYPERNYMS OF CO-HYPONYMS IS BETTER THAN READING DEFINITIONS

distributional semantic models
Recently Published Documents