Local Context Discrimination in Signature Neural Networks

2021 ◽

Author(s):

Zemin Liu ◽

Yuan Fang ◽

Chenghao Liu ◽

Steven C.H. Hoi

Keyword(s):

Neural Networks ◽

State Of The Art ◽

Representation Learning ◽

Global Model ◽

Local Context ◽

Local Contexts ◽

Unique Model ◽

Graph Neural Networks ◽

Global And Local ◽

Different Parts

Graph neural networks (GNNs) emerge as a powerful family of representation learning models on graphs. To derive node representations, they utilize a global model that recursively aggregates information from the neighboring nodes. However, different nodes reside at different parts of the graph in different local contexts, making their distributions vary across the graph. Ideally, how a node receives its neighborhood information should be a function of its local context, to diverge from the global GNN model shared by all nodes. To utilize node locality without overfitting, we propose a node-wise localization of GNNs by accounting for both global and local aspects of the graph. Globally, all nodes on the graph depend on an underlying global GNN to encode the general patterns across the graph; locally, each node is localized into a unique model as a function of the global model and its local context. Finally, we conduct extensive experiments on four benchmark graphs, and consistently obtain promising performance surpassing the state-of-the-art GNNs.

Download Full-text

Attentive Convolution: Equipping CNNs with RNN-style Attention Mechanisms

Transactions of the Association for Computational Linguistics ◽

10.1162/tacl_a_00249 ◽

2018 ◽

Vol 6 ◽

pp. 687-702 ◽

Cited By ~ 5

Author(s):

Wenpeng Yin ◽

Hinrich Schütze

Keyword(s):

Neural Networks ◽

Sentiment Analysis ◽

Recurrent Neural Networks ◽

Representation Learning ◽

Local Context ◽

Fixed Size ◽

Convolution Operation ◽

Input Text ◽

Multiple Context ◽

Textual Entailment

In NLP, convolutional neural networks (CNNs) have benefited less than recurrent neural networks (RNNs) from attention mechanisms. We hypothesize that this is because the attention in CNNs has been mainly implemented as attentive pooling (i.e., it is applied to pooling) rather than as attentive convolution (i.e., it is integrated into convolution). Convolution is the differentiator of CNNs in that it can powerfully model the higher-level representation of a word by taking into account its local fixed-size context in the input text t x. In this work, we propose an attentive convolution network, ATTCONV. It extends the context scope of the convolution operation, deriving higher-level features for a word not only from local context, but also from information extracted from nonlocal context by the attention mechanism commonly used in RNNs. This nonlocal context can come (i) from parts of the input text t x that are distant or (ii) from extra (i.e., external) contexts t y. Experiments on sentence modeling with zero-context (sentiment analysis), single-context (textual entailment) and multiple-context (claim verification) demonstrate the effectiveness of ATTCONV in sentence representation learning with the incorporation of context. In particular, attentive convolution outperforms attentive pooling and is a strong competitor to popular attentive RNNs. 1

Download Full-text

GRN: Gated Relation Network to Enhance Convolutional Neural Network for Named Entity Recognition

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33016236 ◽

2019 ◽

Vol 33 ◽

pp. 6236-6243 ◽

Cited By ~ 9

Author(s):

Hui Chen ◽

Zijia Lin ◽

Guiguang Ding ◽

Jianguang Lou ◽

Yusen Zhang ◽

...

Keyword(s):

Neural Networks ◽

Short Term Memory ◽

Named Entity Recognition ◽

Entity Recognition ◽

Local Context ◽

Named Entity ◽

Entire Sentence ◽

Context Features ◽

Gpu Parallelism

The dominant approaches for named entity recognitionm (NER) mostly adopt complex recurrent neural networks (RNN), e.g., long-short-term-memory (LSTM). However, RNNs are limited by their recurrent nature in terms of computational efficiency. In contrast, convolutional neural networks (CNN) can fully exploit the GPU parallelism with their feedforward architectures. However, little attention has been paid to performing NER with CNNs, mainly owing to their difficulties in capturing the long-term context information in a sequence. In this paper, we propose a simple but effective CNN-based network for NER, i.e., gated relation network (GRN), which is more capable than common CNNs in capturing long-term context. Specifically, in GRN we firstly employ CNNs to explore the local context features of each word. Then we model the relations between words and use them as gates to fuse local context features into global ones for predicting labels. Without using recurrent layers that process a sentence in a sequential manner, our GRN allows computations to be performed in parallel across the entire sentence. Experiments on two benchmark NER datasets (i.e., CoNLL2003 and Ontonotes 5.0) show that, our proposed GRN can achieve state-of-the-art performance with or without external knowledge. It also enjoys lower time costs to train and test.

Download Full-text

Neural Collective Entity Linking Based on Recurrent Random Walk Network Learning

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2019/740 ◽

2019 ◽

Cited By ~ 2

Author(s):

Mengge Xue ◽

Weiming Cai ◽

Jinsong Su ◽

Linfeng Song ◽

Yubin Ge ◽

...

Keyword(s):

Neural Networks ◽

Random Walk ◽

Knowledge Base ◽

Local Context ◽

Entity Linking ◽

External Knowledge ◽

Depth Analysis ◽

Entity Disambiguation ◽

Semantic Dependencies ◽

Recurrent Random Walk

Beneﬁting from the excellent ability of neural networks on learning semantic representations, existing studies for entity linking (EL) have resorted to neural networks to exploit both the local mention-to-entity compatibility and the global interdependence between different EL decisions for target entity disambiguation. However, most neural collective EL methods depend entirely upon neural networks to automatically model the semantic dependencies between different EL decisions, which lack of the guidance from external knowledge. In this paper, we propose a novel end-to-end neural network with recurrent random-walk layers for collective EL, which introduces external knowledge to model the semantic interdependence between different EL decisions. Speciﬁcally, we ﬁrst establish a model based on local context features, and then stack random-walk layers to reinforce the evidence for related EL decisions into high-probability decisions, where the semantic interdependence between candidate entities is mainly induced from an external knowledge base. Finally, a semantic regularizer that preserves the collective EL decisions consistency is incorporated into the conventional objective function, so that the external knowledge base can be fully exploited in collective EL decisions. Experimental results and in-depth analysis on various datasets show that our model achieves better performance than other state-of-the-art models. Our code and data are released at https://github.com/DeepLearnXMU/RRWEL.

Download Full-text

Determination of S-N curves with the application of artificial neural networks

Fatigue & Fracture of Engineering Materials & Structures ◽

10.1046/j.1460-2695.1999.00198.x ◽

1999 ◽

Vol 22 (8) ◽

pp. 723-728 ◽

Cited By ~ 1

Author(s):

Artymiak ◽

Bukowski ◽

Feliks ◽

Narberhaus ◽

Zenner

Keyword(s):

Neural Networks ◽

Artificial Neural Networks ◽

Artificial Neural

Download Full-text

Prime Proportion Affects Masked Priming of Fixed and Free-Choice Responses

Experimental Psychology (formerly Zeitschrift für Experimentelle Psychologie) ◽

10.1027/1618-3169/a000043 ◽

2010 ◽

Vol 57 (5) ◽

pp. 360-366 ◽

Cited By ~ 23

Author(s):

Glen E. Bodner ◽

Rehman Mulji

Keyword(s):

Decision Process ◽

Free Choice ◽

Masked Priming ◽

Local Context ◽

Previous Trial ◽

Target Response ◽

Global Context ◽

Sequential Trial

Left/right “fixed” responses to arrow targets are influenced by whether a masked arrow prime is congruent or incongruent with the required target response. Left/right “free-choice” responses on trials with ambiguous targets that are mixed among fixed trials are also influenced by masked arrow primes. We show that the magnitude of masked priming of both fixed and free-choice responses is greater when the proportion of fixed trials with congruent primes is .8 rather than .2. Unconscious manipulation of context can thus influence both fixed and free choices. Sequential trial analyses revealed that these effects of the overall prime context on fixed and free-choice priming can be modulated by the local context (i.e., the nature of the previous trial). Our results support accounts of masked priming that posit a memory-recruitment, activation, or decision process that is sensitive to aspects of both the local and global context.

Download Full-text