Link Prediction Between Structured Geopolitical Events: Models and Experiments

Frontiers in Big Data ◽

10.3389/fdata.2021.779792 ◽

2021 ◽

Vol 4 ◽

Author(s):

Mayank Kejriwal

Keyword(s):

Learning Community ◽

Language Processing ◽

Link Prediction ◽

Representation Learning ◽

Research Community ◽

Computational Sciences ◽

Input Event ◽

Global Terrorism ◽

Open Question ◽

Global Terrorism Database

Often thought of as higher-order entities, events have recently become important subjects of research in the computational sciences, including within complex systems and natural language processing (NLP). One such application is event link prediction. Given an input event, event link prediction is the problem of retrieving a relevant set of events, similar to the problem of retrieving relevant documents on the Web in response to keyword queries. Since geopolitical events have complex semantics, it is an open question as to how to best model and represent events within the framework of event link prediction. In this paper, we formalize the problem and discuss how established representation learning algorithms from the machine learning community could potentially be applied to it. We then conduct a detailed empirical study on the Global Terrorism Database (GTD) using a set of metrics inspired by the information retrieval community. Our results show that, while there is considerable signal in both network-theoretic and text-centric models of the problem, classic text-only models such as bag-of-words prove surprisingly difficult to outperform. Our results establish both a baseline for event link prediction on GTD, and currently outstanding challenges for the research community to tackle in this space.

Download Full-text

Feature Extraction and Representation of Urban Road Networks Based on Travel Routes

Sustainability ◽

10.3390/su12229621 ◽

2020 ◽

Vol 12 (22) ◽

pp. 9621

Author(s):

Shichen Huang ◽

Chunfu Shao ◽

Juan Li ◽

Xiong Yang ◽

Xiaoyu Zhang ◽

...

Keyword(s):

Language Processing ◽

Traffic Safety ◽

Link Prediction ◽

Road Network ◽

Spatial Information ◽

Representation Learning ◽

Similarity Analysis ◽

Safety Planning ◽

Convolutional Network ◽

The Road

Extraction of traffic features constitutes a key research direction in traffic safety planning. In previous traffic tasks, road network features are extracted manually. In contrast, Network Representation Learning aims to automatically learn low-dimensional node representations. Enlightened by feature learning in Natural Language Processing, representation learning of urban nodes is studied as a supervised task in this paper. Following this line of thinking, a deep learning framework, called StreetNode2VEC, is proposed for learning feature representations for nodes in the road network based on travel routes, and then model parameter calibration is performed. We explain the effectiveness of features from visualization, similarity analysis, and link prediction. In visualization, the features of nodes naturally present a clustered pattern, and different clusters correspond to different regions in the road network. Meanwhile, the features of nodes still retain their spatial information in similarity analysis. The proposed method StreetNode2VEC obtains a AUC score of 0.813 in link prediction, which is greater than that obtained from Graph Convolutional Network (GCN) and Node2vec. This suggests that the features of nodes can be used to effectively and credibly predict whether a link should be established between two nodes. Overall, our work provides a new way of representing road nodes in the road network, which have potential in the traffic safety planning field.

Download Full-text

Suggestions for the Introduction of Korean-style Terrorism Database : Focusing on the Analysis of Global Terrorism Database

Korea Criminal Intelligence Review ◽

10.33563/kscia.2020.6.1.10 ◽

2020 ◽

Vol 6 (1) ◽

pp. 187-208

Author(s):

Seok Woo Jang ◽

Keyword(s):

Global Terrorism ◽

Global Terrorism Database

Download Full-text

W-MMP2Vec: Topic-driven network embedding model for link prediction in content-based heterogeneous information network

Intelligent Data Analysis ◽

10.3233/ida-205168 ◽

2021 ◽

Vol 25 (3) ◽

pp. 711-738

Author(s):

Phu Pham ◽

Phuc Do

Keyword(s):

Link Prediction ◽

Representation Learning ◽

Information Network ◽

Network Embedding ◽

Heterogeneous Information Network ◽

Heterogeneous Information ◽

Learning Framework ◽

Novel Approach ◽

Proposed Model ◽

Meta Path

Link prediction on heterogeneous information network (HIN) is considered as a challenge problem due to the complexity and diversity in types of nodes and links. Currently, there are remained challenges of meta-path-based link prediction in HIN. Previous works of link prediction in HIN via network embedding approach are mainly focused on exploiting features of node rather than existing relations in forms of meta-paths between nodes. In fact, predicting the existence of new links between non-linked nodes is absolutely inconvincible. Moreover, recent HIN-based embedding models also lack of thorough evaluations on the topic similarity between text-based nodes along given meta-paths. To tackle these challenges, in this paper, we proposed a novel approach of topic-driven multiple meta-path-based HIN representation learning framework, namely W-MMP2Vec. Our model leverages the quality of node representations by combining multiple meta-paths as well as calculating the topic similarity weight for each meta-path during the processes of network embedding learning in content-based HINs. To validate our approach, we apply W-TMP2Vec model in solving several link prediction tasks in both content-based and non-content-based HINs (DBLP, IMDB and BlogCatalog). The experimental outputs demonstrate the effectiveness of proposed model which outperforms recent state-of-the-art HIN representation learning models.

Download Full-text

Shall I Work with Them? A Knowledge Graph-Based Approach for Predicting Future Research Collaborations

Entropy ◽

10.3390/e23060664 ◽

2021 ◽

Vol 23 (6) ◽

pp. 664

Author(s):

Nikos Kanakaris ◽

Nikolaos Giarelis ◽

Ilias Siachos ◽

Nikos Karacapilidis

Keyword(s):

Language Processing ◽

Scientific Knowledge ◽

Link Prediction ◽

Performance Metrics ◽

Future Research ◽

Knowledge Graph ◽

Prediction Problem ◽

Textual Information ◽

Research Collaborations ◽

Processing Techniques

We consider the prediction of future research collaborations as a link prediction problem applied on a scientific knowledge graph. To the best of our knowledge, this is the first work on the prediction of future research collaborations that combines structural and textual information of a scientific knowledge graph through a purposeful integration of graph algorithms and natural language processing techniques. Our work: (i) investigates whether the integration of unstructured textual data into a single knowledge graph affects the performance of a link prediction model, (ii) studies the effect of previously proposed graph kernels based approaches on the performance of an ML model, as far as the link prediction problem is concerned, and (iii) proposes a three-phase pipeline that enables the exploitation of structural and textual information, as well as of pre-trained word embeddings. We benchmark the proposed approach against classical link prediction algorithms using accuracy, recall, and precision as our performance metrics. Finally, we empirically test our approach through various feature combinations with respect to the link prediction problem. Our experimentations with the new COVID-19 Open Research Dataset demonstrate a significant improvement of the abovementioned performance metrics in the prediction of future research collaborations.

Download Full-text

BioVerbNet: a large semantic-syntactic classification of verbs in biomedicine

Journal of Biomedical Semantics ◽

10.1186/s13326-021-00247-z ◽

2021 ◽

Vol 12 (1) ◽

Author(s):

Olga Majewska ◽

Charlotte Collins ◽

Simon Baker ◽

Jari Björne ◽

Susan Windisch Brown ◽

...

Keyword(s):

Natural Language ◽

Language Processing ◽

Model Performance ◽

Representation Learning ◽

Verb Classes ◽

Expert Annotation ◽

Biomedical Texts ◽

Time Required ◽

Verb Meaning

Abstract Background Recent advances in representation learning have enabled large strides in natural language understanding; However, verbal reasoning remains a challenge for state-of-the-art systems. External sources of structured, expert-curated verb-related knowledge have been shown to boost model performance in different Natural Language Processing (NLP) tasks where accurate handling of verb meaning and behaviour is critical. The costliness and time required for manual lexicon construction has been a major obstacle to porting the benefits of such resources to NLP in specialised domains, such as biomedicine. To address this issue, we combine a neural classification method with expert annotation to create BioVerbNet. This new resource comprises 693 verbs assigned to 22 top-level and 117 fine-grained semantic-syntactic verb classes. We make this resource available complete with semantic roles and VerbNet-style syntactic frames. Results We demonstrate the utility of the new resource in boosting model performance in document- and sentence-level classification in biomedicine. We apply an established retrofitting method to harness the verb class membership knowledge from BioVerbNet and transform a pretrained word embedding space by pulling together verbs belonging to the same semantic-syntactic class. The BioVerbNet knowledge-aware embeddings surpass the non-specialised baseline by a significant margin on both tasks. Conclusion This work introduces the first large, annotated semantic-syntactic classification of biomedical verbs, providing a detailed account of the annotation process, the key differences in verb behaviour between the general and biomedical domain, and the design choices made to accurately capture the meaning and properties of verbs used in biomedical texts. The demonstrated benefits of leveraging BioVerbNet in text classification suggest the resource could help systems better tackle challenging NLP tasks in biomedicine.

Download Full-text

A node representation learning approach for link prediction in social networks using game theory and K-core decomposition

The European Physical Journal B ◽

10.1140/epjb/e2019-100225-8 ◽

2019 ◽

Vol 92 (10) ◽

Cited By ~ 2

Author(s):

Elaheh Nasiri ◽

Asgarali Bouyer ◽

Esmaeil Nourani

Keyword(s):

Game Theory ◽

Social Networks ◽

Link Prediction ◽

Representation Learning ◽

Learning Approach

Download Full-text

RLPath: a knowledge graph link prediction method using reinforcement learning based attentive relation path searching and representation learning

Applied Intelligence ◽

10.1007/s10489-021-02672-0 ◽

2021 ◽

Author(s):

Ling Chen ◽

Jun Cui ◽

Xing Tang ◽

Yuntao Qian ◽

Yansheng Li ◽

...

Keyword(s):

Reinforcement Learning ◽

Link Prediction ◽

Prediction Method ◽

Representation Learning ◽

Knowledge Graph ◽

Graph Link

Download Full-text

Automated Source Code Generation and Auto-Completion Using Deep Learning: Comparing and Discussing Current Language Model-Related Approaches

AI ◽

10.3390/ai2010001 ◽

2021 ◽

Vol 2 (1) ◽

pp. 1-16

Author(s):

Juan Cruz-Benito ◽

Sanjay Vishwakarma ◽

Francisco Martin-Fernandez ◽

Ismael Faro

Keyword(s):

Deep Learning ◽

Learning Community ◽

Programming Languages ◽

Language Processing ◽

Code Generation ◽

Language Model ◽

Language Models ◽

Stochastic Gradient Descent ◽

Network Architectures ◽

Learning Architectures

In recent years, the use of deep learning in language models has gained much attention. Some research projects claim that they can generate text that can be interpreted as human writing, enabling new possibilities in many application areas. Among the different areas related to language processing, one of the most notable in applying this type of modeling is programming languages. For years, the machine learning community has been researching this software engineering area, pursuing goals like applying different approaches to auto-complete, generate, fix, or evaluate code programmed by humans. Considering the increasing popularity of the deep learning-enabled language models approach, we found a lack of empirical papers that compare different deep learning architectures to create and use language models based on programming code. This paper compares different neural network architectures like Average Stochastic Gradient Descent (ASGD) Weight-Dropped LSTMs (AWD-LSTMs), AWD-Quasi-Recurrent Neural Networks (QRNNs), and Transformer while using transfer learning and different forms of tokenization to see how they behave in building language models using a Python dataset for code generation and filling mask tasks. Considering the results, we discuss each approach’s different strengths and weaknesses and what gaps we found to evaluate the language models or to apply them in a real programming context.

Download Full-text

Lattice-Based Technique to Visualize and Compare Regional Terrorism Using the Global Terrorism Database

10.1109/idaacs53288.2021.9660877 ◽

2021 ◽

Author(s):

Linda Markowsky ◽

George Markowsky

Keyword(s):

Global Terrorism ◽

Global Terrorism Database

Download Full-text

Incrementality and intention-recognition in utterance processing

Dialogue & Discourse ◽

10.5087/dad.2011.109 ◽

2011 ◽

Vol 2 (1) ◽

pp. 199-233 ◽

Cited By ~ 25

Author(s):

Eleni Gregoromichelaki ◽

Ruth Kempson ◽

Matthew Purver ◽

Gregory J. Mills ◽

Ronnie Cann ◽

...

Keyword(s):

Predictive Model ◽

Language Processing ◽

Mental States ◽

Higher Order ◽

Low Level ◽

Intention Recognition ◽

Dialogue Modelling ◽

Open Question ◽

Dynamic Syntax ◽

Do So

Ever since dialogue modelling first developed relative to broadly Gricean assumptions about utter-ance interpretation (Clark, 1996), it has remained an open question whether the full complexity of higher-order intention computation is made use of in everyday conversation. In this paper we examine the phenomenon of split utterances, from the perspective of Dynamic Syntax, to further probe the necessity of full intention recognition/formation in communication: we do so by exploring the extent to which the interactive coordination of dialogue exchange can be seen as emergent from low-level mechanisms of language processing, without needing representation by interlocutors of each other’s mental states, or fully developed intentions as regards messages to be conveyed. We thus illustrate how many dialogue phenomena can be seen as direct consequences of the grammar architecture, as long as this is presented within an incremental, goal-directed/predictive model.

Download Full-text