Multimodal and Crossmodal Representation Learning from Textual and Visual Features with Bidirectional Deep Neural Networks for Video Hyperlinking

For nonconvex optimization in machine learning, this article proves that every local minimum achieves the globally optimal value of the perturbable gradient basis model at any differentiable point. As a result, nonconvex machine learning is theoretically as supported as convex machine learning with a handcrafted basis in terms of the loss at differentiable local minima, except in the case when a preference is given to the handcrafted basis over the perturbable gradient basis. The proofs of these results are derived under mild assumptions. Accordingly, the proven results are directly applicable to many machine learning models, including practical deep neural networks, without any modification of practical methods. Furthermore, as special cases of our general results, this article improves or complements several state-of-the-art theoretical results on deep neural networks, deep residual networks, and overparameterized deep neural networks with a unified proof technique and novel geometric insights. A special case of our results also contributes to the theoretical foundation of representation learning.

Download Full-text

Molecular Graph Contrastive Learning with Parameterized Explainable Augmentations

10.1101/2021.12.03.471150 ◽

2021 ◽

Author(s):

Yingheng Wang ◽

Yaosen Min ◽

Erzhuo Shao ◽

Ji Wu

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

Structural Information ◽

Molecular Graph ◽

Representation Learning ◽

Graph Representation ◽

Input Graph ◽

Recent Success ◽

Real World Datasets ◽

Comparative Results

ABSTRACTLearning generalizable, transferable, and robust representations for molecule data has always been a challenge. The recent success of contrastive learning (CL) for self-supervised graph representation learning provides a novel perspective to learn molecule representations. The most prevailing graph CL framework is to maximize the agreement of representations in different augmented graph views. However, existing graph CL frameworks usually adopt stochastic augmentations or schemes according to pre-defined rules on the input graph to obtain different graph views in various scales (e.g. node, edge, and subgraph), which may destroy topological semantemes and domain prior in molecule data, leading to suboptimal performance. Therefore, designing parameterized, learnable, and explainable augmentation is quite necessary for molecular graph contrastive learning. A well-designed parameterized augmentation scheme can preserve chemically meaningful structural information and intrinsically essential attributes for molecule graphs, which helps to learn representations that are insensitive to perturbation on unimportant atoms and bonds. In this paper, we propose a novel Molecular Graph Contrastive Learning with Parameterized Explainable Augmentations, MolCLE for brevity, that self-adaptively incorporates chemically significative information from both topological and semantic aspects of molecular graphs. Specifically, we apply deep neural networks to parameterize the augmentation process for both the molecular graph topology and atom attributes, to highlight contributive molecular substructures and recognize underlying chemical semantemes. Comprehensive experiments on a variety of real-world datasets demonstrate that our proposed method consistently outperforms compared baselines, which verifies the effectiveness of the proposed framework. Detailedly, our self-supervised MolCLE model surpasses many supervised counterparts, and meanwhile only uses hundreds of thousands of parameters to achieve comparative results against the state-of-the-art baseline, which has tens of millions of parameters. We also provide detailed case studies to validate the explainability of augmented graph views.CCS CONCEPTS• Mathematics of computing → Graph algorithms; • Applied computing → Bioinformatics; • Computing methodologies → Neural networks; Unsupervised learning.

Download Full-text

Learning Eligibility in Cancer Clinical Trials Using Deep Neural Networks

Applied Sciences ◽

10.3390/app8071206 ◽

2018 ◽

Vol 8 (7) ◽

pp. 1206 ◽

Cited By ~ 5

Author(s):

Aurelia Bustos ◽

Antonio Pertusa

Keyword(s):

Neural Networks ◽

Clinical Trials ◽

Deep Neural Networks ◽

Medical Knowledge ◽

Clinical Information ◽

Representation Learning ◽

Free Text ◽

Cancer Clinical Trials ◽

Word Embeddings ◽

New Treatments

Interventional cancer clinical trials are generally too restrictive, and some patients are often excluded on the basis of comorbidity, past or concomitant treatments, or the fact that they are over a certain age. The efficacy and safety of new treatments for patients with these characteristics are, therefore, not defined. In this work, we built a model to automatically predict whether short clinical statements were considered inclusion or exclusion criteria. We used protocols from cancer clinical trials that were available in public registries from the last 18 years to train word-embeddings, and we constructed a dataset of 6M short free-texts labeled as eligible or not eligible. A text classifier was trained using deep neural networks, with pre-trained word-embeddings as inputs, to predict whether or not short free-text statements describing clinical information were considered eligible. We additionally analyzed the semantic reasoning of the word-embedding representations obtained and were able to identify equivalent treatments for a type of tumor analogous with the drugs used to treat other tumors. We show that representation learning using deep neural networks can be successfully leveraged to extract the medical knowledge from clinical trial protocols for potentially assisting practitioners when prescribing treatments.

Download Full-text

Bidirectional Joint Representation Learning with Symmetrical Deep Neural Networks for Multimodal and Crossmodal Applications

Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval - ICMR '16 ◽

10.1145/2911996.2912064 ◽

2016 ◽

Cited By ~ 12

Author(s):

Vedran Vukotić ◽

Christian Raymond ◽

Guillaume Gravier

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

Representation Learning ◽

Joint Representation

Download Full-text

Visual features versus categories: Explaining object representations in primate IT and deep neural networks with weighted representational modeling

Journal of Vision ◽

10.1167/16.12.511 ◽

2016 ◽

Vol 16 (12) ◽

pp. 511

Author(s):

Kamila Jozwik ◽

Nikolaus Kriegeskorte ◽

Radoslaw Cichy ◽

Marieke Mur

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

Visual Features ◽

Object Representations

Download Full-text

Feature Representation Learning in Deep Neural Networks

Automatic Speech Recognition - Signals and Communication Technology ◽

10.1007/978-1-4471-5779-3_9 ◽

2014 ◽

pp. 157-175 ◽

Cited By ~ 1

Author(s):

Dong Yu ◽

Li Deng

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

Representation Learning ◽

Feature Representation

Download Full-text

Representation Learning Using Multi-Task Deep Neural Networks for Semantic Classification and Information Retrieval

10.3115/v1/n15-1092 ◽

2015 ◽

Cited By ~ 76

Author(s):

Xiaodong Liu ◽

Jianfeng Gao ◽

Xiaodong He ◽

Li Deng ◽

Kevin Duh ◽

...

Keyword(s):

Neural Networks ◽

Information Retrieval ◽

Deep Neural Networks ◽

Representation Learning ◽

Semantic Classification

Download Full-text

Real time detection system of driver drowsiness based on representation learning using deep neural networks

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-169909 ◽

2019 ◽

Vol 36 (3) ◽

pp. 1977-1985 ◽

Cited By ~ 5

Author(s):

Vineetha Vijayan ◽

Elizabeth Sherly

Keyword(s):

Neural Networks ◽

Real Time ◽

Deep Neural Networks ◽

Detection System ◽

Representation Learning ◽

Driver Drowsiness ◽

Real Time Detection

Download Full-text

Designing and Implementing Conversational Intelligent Chat-bot Using Natural Language Processing

International Journal of Scientific Research in Computer Science Engineering and Information Technology ◽

10.32628/cseit217351 ◽

2021 ◽

pp. 262-266

Author(s):

Asoke Nath ◽

Rupamita Sarkar ◽

Swastik Mitra ◽

Rohitaswa Pradhan

Keyword(s):

Neural Networks ◽

Natural Language ◽

Language Processing ◽

Deep Neural Networks ◽

Natural Language Understanding ◽

Natural Language Generation ◽

Representation Learning ◽

Language Understanding ◽

Language Generation ◽

User Input

In the early days of Artificial Intelligence, it was observed that tasks which humans consider ‘natural’ and ‘commonplace’, such as Natural Language Understanding, Natural Language Generation and Vision were the most difficult task to carry over to computers. Nevertheless, attempts to crack the proverbial NLP nut were made, initially with methods that fall under ‘Symbolic NLP’. One of the products of this era was ELIZA. At present the most promising forays into the world of NLP are provided by ‘Neural NLP’, which uses Representation Learning and Deep Neural networks to model, understand and generate natural language. In the present paper the authors tried to develop a Conversational Intelligent Chatbot, a program that can chat with a user about any conceivable topic, without having domain-specific knowledge programmed into it. This is a challenging task, as it involves both ‘Natural Language Understanding’ (the task of converting natural language user input into representations that a machine can understand) and subsequently ‘Natural Language Generation’ (the task of generating an appropriate response to the user input in natural language). Several approaches exist for building conversational chatbots. In the present paper, two models have been used and their performance has been compared and contrasted. The first model is purely generative and uses a Transformer-based architecture. The second model is retrieval-based, and uses Deep Neural Networks.

Download Full-text