Approximate Manifold Regularization: Scalable Algorithm and Generalization Analysis

Graph-based semi-supervised learning is one of the most popular and successful semi-supervised learning approaches. Unfortunately, it suffers from high time and space complexity, at least quadratic with the number of training samples. In this paper, we propose an efficient graph-based semi-supervised algorithm with a sound theoretical guarantee. The proposed method combines Nystrom subsampling and preconditioned conjugate gradient descent, substantially improving computational efficiency and reducing memory requirements. Extensive empirical results reveal that our method achieves the state-of-the-art performance in a short time even with limited computing resources.

Download Full-text

Regularized Instance Embedding for Deep Multi-Instance Learning

Applied Sciences ◽

10.3390/app10010064 ◽

2019 ◽

Vol 10 (1) ◽

pp. 64

Author(s):

Yi Lin ◽

Honggang Zhang

Keyword(s):

Neural Network ◽

Big Data ◽

Supervised Learning ◽

Regularization Method ◽

Gradient Descent ◽

State Of The Art ◽

Stochastic Gradient Descent ◽

Learning Framework ◽

Weakly Supervised ◽

The Cost

In the era of Big Data, multi-instance learning, as a weakly supervised learning framework, has various applications since it is helpful to reduce the cost of the data-labeling process. Due to this weakly supervised setting, learning effective instance representation/embedding is challenging. To address this issue, we propose an instance-embedding regularizer that can boost the performance of both instance- and bag-embedding learning in a unified fashion. Specifically, the crux of the instance-embedding regularizer is to maximize correlation between instance-embedding and underlying instance-label similarities. The embedding-learning framework was implemented using a neural network and optimized in an end-to-end manner using stochastic gradient descent. In experiments, various applications were studied, and the results show that the proposed instance-embedding-regularization method is highly effective, having state-of-the-art performance.

Download Full-text

Semi-supervised multi-label classification using an extended graph-based manifold regularization

Complex & Intelligent Systems ◽

10.1007/s40747-021-00611-7 ◽

2022 ◽

Author(s):

Ding Li ◽

Scott Dick

Keyword(s):

Supervised Learning ◽

State Of The Art ◽

Ground Truth ◽

Classification Algorithms ◽

Data Sets ◽

Manifold Regularization ◽

Regularization Algorithm ◽

Weighting Strategy ◽

Extended Graph ◽

General Vector

AbstractGraph-based algorithms are known to be effective approaches to semi-supervised learning. However, there has been relatively little work on extending these algorithms to the multi-label classification case. We derive an extension of the Manifold Regularization algorithm to multi-label classification, which is significantly simpler than the general Vector Manifold Regularization approach. We then augment our algorithm with a weighting strategy to allow differential influence on a model between instances having ground-truth vs. induced labels. Experiments on four benchmark multi-label data sets show that the resulting algorithm performs better overall compared to the existing semi-supervised multi-label classification algorithms at various levels of label sparsity. Comparisons with state-of-the-art supervised multi-label approaches (which of course are fully labeled) also show that our algorithm outperforms all of them even with a substantial number of unlabeled examples.

Download Full-text

Building Attention and Edge Convolution Neural Networks for Bioactivity and Physical-Chemical Property Prediction

10.26434/chemrxiv.9873599.v2 ◽

2019 ◽

Cited By ~ 1

Author(s):

Michael Withnall ◽

Edvard Lindelöf ◽

Ola Engkvist ◽

Hongming Chen

Keyword(s):

Message Passing ◽

State Of The Art ◽

A Priori ◽

Model Performance ◽

Learning Approaches ◽

Physical Chemical ◽

Chemical Descriptor ◽

Derived Properties ◽

Memory Schemes ◽

Hyperparameter Selection

We introduce Attention and Edge Memory schemes to the existing Message Passing Neural Network framework for graph convolution, and benchmark our approaches against eight different physical-chemical and bioactivity datasets from the literature. We remove the need to introduce <i>a priori</i> knowledge of the task and chemical descriptor calculation by using only fundamental graph-derived properties. Our results consistently perform on-par with other state-of-the-art machine learning approaches, and set a new standard on sparse multi-task virtual screening targets. We also investigate model performance as a function of dataset preprocessing, and make some suggestions regarding hyperparameter selection.

Download Full-text

Resisting the Resistance in Cancer: Cheminformatics Studies on Short- Path Base Excision Repair Pathway Antagonists Using Supervised Learning Approaches

Combinatorial Chemistry & High Throughput Screening ◽

10.2174/1386207318666150626093648 ◽

2015 ◽

Vol 18 (9) ◽

pp. 881-891 ◽

Cited By ~ 4

Author(s):

Ritu Jain ◽

Salma Jamal ◽

Sukriti Goyal ◽

Divya Wahi ◽

Aditi Singh ◽

...

Keyword(s):

Supervised Learning ◽

Base Excision Repair ◽

Short Path ◽

Excision Repair ◽

Learning Approaches ◽

Repair Pathway ◽

Base Excision Repair Pathway ◽

Base Excision

Download Full-text

A State of Art Techniques on Machine Learning Algorithms: A Perspective of Supervised Learning Approaches in Data Classification

2018 Second International Conference on Intelligent Computing and Control Systems (ICICCS) ◽

10.1109/iccons.2018.8663155 ◽

2018 ◽

Cited By ~ 15

Author(s):

R. Saravanan ◽

Pothula Sujatha

Keyword(s):

Machine Learning ◽

Supervised Learning ◽

Learning Algorithms ◽

Data Classification ◽

Machine Learning Algorithms ◽

Learning Approaches ◽

State Of Art ◽

Art Techniques

Download Full-text

Neuro-Inspired Computing with Spin-VCSELs

Applied Sciences ◽

10.3390/app11094232 ◽

2021 ◽

Vol 11 (9) ◽

pp. 4232

Author(s):

Krishan Harkhoe ◽

Guy Verschaffelt ◽

Guy Van der Sande

Keyword(s):

Processing Speed ◽

Delay Time ◽

High Speed ◽

State Of The Art ◽

Polarization Modulation ◽

Time Interval ◽

Computing Technique ◽

High Modulation ◽

Short Time ◽

Modulation Speed

Delay-based reservoir computing (RC), a neuromorphic computing technique, has gathered lots of interest, as it promises compact and high-speed RC implementations. To further boost the computing speeds, we introduce and study an RC setup based on spin-VCSELs, thereby exploiting the high polarization modulation speed inherent to these lasers. Based on numerical simulations, we benchmarked this setup against state-of-the-art delay-based RC systems and its parameter space was analyzed for optimal performance. The high modulation speed enabled us to have more virtual nodes in a shorter time interval. However, we found that at these short time scales, the delay time and feedback rate heavily influence the nonlinear dynamics. Therefore, and contrary to other laser-based RC systems, the delay time has to be optimized in order to obtain good RC performances. We achieved state-of-the-art performances on a benchmark timeseries prediction task. This spin-VCSEL-based RC system shows a ten-fold improvement in processing speed, which can further be enhanced in a straightforward way by increasing the birefringence of the VCSEL chip.

Download Full-text

Improving Semi-Supervised Learning for Audio Classification with FixMatch

Electronics ◽

10.3390/electronics10151807 ◽

2021 ◽

Vol 10 (15) ◽

pp. 1807

Author(s):

Sascha Grollmisch ◽

Estefanía Cano

Keyword(s):

Neural Networks ◽

Supervised Learning ◽

Transfer Learning ◽

Data Transfer ◽

State Of The Art ◽

Training Data ◽

Audio Classification ◽

Image Domain ◽

Full Dataset ◽

Audio Data

Including unlabeled data in the training process of neural networks using Semi-Supervised Learning (SSL) has shown impressive results in the image domain, where state-of-the-art results were obtained with only a fraction of the labeled data. The commonality between recent SSL methods is that they strongly rely on the augmentation of unannotated data. This is vastly unexplored for audio data. In this work, SSL using the state-of-the-art FixMatch approach is evaluated on three audio classification tasks, including music, industrial sounds, and acoustic scenes. The performance of FixMatch is compared to Convolutional Neural Networks (CNN) trained from scratch, Transfer Learning, and SSL using the Mean Teacher approach. Additionally, a simple yet effective approach for selecting suitable augmentation methods for FixMatch is introduced. FixMatch with the proposed modifications always outperformed Mean Teacher and the CNNs trained from scratch. For the industrial sounds and music datasets, the CNN baseline performance using the full dataset was reached with less than 5% of the initial training data, demonstrating the potential of recent SSL methods for audio data. Transfer Learning outperformed FixMatch only for the most challenging dataset from acoustic scene classification, showing that there is still room for improvement.

Download Full-text

Learning-free pattern detection for manuscript research:

International Journal on Document Analysis and Recognition (IJDAR) ◽

10.1007/s10032-021-00371-7 ◽

2021 ◽

Author(s):

Hussein Mohammed ◽

Volker Märgner ◽

Giovanni Ciotti

Keyword(s):

South Asian ◽

State Of The Art ◽

Research Question ◽

Software Tool ◽

Pattern Detection ◽

Asian Studies ◽

Medieval Manuscripts ◽

South Asian Studies ◽

Training Samples ◽

Nearest Neighbour Classifier

AbstractAutomatic pattern detection has become increasingly important for scholars in the humanities as the number of manuscripts that have been digitised has grown. Most of the state-of-the-art methods used for pattern detection depend on the availability of a large number of training samples, which are typically not available in the humanities as they involve tedious manual annotation by researchers (e.g. marking the location and size of words, drawings, seals and so on). This makes the applicability of such methods very limited within the field of manuscript research. We propose a learning-free approach based on a state-of-the-art Naïve Bayes Nearest-Neighbour classifier for the task of pattern detection in manuscript images. The method has already been successfully applied to an actual research question from South Asian studies about palm-leaf manuscripts. Furthermore, state-of-the-art results have been achieved on two extremely challenging datasets, namely the AMADI_LontarSet dataset of handwriting on palm leaves for word-spotting and the DocExplore dataset of medieval manuscripts for pattern detection. A performance analysis is provided as well in order to facilitate later comparisons by other researchers. Finally, an easy-to-use implementation of the proposed method is developed as a software tool and made freely available.

Download Full-text

An All-Batch Loss for Constructing Prediction Intervals

Applied Sciences ◽

10.3390/app11041728 ◽

2021 ◽

Vol 11 (4) ◽

pp. 1728

Author(s):

Hua Zhong ◽

Li Xu

Keyword(s):

Gradient Descent ◽

State Of The Art ◽

Prediction Interval ◽

Feedforward Neural Networks ◽

Important Research ◽

Likelihood Principle ◽

High Quality ◽

Construction Methods ◽

Important Research Topic ◽

Benchmark Datasets

The prediction interval (PI) is an important research topic in reliability analyses and decision support systems. Data size and computation costs are two of the issues which may hamper the construction of PIs. This paper proposes an all-batch (AB) loss function for constructing high quality PIs. Taking the full advantage of the likelihood principle, the proposed loss makes it possible to train PI generation models using the gradient descent (GD) method for both small and large batches of samples. With the structure of dual feedforward neural networks (FNNs), a high-quality PI generation framework is introduced, which can be adapted to a variety of problems including regression analysis. Numerical experiments were conducted on the benchmark datasets; the results show that higher-quality PIs were achieved using the proposed scheme. Its reliability and stability were also verified in comparison with various state-of-the-art PI construction methods.

Download Full-text

Target Detection Network for SAR Images Based on Semi-Supervised Learning and Attention Mechanism

Remote Sensing ◽

10.3390/rs13142686 ◽

2021 ◽

Vol 13 (14) ◽

pp. 2686

Author(s):

Di Wei ◽

Yuang Du ◽

Lan Du ◽

Lu Li

Keyword(s):

Feature Extraction ◽

Supervised Learning ◽

Target Detection ◽

Scene Recognition ◽

Attention Mechanism ◽

Sar Image ◽

Target Level ◽

Sar Images ◽

Training Samples ◽

Pass Through

The existing Synthetic Aperture Radar (SAR) image target detection methods based on convolutional neural networks (CNNs) have achieved remarkable performance, but these methods require a large number of target-level labeled training samples to train the network. Moreover, some clutter is very similar to targets in SAR images with complex scenes, making the target detection task very difficult. Therefore, a SAR target detection network based on a semi-supervised learning and attention mechanism is proposed in this paper. Since the image-level label simply marks whether the image contains the target of interest or not, which is easier to be labeled than the target-level label, the proposed method uses a small number of target-level labeled training samples and a large number of image-level labeled training samples to train the network with a semi-supervised learning algorithm. The proposed network consists of a detection branch and a scene recognition branch with a feature extraction module and an attention module shared between these two branches. The feature extraction module can extract the deep features of the input SAR images, and the attention module can guide the network to focus on the target of interest while suppressing the clutter. During the semi-supervised learning process, the target-level labeled training samples will pass through the detection branch, while the image-level labeled training samples will pass through the scene recognition branch. During the test process, considering the help of global scene information in SAR images for detection, a novel coarse-to-fine detection procedure is proposed. After the coarse scene recognition determining whether the input SAR image contains the target of interest or not, the fine target detection is performed on the image that may contain the target. The experimental results based on the measured SAR dataset demonstrate that the proposed method can achieve better performance than the existing methods.

Download Full-text