Online Deep Learning: Learning Deep Neural Networks on the Fly

Deep Neural Networks (DNNs) are typically trained by backpropagation in a batch setting, requiring the entire training data to be made available prior to the learning task. This is not scalable for many real-world scenarios where new data arrives sequentially in a stream. We aim to address an open challenge of ``Online Deep Learning" (ODL) for learning DNNs on the fly in an online setting. Unlike traditional online learning that often optimizes some convex objective function with respect to a shallow model (e.g., a linear/kernel-based hypothesis), ODL is more challenging as the optimization objective is non-convex, and regular DNN with standard backpropagation does not work well in practice for online settings. We present a new ODL framework that attempts to tackle the challenges by learning DNN models which dynamically adapt depth from a sequence of training data in an online learning setting. Specifically, we propose a novel Hedge Backpropagation (HBP) method for online updating the parameters of DNN effectively, and validate the efficacy on large data sets (both stationary and concept drifting scenarios).

Download Full-text

Artificial Intelligence Explained for Nonexperts

Seminars in Musculoskeletal Radiology ◽

10.1055/s-0039-3401041 ◽

2020 ◽

Vol 24 (01) ◽

pp. 003-011 ◽

Cited By ~ 1

Author(s):

Narges Razavian ◽

Florian Knoll ◽

Krzysztof J. Geras

Keyword(s):

Artificial Intelligence ◽

Neural Networks ◽

Clinical Practice ◽

Medical Imaging ◽

Deep Neural Networks ◽

Large Data ◽

Large Data Sets ◽

Natural Images ◽

Data Sets ◽

Current State

AbstractArtificial intelligence (AI) has made stunning progress in the last decade, made possible largely due to the advances in training deep neural networks with large data sets. Many of these solutions, initially developed for natural images, speech, or text, are now becoming successful in medical imaging. In this article we briefly summarize in an accessible way the current state of the field of AI. Furthermore, we highlight the most promising approaches and describe the current challenges that will need to be solved to enable broad deployment of AI in clinical practice.

Download Full-text

The use of deep learning to automate the segmentation of the skeleton from CT volumes of pigs1

Translational Animal Science ◽

10.1093/tas/txy060 ◽

2018 ◽

Vol 2 (3) ◽

pp. 324-335 ◽

Cited By ~ 2

Author(s):

Johannes Kvam ◽

Lars Erik Gangsei ◽

Jørgen Kongsro ◽

Anne H Schistad Solberg

Keyword(s):

Deep Learning ◽

Automatic Segmentation ◽

Large Data ◽

Ct Scanning ◽

Large Data Sets ◽

Training Data ◽

Data Sets ◽

Fully Automatic ◽

Number Of Individuals ◽

Control Accuracy

Abstract Computed tomography (CT) scanning of pigs has been shown to produce detailed phenotypes useful in pig breeding. Due to the large number of individuals scanned and corresponding large data sets, there is a need for automatic tools for analysis of these data sets. In this paper, the feasibility of deep learning for fully automatic segmentation of the skeleton of pigs from CT volumes is explored. To maximize performance, given the training data available, a series of problem simplifications are applied. The deep-learning approach can replace our currently used semiautomatic solution, with increased robustness and little or no need for manual control. Accuracy was highly affected by training data, and expanding the training set can further increase performance making this approach especially promising.

Download Full-text

Rate-optimal denoising with deep neural networks

Information and Inference A Journal of the IMA ◽

10.1093/imaiai/iaaa011 ◽

2020 ◽

Author(s):

Reinhard Heckel ◽

Wen Huang ◽

Paul Hand ◽

Vladislav Voroninski

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

Large Data ◽

Large Data Sets ◽

Gradient Algorithm ◽

Data Sets ◽

Underlying Principle ◽

Noise Energy ◽

Noisy Observation ◽

Low Dimensional

Abstract Deep neural networks provide state-of-the-art performance for image denoising, where the goal is to recover a near noise-free image from a noisy observation. The underlying principle is that neural networks trained on large data sets have empirically been shown to be able to generate natural images well from a low-dimensional latent representation of the image. Given such a generator network, a noisy image can be denoised by (i) finding the closest image in the range of the generator or by (ii) passing it through an encoder-generator architecture (known as an autoencoder). However, there is little theory to justify this success, let alone to predict the denoising performance as a function of the network parameters. In this paper, we consider the problem of denoising an image from additive Gaussian noise using the two generator-based approaches. In both cases, we assume the image is well described by a deep neural network with ReLU activations functions, mapping a $k$-dimensional code to an $n$-dimensional image. In the case of the autoencoder, we show that the feedforward network reduces noise energy by a factor of $O(k/n)$. In the case of optimizing over the range of a generative model, we state and analyze a simple gradient algorithm that minimizes a non-convex loss function and provably reduces noise energy by a factor of $O(k/n)$. We also demonstrate in numerical experiments that this denoising performance is, indeed, achieved by generative priors learned from data.

Download Full-text

Generation of geometric interpolations of building types with deep variational autoencoders

Design Science ◽

10.1017/dsj.2020.31 ◽

2020 ◽

Vol 6 ◽

Author(s):

Jaime de Miguel Rodríguez ◽

Maria Eugenia Villafañe ◽

Luka Piškorec ◽

Fernando Sancho Caparrini

Keyword(s):

Machine Learning ◽

Neural Networks ◽

Large Data ◽

Learning Model ◽

Large Data Sets ◽

Data Sets ◽

Connectivity Map ◽

Data Set ◽

3D Objects ◽

Machine Learning Model

Abstract This work presents a methodology for the generation of novel 3D objects resembling wireframes of building types. These result from the reconstruction of interpolated locations within the learnt distribution of variational autoencoders (VAEs), a deep generative machine learning model based on neural networks. The data set used features a scheme for geometry representation based on a ‘connectivity map’ that is especially suited to express the wireframe objects that compose it. Additionally, the input samples are generated through ‘parametric augmentation’, a strategy proposed in this study that creates coherent variations among data by enabling a set of parameters to alter representative features on a given building type. In the experiments that are described in this paper, more than 150 k input samples belonging to two building types have been processed during the training of a VAE model. The main contribution of this paper has been to explore parametric augmentation for the generation of large data sets of 3D geometries, showcasing its problems and limitations in the context of neural networks and VAEs. Results show that the generation of interpolated hybrid geometries is a challenging task. Despite the difficulty of the endeavour, promising advances are presented.

Download Full-text

Empirical modeling of very large data sets using neural networks

Proceedings of the IEEE-INNS-ENNS International Joint Conference on Neural Networks. IJCNN 2000. Neural Computing: New Challenges and Perspectives for the New Millennium ◽

10.1109/ijcnn.2000.859413 ◽

2000 ◽

Cited By ~ 5

Author(s):

A.J. Owens

Keyword(s):

Neural Networks ◽

Large Data ◽

Empirical Modeling ◽

Large Data Sets ◽

Data Sets

Download Full-text

Convolutional Neural Networks for Scientific Images and Other Large Data Sets

10.1007/978-3-030-70388-2_6 ◽

2021 ◽

pp. 149-172

Author(s):

Ryan G. McClarren

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Large Data ◽

Large Data Sets ◽

Data Sets ◽

Scientific Images

Download Full-text

Deep Learning Approaches for Sentiment Analysis Challenges and Future Issues

10.4018/978-1-7998-8161-2.ch003 ◽

2022 ◽

pp. 27-50

Author(s):

Rajalaxmi Prabhu B. ◽

Seema S.

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Model Building ◽

Large Data ◽

Machine Learning Algorithms ◽

Large Data Sets ◽

Data Sets ◽

Learning Approaches ◽

Learning Techniques ◽

Important Challenge

A lot of user-generated data is available these days from huge platforms, blogs, websites, and other review sites. These data are usually unstructured. Analyzing sentiments from these data automatically is considered an important challenge. Several machine learning algorithms are implemented to check the opinions from large data sets. A lot of research has been undergone in understanding machine learning approaches to analyze sentiments. Machine learning mainly depends on the data required for model building, and hence, suitable feature exactions techniques also need to be carried. In this chapter, several deep learning approaches, its challenges, and future issues will be addressed. Deep learning techniques are considered important in predicting the sentiments of users. This chapter aims to analyze the deep-learning techniques for predicting sentiments and understanding the importance of several approaches for mining opinions and determining sentiment polarity.

Download Full-text

A New Deep Learning Calibration Method Enhances Genome-Based Prediction of Continuous Crop Traits

Frontiers in Genetics ◽

10.3389/fgene.2021.798840 ◽

2021 ◽

Vol 12 ◽

Author(s):

Osval A. Montesinos-López ◽

Abelardo Montesinos-López ◽

Brandon A. Mosqueda-González ◽

Alison R. Bentley ◽

Morten Lillemo ◽

...

Keyword(s):

Neural Networks ◽

Deep Learning ◽

Plant Breeding ◽

Deep Neural Networks ◽

Reference Population ◽

Calibration Method ◽

Data Sets ◽

Simple Method ◽

Continuous Response ◽

Predicting Performance

Genomic selection (GS) has the potential to revolutionize predictive plant breeding. A reference population is phenotyped and genotyped to train a statistical model that is used to perform genome-enabled predictions of new individuals that were only genotyped. In this vein, deep neural networks, are a type of machine learning model and have been widely adopted for use in GS studies, as they are not parametric methods, making them more adept at capturing nonlinear patterns. However, the training process for deep neural networks is very challenging due to the numerous hyper-parameters that need to be tuned, especially when imperfect tuning can result in biased predictions. In this paper we propose a simple method for calibrating (adjusting) the prediction of continuous response variables resulting from deep learning applications. We evaluated the proposed deep learning calibration method (DL_M2) using four crop breeding data sets and its performance was compared with the standard deep learning method (DL_M1), as well as the standard genomic Best Linear Unbiased Predictor (GBLUP). While the GBLUP was the most accurate model overall, the proposed deep learning calibration method (DL_M2) helped increase the genome-enabled prediction performance in all data sets when compared with the traditional DL method (DL_M1). Taken together, we provide evidence for extending the use of the proposed calibration method to evaluate its potential and consistency for predicting performance in the context of GS applied to plant breeding.

Download Full-text

An Improved Fellegi-Sunter Framework for Probabilistic Record Linkage Between Large Data Sets

Journal of Official Statistics ◽

10.2478/jos-2020-0039 ◽

2020 ◽

Vol 36 (4) ◽

pp. 803-825

Author(s):

Marco Fortini

Keyword(s):

Record Linkage ◽

Large Data ◽

Real Data ◽

Large Data Sets ◽

Training Data ◽

Data Sets ◽

Estimated Parameters ◽

Unbiased Estimates ◽

Structural Zeros ◽

Different Sources

AbstractRecord linkage addresses the problem of identifying pairs of records coming from different sources and referred to the same unit of interest. Fellegi and Sunter propose an optimal statistical test in order to assign the match status to the candidate pairs, in which the needed parameters are obtained through EM algorithm directly applied to the set of candidate pairs, without recourse to training data. However, this procedure has a quadratic complexity as the two lists to be matched grow. In addition, a large bias of EM-estimated parameters is also produced in this case, so that the problem is tackled by reducing the set of candidate pairs through filtering methods such as blocking. Unfortunately, the probability that excluded pairs would be actually true-matches cannot be assessed through such methods.The present work proposes an efficient approach in which the comparison of records between lists are minimised while the EM estimates are modified by modelling tables with structural zeros in order to obtain unbiased estimates of the parameters. Improvement achieved by the suggested method is shown by means of simulations and an application based on real data.

Download Full-text

SatImNet: Structured and Harmonised Training Data for Enhanced Satellite Imagery Classification

Remote Sensing ◽

10.3390/rs12203358 ◽

2020 ◽

Vol 12 (20) ◽

pp. 3358

Author(s):

Vasileios Syrris ◽

Ondrej Pesek ◽

Pierre Soille

Keyword(s):

Neural Networks ◽

Image Classification ◽

Supervised Classification ◽

Deep Neural Networks ◽

Satellite Image ◽

Data Retrieval ◽

Remote Sensing Image ◽

Training Data ◽

Data Sets ◽

Remote Sensing Image Classification

Automatic supervised classification with complex modelling such as deep neural networks requires the availability of representative training data sets. While there exists a plethora of data sets that can be used for this purpose, they are usually very heterogeneous and not interoperable. In this context, the present work has a twofold objective: (i) to describe procedures of open-source training data management, integration, and data retrieval, and (ii) to demonstrate the practical use of varying source training data for remote sensing image classification. For the former, we propose SatImNet, a collection of open training data, structured and harmonized according to specific rules. For the latter, two modelling approaches based on convolutional neural networks have been designed and configured to deal with satellite image classification and segmentation.

Download Full-text