Physics-Driven Regularization of Deep Neural Networks for Enhanced Engineering Design and Analysis

Abstract In this paper, we introduce a physics-driven regularization method for training of deep neural networks (DNNs) for use in engineering design and analysis problems. In particular, we focus on the prediction of a physical system, for which in addition to training data, partial or complete information on a set of governing laws is also available. These laws often appear in the form of differential equations, derived from first principles, empirically validated laws, or domain expertise, and are usually neglected in a data-driven prediction of engineering systems. We propose a training approach that utilizes the known governing laws and regularizes data-driven DNN models by penalizing divergence from those laws. The first two numerical examples are synthetic examples, where we show that in constructing a DNN model that best fits the measurements from a physical system, the use of our proposed regularization results in DNNs that are more interpretable with smaller generalization errors, compared with other common regularization methods. The last two examples concern metamodeling for a random Burgers’ system and for aerodynamic analysis of passenger vehicles, where we demonstrate that the proposed regularization provides superior generalization accuracy compared with other common alternatives.

Download Full-text

High accuracy data-driven heliostat calibration and state prediction with pretrained deep neural networks

Solar Energy ◽

10.1016/j.solener.2021.01.046 ◽

2021 ◽

Vol 218 ◽

pp. 48-56

Author(s):

Max Pargmann ◽

Daniel Maldonado Quinto ◽

Peter Schwarzbözl ◽

Robert Pitz-Paal

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

High Accuracy ◽

Data Driven ◽

State Prediction ◽

Accuracy Data

Download Full-text

Evaluation of Power Insulator Detection Efficiency with the Use of Limited Training Dataset

Applied Sciences ◽

10.3390/app10062104 ◽

2020 ◽

Vol 10 (6) ◽

pp. 2104

Author(s):

Michał Tomaszewski ◽

Paweł Michalski ◽

Jakub Osuchowski

Keyword(s):

Neural Network ◽

Neural Networks ◽

Object Detection ◽

Convolutional Neural Network ◽

Deep Neural Networks ◽

Detection Efficiency ◽

Training Data ◽

Training Dataset ◽

Training Set ◽

Convolutional Network

This article presents an analysis of the effectiveness of object detection in digital images with the application of a limited quantity of input. The possibility of using a limited set of learning data was achieved by developing a detailed scenario of the task, which strictly defined the conditions of detector operation in the considered case of a convolutional neural network. The described solution utilizes known architectures of deep neural networks in the process of learning and object detection. The article presents comparisons of results from detecting the most popular deep neural networks while maintaining a limited training set composed of a specific number of selected images from diagnostic video. The analyzed input material was recorded during an inspection flight conducted along high-voltage lines. The object detector was built for a power insulator. The main contribution of the presented papier is the evidence that a limited training set (in our case, just 60 training frames) could be used for object detection, assuming an outdoor scenario with low variability of environmental conditions. The decision of which network will generate the best result for such a limited training set is not a trivial task. Conducted research suggests that the deep neural networks will achieve different levels of effectiveness depending on the amount of training data. The most beneficial results were obtained for two convolutional neural networks: the faster region-convolutional neural network (faster R-CNN) and the region-based fully convolutional network (R-FCN). Faster R-CNN reached the highest AP (average precision) at a level of 0.8 for 60 frames. The R-FCN model gained a worse AP result; however, it can be noted that the relationship between the number of input samples and the obtained results has a significantly lower influence than in the case of other CNN models, which, in the authors’ assessment, is a desired feature in the case of a limited training set.

Download Full-text

Towards better performance with heterogeneous training data in acoustic modeling using deep neural networks

10.21437/interspeech.2014-214 ◽

2014 ◽

Author(s):

Yan Huang ◽

Malcolm Slaney ◽

Michael L. Seltzer ◽

Yifan Gong

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

Training Data ◽

Acoustic Modeling

Download Full-text

Framework for TCAD augmented machine learning on multi- I–V characteristics using convolutional neural network and multiprocessing

Journal of Semiconductors ◽

10.1088/1674-4926/42/12/124101 ◽

2021 ◽

Vol 42 (12) ◽

pp. 124101

Author(s):

Thomas Hirtz ◽

Steyn Huurman ◽

He Tian ◽

Yi Yang ◽

Tian-Ling Ren

Keyword(s):

Machine Learning ◽

Neural Networks ◽

Information Technologies ◽

Deep Neural Networks ◽

State Of The Art ◽

Data Driven ◽

Sufficient Data ◽

Learning Models ◽

Simulation Tools ◽

New Information

Abstract In a world where data is increasingly important for making breakthroughs, microelectronics is a field where data is sparse and hard to acquire. Only a few entities have the infrastructure that is required to automate the fabrication and testing of semiconductor devices. This infrastructure is crucial for generating sufficient data for the use of new information technologies. This situation generates a cleavage between most of the researchers and the industry. To address this issue, this paper will introduce a widely applicable approach for creating custom datasets using simulation tools and parallel computing. The multi-I–V curves that we obtained were processed simultaneously using convolutional neural networks, which gave us the ability to predict a full set of device characteristics with a single inference. We prove the potential of this approach through two concrete examples of useful deep learning models that were trained using the generated data. We believe that this work can act as a bridge between the state-of-the-art of data-driven methods and more classical semiconductor research, such as device engineering, yield engineering or process monitoring. Moreover, this research gives the opportunity to anybody to start experimenting with deep neural networks and machine learning in the field of microelectronics, without the need for expensive experimentation infrastructure.

Download Full-text

Gaussian Process Emulation for Big Data in Data-Driven Metamaterials Design

Volume 2A: 45th Design Automation Conference ◽

10.1115/detc2019-98027 ◽

2019 ◽

Author(s):

Ramin Bostanabad ◽

Yu-Chin Chan ◽

Liwei Wang ◽

Ping Zhu ◽

Wei Chen

Keyword(s):

Big Data ◽

Gaussian Process ◽

Engineering Design ◽

Training Data ◽

Data Driven ◽

Training Dataset ◽

Massive Datasets ◽

Unit Cells ◽

Gaussian Process Emulation ◽

Novel Method

Abstract Our main contribution is to introduce a novel method for Gaussian process (GP) modeling of massive datasets. The key idea is to build an ensemble of independent GPs that use the same hyperparameters but distribute the entire training dataset among themselves. This is motivated by our observation that estimates of the GP hyperparameters change negligibly as the size of the training data exceeds a certain level, which can be found in a systematic way. For inference, the predictions from all GPs in the ensemble are pooled to efficiently exploit the entire training dataset for prediction. We name our modeling approach globally approximate Gaussian process (GAGP), which, unlike most largescale supervised learners such as neural networks and trees, is easy to fit and can interpret the model behavior. These features make it particularly useful in engineering design with big data. We use analytical examples to demonstrate that GAGP achieves very high predictive power that matches or exceeds that of state-of-the-art machine learning methods. We illustrate the application of GAGP in engineering design with a problem on data-driven metamaterials design where it is used to link reduced-dimension geometrical descriptors of unit cells and their properties. Searching for new unit cell designs with desired properties is then accomplished by employing GAGP in inverse optimization.

Download Full-text

Data-driven modeling for boiling heat transfer: Using deep neural networks and high-fidelity simulation results

Applied Thermal Engineering ◽

10.1016/j.applthermaleng.2018.08.041 ◽

2018 ◽

Vol 144 ◽

pp. 305-320 ◽

Cited By ~ 18

Author(s):

Yang Liu ◽

Nam Dinh ◽

Yohei Sato ◽

Bojan Niceno

Keyword(s):

Heat Transfer ◽

Neural Networks ◽

Boiling Heat Transfer ◽

Deep Neural Networks ◽

Data Driven ◽

High Fidelity ◽

High Fidelity Simulation ◽

Simulation Results ◽

Data Driven Modeling

Download Full-text

Language recognition using deep neural networks with very limited training data

2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp.2016.7472795 ◽

2016 ◽

Cited By ~ 6

Author(s):

Shivesh Ranjan ◽

Chengzhu Yu ◽

Chunlei Zhang ◽

Finnian Kelly ◽

John H. L. Hansen

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

Training Data ◽

Language Recognition

Download Full-text

Improving speech recognition using limited accent diverse British English training data with deep neural networks

2016 IEEE 26th International Workshop on Machine Learning for Signal Processing (MLSP) ◽

10.1109/mlsp.2016.7738854 ◽

2016 ◽

Cited By ~ 1

Author(s):

Maryam Najafian ◽

Saeid Safavi ◽

John H. L. Hansen ◽

Martin Russell

Keyword(s):

Neural Networks ◽

Speech Recognition ◽

Deep Neural Networks ◽

Training Data ◽

British English ◽

English Training

Download Full-text

Deep Neural Networks Techniques using for Learning Automata Based Incremental Learning Method

Turkish Journal of Computer and Mathematics Education (TURCOMAT) ◽

10.17762/turcomat.v12i6.1268 ◽

2021 ◽

Vol 12 (6) ◽

pp. 69-73

Author(s):

C. Swetha Reddy Et.al

Keyword(s):

Neural Networks ◽

Language Processing ◽

Visual Recognition ◽

Deep Neural Networks ◽

Learning Automata ◽

Training Data ◽

Neuron Models ◽

Effectiveness Of Training ◽

Learning Machine ◽

Incremental Learning Method

Surprisingly comprehensive learning methods are implemented in many large learning machine data, such as visual recognition and visual language processing. Much of the success of advanced training in recent years is due to leadership training, which requires a set of information for specific tasks, before such training. However, in reality, selected tasks related to personal study are gradually accumulated over time as it is difficult to collect and submit training data manually. It provides a way to continue learning some information columns and examples of steps that are specific to the new class and called additional learning. In this post, we recommend the best machine training method for further training for deep neural networks. The basic idea is to learn a deep system with strong connections that can be "activated" or "turned off" at different stages. The approach you suggest allows you to reduce the distribution of old services as you learn new for example new training, which increases the effectiveness of training in the additional training phase. Experiments with MNIST and CIFAR-100 show that our approach can be implemented in other long-term phases in deep neuron models and achieve better results from zero-base training.

Download Full-text

Development of Vietnamese Speech Synthesis System using Deep Neural Networks

Journal of Computer Science and Cybernetics ◽

10.15625/1813-9663/34/4/13172 ◽

2019 ◽

Vol 34 (4) ◽

pp. 349-363 ◽

Cited By ~ 4

Author(s):

Thinh Van Nguyen ◽

Bao Quoc Nguyen ◽

Kinh Huy Phan ◽

Hai Van Do

Keyword(s):

Neural Networks ◽

Markov Model ◽

Hidden Markov Model ◽

Speech Synthesis ◽

Deep Neural Networks ◽

Hidden Markov ◽

Experimental Results ◽

Training Data ◽

The Internet ◽

Synthesis System

In this paper, we present our first Vietnamese speech synthesis system based on deep neural networks. To improve the training data collected from the Internet, a cleaning method is proposed. The experimental results indicate that by using deeper architectures we can achieve better performance for the TTS than using shallow architectures such as hidden Markov model. We also present the effect of using different amounts of data to train the TTS systems. In the VLSP TTS challenge 2018, our proposed DNN-based speech synthesis system won the first place in all three subjects including naturalness, intelligibility, and MOS.

Download Full-text