Temporal Consistency-Based Loss Function for Both Deep Q-Networks and Deep Deterministic Policy Gradients for Continuous Actions

Chayoung Kim

doi:10.3390/sym13122411

Temporal Consistency-Based Loss Function for Both Deep Q-Networks and Deep Deterministic Policy Gradients for Continuous Actions

Symmetry ◽

10.3390/sym13122411 ◽

2021 ◽

Vol 13 (12) ◽

pp. 2411

Author(s):

Chayoung Kim

Keyword(s):

Energy Management ◽

Loss Function ◽

Loss Functions ◽

Temporal Consistency ◽

Building Automation ◽

Target Network ◽

Network Update ◽

And Performance ◽

The Stability ◽

Continuous Actions

Artificial intelligence (AI) techniques in power grid control and energy management in building automation require both deep Q-networks (DQNs) and deep deterministic policy gradients (DDPGs) in deep reinforcement learning (DRL) as off-policy algorithms. Most studies on improving the stability of DRL have addressed these with replay buffers and a target network using a delayed temporal difference (TD) backup, which is known for minimizing a loss function at every iteration. The loss functions were developed for DQN and DDPG, and it is well-known that there have been few studies on improving the techniques of the loss functions used in both DQN and DDPG. Therefore, we modified the loss function based on a temporal consistency (TC) loss and adapted the proposed TC loss function for the target network update in both DQN and DDPG. The proposed TC loss function showed effective results, particularly in a critic network in DDPG. In this work, we demonstrate that, in OpenAI Gym, both “cart-pole” and “pendulum”, the proposed TC loss function shows enormously improved convergence speed and performance, particularly in the critic network in DDPG.

Get full-text (via PubEx)

Valence electron spectroscopy of inhomogeneous media

Proceedings, annual meeting, Electron Microscopy Society of America ◽

10.1017/s0424820100130389 ◽

1992 ◽

Vol 50 (2) ◽

pp. 1150-1151

Author(s):

A. Howie ◽

D.W. McComb

Keyword(s):

Specific Gravity ◽

Loss Function ◽

Electron Spectroscopy ◽

Valence Electron ◽

Peak Height ◽

Loss Functions ◽

Loss Peak ◽

High Energies ◽

Valence Electron Density ◽

Electron Microscopist

The bulk loss function Im(-l/ε (ω)), a well established tool for the interpretation of valence loss spectra, is being progressively adapted to the wide variety of inhomogeneous samples of interest to the electron microscopist. Proportionality between n, the local valence electron density, and ε-1 (Sellmeyer's equation) has sometimes been assumed but may not be valid even in homogeneous samples. Figs. 1 and 2 show the experimentally measured bulk loss functions for three pure silicates of different specific gravity ρ - quartz (ρ = 2.66), coesite (ρ = 2.93) and a zeolite (ρ = 1.79). Clearly, despite the substantial differences in density, the shift of the prominent loss peak is very small and far less than that predicted by scaling e for quartz with Sellmeyer's equation or even the somewhat smaller shift given by the Clausius-Mossotti (CM) relation which assumes proportionality between n (or ρ in this case) and (ε - 1)/(ε + 2). Both theories overestimate the rise in the peak height for coesite and underestimate the increase at high energies.

Get full-text (via PubEx)

Implementation and Performance Evaluation of IEC 61850 based Home Energy Management System

2019 IEEE 8th Global Conference on Consumer Electronics (GCCE) ◽

10.1109/gcce46687.2019.9015222 ◽

2019 ◽

Cited By ~ 1

Author(s):

Taha Selim Ustun ◽

S.M. Suhail Hussain

Keyword(s):

Performance Evaluation ◽

Energy Management ◽

Management System ◽

Energy Management System ◽

Iec 61850 ◽

Home Energy Management ◽

And Performance ◽

Home Energy Management System

Get full-text (via PubEx)

Study on Radar Echo-Filling in an Occlusion Area by a Deep Learning Algorithm

Remote Sensing ◽

10.3390/rs13091779 ◽

2021 ◽

Vol 13 (9) ◽

pp. 1779

Author(s):

Xiaoyan Yin ◽

Zhiqun Hu ◽

Jiafeng Zheng ◽

Boyong Li ◽

Yuanyuan Zuo

Keyword(s):

Deep Learning ◽

Loss Function ◽

Learning Algorithm ◽

Weather Radar ◽

Loss Functions ◽

Training Dataset ◽

Echo Intensity ◽

Common Mean ◽

Deep Learning Algorithm ◽

Radar Beam

Radar beam blockage is an important error source that affects the quality of weather radar data. An echo-filling network (EFnet) is proposed based on a deep learning algorithm to correct the echo intensity under the occlusion area in the Nanjing S-band new-generation weather radar (CINRAD/SA). The training dataset is constructed by the labels, which are the echo intensity at the 0.5° elevation in the unblocked area, and by the input features, which are the intensity in the cube including multiple elevations and gates corresponding to the location of bottom labels. Two loss functions are applied to compile the network: one is the common mean square error (MSE), and the other is a self-defined loss function that increases the weight of strong echoes. Considering that the radar beam broadens with distance and height, the 0.5° elevation scan is divided into six range bands every 25 km to train different models. The models are evaluated by three indicators: explained variance (EVar), mean absolute error (MAE), and correlation coefficient (CC). Two cases are demonstrated to compare the effect of the echo-filling model by different loss functions. The results suggest that EFnet can effectively correct the echo reflectivity and improve the data quality in the occlusion area, and there are better results for strong echoes when the self-defined loss function is used.

Get full-text (via PubEx)

A Densely Connected Network Based on U-Net for Medical Image Segmentation

ACM Transactions on Multimedia Computing Communications and Applications ◽

10.1145/3446618 ◽

2021 ◽

Vol 17 (3) ◽

pp. 1-14

Author(s):

Zhenzhen Yang ◽

Pengfei Xu ◽

Yongpeng Yang ◽

Bing-Kun Bao

Keyword(s):

Feature Extraction ◽

Image Segmentation ◽

Loss Function ◽

Network Architecture ◽

Medical Image ◽

Medical Image Segmentation ◽

Cross Entropy ◽

Loss Functions ◽

Feature Maps ◽

Different Levels

The U-Net has become the most popular structure in medical image segmentation in recent years. Although its performance for medical image segmentation is outstanding, a large number of experiments demonstrate that the classical U-Net network architecture seems to be insufficient when the size of segmentation targets changes and the imbalance happens between target and background in different forms of segmentation. To improve the U-Net network architecture, we develop a new architecture named densely connected U-Net (DenseUNet) network in this article. The proposed DenseUNet network adopts a dense block to improve the feature extraction capability and employs a multi-feature fuse block fusing feature maps of different levels to increase the accuracy of feature extraction. In addition, in view of the advantages of the cross entropy and the dice loss functions, a new loss function for the DenseUNet network is proposed to deal with the imbalance between target and background. Finally, we test the proposed DenseUNet network and compared it with the multi-resolutional U-Net (MultiResUNet) and the classic U-Net networks on three different datasets. The experimental results show that the DenseUNet network has significantly performances compared with the MultiResUNet and the classic U-Net networks.

Get full-text (via PubEx)

Analytical Modeling and Design of Novel Conical Halbach Permanent Magnet Couplings for Underwater Propulsion

Journal of Marine Science and Engineering ◽

10.3390/jmse9030290 ◽

2021 ◽

Vol 9 (3) ◽

pp. 290

Author(s):

Yukai Li ◽

Yuli Hu ◽

Youguang Guo ◽

Baowei Song ◽

Zhaoyong Mao

Keyword(s):

Permanent Magnet ◽

Analytical Calculation ◽

Underwater Propulsion ◽

Conical Structure ◽

Torque Analysis ◽

Dynamic Seal ◽

And Performance ◽

The Stability ◽

Propulsion Unit ◽

Force Calculation

Permanent magnet couplings can convert a dynamic seal into a static seal, thereby greatly improving the stability of the underwater propulsion unit. In order to make full use of the tail space and improve the transmitted torque capability, a conical Halbach permanent magnet coupling (C-HPMC) is proposed in this paper. The C-HPMC combines multiple cylindrical HPMCs with different sizes into an approximately conical structure. Compared with the conical permanent magnet couplings in our previous work, the novel C-HPMC has better torque performance and is easy to process. The analytical calculation method of transmitted torque of C-HPMC is proposed on the basis of torque calculation of the three common types of HPMCs. The accuracy of the torque calculation of the three HPMCs is verified, and the torque performance of the three HPMCSs of different sizes is compared and discussed. The “optimal type selection” method is proposed and applied in the design of C-HPMC. Finally, on the basis of torque analysis calculation and axial force calculation, a complete flowchart of the design and performance analysis of C-HPMC is described.

Get full-text (via PubEx)

OPTIMAL REINSURANCE FROM THE VIEWPOINTS OF BOTH AN INSURER AND A REINSURER UNDER THE CVAR RISK MEASURE AND VAJDA CONDITION

Astin Bulletin ◽

10.1017/asb.2021.9 ◽

2021 ◽

pp. 1-29

Author(s):

Yanhong Chen

Keyword(s):

Loss Function ◽

Value At Risk ◽

Numerical Study ◽

Risk Measure ◽

Convex Combination ◽

Weighting Factor ◽

Loss Functions ◽

Conditional Value At Risk ◽

Optimal Reinsurance ◽

The Impact

ABSTRACT In this paper, we study the optimal reinsurance contracts that minimize the convex combination of the Conditional Value-at-Risk (CVaR) of the insurer’s loss and the reinsurer’s loss over the class of ceded loss functions such that the retained loss function is increasing and the ceded loss function satisfies Vajda condition. Among a general class of reinsurance premium principles that satisfy the properties of risk loading and convex order preserving, the optimal solutions are obtained. Our results show that the optimal ceded loss functions are in the form of five interconnected segments for general reinsurance premium principles, and they can be further simplified to four interconnected segments if more properties are added to reinsurance premium principles. Finally, we derive optimal parameters for the expected value premium principle and give a numerical study to analyze the impact of the weighting factor on the optimal reinsurance.

Get full-text (via PubEx)

Assessing the Impact of the Loss Function, Architecture and Image Type for Deep Learning-Based Wildfire Segmentation

Applied Sciences ◽

10.3390/app11157046 ◽

2021 ◽

Vol 11 (15) ◽

pp. 7046

Author(s):

Jorge Francisco Ciprián-Sánchez ◽

Gilberto Ochoa-Ruiz ◽

Lucile Rossi ◽

Frédéric Morandini

Keyword(s):

Deep Learning ◽

Loss Function ◽

State Of The Art ◽

Fire Detection ◽

Loss Functions ◽

Wildfire Spread ◽

Combine Information ◽

The Impact ◽

Image Type ◽

Segmentation Models

Wildfires stand as one of the most relevant natural disasters worldwide, particularly more so due to the effect of climate change and its impact on various societal and environmental levels. In this regard, a significant amount of research has been done in order to address this issue, deploying a wide variety of technologies and following a multi-disciplinary approach. Notably, computer vision has played a fundamental role in this regard. It can be used to extract and combine information from several imaging modalities in regard to fire detection, characterization and wildfire spread forecasting. In recent years, there has been work pertaining to Deep Learning (DL)-based fire segmentation, showing very promising results. However, it is currently unclear whether the architecture of a model, its loss function, or the image type employed (visible, infrared, or fused) has the most impact on the fire segmentation results. In the present work, we evaluate different combinations of state-of-the-art (SOTA) DL architectures, loss functions, and types of images to identify the parameters most relevant to improve the segmentation results. We benchmark them to identify the top-performing ones and compare them to traditional fire segmentation techniques. Finally, we evaluate if the addition of attention modules on the best performing architecture can further improve the segmentation results. To the best of our knowledge, this is the first work that evaluates the impact of the architecture, loss function, and image type in the performance of DL-based wildfire segmentation models.

Get full-text (via PubEx)

Adaptive attitude-tracking control of spacecraft considering on-orbit refuelling

Transactions of the Institute of Measurement and Control ◽

10.1177/0142331220973132 ◽

2020 ◽

pp. 014233122097313

Author(s):

Yiqi Xu

Keyword(s):

Tracking Control ◽

Closed Loop ◽

Closed Loop System ◽

Tracking Errors ◽

Attitude Tracking ◽

Control Scheme ◽

Inertia Model ◽

And Performance ◽

Attitude Tracking Control ◽

The Stability

This paper studies the attitude-tracking control problem of spacecraft considering on-orbit refuelling. A time-varying inertia model is developed for spacecraft on-orbit refuelling, which actually includes two processes: fuel in the transfer pipe and fuel in the tank. Based upon the inertia model, an adaptive attitude-tracking controller is derived to guarantee the stability of the resulted closed-loop system, as well as asymptotic convergence of the attitude-tracking errors, despite performing refuelling operations. Finally, numerical simulations illustrate the effectiveness and performance of the proposed control scheme.

Get full-text (via PubEx)

Quality loss functions and performance measures for a mixed bivariate response

Journal of Manufacturing Systems ◽

10.1016/s0278-6125(02)80095-4 ◽

2002 ◽

Vol 21 (6) ◽

pp. 476

Keyword(s):

Performance Measures ◽

Loss Functions ◽

Quality Loss ◽

And Performance

Get full-text (via PubEx)

Nondestructive Test on Typical Roadway Supports of a Mine via Drilling Core and Ground Penetrating Radar

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.368-373.2411 ◽

2011 ◽

Vol 368-373 ◽

pp. 2411-2416

Author(s):

Jian Ping Han ◽

Hai Peng Liu

Keyword(s):

Ground Penetrating Radar ◽

Geological Structure ◽

Surrounding Rock ◽

Potential Threat ◽

Real Performance ◽

Design Requirements ◽

And Performance ◽

The Stability ◽

Ground Penetrating ◽

Drilling Core

Temporary or permanent supports are necessary in underground construction for maintaining the stability and limiting the damage of surrounding rock. Due to the uncertainty of geological structure, the specificity of the underground environment as well as other factors, the quality and performance of supporting structure are often difficult to satisfy the design requirements, which not only seriously affects the normal construction and operation of mines but also has the potential threat to the safety of underground production. In order to investigate the influence of the unfavorable geologic environment on supporting concrete and evaluate the real performance of roadway supports of a mine, 17 typical projects were chosen and the strength of supporting concrete was detected by nondestructive drilling core method. The result shows that the strength is widely less than the design value. Furthermore, 4 projects of them were investigated by the ground penetrating radar (GPR) in order to evaluate the feasibility of GPR in the performance investigation of the roadway supports of a mine. The results indicate that ground penetrating radar is capable of measuring the thickness of the support, the distribution of rebars and the defects of the surrounding rock.

Get full-text (via PubEx)