scholarly journals Temporal Consistency-Based Loss Function for Both Deep Q-Networks and Deep Deterministic Policy Gradients for Continuous Actions

Symmetry ◽  
2021 ◽  
Vol 13 (12) ◽  
pp. 2411
Author(s):  
Chayoung Kim

Artificial intelligence (AI) techniques in power grid control and energy management in building automation require both deep Q-networks (DQNs) and deep deterministic policy gradients (DDPGs) in deep reinforcement learning (DRL) as off-policy algorithms. Most studies on improving the stability of DRL have addressed these with replay buffers and a target network using a delayed temporal difference (TD) backup, which is known for minimizing a loss function at every iteration. The loss functions were developed for DQN and DDPG, and it is well-known that there have been few studies on improving the techniques of the loss functions used in both DQN and DDPG. Therefore, we modified the loss function based on a temporal consistency (TC) loss and adapted the proposed TC loss function for the target network update in both DQN and DDPG. The proposed TC loss function showed effective results, particularly in a critic network in DDPG. In this work, we demonstrate that, in OpenAI Gym, both “cart-pole” and “pendulum”, the proposed TC loss function shows enormously improved convergence speed and performance, particularly in the critic network in DDPG.

Author(s):  
A. Howie ◽  
D.W. McComb

The bulk loss function Im(-l/ε (ω)), a well established tool for the interpretation of valence loss spectra, is being progressively adapted to the wide variety of inhomogeneous samples of interest to the electron microscopist. Proportionality between n, the local valence electron density, and ε-1 (Sellmeyer's equation) has sometimes been assumed but may not be valid even in homogeneous samples. Figs. 1 and 2 show the experimentally measured bulk loss functions for three pure silicates of different specific gravity ρ - quartz (ρ = 2.66), coesite (ρ = 2.93) and a zeolite (ρ = 1.79). Clearly, despite the substantial differences in density, the shift of the prominent loss peak is very small and far less than that predicted by scaling e for quartz with Sellmeyer's equation or even the somewhat smaller shift given by the Clausius-Mossotti (CM) relation which assumes proportionality between n (or ρ in this case) and (ε - 1)/(ε + 2). Both theories overestimate the rise in the peak height for coesite and underestimate the increase at high energies.


2021 ◽  
Vol 13 (9) ◽  
pp. 1779
Author(s):  
Xiaoyan Yin ◽  
Zhiqun Hu ◽  
Jiafeng Zheng ◽  
Boyong Li ◽  
Yuanyuan Zuo

Radar beam blockage is an important error source that affects the quality of weather radar data. An echo-filling network (EFnet) is proposed based on a deep learning algorithm to correct the echo intensity under the occlusion area in the Nanjing S-band new-generation weather radar (CINRAD/SA). The training dataset is constructed by the labels, which are the echo intensity at the 0.5° elevation in the unblocked area, and by the input features, which are the intensity in the cube including multiple elevations and gates corresponding to the location of bottom labels. Two loss functions are applied to compile the network: one is the common mean square error (MSE), and the other is a self-defined loss function that increases the weight of strong echoes. Considering that the radar beam broadens with distance and height, the 0.5° elevation scan is divided into six range bands every 25 km to train different models. The models are evaluated by three indicators: explained variance (EVar), mean absolute error (MAE), and correlation coefficient (CC). Two cases are demonstrated to compare the effect of the echo-filling model by different loss functions. The results suggest that EFnet can effectively correct the echo reflectivity and improve the data quality in the occlusion area, and there are better results for strong echoes when the self-defined loss function is used.


Author(s):  
Zhenzhen Yang ◽  
Pengfei Xu ◽  
Yongpeng Yang ◽  
Bing-Kun Bao

The U-Net has become the most popular structure in medical image segmentation in recent years. Although its performance for medical image segmentation is outstanding, a large number of experiments demonstrate that the classical U-Net network architecture seems to be insufficient when the size of segmentation targets changes and the imbalance happens between target and background in different forms of segmentation. To improve the U-Net network architecture, we develop a new architecture named densely connected U-Net (DenseUNet) network in this article. The proposed DenseUNet network adopts a dense block to improve the feature extraction capability and employs a multi-feature fuse block fusing feature maps of different levels to increase the accuracy of feature extraction. In addition, in view of the advantages of the cross entropy and the dice loss functions, a new loss function for the DenseUNet network is proposed to deal with the imbalance between target and background. Finally, we test the proposed DenseUNet network and compared it with the multi-resolutional U-Net (MultiResUNet) and the classic U-Net networks on three different datasets. The experimental results show that the DenseUNet network has significantly performances compared with the MultiResUNet and the classic U-Net networks.


2021 ◽  
Vol 9 (3) ◽  
pp. 290
Author(s):  
Yukai Li ◽  
Yuli Hu ◽  
Youguang Guo ◽  
Baowei Song ◽  
Zhaoyong Mao

Permanent magnet couplings can convert a dynamic seal into a static seal, thereby greatly improving the stability of the underwater propulsion unit. In order to make full use of the tail space and improve the transmitted torque capability, a conical Halbach permanent magnet coupling (C-HPMC) is proposed in this paper. The C-HPMC combines multiple cylindrical HPMCs with different sizes into an approximately conical structure. Compared with the conical permanent magnet couplings in our previous work, the novel C-HPMC has better torque performance and is easy to process. The analytical calculation method of transmitted torque of C-HPMC is proposed on the basis of torque calculation of the three common types of HPMCs. The accuracy of the torque calculation of the three HPMCs is verified, and the torque performance of the three HPMCSs of different sizes is compared and discussed. The “optimal type selection” method is proposed and applied in the design of C-HPMC. Finally, on the basis of torque analysis calculation and axial force calculation, a complete flowchart of the design and performance analysis of C-HPMC is described.


2021 ◽  
pp. 1-29
Author(s):  
Yanhong Chen

ABSTRACT In this paper, we study the optimal reinsurance contracts that minimize the convex combination of the Conditional Value-at-Risk (CVaR) of the insurer’s loss and the reinsurer’s loss over the class of ceded loss functions such that the retained loss function is increasing and the ceded loss function satisfies Vajda condition. Among a general class of reinsurance premium principles that satisfy the properties of risk loading and convex order preserving, the optimal solutions are obtained. Our results show that the optimal ceded loss functions are in the form of five interconnected segments for general reinsurance premium principles, and they can be further simplified to four interconnected segments if more properties are added to reinsurance premium principles. Finally, we derive optimal parameters for the expected value premium principle and give a numerical study to analyze the impact of the weighting factor on the optimal reinsurance.


2021 ◽  
Vol 11 (15) ◽  
pp. 7046
Author(s):  
Jorge Francisco Ciprián-Sánchez ◽  
Gilberto Ochoa-Ruiz ◽  
Lucile Rossi ◽  
Frédéric Morandini

Wildfires stand as one of the most relevant natural disasters worldwide, particularly more so due to the effect of climate change and its impact on various societal and environmental levels. In this regard, a significant amount of research has been done in order to address this issue, deploying a wide variety of technologies and following a multi-disciplinary approach. Notably, computer vision has played a fundamental role in this regard. It can be used to extract and combine information from several imaging modalities in regard to fire detection, characterization and wildfire spread forecasting. In recent years, there has been work pertaining to Deep Learning (DL)-based fire segmentation, showing very promising results. However, it is currently unclear whether the architecture of a model, its loss function, or the image type employed (visible, infrared, or fused) has the most impact on the fire segmentation results. In the present work, we evaluate different combinations of state-of-the-art (SOTA) DL architectures, loss functions, and types of images to identify the parameters most relevant to improve the segmentation results. We benchmark them to identify the top-performing ones and compare them to traditional fire segmentation techniques. Finally, we evaluate if the addition of attention modules on the best performing architecture can further improve the segmentation results. To the best of our knowledge, this is the first work that evaluates the impact of the architecture, loss function, and image type in the performance of DL-based wildfire segmentation models.


Author(s):  
Yiqi Xu

This paper studies the attitude-tracking control problem of spacecraft considering on-orbit refuelling. A time-varying inertia model is developed for spacecraft on-orbit refuelling, which actually includes two processes: fuel in the transfer pipe and fuel in the tank. Based upon the inertia model, an adaptive attitude-tracking controller is derived to guarantee the stability of the resulted closed-loop system, as well as asymptotic convergence of the attitude-tracking errors, despite performing refuelling operations. Finally, numerical simulations illustrate the effectiveness and performance of the proposed control scheme.


2011 ◽  
Vol 368-373 ◽  
pp. 2411-2416
Author(s):  
Jian Ping Han ◽  
Hai Peng Liu

Temporary or permanent supports are necessary in underground construction for maintaining the stability and limiting the damage of surrounding rock. Due to the uncertainty of geological structure, the specificity of the underground environment as well as other factors, the quality and performance of supporting structure are often difficult to satisfy the design requirements, which not only seriously affects the normal construction and operation of mines but also has the potential threat to the safety of underground production. In order to investigate the influence of the unfavorable geologic environment on supporting concrete and evaluate the real performance of roadway supports of a mine, 17 typical projects were chosen and the strength of supporting concrete was detected by nondestructive drilling core method. The result shows that the strength is widely less than the design value. Furthermore, 4 projects of them were investigated by the ground penetrating radar (GPR) in order to evaluate the feasibility of GPR in the performance investigation of the roadway supports of a mine. The results indicate that ground penetrating radar is capable of measuring the thickness of the support, the distribution of rebars and the defects of the surrounding rock.


Sign in / Sign up

Export Citation Format

Share Document