cross entropy Latest Research Papers

Session-based recommendation aims to generate recommendations merely based on the ongoing session, which is a challenging task. Previous methods mainly focus on modeling the sequential signals or the transition relations between items in the current session using RNNs or GNNs to identify user’s intent for recommendation. Such models generally ignore the dynamic connections between the local and global item transition patterns, although the global information is taken into consideration by exploiting the global-level pair-wise item transitions. Moreover, existing methods that mainly adopt the cross-entropy loss with softmax generally face a serious over-fitting problem, harming the recommendation accuracy. Thus, in this article, we propose a Graph Co-Attentive Recommendation Machine (GCARM) for session-based recommendation. In detail, we first design a Graph Co-Attention Network (GCAT) to consider the dynamic correlations between the local and global neighbors of each node during the information propagation. Then, the item-level dynamic connections between the output of the local and global graphs are modeled to generate the final item representations. After that, we produce the prediction scores and design a Max Cross-Entropy (MCE) loss to prevent over-fitting. Extensive experiments are conducted on three benchmark datasets, i.e., Diginetica, Gowalla, and Yoochoose. The experimental results show that GCARM can achieve the state-of-the-art performance in terms of Recall and MRR, especially on boosting the ranking of the target item.

Download Full-text

Hypergraph Convolution on Nodes-Hyperedges Network for Semi-Supervised Node Classification

ACM Transactions on Knowledge Discovery from Data ◽

10.1145/3494567 ◽

2022 ◽

Vol 16 (4) ◽

pp. 1-19

Author(s):

Hanrui Wu ◽

Michael K. Ng

Keyword(s):

Deep Learning ◽

Classification Problem ◽

Cross Entropy ◽

Learning Approaches ◽

Entropy Loss ◽

Order Relations ◽

Data Representations ◽

Node Classification ◽

The Cross ◽

Full Consideration

Hypergraphs have shown great power in representing high-order relations among entities, and lots of hypergraph-based deep learning methods have been proposed to learn informative data representations for the node classification problem. However, most of these deep learning approaches do not take full consideration of either the hyperedge information or the original relationships among nodes and hyperedges. In this article, we present a simple yet effective semi-supervised node classification method named Hypergraph Convolution on Nodes-Hyperedges network, which performs filtering on both nodes and hyperedges as well as recovers the original hypergraph with the least information loss. Instead of only reducing the cross-entropy loss over the labeled samples as most previous approaches do, we additionally consider the hypergraph reconstruction loss as prior information to improve prediction accuracy. As a result, by taking both the cross-entropy loss on the labeled samples and the hypergraph reconstruction loss into consideration, we are able to achieve discriminative latent data representations for training a classifier. We perform extensive experiments on the semi-supervised node classification problem and compare the proposed method with state-of-the-art algorithms. The promising results demonstrate the effectiveness of the proposed method.

Download Full-text

Quantify pixel-level detection of dam surface crack using deep learning

Measurement Science and Technology ◽

10.1088/1361-6501/ac4b8d ◽

2022 ◽

Author(s):

Bo Chen ◽

Hua Zhang ◽

Yonglong Li ◽

Shuang Wang ◽

Huaifang Zhou ◽

...

Keyword(s):

Deep Learning ◽

Surface Crack ◽

Crack Detection ◽

State Of The Art ◽

Contextual Information ◽

Semantic Segmentation ◽

Quantitative Information ◽

Cross Entropy ◽

Detection Methods ◽

Water Conservancy

Abstract An increasing number of detection methods based on computer vision are applied to detect cracks in water conservancy infrastructure. However, most studies directly use existing feature extraction networks to extract cracks information, which are proposed for open-source datasets. As the cracks distribution and pixel features are different from these data, the extracted cracks information is incomplete. In this paper, a deep learning-based network for dam surface crack detection is proposed, which mainly addresses the semantic segmentation of cracks on the dam surface. Particularly, we design a shallow encoding network to extract features of crack images based on the statistical analysis of cracks. Further, to enhance the relevance of contextual information, we introduce an attention module into the decoding network. During the training, we use the sum of Cross-Entropy and Dice Loss as the loss function to overcome data imbalance. The quantitative information of cracks is extracted by the imaging principle after using morphological algorithms to extract the morphological features of the predicted result. We built a manual annotation dataset containing 1577 images to verify the effectiveness of the proposed method. This method achieves the state-of-the-art performance on our dataset. Specifically, the precision, recall, IoU, F1_measure, and accuracy achieve 90.81%, 81.54%, 75.23%, 85.93%, 99.76%, respectively. And the quantization error of cracks is less than 4%.

Download Full-text

Multiscale Deep Network with Centerness-Aware Loss for Salient Object Detection

Advances in Multimedia ◽

10.1155/2022/2243927 ◽

2022 ◽

Vol 2022 ◽

pp. 1-14

Author(s):

Liangliang Duan

Keyword(s):

Object Detection ◽

Network Architecture ◽

State Of The Art ◽

Saliency Detection ◽

Salient Object Detection ◽

Cross Entropy ◽

Salient Object ◽

Prediction Module ◽

Saliency Prediction ◽

Salient Regions

Deep encoder-decoder networks have been adopted for saliency detection and achieved state-of-the-art performance. However, most existing saliency models usually fail to detect very small salient objects. In this paper, we propose a multitask architecture, M2Net, and a novel centerness-aware loss for salient object detection. The proposed M2Net aims to solve saliency prediction and centerness prediction simultaneously. Specifically, the network architecture is composed of a bottom-up encoder module, top-down decoder module, and centerness prediction module. In addition, different from binary cross entropy, the proposed centerness-aware loss can guide the proposed M2Net to uniformly highlight the entire salient regions with well-defined object boundaries. Experimental results on five benchmark saliency datasets demonstrate that M2Net outperforms state-of-the-art methods on different evaluation metrics.

Download Full-text

Adaptive Feature Pyramid Network to Predict Crisp Boundaries via NMS Layer and ODS F-Measure Loss Function

Information ◽

10.3390/info13010032 ◽

2022 ◽

Vol 13 (1) ◽

pp. 32

Author(s):

Gang Sun ◽

Hancheng Yu ◽

Xiangtao Jiang ◽

Mingkui Feng

Keyword(s):

Edge Detection ◽

Loss Function ◽

State Of The Art ◽

Cross Entropy ◽

Post Processing ◽

Multi Scale ◽

Feature Pyramid ◽

Multi Level ◽

Different Levels ◽

F Measure

Edge detection is one of the fundamental computer vision tasks. Recent methods for edge detection based on a convolutional neural network (CNN) typically employ the weighted cross-entropy loss. Their predicted results being thick and needing post-processing before calculating the optimal dataset scale (ODS) F-measure for evaluation. To achieve end-to-end training, we propose a non-maximum suppression layer (NMS) to obtain sharp boundaries without the need for post-processing. The ODS F-measure can be calculated based on these sharp boundaries. So, the ODS F-measure loss function is proposed to train the network. Besides, we propose an adaptive multi-level feature pyramid network (AFPN) to better fuse different levels of features. Furthermore, to enrich multi-scale features learned by AFPN, we introduce a pyramid context module (PCM) that includes dilated convolution to extract multi-scale features. Experimental results indicate that the proposed AFPN achieves state-of-the-art performance on the BSDS500 dataset (ODS F-score of 0.837) and the NYUDv2 dataset (ODS F-score of 0.780).

Download Full-text

TB-NET: A Two-Branch Neural Network for Direction of Arrival Estimation under Model Imperfections

Electronics ◽

10.3390/electronics11020220 ◽

2022 ◽

Vol 11 (2) ◽

pp. 220

Author(s):

Liyu Lin ◽

Chaoran She ◽

Yun Chen ◽

Ziyu Guo ◽

Xiaoyang Zeng

Keyword(s):

Neural Network ◽

Deep Learning ◽

Direction Of Arrival ◽

Doa Estimation ◽

Cross Entropy ◽

Data Driven ◽

Estimation Accuracy ◽

Model Based ◽

Classification And Regression ◽

Grid Based

For direction of arrival (DoA) estimation, the data-driven deep-learning method has an advantage over the model-based methods since it is more robust against model imperfections. Conventionally, networks are based singly on regression or classification and may lead to unstable training and limited resolution. Alternatively, this paper proposes a two-branch neural network (TB-Net) that combines classification and regression in parallel. The grid-based classification branch is optimized by binary cross-entropy (BCE) loss and provides a mask that indicates the existence of the DoAs at predefined grids. The regression branch refines the DoA estimates by predicting the deviations from the grids. At the output layer, the outputs of the two branches are combined to obtain final DoA estimates. To achieve a lightweight model, only convolutional layers are used in the proposed TB-Net. The simulation results demonstrated that compared with the model-based and existing deep-learning methods, the proposed method can achieve higher DoA estimation accuracy in the presence of model imperfections and only has a size of 1.8 MB.

Download Full-text

Automated Clinical Decision Support for Coronary Plaques Characterization from Optical Coherence Tomography Imaging with Fused Neural Networks

Optics ◽

10.3390/opt3010002 ◽

2022 ◽

Vol 3 (1) ◽

pp. 8-18

Author(s):

Haroon Zafar ◽

Junaid Zafar ◽

Faisal Sharif

Keyword(s):

Neural Networks ◽

Optical Coherence Tomography ◽

Decision Support ◽

Real Time ◽

Transfer Learning ◽

Clinical Decision Support ◽

Clinical Decision ◽

Cross Entropy ◽

Optical Coherence ◽

Arterial Plaques

Deep Neural Networks (DNNs) are nurturing clinical decision support systems for the detection and accurate modeling of coronary arterial plaques. However, efficient plaque characterization in time-constrained settings is still an open problem. The purpose of this study is to develop a novel automated classification architecture viable for the real-time clinical detection and classification of coronary artery plaques, and secondly, to use the novel dataset of OCT images for data augmentation. Further, the purpose is to validate the efficacy of transfer learning for arterial plaques classification. In this perspective, a novel time-efficient classification architecture based on DNNs is proposed. A new data set consisting of in-vivo patient Optical Coherence Tomography (OCT) images labeled by three trained experts was created and dynamically programmed. Generative Adversarial Networks (GANs) were used for populating the coronary aerial plaques dataset. We removed the fully connected layers, including softmax and the cross-entropy in the GoogleNet framework, and replaced them with the Support Vector Machines (SVMs). Our proposed architecture limits weight up-gradation cycles to only modified layers and computes the global hyper-plane in a timely, competitive fashion. Transfer learning was used for high-level discriminative feature learning. Cross-entropy loss was minimized by using the Adam optimizer for model training. A train validation scheme was used to determine the classification accuracy. Automated plaques differentiation in addition to their detection was found to agree with the clinical findings. Our customized fused classification scheme outperforms the other leading reported works with an overall accuracy of 96.84%, and multiple folds reduced elapsed time demonstrating it as a viable choice for real-time clinical settings.

Download Full-text

River Segmentation of Remote Sensing Images Based on Composite Attention Network

Complexity ◽

10.1155/2022/7750281 ◽

2022 ◽

Vol 2022 ◽

pp. 1-13

Author(s):

Zhiyong Fan ◽

Jianmin Hou ◽

Qiang Zang ◽

Yunjie Chen ◽

Fei Yan

Keyword(s):

Remote Sensing ◽

Semantic Segmentation ◽

Cross Entropy ◽

Important Research ◽

Dice Coefficient ◽

Remote Sensing Images ◽

Training Process ◽

Attention Network ◽

Evaluation Indexes ◽

Agricultural Planning

River segmentation of remote sensing images is of important research significance and application value for environmental monitoring, disaster warning, and agricultural planning in an area. In this study, we propose a river segmentation model in remote sensing images based on composite attention network to solve the problems of abundant river details in images and the interference of non-river information including bridges, shadows, and roads. To improve the segmentation efficiency, a composite attention mechanism is firstly introduced in the central region of the network to obtain the global feature dependence of river information. Next, in this study, we dynamically combine binary cross-entropy loss that is designed for pixel-wise segmentation and the Dice coefficient loss that measures the similarity of two segmentation objects into a weighted one to optimize the training process of the proposed segmentation network. The experimental results show that compared with other semantic segmentation networks, the evaluation indexes of the proposed method are higher than those of others, and the river segmentation effect of CoANet model is significantly improved. This method can segment rivers in remote sensing images more accurately and coherently, which can meet the needs of subsequent research.

Download Full-text

Neural collapse under cross-entropy loss

Applied and Computational Harmonic Analysis ◽

10.1016/j.acha.2021.12.011 ◽

2022 ◽

Author(s):

Jianfeng Lu ◽

Stefan Steinerberger

Keyword(s):

Cross Entropy ◽

Entropy Loss

Download Full-text

An adaptive sparse polynomial dimensional decomposition based on Bayesian compressive sensing and cross-entropy

Structural and Multidisciplinary Optimization ◽

10.1007/s00158-021-03120-w ◽

2022 ◽

Vol 65 (1) ◽

Author(s):

Wanxin He ◽

Gang Li ◽

Zhaokun Nie

Keyword(s):

Compressive Sensing ◽

Cross Entropy ◽

Sparse Polynomial ◽

Dimensional Decomposition ◽

Polynomial Dimensional Decomposition

Download Full-text

cross entropy
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Graph Co-Attentive Session-based Recommendation

Hypergraph Convolution on Nodes-Hyperedges Network for Semi-Supervised Node Classification

Quantify pixel-level detection of dam surface crack using deep learning

Multiscale Deep Network with Centerness-Aware Loss for Salient Object Detection

Adaptive Feature Pyramid Network to Predict Crisp Boundaries via NMS Layer and ODS F-Measure Loss Function

TB-NET: A Two-Branch Neural Network for Direction of Arrival Estimation under Model Imperfections

Automated Clinical Decision Support for Coronary Plaques Characterization from Optical Coherence Tomography Imaging with Fused Neural Networks

River Segmentation of Remote Sensing Images Based on Composite Attention Network

Neural collapse under cross-entropy loss

An adaptive sparse polynomial dimensional decomposition based on Bayesian compressive sensing and cross-entropy

Export Citation Format

cross entropyRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Graph Co-Attentive Session-based Recommendation

Hypergraph Convolution on Nodes-Hyperedges Network for Semi-Supervised Node Classification

Quantify pixel-level detection of dam surface crack using deep learning

Multiscale Deep Network with Centerness-Aware Loss for Salient Object Detection

Adaptive Feature Pyramid Network to Predict Crisp Boundaries via NMS Layer and ODS F-Measure Loss Function

TB-NET: A Two-Branch Neural Network for Direction of Arrival Estimation under Model Imperfections

Automated Clinical Decision Support for Coronary Plaques Characterization from Optical Coherence Tomography Imaging with Fused Neural Networks

River Segmentation of Remote Sensing Images Based on Composite Attention Network

Neural collapse under cross-entropy loss

An adaptive sparse polynomial dimensional decomposition based on Bayesian compressive sensing and cross-entropy

cross entropy
Recently Published Documents