A Calibrated Multiexit Neural Network for Detecting Urothelial Cancer Cells

Deep convolutional networks have become a powerful tool for medical imaging diagnostic. In pathology, most efforts have been focused in the subfield of histology, while cytopathology (which studies diagnostic tools at the cellular level) remains underexplored. In this paper, we propose a novel deep learning model for cancer detection from urinary cytopathology screening images. We leverage recent ideas from the field of multioutput neural networks to provide a model that can efficiently train even on small-scale datasets, such as those typically found in real-world scenarios. Additionally, we argue that calibration (i.e., providing confidence levels that are aligned with the ground truth probability of an event) has been a major shortcoming of prior works, and we experiment a number of techniques to provide a well-calibrated model. We evaluate the proposed algorithm on a novel dataset, and we show that the combination of focal loss, multiple outputs, and temperature scaling provides a model that is significantly more accurate and calibrated than a baseline deep convolutional network.

Download Full-text

Improving Convolutional Network Interpretability with Exponential Activations

10.1101/650804 ◽

2019 ◽

Cited By ~ 5

Author(s):

Peter K. Koo ◽

Matt Ploenzke

Keyword(s):

Dna Sequences ◽

Ground Truth ◽

Genomic Sequences ◽

Sequence Motifs ◽

Convolutional Network ◽

Distributed Representations ◽

Convolutional Networks ◽

Synthetic Dna

AbstractDeep convolutional networks trained on regulatory genomic sequences tend to learn distributed representations of sequence motifs across many first layer filters. This makes it challenging to decipher which features are biologically meaningful. Here we introduce the exponential activation that – when applied to first layer filters – leads to more interpretable representations of motifs, both visually and quantitatively, compared to rectified linear units. We demonstrate this on synthetic DNA sequences which have ground truth with various convolutional networks, and then show that this phenomenon holds on in vivo DNA sequences.

Download Full-text

Multi-Range Attentive Bicomponent Graph Convolutional Network for Traffic Forecasting

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i04.5758 ◽

2020 ◽

Vol 34 (04) ◽

pp. 3529-3536 ◽

Cited By ~ 4

Author(s):

Weiqi Chen ◽

Ling Chen ◽

Yu Xie ◽

Wei Cao ◽

Yusong Gao ◽

...

Keyword(s):

Road Network ◽

Weighted Graph ◽

Interaction Patterns ◽

Convolutional Network ◽

The Road ◽

Traffic Forecasting ◽

Convolutional Networks ◽

Traffic Conditions ◽

Aggregate Information ◽

Deep Learning Model

Traffic forecasting is of great importance to transportation management and public safety, and very challenging due to the complicated spatial-temporal dependency and essential uncertainty brought about by the road network and traffic conditions. Latest studies mainly focus on modeling the spatial dependency by utilizing graph convolutional networks (GCNs) throughout a fixed weighted graph. However, edges, i.e., the correlations between pair-wise nodes, are much more complicated and interact with each other. In this paper, we propose the Multi-Range Attentive Bicomponent GCN (MRA-BGCN), a novel deep learning model for traffic forecasting. We first build the node-wise graph according to the road network distance and the edge-wise graph according to various edge interaction patterns. Then, we implement the interactions of both nodes and edges using bicomponent graph convolution. The multi-range attention mechanism is introduced to aggregate information in different neighborhood ranges and automatically learn the importance of different ranges. Extensive experiments on two real-world road network traffic datasets, METR-LA and PEMS-BAY, show that our MRA-BGCN achieves the state-of-the-art results.

Download Full-text

Convolutional Networks Can Learn to Generate Affinity Graphs for Image Segmentation

Neural Computation ◽

10.1162/neco.2009.10-08-881 ◽

2010 ◽

Vol 22 (2) ◽

pp. 511-538 ◽

Cited By ~ 196

Author(s):

Srinivas C. Turaga ◽

Joseph F. Murray ◽

Viren Jain ◽

Fabian Roth ◽

Moritz Helmstaedter ◽

...

Keyword(s):

Image Segmentation ◽

Ground Truth ◽

Image Features ◽

Convolutional Network ◽

Convolutional Networks ◽

Machine Learning Approach ◽

Segmentation Accuracy ◽

Segmentation Algorithms ◽

Neuronal Processes ◽

Segmentation Problem

Many image segmentation algorithms first generate an affinity graph and then partition it. We present a machine learning approach to computing an affinity graph using a convolutional network (CN) trained using ground truth provided by human experts. The CN affinity graph can be paired with any standard partitioning algorithm and improves segmentation accuracy significantly compared to standard hand-designed affinity functions. We apply our algorithm to the challenging 3D segmentation problem of reconstructing neuronal processes from volumetric electron microscopy (EM) and show that we are able to learn a good affinity graph directly from the raw EM images. Further, we show that our affinity graph improves the segmentation accuracy of both simple and sophisticated graph partitioning algorithms. In contrast to previous work, we do not rely on prior knowledge in the form of hand-designed image features or image preprocessing. Thus, we expect our algorithm to generalize effectively to arbitrary image types.

Download Full-text

Graph Convolutional Network-based Model for Incident-related Congestion Prediction: A Case Study of Shanghai Expressways

ACM Transactions on Management Information Systems ◽

10.1145/3451356 ◽

2021 ◽

Vol 12 (3) ◽

pp. 1-22

Author(s):

Xi Wang ◽

Yibo Chai ◽

Hui Li ◽

Wenbin Wang ◽

Weishan Sun

Keyword(s):

Traffic Control ◽

Traffic Congestion ◽

Local Governments ◽

Ground Truth ◽

Small Scale ◽

Convolutional Network ◽

Ground Truth Data ◽

Real Time Traffic ◽

Management And Development

Traffic congestion has become a significant obstacle to the development of mega cities in China. Although local governments have used many resources in constructing road infrastructure, it is still insufficient for the increasing traffic demands. As a first step toward optimizing real-time traffic control, this study uses Shanghai Expressways as a case study to predict incident-related congestions. Our study proposes a graph convolutional network-based model to identify correlations in multi-dimensional sensor-detected data, while simultaneously taking into account environmental, spatiotemporal, and network features in predicting traffic conditions immediately after a traffic incident. The average accuracy, average AUC, and average F-1 score of the predictive model are 92.78%, 95.98%, and 88.78%, respectively, on small-scale ground-truth data. Furthermore, we improve the predictive model’s performance using semi-supervised learning by including more unlabeled data instances. As a result, the accuracy, AUC, and F-1 score of the model increase by 2.69%, 1.25%, and 4.72%, respectively. The findings of this article have important implications that can be used to improve the management and development of Expressways in Shanghai, as well as other metropolitan areas in China.

Download Full-text

Efficient End-to-End Sentence-Level Lipreading with Temporal Convolutional Networks

Applied Sciences ◽

10.3390/app11156975 ◽

2021 ◽

Vol 11 (15) ◽

pp. 6975

Author(s):

Tao Zhang ◽

Lun He ◽

Xudong Li ◽

Guoqing Feng

Keyword(s):

Performance Improvement ◽

State Of The Art ◽

Error Rates ◽

Convolutional Network ◽

Convolutional Networks ◽

Sentence Level ◽

End To End ◽

High Level ◽

Improved Accuracy ◽

Talking Face

Lipreading aims to recognize sentences being spoken by a talking face. In recent years, the lipreading method has achieved a high level of accuracy on large datasets and made breakthrough progress. However, lipreading is still far from being solved, and existing methods tend to have high error rates on the wild data and have the defects of disappearing training gradient and slow convergence. To overcome these problems, we proposed an efficient end-to-end sentence-level lipreading model, using an encoder based on a 3D convolutional network, ResNet50, Temporal Convolutional Network (TCN), and a CTC objective function as the decoder. More importantly, the proposed architecture incorporates TCN as a feature learner to decode feature. It can partly eliminate the defects of RNN (LSTM, GRU) gradient disappearance and insufficient performance, and this yields notable performance improvement as well as faster convergence. Experiments show that the training and convergence speed are 50% faster than the state-of-the-art method, and improved accuracy by 2.4% on the GRID dataset.

Download Full-text

Towards 6G IoT: Tracing Mobile Sensor Nodes with Deep Learning Clustering in UAV Networks

Sensors ◽

10.3390/s21113936 ◽

2021 ◽

Vol 21 (11) ◽

pp. 3936

Author(s):

Yannis Spyridis ◽

Thomas Lagkas ◽

Panagiotis Sarigiannidis ◽

Vasileios Argyriou ◽

Antonios Sarigiannidis ◽

...

Keyword(s):

Deep Learning ◽

Heuristic Method ◽

Sensor Nodes ◽

Convolutional Network ◽

Received Signal Strength Indicator ◽

Anchor Nodes ◽

Time Required ◽

Rf Signal ◽

Deep Learning Model

Unmanned aerial vehicles (UAVs) in the role of flying anchor nodes have been proposed to assist the localisation of terrestrial Internet of Things (IoT) sensors and provide relay services in the context of the upcoming 6G networks. This paper considered the objective of tracing a mobile IoT device of unknown location, using a group of UAVs that were equipped with received signal strength indicator (RSSI) sensors. The UAVs employed measurements of the target’s radio frequency (RF) signal power to approach the target as quickly as possible. A deep learning model performed clustering in the UAV network at regular intervals, based on a graph convolutional network (GCN) architecture, which utilised information about the RSSI and the UAV positions. The number of clusters was determined dynamically at each instant using a heuristic method, and the partitions were determined by optimising an RSSI loss function. The proposed algorithm retained the clusters that approached the RF source more effectively, removing the rest of the UAVs, which returned to the base. Simulation experiments demonstrated the improvement of this method compared to a previous deterministic approach, in terms of the time required to reach the target and the total distance covered by the UAVs.

Download Full-text

HGCN: A Heterogeneous Graph Convolutional Network-Based Deep Learning Model Toward Collective Classification

Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining ◽

10.1145/3394486.3403169 ◽

2020 ◽

Author(s):

Zhihua Zhu ◽

Xinxin Fan ◽

Xiaokai Chu ◽

Jingping Bi

Keyword(s):

Deep Learning ◽

Learning Model ◽

Convolutional Network ◽

Collective Classification ◽

Deep Learning Model

Download Full-text

Building Multi-Feature Fusion Refined Network for Building Extraction from High-Resolution Remote Sensing Images

Remote Sensing ◽

10.3390/rs13142794 ◽

2021 ◽

Vol 13 (14) ◽

pp. 2794

Author(s):

Shuhao Ran ◽

Xianjun Gao ◽

Yuanwei Yang ◽

Shaohua Li ◽

Guangbin Zhang ◽

...

Keyword(s):

Feature Fusion ◽

Small Scale ◽

Automatic Extraction ◽

Learning Approaches ◽

Building Extraction ◽

Visual Interpretation ◽

Learning Capacity ◽

Convolutional Networks ◽

Fully Convolutional Networks ◽

Great Progress

Deep learning approaches have been widely used in building automatic extraction tasks and have made great progress in recent years. However, the missing detection and wrong detection causing by spectrum confusion is still a great challenge. The existing fully convolutional networks (FCNs) cannot effectively distinguish whether the feature differences are from one building or the building and its adjacent non-building objects. In order to overcome the limitations, a building multi-feature fusion refined network (BMFR-Net) was presented in this paper to extract buildings accurately and completely. BMFR-Net is based on an encoding and decoding structure, mainly consisting of two parts: the continuous atrous convolution pyramid (CACP) module and the multiscale output fusion constraint (MOFC) structure. The CACP module is positioned at the end of the contracting path and it effectively minimizes the loss of effective information in multiscale feature extraction and fusion by using parallel continuous small-scale atrous convolution. To improve the ability to aggregate semantic information from the context, the MOFC structure performs predictive output at each stage of the expanding path and integrates the results into the network. Furthermore, the multilevel joint weighted loss function effectively updates parameters well away from the output layer, enhancing the learning capacity of the network for low-level abstract features. The experimental results demonstrate that the proposed BMFR-Net outperforms the other five state-of-the-art approaches in both visual interpretation and quantitative evaluation.

Download Full-text

Knowledge-aware Multi-modal Adaptive Graph Convolutional Networks for Fake News Detection

ACM Transactions on Multimedia Computing Communications and Applications ◽

10.1145/3451215 ◽

2021 ◽

Vol 17 (3) ◽

pp. 1-23

Author(s):

Shengsheng Qian ◽

Jun Hu ◽

Quan Fang ◽

Changsheng Xu

Keyword(s):

Social Media ◽

Visual Information ◽

Representation Learning ◽

Fake News ◽

Unified Framework ◽

Model Learning ◽

Convolutional Network ◽

Textual Information ◽

Convolutional Networks ◽

Real World Datasets

In this article, we focus on fake news detection task and aim to automatically identify the fake news from vast amount of social media posts. To date, many approaches have been proposed to detect fake news, which includes traditional learning methods and deep learning-based models. However, there are three existing challenges: (i) How to represent social media posts effectively, since the post content is various and highly complicated; (ii) how to propose a data-driven method to increase the flexibility of the model to deal with the samples in different contexts and news backgrounds; and (iii) how to fully utilize the additional auxiliary information (the background knowledge and multi-modal information) of posts for better representation learning. To tackle the above challenges, we propose a novel Knowledge-aware Multi-modal Adaptive Graph Convolutional Networks (KMAGCN) to capture the semantic representations by jointly modeling the textual information, knowledge concepts, and visual information into a unified framework for fake news detection. We model posts as graphs and use a knowledge-aware multi-modal adaptive graph learning principal for the effective feature learning. Compared with existing methods, the proposed KMAGCN addresses challenges from three aspects: (1) It models posts as graphs to capture the non-consecutive and long-range semantic relations; (2) it proposes a novel adaptive graph convolutional network to handle the variability of graph data; and (3) it leverages textual information, knowledge concepts and visual information jointly for model learning. We have conducted extensive experiments on three public real-world datasets and superior results demonstrate the effectiveness of KMAGCN compared with other state-of-the-art algorithms.

Download Full-text

IMG-22. A DEEP LEARNING MODEL FOR AUTOMATIC POSTERIOR FOSSA PEDIATRIC BRAIN TUMOR SEGMENTATION: A MULTI-INSTITUTIONAL STUDY

Neuro-Oncology ◽

10.1093/neuonc/noaa222.357 ◽

2020 ◽

Vol 22 (Supplement_3) ◽

pp. iii359-iii359

Author(s):

Lydia Tam ◽

Edward Lee ◽

Michelle Han ◽

Jason Wright ◽

Leo Chen ◽

...

Keyword(s):

Deep Learning ◽

Posterior Fossa ◽

Ground Truth ◽

Learning Model ◽

Tumor Segmentation ◽

Dice Similarity Coefficient ◽

Study Cohort ◽

Navigation Systems ◽

Tumor Types ◽

Deep Learning Model

Abstract BACKGROUND Brain tumors are the most common solid malignancies in childhood, many of which develop in the posterior fossa (PF). Manual tumor measurements are frequently required to optimize registration into surgical navigation systems or for surveillance of nonresectable tumors after therapy. With recent advances in artificial intelligence (AI), automated MRI-based tumor segmentation is now feasible without requiring manual measurements. Our goal was to create a deep learning model for automated PF tumor segmentation that can register into navigation systems and provide volume output. METHODS 720 pre-surgical MRI scans from five pediatric centers were divided into training, validation, and testing datasets. The study cohort comprised of four PF tumor types: medulloblastoma, diffuse midline glioma, ependymoma, and brainstem or cerebellar pilocytic astrocytoma. Manual segmentation of the tumors by an attending neuroradiologist served as “ground truth” labels for model training and evaluation. We used 2D Unet, an encoder-decoder convolutional neural network architecture, with a pre-trained ResNet50 encoder. We assessed ventricle segmentation accuracy on a held-out test set using Dice similarity coefficient (0–1) and compared ventricular volume calculation between manual and model-derived segmentations using linear regression. RESULTS Compared to the ground truth expert human segmentation, overall Dice score for model performance accuracy was 0.83 for automatic delineation of the 4 tumor types. CONCLUSIONS In this multi-institutional study, we present a deep learning algorithm that automatically delineates PF tumors and outputs volumetric information. Our results demonstrate applied AI that is clinically applicable, potentially augmenting radiologists, neuro-oncologists, and neurosurgeons for tumor evaluation, surveillance, and surgical planning.

Download Full-text