Fast Quaternion Product Units for Learning Disentangled Representations in SO(3)

10.36227/techrxiv.17791574 ◽

2022 ◽

Author(s):

Shaofei Qin ◽

Xuan Zhang ◽

Hongteng Xu ◽

Yi Xu

Keyword(s):

Neural Networks ◽

Real World ◽

Message Passing ◽

Point Clouds ◽

Rotation Group ◽

Structured Data ◽

Data Indexing ◽

Basic Module ◽

Real Neuron ◽

Rotation Groups

Real-world 3D structured data like point clouds and skeletons often can be represented as data in a 3D rotation group (denoted as $\mathbb{SO}(3)$). However, most existing neural networks are tailored for the data in the Euclidean space, which makes the 3D rotation data not closed under their algebraic operations and leads to sub-optimal performance in 3D-related learning tasks. To resolve the issues caused by the above mismatching between data and model, we propose a novel non-real neuron model called \textit{quaternion product unit} (QPU) to represent data on 3D rotation groups. The proposed QPU leverages quaternion algebra and the law of the 3D rotation group, representing 3D rotation data as quaternions and merging them via a weighted chain of Hamilton products. We demonstrate that the QPU mathematically maintains the $\mathbb{SO}(3)$ structure of the 3D rotation data during the inference process and disentangles the 3D representations into ``rotation-invariant'' features and ``rotation-equivariant'' features, respectively. Moreover, we design a fast QPU to accelerate the computation of QPU. The fast QPU applies a tree-structured data indexing process, and accordingly, leverages the power of parallel computing, which reduces the computational complexity of QPU in a single thread from $\mathcal{O}(N)$ to $\mathcal {O}(\log N)$. Taking the fast QPU as a basic module, we develop a series of quaternion neural networks (QNNs), including quaternion multi-layer perceptron (QMLP), quaternion message passing (QMP), and so on. In addition, we make the QNNs compatible with conventional real-valued neural networks and applicable for both skeletons and point clouds. Experiments on synthetic and real-world 3D tasks show that the QNNs based on our fast QPUs are superior to state-of-the-art real-valued models, especially in the scenarios requiring the robustness to random rotations.<br>

Download Full-text

USING SIMULATION DATA FROM GAMING ENVIRONMENTS FOR TRAINING A DEEP LEARNING ALGORITHM ON 3D POINT CLOUDS

ISPRS Annals of Photogrammetry Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-annals-viii-4-w2-2021-67-2021 ◽

2021 ◽

Vol VIII-4/W2-2021 ◽

pp. 67-74

Author(s):

S. Spiegel ◽

J. Chen

Keyword(s):

Neural Networks ◽

Real World ◽

Resource Constraints ◽

Learning Algorithm ◽

Point Clouds ◽

Training Data ◽

Lidar Data ◽

Learning Approaches ◽

Data Set ◽

3D Point Clouds

Abstract. Deep neural networks (DNNs) and convolutional neural networks (CNNs) have demonstrated greater robustness and accuracy in classifying two-dimensional images and three-dimensional point clouds compared to more traditional machine learning approaches. However, their main drawback is the need for large quantities of semantically labeled training data sets, which are often out of reach for those with resource constraints. In this study, we evaluated the use of simulated 3D point clouds for training a CNN learning algorithm to segment and classify 3D point clouds of real-world urban environments. The simulation involved collecting light detection and ranging (LiDAR) data using a simulated 16 channel laser scanner within the the CARLA (Car Learning to Act) autonomous vehicle gaming environment. We used this labeled data to train the Kernel Point Convolution (KPConv) and KPConv Segmentation Network for Point Clouds (KP-FCNN), which we tested on real-world LiDAR data from the NPM3D benchmark data set. Our results showed that high accuracy can be achieved using data collected in a simulator.

Download Full-text

Message Passing Attention Networks for Document Understanding

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i05.6376 ◽

2020 ◽

Vol 34 (05) ◽

pp. 8544-8551 ◽

Cited By ~ 2

Author(s):

Giannis Nikolentzos ◽

Antoine Tixier ◽

Michalis Vazirgiannis

Keyword(s):

Neural Networks ◽

Text Classification ◽

Message Passing ◽

State Of The Art ◽

Structured Data ◽

Attention Networks ◽

Document Understanding ◽

Standard Text ◽

Graph Neural Networks ◽

The Impact

Graph neural networks have recently emerged as a very effective framework for processing graph-structured data. These models have achieved state-of-the-art performance in many tasks. Most graph neural networks can be described in terms of message passing, vertex update, and readout functions. In this paper, we represent documents as word co-occurrence networks and propose an application of the message passing framework to NLP, the Message Passing Attention network for Document understanding (MPAD). We also propose several hierarchical variants of MPAD. Experiments conducted on 10 standard text classification datasets show that our architectures are competitive with the state-of-the-art. Ablation studies reveal further insights about the impact of the different components on performance. Code is publicly available at: https://github.com/giannisnik/mpad.

Download Full-text

Improving Attention Mechanism in Graph Neural Networks via Cardinality Preservation

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2020/194 ◽

2020 ◽

Author(s):

Shuo Zhang ◽

Lei Xie

Keyword(s):

Neural Networks ◽

Theoretical Analysis ◽

Message Passing ◽

Representation Learning ◽

Attention Mechanism ◽

Structured Data ◽

Clear Understanding ◽

Graph Classification ◽

Competitive Performance ◽

Graph Neural Networks

Graph Neural Networks (GNNs) are powerful for the representation learning of graph-structured data. Most of the GNNs use a message-passing scheme, where the embedding of a node is iteratively updated by aggregating the information from its neighbors. To achieve a better expressive capability of node influences, attention mechanism has grown to be popular to assign trainable weights to the nodes in aggregation. Though the attention-based GNNs have achieved remarkable results in various tasks, a clear understanding of their discriminative capacities is missing. In this work, we present a theoretical analysis of the representational properties of the GNN that adopts the attention mechanism as an aggregator. Our analysis determines all cases when those attention-based GNNs can always fail to distinguish certain distinct structures. Those cases appear due to the ignorance of cardinality information in attention-based aggregation. To improve the performance of attention-based GNNs, we propose cardinality preserved attention (CPA) models that can be applied to any kind of attention mechanisms. Our experiments on node and graph classification confirm our theoretical analysis and show the competitive performance of our CPA models. The code is available online: https://github.com/zetayue/CPA.

Download Full-text

Human Activity Classification Based on Point Clouds Measured by Millimeter Wave MIMO Radar with Deep Recurrent Neural Networks

IEEE Sensors Journal ◽

10.1109/jsen.2021.3068388 ◽

2021 ◽

pp. 1-1

Author(s):

Youngwook Kim ◽

Ibrahim Alnujaim ◽

Daegun Oh

Keyword(s):

Neural Networks ◽

Millimeter Wave ◽

Human Activity ◽

Recurrent Neural Networks ◽

Point Clouds ◽

Mimo Radar ◽

Activity Classification

Download Full-text

Graph convolutional neural networks with node transition probability-based message passing and DropNode regularization

Expert Systems with Applications ◽

10.1016/j.eswa.2021.114711 ◽

2021 ◽

Vol 174 ◽

pp. 114711

Author(s):

Tien Huu Do ◽

Duc Minh Nguyen ◽

Giannis Bekoulis ◽

Adrian Munteanu ◽

Nikos Deligiannis

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Message Passing ◽

Transition Probability

Download Full-text

Learning Rotation-Invariant Representations of Point Clouds Using Aligned Edge Convolutional Neural Networks

2020 International Conference on 3D Vision (3DV) ◽

10.1109/3dv50981.2020.00030 ◽

2020 ◽

Author(s):

Junming Zhang ◽

Ming-Yuan Yu ◽

Ram Vasudevan ◽

Matthew Johnson-Roberson

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Point Clouds ◽

Rotation Invariant ◽

Invariant Representations

Download Full-text

Deep learning-enabled medical computer vision

npj Digital Medicine ◽

10.1038/s41746-020-00376-2 ◽

2021 ◽

Vol 4 (1) ◽

Author(s):

Andre Esteva ◽

Katherine Chou ◽

Serena Yeung ◽

Nikhil Naik ◽

Ali Madani ◽

...

Keyword(s):

Artificial Intelligence ◽

Neural Networks ◽

Computer Vision ◽

Deep Learning ◽

Medical Imaging ◽

Real World ◽

Recent Progress ◽

Medical Applications ◽

Modern Computer ◽

Medical Computer

AbstractA decade of unprecedented progress in artificial intelligence (AI) has demonstrated the potential for many fields—including medicine—to benefit from the insights that AI techniques can extract from data. Here we survey recent progress in the development of modern computer vision techniques—powered by deep learning—for medical applications, focusing on medical imaging, medical video, and clinical deployment. We start by briefly summarizing a decade of progress in convolutional neural networks, including the vision tasks they enable, in the context of healthcare. Next, we discuss several example medical imaging applications that stand to benefit—including cardiology, pathology, dermatology, ophthalmology–and propose new avenues for continued work. We then expand into general medical video, highlighting ways in which clinical workflows can integrate computer vision to enhance care. Finally, we discuss the challenges and hurdles required for real-world clinical deployment of these technologies.

Download Full-text