3d object classification
Recently Published Documents


TOTAL DOCUMENTS

81
(FIVE YEARS 32)

H-INDEX

11
(FIVE YEARS 2)

2021 ◽  
pp. 459-470
Author(s):  
Fouzia Adjailia ◽  
Andrinandrasana David Rasamoelina ◽  
Peter Sincak

Author(s):  
Wenju Wang ◽  
Yu Cai ◽  
Tao Wang

AbstractThe existing view-based 3D object classification and recognition methods ignore the inherent hierarchical correlation and distinguishability of views, making it difficult to further improve the classification accuracy. In order to solve this problem, this paper proposes an end-to-end multi-view dual attention network framework for high-precision recognition of 3D objects. On one hand, we obtain three feature layers of query, key, and value through the convolution layer. The spatial attention matrix is generated by the key-value pairs of query and key, and each feature in the value of the original feature space branch is assigned different importance, which clearly captures the prominent detail features in the view, generates the view space shape descriptor, and focuses on the detail part of the view with the feature of category discrimination. On the other hand, a channel attention vector is obtained by compressing the channel information in different views, and the attention weight of each view feature is scaled to find the correlation between the target views and focus on the view with important features in all views. Integrating the two feature descriptors together to generate global shape descriptors of the 3D model, which has a stronger response to the distinguishing features of the object model and can be used for high-precision 3D object recognition. The proposed method achieves an overall accuracy of 96.6% and an average accuracy of 95.5% on the open-source ModelNet40 dataset, compiled by Princeton University when using Resnet50 as the basic CNN model. Compared with the existing deep learning methods, the experimental results demonstrate that the proposed method achieves state-of-the-art performance in the 3D object classification accuracy.


2021 ◽  
Author(s):  
Liping Nong ◽  
Junyi Wang ◽  
Jiming Lin ◽  
Hongbing Qiu ◽  
Lin Zheng ◽  
...  

Displays ◽  
2021 ◽  
pp. 102076
Author(s):  
Weiwei Cai ◽  
Dong Liu ◽  
Xin Ning ◽  
Chen Wang ◽  
Guojie Xie

2021 ◽  
Author(s):  
Weihao Lu ◽  
Dezong Zhao ◽  
Cristiano Premebida ◽  
Wen-Hua Chen ◽  
Daxin Tian

2021 ◽  
Vol 38 (2) ◽  
pp. 321-330
Author(s):  
Imen Hamrouni Trimech ◽  
Ahmed Maalej ◽  
Najoua Essoukri Ben Amara

Point cloud-based Deep Neural Networks (DNNs) have gained increasing attention as an insightful solution in the study field of geometric deep learning. Point set aware DNNs have proven capable of dealing with the unstructured data type and successful in 3D data applications such as 3D object classification, segmentation and recognition. On the other hand, two major challenges remain understudied when it comes to the use of point cloud-based DNNs for 3D facial expression (FE) recognition. The first challenge is the lack of large labelled 3D facial data. The second is how to obtain a point-based discriminative representation of 3D faces. To address the first issue, we suggest to enlarge the used dataset by generating synthetic 3D FEs. For the second one, we propose to apply a level-curve based sampling strategy in order to exploit crucial geometric information. The conducted experiments show promising results reaching 97.23% on the enlarged BU-3DFE dataset.


Sensors ◽  
2021 ◽  
Vol 21 (8) ◽  
pp. 2644
Author(s):  
Long Hoang ◽  
Suk-Hwan Lee ◽  
Ki-Ryong Kwon

A vital and challenging task in computer vision is 3D Object Classification and Retrieval, with many practical applications such as an intelligent robot, autonomous driving, multimedia contents processing and retrieval, and augmented/mixed reality. Various deep learning methods were introduced for solving classification and retrieval problems of 3D objects. Almost all view-based methods use many views to handle spatial loss, although they perform the best among current techniques such as View-based, Voxelization, and Point Cloud methods. Many views make network structure more complicated due to the parallel Convolutional Neural Network (CNN). We propose a novel method that combines a Global Point Signature Plus with a Deep Wide Residual Network, namely GPSP-DWRN, in this paper. Global Point Signature Plus (GPSPlus) is a novel descriptor because it can capture more shape information of the 3D object for a single view. First, an original 3D model was converted into a colored one by applying GPSPlus. Then, a 32 × 32 × 3 matrix stored the obtained 2D projection of this color 3D model. This matrix was the input data of a Deep Residual Network, which used a single CNN structure. We evaluated the GPSP-DWRN for a retrieval task using the Shapnetcore55 dataset, while using two well-known datasets—ModelNet10 and ModelNet40 for a classification task. Based on our experimental results, our framework performed better than the state-of-the-art methods.


2021 ◽  
Vol 547 ◽  
pp. 984-995
Author(s):  
An-An Liu ◽  
Heyu Zhou ◽  
Weizhi Nie ◽  
Zhenguang Liu ◽  
Wu Liu ◽  
...  

2021 ◽  
Vol 30 ◽  
pp. 7486-7498
Author(s):  
Yuyang Liu ◽  
Yang Cong ◽  
Gan Sun ◽  
Tao Zhang ◽  
Jiahua Dong ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document