Identification of diatom taxonomy by a combination of region-based full convolutional network, online hard example mining, and shape priors of diatoms

Abstract. The use of multispectral imagery for monitoring biodiversity in ecosystems is becoming widespread. A key parameter of forest ecosystems is the distribution of dead wood. This work addresses the segmentation of individual dead tree crowns in nadir-view aerial infrared imagery. While dead vegetation produces a distinct spectral response in the near infrared band, separating adjacent trees within large swaths of dead stands remains a challenge. We tackle this problem by casting the segmentation task within the active contour framework, a mathematical formulation combining learned models of the object’s shape and appearance as prior information. We explore the use of a deep convolutional generative adversarial network (DCGAN) in the role of the shape model, replacing the original linear mixture-of-eigenshapes formulation. Also, we rely on probabilities obtained from a deep fully convolutional network (FCN) as the appearance prior. Experiments conducted on manually labeled reference polygons show that the DCGAN is able to learn a low-dimensional manifold of tree crown shapes, outperforming the eigenshape model with respect to the similarity of the reproduced and referenced shapes on about 45 % of the test samples. The DCGAN is successful mostly for less convex shapes, whereas the baseline remains superior for more regular tree crown polygons.

Download Full-text

Practically Feasible Design for Convolutional Network Code

IEICE Transactions on Fundamentals of Electronics Communications and Computer Sciences ◽

10.1587/transfun.e96.a.1895 ◽

2013 ◽

Vol E96.A (9) ◽

pp. 1895-1900 ◽

Cited By ~ 2

Author(s):

Songtao LIANG ◽

Haibin KAN

Keyword(s):

Network Code ◽

Convolutional Network

Download Full-text

Marine Isotope Stage (MIS) 5 on the Umnak Plateau, Bering Sea (IODP Site U1339): diatom taxonomy, grain size and isotopic composition of marine sediments as proxies for primary productivity and sea ice extent

10.31274/etd-180810-4257 ◽

2015 ◽

Author(s):

Derrick Ray Vaughn

Keyword(s):

Grain Size ◽

Sea Ice ◽

Isotopic Composition ◽

Marine Sediments ◽

Primary Productivity ◽

Bering Sea ◽

Marine Isotope Stage ◽

Sea Ice Extent ◽

Mis 5 ◽

Diatom Taxonomy

Download Full-text

Object Contour Tracking with Fusion of Color and Incremental Shape Priors

ACTA AUTOMATICA SINICA ◽

10.3724/sp.j.1004.2009.01394 ◽

2009 ◽

Vol 35 (11) ◽

pp. 1394-1402

Author(s):

Xue ZHOU ◽

Wei-Ming HU

Keyword(s):

Contour Tracking ◽

Shape Priors ◽

Object Contour ◽

Object Contour Tracking

Download Full-text

Knowledge and Geo-Object Based Graph Convolutional Network for Remote Sensing Semantic Segmentation

Sensors ◽

10.3390/s21113848 ◽

2021 ◽

Vol 21 (11) ◽

pp. 3848

Author(s):

Wei Cui ◽

Meng Yao ◽

Yuanjie Hao ◽

Ziwei Wang ◽

Xin He ◽

...

Keyword(s):

Remote Sensing ◽

Prior Knowledge ◽

Contextual Information ◽

Information Aggregation ◽

Semantic Segmentation ◽

Spatial Correlations ◽

Convolutional Network ◽

Object Based ◽

Graph Neural Networks ◽

Salt And Pepper

Pixel-based semantic segmentation models fail to effectively express geographic objects and their topological relationships. Therefore, in semantic segmentation of remote sensing images, these models fail to avoid salt-and-pepper effects and cannot achieve high accuracy either. To solve these problems, object-based models such as graph neural networks (GNNs) are considered. However, traditional GNNs directly use similarity or spatial correlations between nodes to aggregate nodes’ information, which rely too much on the contextual information of the sample. The contextual information of the sample is often distorted, which results in a reduction in the node classification accuracy. To solve this problem, a knowledge and geo-object-based graph convolutional network (KGGCN) is proposed. The KGGCN uses superpixel blocks as nodes of the graph network and combines prior knowledge with spatial correlations during information aggregation. By incorporating the prior knowledge obtained from all samples of the study area, the receptive field of the node is extended from its sample context to the study area. Thus, the distortion of the sample context is overcome effectively. Experiments demonstrate that our model is improved by 3.7% compared with the baseline model named Cluster GCN and 4.1% compared with U-Net.

Download Full-text

SACN: A Novel Rotating Face Detector Based on Architecture Search

Electronics ◽

10.3390/electronics10050558 ◽

2021 ◽

Vol 10 (5) ◽

pp. 558

Author(s):

Anping Song ◽

Xiaokang Xu ◽

Xinyi Zhai

Keyword(s):

Face Detection ◽

Human Face ◽

Angle Error ◽

Rotation Invariant ◽

Convolutional Network ◽

Data Set ◽

Practical Applications ◽

Model Size ◽

Average Angle ◽

Face Detector

Rotation-Invariant Face Detection (RIPD) has been widely used in practical applications; however, the problem of the adjusting of the rotation-in-plane (RIP) angle of the human face still remains. Recently, several methods based on neural networks have been proposed to solve the RIP angle problem. However, these methods have various limitations, including low detecting speed, model size, and detecting accuracy. To solve the aforementioned problems, we propose a new network, called the Searching Architecture Calibration Network (SACN), which utilizes architecture search, fully convolutional network (FCN) and bounding box center cluster (CC). SACN was tested on the challenging Multi-Oriented Face Detection Data Set and Benchmark (MOFDDB) and achieved a higher detecting accuracy and almost the same speed as existing detectors. Moreover, the average angle error is optimized from the current 12.6° to 10.5°.

Download Full-text

Multi-Channel Temporal Graph Convolutional Network for Stock Return Prediction

2020 IEEE 18th International Conference on Industrial Informatics (INDIN) ◽

10.1109/indin45582.2020.9442196 ◽

2020 ◽

Author(s):

Jifeng Sun ◽

Jianwu Lin ◽

Yi Zhou

Keyword(s):

Stock Return ◽

Convolutional Network ◽

Stock Return Prediction ◽

Temporal Graph

Download Full-text

Proposing Gesture Recognition Algorithm Using Two-Stream Convolutional Network and LSTM

2020 IEEE Eighth International Conference on Communications and Electronics (ICCE) ◽

10.1109/icce48956.2021.9352147 ◽

2021 ◽

Author(s):

Phat Nguyen Huu ◽

Tien Luong Ngoc ◽

Quang Tran Minh

Keyword(s):

Gesture Recognition ◽

Recognition Algorithm ◽

Convolutional Network

Download Full-text

NPU RGB+D Dataset and a Feature-Enhanced LSTM-DGCN Method for Action Recognition of Basketball Players

Applied Sciences ◽

10.3390/app11104426 ◽

2021 ◽

Vol 11 (10) ◽

pp. 4426

Author(s):

Chunyan Ma ◽

Ji Fan ◽

Jinghao Yao ◽

Tao Zhang

Keyword(s):

Action Recognition ◽

Large Scale ◽

Short Term Memory ◽

Evaluation Criteria ◽

Image Data ◽

Basketball Player ◽

Basketball Players ◽

Convolutional Network ◽

Atomic Actions ◽

New Feature

Computer vision-based action recognition of basketball players in basketball training and competition has gradually become a research hotspot. However, owing to the complex technical action, diverse background, and limb occlusion, it remains a challenging task without effective solutions or public dataset benchmarks. In this study, we defined 32 kinds of atomic actions covering most of the complex actions for basketball players and built the dataset NPU RGB+D (a large scale dataset of basketball action recognition with RGB image data and Depth data captured in Northwestern Polytechnical University) for 12 kinds of actions of 10 professional basketball players with 2169 RGB+D videos and 75 thousand frames, including RGB frame sequences, depth maps, and skeleton coordinates. Through extracting the spatial features of the distances and angles between the joint points of basketball players, we created a new feature-enhanced skeleton-based method called LSTM-DGCN for basketball player action recognition based on the deep graph convolutional network (DGCN) and long short-term memory (LSTM) methods. Many advanced action recognition methods were evaluated on our dataset and compared with our proposed method. The experimental results show that the NPU RGB+D dataset is very competitive with the current action recognition algorithms and that our LSTM-DGCN outperforms the state-of-the-art action recognition methods in various evaluation criteria on our dataset. Our action classifications and this NPU RGB+D dataset are valuable for basketball player action recognition techniques. The feature-enhanced LSTM-DGCN has a more accurate action recognition effect, which improves the motion expression ability of the skeleton data.

Download Full-text

Effects of Pooling Operations on Prediction of Ligand Rotation‐Dependent Protein–Ligand Binding in 3D Graph Convolutional Network

Bulletin of the Korean Chemical Society ◽

10.1002/bkcs.12267 ◽

2021 ◽

Author(s):

Yeji Kim ◽

Jihoo Kim ◽

Won June Kim ◽

Eok Kyun Lee ◽

Insung S. Choi

Keyword(s):

Ligand Binding ◽

Convolutional Network ◽

Dependent Protein

Download Full-text