Temporal Motionless Analysis of Video using CNN in MPSoC

10.36227/techrxiv.12668831 ◽

2020 ◽

Author(s):

Somdip Dey ◽

Amit Singh ◽

Dilip Kumar Prasad ◽

Klaus D. Mcdonald-Maier

Keyword(s):

Neural Network ◽

Energy Consumption ◽

Convolutional Neural Network ◽

State Of The Art ◽

Training Image ◽

Experimental Results ◽

Current Frame ◽

Previous Image

<div><div><div><p>This paper proposes a novel human-inspired methodology called IRON-MAN (Integrated RatiONal prediction and Motionless ANalysis of videos) on mobile multi-processor systems-on-chips (MPSoCs). The methodology integrates analysis of the previous image frames of the video to represent the analysis of the current frame in order to perform Temporal Motionless Analysis of the Video (TMAV). This is the first work on TMAV using Convolutional Neural Network (CNN) for scene prediction in MPSoCs. Experimental results show that our methodology outperforms state-of-the-art. We also introduce a metric named, Energy Consumption per Training Image (ECTI) to assess the suitability of using a CNN model in mobile MPSoCs with a focus on energy consumption of the device.</p></div></div></div>

Download Full-text

PedNet: A Spatio-Temporal Deep Convolutional Neural Network for Pedestrian Segmentation

Journal of Imaging ◽

10.3390/jimaging4090107 ◽

2018 ◽

Vol 4 (9) ◽

pp. 107 ◽

Cited By ~ 5

Author(s):

Mohib Ullah ◽

Ahmed Mohammed ◽

Faouzi Alaya Cheikh

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Performance Metrics ◽

State Of The Art ◽

Temporal Information ◽

Feature Maps ◽

Current Frame ◽

Low Level ◽

Art Methods ◽

Spatio Temporal

Articulation modeling, feature extraction, and classification are the important components of pedestrian segmentation. Usually, these components are modeled independently from each other and then combined in a sequential way. However, this approach is prone to poor segmentation if any individual component is weakly designed. To cope with this problem, we proposed a spatio-temporal convolutional neural network named PedNet which exploits temporal information for spatial segmentation. The backbone of the PedNet consists of an encoder–decoder network for downsampling and upsampling the feature maps, respectively. The input to the network is a set of three frames and the output is a binary mask of the segmented regions in the middle frame. Irrespective of classical deep models where the convolution layers are followed by a fully connected layer for classification, PedNet is a Fully Convolutional Network (FCN). It is trained end-to-end and the segmentation is achieved without the need of any pre- or post-processing. The main characteristic of PedNet is its unique design where it performs segmentation on a frame-by-frame basis but it uses the temporal information from the previous and the future frame for segmenting the pedestrian in the current frame. Moreover, to combine the low-level features with the high-level semantic information learned by the deeper layers, we used long-skip connections from the encoder to decoder network and concatenate the output of low-level layers with the higher level layers. This approach helps to get segmentation map with sharp boundaries. To show the potential benefits of temporal information, we also visualized different layers of the network. The visualization showed that the network learned different information from the consecutive frames and then combined the information optimally to segment the middle frame. We evaluated our approach on eight challenging datasets where humans are involved in different activities with severe articulation (football, road crossing, surveillance). The most common CamVid dataset which is used for calculating the performance of the segmentation algorithm is evaluated against seven state-of-the-art methods. The performance is shown on precision/recall, F 1 , F 2 , and mIoU. The qualitative and quantitative results show that PedNet achieves promising results against state-of-the-art methods with substantial improvement in terms of all the performance metrics.

Download Full-text

Global and Local Attention-Based Free-Form Image Inpainting

Sensors ◽

10.3390/s20113204 ◽

2020 ◽

Vol 20 (11) ◽

pp. 3204

Author(s):

S. M. Nadim Uddin ◽

Yong Ju Jung

Keyword(s):

Neural Network ◽

Deep Learning ◽

Convolutional Neural Network ◽

State Of The Art ◽

Image Inpainting ◽

Experimental Results ◽

Free Form ◽

Local Similarity ◽

Global And Local ◽

Similarity Information

Deep-learning-based image inpainting methods have shown significant promise in both rectangular and irregular holes. However, the inpainting of irregular holes presents numerous challenges owing to uncertainties in their shapes and locations. When depending solely on convolutional neural network (CNN) or adversarial supervision, plausible inpainting results cannot be guaranteed because irregular holes need attention-based guidance for retrieving information for content generation. In this paper, we propose two new attention mechanisms, namely a mask pruning-based global attention module and a global and local attention module to obtain global dependency information and the local similarity information among the features for refined results. The proposed method is evaluated using state-of-the-art methods, and the experimental results show that our method outperforms the existing methods in both quantitative and qualitative measures.

Download Full-text

ATG-PVD: Ticketing Parking Violations on A Drone

10.36227/techrxiv.12839156 ◽

2020 ◽

Author(s):

Hengli Wang ◽

Yuxuan Liu ◽

Huaiyang Huang ◽

Yuheng Pan ◽

Wenbin Yu ◽

...

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Optical Flow ◽

State Of The Art ◽

Experimental Results ◽

Visual Slam ◽

Flow Estimation ◽

Optical Flow Estimation ◽

Car Detection ◽

Speed And Accuracy

In this paper, we introduce a novel suspect-and-investigate framework, which can be easily embedded in a drone for automated parking violation detection (PVD). Our proposed framework consists of: 1) SwiftFlow, an efficient and accurate convolutional neural network (CNN) for unsupervised optical flow estimation; 2) Flow-RCNN, a flow-guided CNN for car detection and classification; and 3) an illegally parked car (IPC) candidate investigation module developed based on visual SLAM. The proposed framework was successfully embedded in a drone from ATG Robotics. The experimental results demonstrate that, firstly, our proposed SwiftFlow outperforms all other state-of-the-art unsupervised optical flow estimation approaches in terms of both speed and accuracy; secondly, IPC candidates can be effectively and efficiently detected by our proposed Flow-RCNN, with a better performance than our baseline network, Faster-RCNN; finally, the actual IPCs can be successfully verified by our investigation module after drone re-localization.

Download Full-text

ATG-PVD: Ticketing Parking Violations on A Drone

10.36227/techrxiv.12839156.v1 ◽

2020 ◽

Author(s):

Hengli Wang ◽

Yuxuan Liu ◽

Huaiyang Huang ◽

Yuheng Pan ◽

Wenbin Yu ◽

...

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Optical Flow ◽

State Of The Art ◽

Experimental Results ◽

Visual Slam ◽

Flow Estimation ◽

Optical Flow Estimation ◽

Car Detection ◽

Speed And Accuracy

In this paper, we introduce a novel suspect-and-investigate framework, which can be easily embedded in a drone for automated parking violation detection (PVD). Our proposed framework consists of: 1) SwiftFlow, an efficient and accurate convolutional neural network (CNN) for unsupervised optical flow estimation; 2) Flow-RCNN, a flow-guided CNN for car detection and classification; and 3) an illegally parked car (IPC) candidate investigation module developed based on visual SLAM. The proposed framework was successfully embedded in a drone from ATG Robotics. The experimental results demonstrate that, firstly, our proposed SwiftFlow outperforms all other state-of-the-art unsupervised optical flow estimation approaches in terms of both speed and accuracy; secondly, IPC candidates can be effectively and efficiently detected by our proposed Flow-RCNN, with a better performance than our baseline network, Faster-RCNN; finally, the actual IPCs can be successfully verified by our investigation module after drone re-localization.

Download Full-text

Concat Convolutional Neural Network for pulsar candidate selection

Monthly Notices of the Royal Astronomical Society ◽

10.1093/mnras/staa916 ◽

2020 ◽

Vol 494 (3) ◽

pp. 3110-3119

Author(s):

Qingguo Zeng ◽

Xiangru Li ◽

Haitao Lin

Keyword(s):

Neural Network ◽

Online Learning ◽

Convolutional Neural Network ◽

Radio Telescope ◽

State Of The Art ◽

Scientific Research ◽

Selection Method ◽

Experimental Results ◽

Candidate Selection ◽

Growth Speed

ABSTRACT Pulsar searching is essential for the scientific research in the field of physics and astrophysics. With the development of the radio telescope, the exploding volume and growth speed of candidates have brought about several challenges. Therefore, there is an urgent demand for developing an automatic, accurate, and efficient pulsar candidate selection method. To meet this need, this work designed a Concat Convolutional Neural Network (CCNN) to identify the candidates collected from the Five-hundred-meter Aperture Spherical Telescope (FAST) data. The CCNN extracts some ‘pulsar-like’ patterns from the diagnostic subplots using Convolutional Neural Network (CNN) and combines these CNN features by a concatenate layer. Therefore, the CCNN is an end-to-end learning model without any need for any intermediate labels, which makes CCNN suitable for the online learning pipeline of pulsar candidate selection. Experimental results on FAST data show that the CCNN outperforms the available state-of-the-art models in a similar scenario. In total, it misses only 4 real pulsars out of 326.

Download Full-text

Estimation of energy consumption of electric vehicles using Deep Convolutional Neural Network to reduce driver’s range anxiety

ISA Transactions ◽

10.1016/j.isatra.2019.08.055 ◽

2020 ◽

Vol 98 ◽

pp. 454-470 ◽

Cited By ~ 1

Author(s):

Shatrughan Modi ◽

Jhilik Bhattacharya ◽

Prasenjit Basak

Keyword(s):

Neural Network ◽

Energy Consumption ◽

Convolutional Neural Network ◽

Electric Vehicles ◽

Deep Convolutional Neural Network ◽

Range Anxiety

Download Full-text

Low-Rank and Sparse Based Deep-Fusion Convolutional Neural Network for Crowd Counting

Mathematical Problems in Engineering ◽

10.1155/2017/5046727 ◽

2017 ◽

Vol 2017 ◽

pp. 1-11 ◽

Cited By ~ 2

Author(s):

Siqi Tang ◽

Zhisong Pan ◽

Xingyu Zhou

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

State Of The Art ◽

Regression Method ◽

Low Rank ◽

Counting Method ◽

Direct Integral ◽

Crowd Counting ◽

Counting Methods ◽

Density Map

This paper proposes an accurate crowd counting method based on convolutional neural network and low-rank and sparse structure. To this end, we firstly propose an effective deep-fusion convolutional neural network to promote the density map regression accuracy. Furthermore, we figure out that most of the existing CNN based crowd counting methods obtain overall counting by direct integral of estimated density map, which limits the accuracy of counting. Instead of direct integral, we adopt a regression method based on low-rank and sparse penalty to promote accuracy of the projection from density map to global counting. Experiments demonstrate the importance of such regression process on promoting the crowd counting performance. The proposed low-rank and sparse based deep-fusion convolutional neural network (LFCNN) outperforms existing crowd counting methods and achieves the state-of-the-art performance.

Download Full-text

Performance Analysis of State of the Art Convolutional Neural Network Architectures in Bangla Handwritten Character Recognition

Pattern Recognition and Image Analysis ◽

10.1134/s1054661821010089 ◽

2021 ◽

Vol 31 (1) ◽

pp. 60-71

Author(s):

Tapotosh Ghosh ◽

Min-Ha-Zul Abedin ◽

Hasan Al Banna ◽

Nasirul Mumenin ◽

Mohammad Abu Yousuf

Keyword(s):

Neural Network ◽

Performance Analysis ◽

Convolutional Neural Network ◽

Character Recognition ◽

State Of The Art ◽

Network Architectures ◽

Handwritten Character Recognition ◽

Handwritten Character ◽

Neural Network Architectures

Download Full-text

Spam Detection on Social Media Using Semantic Convolutional Neural Network

International Journal of Knowledge Discovery in Bioinformatics ◽

10.4018/ijkdb.2018010102 ◽

2018 ◽

Vol 8 (1) ◽

pp. 12-26 ◽

Cited By ~ 16

Author(s):

Gauri Jain ◽

Manisha Sharma ◽

Basant Agarwal

Keyword(s):

Neural Network ◽

Social Media ◽

Convolutional Neural Network ◽

State Of The Art ◽

Spam Detection ◽

Learning Technology ◽

The Social ◽

Social Media Text ◽

Current Article ◽

Semantic Layer

This article describes how spam detection in the social media text is becoming increasing important because of the exponential increase in the spam volume over the network. It is challenging, especially in case of text within the limited number of characters. Effective spam detection requires more number of efficient features to be learned. In the current article, the use of a deep learning technology known as a convolutional neural network (CNN) is proposed for spam detection with an added semantic layer on the top of it. The resultant model is known as a semantic convolutional neural network (SCNN). A semantic layer is composed of training the random word vectors with the help of Word2vec to get the semantically enriched word embedding. WordNet and ConceptNet are used to find the word similar to a given word, in case it is missing in the word2vec. The architecture is evaluated on two corpora: SMS Spam dataset (UCI repository) and Twitter dataset (Tweets scrapped from public live tweets). The authors' approach outperforms the-state-of-the-art results with 98.65% accuracy on SMS spam dataset and 94.40% accuracy on Twitter dataset.

Download Full-text