Automated Individual Cattle Identification Using Video Data: A Unified Deep Learning Architecture Approach

Individual cattle identification is a prerequisite and foundation for precision livestock farming. Existing methods for cattle identification require radio frequency or visual ear tags, all of which are prone to loss or damage. Here, we propose and implement a new unified deep learning approach to cattle identification using video analysis. The proposed deep learning framework is composed of a Convolutional Neural Network (CNN) and Bidirectional Long Short-Term Memory (BiLSTM) with a self-attention mechanism. More specifically, the Inception-V3 CNN was used to extract features from a cattle video dataset taken in a feedlot with rear-view. Extracted features were then fed to a BiLSTM layer to capture spatio-temporal information. Then, self-attention was employed to provide a different focus on the features captured by BiLSTM for the final step of cattle identification. We used a total of 363 rear-view videos from 50 cattle at three different times with an interval of 1 month between data collection periods. The proposed method achieved 93.3% identification accuracy using a 30-frame video length, which outperformed current state-of-the-art methods (Inception-V3, MLP, SimpleRNN, LSTM, and BiLSTM). Furthermore, two different attention schemes, namely, additive and multiplicative attention mechanisms were compared. Our results show that the additive attention mechanism achieved 93.3% accuracy and 91.0% recall, greater than multiplicative attention mechanism with 90.7% accuracy and 87.0% recall. Video length also impacted accuracy, with video sequence length up to 30-frames enhancing identification performance. Overall, our approach can capture key spatio-temporal features to improve cattle identification accuracy, enabling automated cattle identification for precision livestock farming.

Download Full-text

BiGRU-Attention Based Cow Behavior Classification Using Video Data for Precision Livestock Farming

Transactions of the ASABE ◽

10.13031/trans.14658 ◽

2021 ◽

Vol 64 (6) ◽

pp. 1823-1833

Author(s):

Yangyang Guo ◽

Yongliang Qiao ◽

Salah Sukkarieh ◽

Lilong Chai ◽

Dongjian He

Keyword(s):

Deep Learning ◽

Animal Behavior ◽

Classification Accuracy ◽

Short Term Memory ◽

Video Data ◽

List Type ◽

Livestock Farming ◽

Precision Livestock Farming ◽

Temporal Features ◽

Gated Recurrent Unit

HighlightsBiGRU-attention based cow behavior classification was proposed.Key spatial-temporal features were captured for behavior representation.BiGRU-attention achieved >82% classification accuracy on calf and adult cow datasets.The proposed method could be used for similar animal behavior classification.Abstract. Animal behavior consists of time series activities, which can reflect animals’ health and welfare status. Monitoring and classifying animal behavior facilitates management decisions to optimize animal performance, welfare, and environmental outcomes. In recent years, deep learning methods have been applied to monitor animal behavior worldwide. To achieve high behavior classification accuracy, a BiGRU-attention based method is proposed in this article to classify some common behaviors, such as exploring, feeding, grooming, standing, and walking. In our work, (1) Inception-V3 was first applied to extract convolutional neural network (CNN) features for each image frame in videos, (2) bidirectional gated recurrent unit (BiGRU) was used to further extract spatial-temporal features, (3) an attention mechanism was deployed to allocate weights to each of the extracted spatial-temporal features according to feature similarity, and (4) the weighted spatial-temporal features were fed to a Softmax layer for behavior classification. Experiments were conducted on two datasets (i.e., calf and adult cow), and the proposed method achieved 82.35% and 82.26% classification accuracy on the calf and adult cow datasets, respectively. In addition, in comparison with other methods, the proposed BiGRU-attention method outperformed long short-term memory (LSTM), bidirectional LSTM (BiLSTM), and BiGRU. Overall, the proposed BiGRU-attention method can capture key spatial-temporal features to significantly improve animal behavior classification, which is favorable for automatic behavior classification in precision livestock farming. Keywords: BiGRU, Cow behavior, Deep learning, LSTM, Precision livestock farming.

Download Full-text

A Deep Learning Framework Based On Spatio-Temporal Attention Mechanism For Traffic Prediction

2020 IEEE 22nd International Conference on High Performance Computing and Communications; IEEE 18th International Conference on Smart City; IEEE 6th International Conference on Data Science and Systems (HPCC/SmartCity/DSS) ◽

10.1109/hpcc-smartcity-dss50907.2020.00098 ◽

2020 ◽

Author(s):

Jun Hu ◽

Bo Li

Keyword(s):

Deep Learning ◽

Attention Mechanism ◽

Traffic Prediction ◽

Temporal Attention ◽

Learning Framework ◽

Spatio Temporal

Download Full-text

T-GAN: A Deep Learning Framework for Prediction of Temporal Complex Networks with Adaptive Graph Convolution and Attention Mechanism

Displays ◽

10.1016/j.displa.2021.102023 ◽

2021 ◽

pp. 102023

Author(s):

Ru Huang ◽

Lei Ma ◽

Jianhua He ◽

Xiaoli Chu

Keyword(s):

Deep Learning ◽

Complex Networks ◽

Attention Mechanism ◽

Learning Framework

Download Full-text

C3D-ConvLSTM based cow behaviour classification using video data for precision livestock farming

Computers and Electronics in Agriculture ◽

10.1016/j.compag.2021.106650 ◽

2022 ◽

Vol 193 ◽

pp. 106650

Author(s):

Yongliang Qiao ◽

Yangyang Guo ◽

Keping Yu ◽

Dongjian He

Keyword(s):

Video Data ◽

Livestock Farming ◽

Precision Livestock Farming

Download Full-text

Screening children at risk for developmental disabilities based on face landmark from video data of mobile-based application: Cross-Sectional Study (Preprint)

10.2196/preprints.29908 ◽

2021 ◽

Author(s):

Yu Rang Park ◽

Sang Ho Hwang ◽

Yeonsoo Yu ◽

Jichul Kim ◽

Taeyeop Lee ◽

...

Keyword(s):

Deep Learning ◽

Developmental Disabilities ◽

Early Detection ◽

Short Term Memory ◽

Preliminary Evidence ◽

Cross Sectional Study ◽

Important Variable ◽

Video Data ◽

Classification Model ◽

Cross Sectional

BACKGROUND Early detection and intervention of developmental disabilities (DDs) are critical for improving the long-term outcomes of the afflicted children. Mobile-based applications are easily accessible and may thus help the early identification of DDs. OBJECTIVE We aimed to identify facial expression and head pose based on face landmark data extracted from face recording videos and to differentiate the characteristics between children with DDs and those without. METHODS Eighty-nine children (DD, n=33; typically developing, n=56) were included in the analysis. Using the mobile-based application, we extracted facial landmarks and head poses from the recorded videos and performed Long Short-Term Memory(LSTM)-based DD classification. RESULTS Stratified k-fold cross-validation showed that the average values of accuracy, precision, recall, and f1-score of the LSTM based deep learning model of DD children were 88%, 91%,72%, and 80%, respectively. Through the interpretation of prediction results using SHapley Additive exPlanations (SHAP), we confirmed that the nodding head angle variable was the most important variable. All of the top 10 variables of importance had significant differences in the distribution between children with DDs and those without (p<0.05). CONCLUSIONS Our results provide preliminary evidence that the deep-learning classification model using mobile-based children’s video data could be used for the early detection of children with DDs.

Download Full-text

Two-Phase Deep Learning-Based EDoS Detection System

Applied Sciences ◽

10.3390/app112110249 ◽

2021 ◽

Vol 11 (21) ◽

pp. 10249

Author(s):

Chien-Nguyen Nhu ◽

Minho Park

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Network Traffic ◽

Input Data ◽

Short Term Memory ◽

Detection System ◽

Scale Up ◽

Sequence Length ◽

Two Phase ◽

Detection Scheme

Cloud computing is currently considered the most cost-effective platform for offering business and consumer IT services over the Internet. However, it is prone to new vulnerabilities. A new type of attack called an economic denial of sustainability (EDoS) attack exploits the pay-per-use model to scale up the resource usage over time to the extent that the cloud user has to pay for the unexpected usage charge. To prevent EDoS attacks, a few solutions have been proposed, including hard-threshold and machine learning-based solutions. Among them, long short-term memory (LSTM)-based solutions achieve much higher accuracy and false-alarm rates than hard-threshold and other machine learning-based solutions. However, LSTM requires a long sequence length of the input data, leading to a degraded performance owing to increases in the calculations, the detection time, and consuming a large number of computing resources of the defense system. We, therefore, propose a two-phase deep learning-based EDoS detection scheme that uses an LSTM model to detect each abnormal flow in network traffic; however, the LSTM model requires only a short sequence length of five of the input data. Thus, the proposed scheme can take advantage of the efficiency of the LSTM algorithm in detecting each abnormal flow in network traffic, while reducing the required sequence length of the input data. A comprehensive performance evaluation shows that our proposed scheme outperforms the existing solutions in terms of accuracy and resource consumption.

Download Full-text

DeepHTTP: Anomalous HTTP Traffic Detection and Malicious Pattern Mining Based on Deep Learning

Communications in Computer and Information Science - Cyber Security ◽

10.1007/978-981-33-4922-3_11 ◽

2020 ◽

pp. 141-161

Author(s):

Yuqi Yu ◽

Hanbing Yan ◽

Yuan Ma ◽

Hao Zhou ◽

Hongchao Guan

Keyword(s):

Deep Learning ◽

Web Application ◽

Pattern Mining ◽

Short Term Memory ◽

Attack Detection ◽

Attention Mechanism ◽

Discriminative Ability ◽

Feature Extraction Method ◽

Detection Model ◽

Traffic Detection

AbstractHypertext Transfer Protocol (HTTP) accounts for a large portion of Internet application-layer traffic. Since the payload of HTTP traffic can record website status and user request information, many studies use HTTP protocol traffic for web application attack detection. In this work, we propose DeepHTTP, an HTTP traffic detection framework based on deep learning. Unlike previous studies, this framework not only performs malicious traffic detection but also uses the deep learning model to mine malicious fields of the traffic payload. The detection model is called AT-Bi-LSTM, which is based on Bidirectional Long Short-Term Memory (Bi-LSTM) with attention mechanism. The attention mechanism can improve the discriminative ability and make the result interpretable. To enhance the generalization ability of the model, this paper proposes a novel feature extraction method. Experiments show that DeepHTTP has an excellent performance in malicious traffic discrimination and pattern mining.

Download Full-text

A New Video-Based Crash Detection Method: Balancing Speed and Accuracy Using a Feature Fusion Deep Learning Framework

Journal of Advanced Transportation ◽

10.1155/2020/8848874 ◽

2020 ◽

Vol 2020 ◽

pp. 1-12

Author(s):

Zhenbo Lu ◽

Wei Zhou ◽

Shixiang Zhang ◽

Chen Wang

Keyword(s):

Deep Learning ◽

Short Term Memory ◽

Feature Fusion ◽

Urban Traffic ◽

Detection Accuracy ◽

Traffic Crash ◽

Learning Framework ◽

Proposed Model ◽

Speed And Accuracy ◽

Detection Speed

Quick and accurate crash detection is important for saving lives and improved traffic incident management. In this paper, a feature fusion-based deep learning framework was developed for video-based urban traffic crash detection task, aiming at achieving a balance between detection speed and accuracy with limited computing resource. In this framework, a residual neural network (ResNet) combined with attention modules was proposed to extract crash-related appearance features from urban traffic videos (i.e., a crash appearance feature extractor), which were further fed to a spatiotemporal feature fusion model, Conv-LSTM (Convolutional Long Short-Term Memory), to simultaneously capture appearance (static) and motion (dynamic) crash features. The proposed model was trained by a set of video clips covering 330 crash and 342 noncrash events. In general, the proposed model achieved an accuracy of 87.78% on the testing dataset and an acceptable detection speed (FPS > 30 with GTX 1060). Thanks to the attention module, the proposed model can capture the localized appearance features (e.g., vehicle damage and pedestrian fallen-off) of crashes better than conventional convolutional neural networks. The Conv-LSTM module outperformed conventional LSTM in terms of capturing motion features of crashes, such as the roadway congestion and pedestrians gathering after crashes. Compared to traditional motion-based crash detection model, the proposed model achieved higher detection accuracy. Moreover, it could detect crashes much faster than other feature fusion-based models (e.g., C3D). The results show that the proposed model is a promising video-based urban traffic crash detection algorithm that could be used in practice in the future.

Download Full-text

Deep-learning GIS hybrid approach in precipitation modeling based on spatio-temporal variables in the coastal zone of Turkey

Climate Research ◽

10.3354/cr01612 ◽

2020 ◽

Vol 81 ◽

pp. 149-165

Author(s):

H Apaydin ◽

MT Sattari

Keyword(s):

Artificial Intelligence ◽

Deep Learning ◽

Water Resource Management ◽

Short Term Memory ◽

Hybrid Approach ◽

Support Vector ◽

Monthly Precipitation ◽

Artificial Intelligence Methods ◽

Temporal Variables ◽

Spatio Temporal

It is clearly known that precipitation is essential for fauna and flora. Studies have shown that location and temporal factors have an effect on precipitation. Accurate prediction of precipitation is very important for water resource management, and artificial intelligence methods are frequently used to make such predictions. In this study, the deep-learning and geographic information system (GIS) hybrid approach based on spatio-temporal variables was applied in order to model the amount of precipitation on Turkey’s coastline. Information about latitude, longitude, altitude, distance to the sea, and aspect was taken from meteorological stations, and these factors were utilized as spatial variables. The change in monthly precipitation was taken into account as a temporal variable. Artificial intelligence methods such as Gaussian process regression, support vector regression, the Broyden-Fletcher-Goldfarb-Shanno artificial neural network, M5, random forest, and long short-term memory (LSTM) were used. According to the results of the study, in which different input variable alternatives were also evaluated, LSTM was the most successful method for predicting precipitation with a value of 0.93 R. The study shows that the amount of precipitation can be estimated and a distribution map can be drawn by using spatio-temporal data and the deep-learning and GIS hybrid method at points where the measurement is not performed.

Download Full-text