Efficient and Fast Objects Detection Technique for Intelligent Video Surveillance Using Transfer Learning and Fine-Tuning

Mahmoud Ahmadi; Wael Ouarda; Adel M. Alimi

doi:10.1007/s13369-019-03969-6

Improving Street Object Detection Using Transfer Learning: From Generic Model to Specific Model

Journal of Advanced Computational Intelligence and Intelligent Informatics ◽

10.20965/jaciii.2018.p0869 ◽

2018 ◽

Vol 22 (6) ◽

pp. 869-874

Author(s):

Wei Liu ◽

◽

Shu Chen ◽

Longsheng Wei

Keyword(s):

Object Detection ◽

Transfer Learning ◽

Specific Model ◽

Intelligent Vehicles ◽

Fine Tuning ◽

Generic Model ◽

Learning Methods ◽

Urban Street ◽

General Object ◽

Objects Detection

A high accuracy rate of street objects detection is significant in realizing intelligent vehicles. Algorithms based on convolution neural network (CNN) currently exhibit reasonable performance in general object detection. For example SSD and YOLO can detect a wide variety of objects in 2D images in real time; however the performance is not sufficient for street objects detection, especially in complex urban street environments. In this study, instead of proposing and training a new CNN model, we use transfer learning methods to enable our specific model to learn from a generic CNN model to achieve good performance. The transfer learning methods include fine-tuning the pretrained CNN model with a self-made dataset, and adjusting the CNN model structure. We analyze the transfer learning results based on fine-tuning SSD with self-made datasets. The experimental results based on the transfer learning method show that the proposed method is effective.

Download Full-text

Study on moving-objects detection technique in video surveillance system

2010 Chinese Control and Decision Conference ◽

10.1109/ccdc.2010.5498797 ◽

2010 ◽

Cited By ~ 3

Author(s):

Tao Zhang ◽

Zaiwen Liu ◽

Xiaofeng Lian ◽

Xiaoyi Wang

Keyword(s):

Video Surveillance ◽

Surveillance System ◽

Moving Objects ◽

Detection Technique ◽

Video Surveillance System ◽

Moving Objects Detection ◽

Objects Detection

Download Full-text

A Study on Optimization of Intelligent Video Surveillance System based on Embedded Module

Korean Institute of Smart Media ◽

10.30693/smj.2018.7.2.40 ◽

2018 ◽

Vol 7 (2) ◽

pp. 40-46 ◽

Cited By ~ 1

Author(s):

Jin Su Kim ◽

◽

Min-Gu Kim ◽

Sung Bum Pan ◽

◽

...

Keyword(s):

Video Surveillance ◽

Surveillance System ◽

Video Surveillance System ◽

Intelligent Video Surveillance ◽

Intelligent Video Surveillance System

Download Full-text

Deep Learning Techniques for Grape Plant Species Identification in Natural Images

Sensors ◽

10.3390/s19224850 ◽

2019 ◽

Vol 19 (22) ◽

pp. 4850 ◽

Cited By ~ 6

Author(s):

Carlos S. Pereira ◽

Raul Morais ◽

Manuel J. C. S. Reis

Keyword(s):

Transfer Learning ◽

Climatic Conditions ◽

Fine Tuning ◽

Variety Identification ◽

Test Accuracy ◽

Accuracy Score ◽

Learning Techniques ◽

Four Corners ◽

Integrated Software ◽

Grape Varieties

Frequently, the vineyards in the Douro Region present multiple grape varieties per parcel and even per row. An automatic algorithm for grape variety identification as an integrated software component was proposed that can be applied, for example, to a robotic harvesting system. However, some issues and constraints in its development were highlighted, namely, the images captured in natural environment, low volume of images, high similarity of the images among different grape varieties, leaf senescence, and significant changes on the grapevine leaf and bunch images in the harvest seasons, mainly due to adverse climatic conditions, diseases, and the presence of pesticides. In this paper, the performance of the transfer learning and fine-tuning techniques based on AlexNet architecture were evaluated when applied to the identification of grape varieties. Two natural vineyard image datasets were captured in different geographical locations and harvest seasons. To generate different datasets for training and classification, some image processing methods, including a proposed four-corners-in-one image warping algorithm, were used. The experimental results, obtained from the application of an AlexNet-based transfer learning scheme and trained on the image dataset pre-processed through the four-corners-in-one method, achieved a test accuracy score of 77.30%. Applying this classifier model, an accuracy of 89.75% on the popular Flavia leaf dataset was reached. The results obtained by the proposed approach are promising and encouraging in helping Douro wine growers in the automatic task of identifying grape varieties.

Download Full-text

Architecture of the intelligent video surveillance systems for fog environments based on embedded computers

2021 10th Mediterranean Conference on Embedded Computing (MECO) ◽

10.1109/meco52532.2021.9460270 ◽

2021 ◽

Author(s):

Alexey Subbotin ◽

Nataly Zhukova ◽

Tianxing Man

Keyword(s):

Video Surveillance ◽

Surveillance Systems ◽

Intelligent Video Surveillance

Download Full-text

New polyp image classification technique using transfer learning of network-in-network structure in endoscopic images

Scientific Reports ◽

10.1038/s41598-021-83199-9 ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Young Jae Kim ◽

Jang Pyo Bae ◽

Jun-Won Chung ◽

Dong Kyun Park ◽

Kwang Gi Kim ◽

...

Keyword(s):

Colorectal Cancer ◽

Transfer Learning ◽

Test Data ◽

State Of The Art ◽

Early Stage ◽

Statistical Significance ◽

Recall Rate ◽

Training Data ◽

Fine Tuning ◽

Accuracy Evaluation

AbstractWhile colorectal cancer is known to occur in the gastrointestinal tract. It is the third most common form of cancer of 27 major types of cancer in South Korea and worldwide. Colorectal polyps are known to increase the potential of developing colorectal cancer. Detected polyps need to be resected to reduce the risk of developing cancer. This research improved the performance of polyp classification through the fine-tuning of Network-in-Network (NIN) after applying a pre-trained model of the ImageNet database. Random shuffling is performed 20 times on 1000 colonoscopy images. Each set of data are divided into 800 images of training data and 200 images of test data. An accuracy evaluation is performed on 200 images of test data in 20 experiments. Three compared methods were constructed from AlexNet by transferring the weights trained by three different state-of-the-art databases. A normal AlexNet based method without transfer learning was also compared. The accuracy of the proposed method was higher in statistical significance than the accuracy of four other state-of-the-art methods, and showed an 18.9% improvement over the normal AlexNet based method. The area under the curve was approximately 0.930 ± 0.020, and the recall rate was 0.929 ± 0.029. An automatic algorithm can assist endoscopists in identifying polyps that are adenomatous by considering a high recall rate and accuracy. This system can enable the timely resection of polyps at an early stage.

Download Full-text

COVID-19 diagnosis from chest X-ray images using transfer learning: Enhanced performance by debiasing dataloader

Journal of X-Ray Science and Technology ◽

10.3233/xst-200757 ◽

2021 ◽

Vol 29 (1) ◽

pp. 19-36

Author(s):

Çağín Polat ◽

Onur Karaman ◽

Ceren Karaman ◽

Güney Korkmaz ◽

Mehmet Can Balcı ◽

...

Keyword(s):

Transfer Learning ◽

Critical Time ◽

Image Data ◽

Mapping Method ◽

Fine Tuning ◽

Learning Stage ◽

X Ray ◽

Testing Dataset ◽

X Ray Imaging ◽

Chest X Ray

BACKGROUND: Chest X-ray imaging has been proved as a powerful diagnostic method to detect and diagnose COVID-19 cases due to its easy accessibility, lower cost and rapid imaging time. OBJECTIVE: This study aims to improve efficacy of screening COVID-19 infected patients using chest X-ray images with the help of a developed deep convolutional neural network model (CNN) entitled nCoV-NET. METHODS: To train and to evaluate the performance of the developed model, three datasets were collected from resources of “ChestX-ray14”, “COVID-19 image data collection”, and “Chest X-ray collection from Indiana University,” respectively. Overall, 299 COVID-19 pneumonia cases and 1,522 non-COVID 19 cases are involved in this study. To overcome the probable bias due to the unbalanced cases in two classes of the datasets, ResNet, DenseNet, and VGG architectures were re-trained in the fine-tuning stage of the process to distinguish COVID-19 classes using a transfer learning method. Lastly, the optimized final nCoV-NET model was applied to the testing dataset to verify the performance of the proposed model. RESULTS: Although the performance parameters of all re-trained architectures were determined close to each other, the final nCOV-NET model optimized by using DenseNet-161 architecture in the transfer learning stage exhibits the highest performance for classification of COVID-19 cases with the accuracy of 97.1 %. The Activation Mapping method was used to create activation maps that highlights the crucial areas of the radiograph to improve causality and intelligibility. CONCLUSION: This study demonstrated that the proposed CNN model called nCoV-NET can be utilized for reliably detecting COVID-19 cases using chest X-ray images to accelerate the triaging and save critical time for disease control as well as assisting the radiologist to validate their initial diagnosis.

Download Full-text

Intuitive real-time control strategy for high-density myoelectric hand prosthesis using deep and transfer learning

Scientific Reports ◽

10.1038/s41598-021-90688-4 ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Simon Tam ◽

Mounir Boukadoum ◽

Alexandre Campeau-Lecours ◽

Benoit Gosselin

Keyword(s):

Real Time ◽

Transfer Learning ◽

Control Strategy ◽

High Density ◽

Fine Tuning ◽

Mode Switching ◽

Correct Prediction ◽

Learning Approach ◽

Predictive Values ◽

Hand Prosthesis

AbstractMyoelectric hand prostheses offer a way for upper-limb amputees to recover gesture and prehensile abilities to ease rehabilitation and daily life activities. However, studies with prosthesis users found that a lack of intuitiveness and ease-of-use in the human-machine control interface are among the main driving factors in the low user acceptance of these devices. This paper proposes a highly intuitive, responsive and reliable real-time myoelectric hand prosthesis control strategy with an emphasis on the demonstration and report of real-time evaluation metrics. The presented solution leverages surface high-density electromyography (HD-EMG) and a convolutional neural network (CNN) to adapt itself to each unique user and his/her specific voluntary muscle contraction patterns. Furthermore, a transfer learning approach is presented to drastically reduce the training time and allow for easy installation and calibration processes. The CNN-based gesture recognition system was evaluated in real-time with a group of 12 able-bodied users. A real-time test for 6 classes/grip modes resulted in mean and median positive predictive values (PPV) of 93.43% and 100%, respectively. Each gesture state is instantly accessible from any other state, with no mode switching required for increased responsiveness and natural seamless control. The system is able to output a correct prediction within less than 116 ms latency. 100% PPV has been attained in many trials and is realistically achievable consistently with user practice and/or employing a thresholded majority vote inference. Using transfer learning, these results are achievable after a sensor installation, data recording and network training/fine-tuning routine taking less than 10 min to complete, a reduction of 89.4% in the setup time of the traditional, non-transfer learning approach.

Download Full-text