TCM: Temporal Consistency Model for Head Detection in Complex Videos

Head detection in real-world videos is a classical research problem in computer vision. Head detection in videos is challenging than in a single image due to many nuisances that are commonly observed in natural videos, including arbitrary poses, appearances, and scales. Generally, head detection is treated as a particular case of object detection in a single image. However, the performance of object detectors deteriorates in unconstrained videos. In this paper, we propose a temporal consistency model (TCM) to enhance the performance of a generic object detector by integrating spatial-temporal information that exists among subsequent frames of a particular video. Generally, our model takes detection from a generic detector as input and improves mean average precision (mAP) by recovering missed detection and suppressing false positives. We compare and evaluate the proposed framework on four challenging datasets, i.e., HollywoodHeads, Casablanca, BOSS, and PAMELA. Experimental evaluation shows that the performance is improved by employing the proposed TCM model. We demonstrate both qualitatively and quantitatively that our proposed framework obtains significant improvements over other methods.

Download Full-text

Real-Time Object Detection Model

International Journal for Modern Trends in Science and Technology - RTT2020 ◽

10.46501/ijmtst061267 ◽

2020 ◽

Vol 6 (12) ◽

pp. 360-364

Author(s):

Akash Kumar, Dr. Amita Goel Prof. Vasudha Bahl and Prof. Nidhi Sengar

Keyword(s):

Neural Network ◽

Computer Vision ◽

Object Detection ◽

Real Time ◽

Real World ◽

The Real ◽

Detection Model ◽

Bounding Boxes ◽

The Given ◽

Single Neural Network

Object Detection is a study in the field of computer vision. An object detection model recognizes objects of the real world present either in a captured image or in real-time video where the object can belong to any class of objects namely humans, animals, objects, etc. This project is an implementation of an algorithm based on object detection called You Only Look Once (YOLO v3). The architecture of yolo model is extremely fast compared to all previous methods. Yolov3 model executes a single neural network to the given image and then divides the image into predetermined bounding boxes. These boxes are weighted by the predicted probabilities. After non max-suppression it gives the result of recognized objects together with bounding boxes. Yolo trains and directly executes object detection on full images.

Download Full-text

Improving Deep Object Detection Algorithms for Game Scenes

Electronics ◽

10.3390/electronics10202527 ◽

2021 ◽

Vol 10 (20) ◽

pp. 2527

Author(s):

Minji Jung ◽

Heekyung Yang ◽

Kyungha Min

Keyword(s):

Computer Vision ◽

Object Detection ◽

Real World ◽

Computer Games ◽

Recognition Performance ◽

Role Playing ◽

Scene Analysis ◽

Research Topics ◽

The Real ◽

Detection Algorithms

The advancement and popularity of computer games make game scene analysis one of the most interesting research topics in the computer vision society. Among the various computer vision techniques, we employ object detection algorithms for the analysis, since they can both recognize and localize objects in a scene. However, applying the existing object detection algorithms for analyzing game scenes does not guarantee a desired performance, since the algorithms are trained using datasets collected from the real world. In order to achieve a desired performance for analyzing game scenes, we built a dataset by collecting game scenes and retrained the object detection algorithms pre-trained with the datasets from the real world. We selected five object detection algorithms, namely YOLOv3, Faster R-CNN, SSD, FPN and EfficientDet, and eight games from various game genres including first-person shooting, role-playing, sports, and driving. PascalVOC and MS COCO were employed for the pre-training of the object detection algorithms. We proved the improvement in the performance that comes from our strategy in two aspects: recognition and localization. The improvement in recognition performance was measured using mean average precision (mAP) and the improvement in localization using intersection over union (IoU).

Download Full-text

Lightweight Object Detection Ensemble Framework for Autonomous Vehicles in Challenging Weather Conditions

Computational Intelligence and Neuroscience ◽

10.1155/2021/5278820 ◽

2021 ◽

Vol 2021 ◽

pp. 1-12

Author(s):

Rahee Walambe ◽

Aboli Marathe ◽

Ketan Kotecha ◽

George Ghinea

Keyword(s):

Computer Vision ◽

Object Detection ◽

Autonomous Vehicles ◽

Data Augmentation ◽

Weather Conditions ◽

Training Data ◽

Average Precision ◽

Ensemble Techniques ◽

Rainy Weather ◽

Increase In Accuracy

The computer vision systems driving autonomous vehicles are judged by their ability to detect objects and obstacles in the vicinity of the vehicle in diverse environments. Enhancing this ability of a self-driving car to distinguish between the elements of its environment under adverse conditions is an important challenge in computer vision. For example, poor weather conditions like fog and rain lead to image corruption which can cause a drastic drop in object detection (OD) performance. The primary navigation of autonomous vehicles depends on the effectiveness of the image processing techniques applied to the data collected from various visual sensors. Therefore, it is essential to develop the capability to detect objects like vehicles and pedestrians under challenging conditions such as like unpleasant weather. Ensembling multiple baseline deep learning models under different voting strategies for object detection and utilizing data augmentation to boost the models’ performance is proposed to solve this problem. The data augmentation technique is particularly useful and works with limited training data for OD applications. Furthermore, using the baseline models significantly speeds up the OD process as compared to the custom models due to transfer learning. Therefore, the ensembling approach can be highly effective in resource-constrained devices deployed for autonomous vehicles in uncertain weather conditions. The applied techniques demonstrated an increase in accuracy over the baseline models and were able to identify objects from the images captured in the adverse foggy and rainy weather conditions. The applied techniques demonstrated an increase in accuracy over the baseline models and reached 32.75% mean average precision (mAP) and 52.56% average precision (AP) in detecting cars in the adverse fog and rain weather conditions present in the dataset. The effectiveness of multiple voting strategies for bounding box predictions on the dataset is also demonstrated. These strategies help increase the explainability of object detection in autonomous systems and improve the performance of the ensemble techniques over the baseline models.

Download Full-text

ON METHODS OF OBJECT DETECTION IN VIDEO STREAMS

Computer systems and network ◽

10.23939/csn2020.01.080 ◽

2017 ◽

Vol 2 (1) ◽

pp. 80-87

Author(s):

Puyda V. ◽

◽

Stoian. A.

Keyword(s):

Computer Vision ◽

Object Detection ◽

Open Source ◽

Feature Detection ◽

Video Stream ◽

Object Identification ◽

Vision Systems ◽

Modern Computer ◽

Computer Vision Systems ◽

Open Source Hardware

Detecting objects in a video stream is a typical problem in modern computer vision systems that are used in multiple areas. Object detection can be done on both static images and on frames of a video stream. Essentially, object detection means finding color and intensity non-uniformities which can be treated as physical objects. Beside that, the operations of finding coordinates, size and other characteristics of these non-uniformities that can be used to solve other computer vision related problems like object identification can be executed. In this paper, we study three algorithms which can be used to detect objects of different nature and are based on different approaches: detection of color non-uniformities, frame difference and feature detection. As the input data, we use a video stream which is obtained from a video camera or from an mp4 video file. Simulations and testing of the algoritms were done on a universal computer based on an open-source hardware, built on the Broadcom BCM2711, quad-core Cortex-A72 (ARM v8) 64-bit SoC processor with frequency 1,5GHz. The software was created in Visual Studio 2019 using OpenCV 4 on Windows 10 and on a universal computer operated under Linux (Raspbian Buster OS) for an open-source hardware. In the paper, the methods under consideration are compared. The results of the paper can be used in research and development of modern computer vision systems used for different purposes. Keywords: object detection, feature points, keypoints, ORB detector, computer vision, motion detection, HSV model color

Download Full-text

Using Feature Alignment Can Improve Clean Average Precision And Adversarial Robustness In Object Detection

10.1109/icip42928.2021.9506689 ◽

2021 ◽

Author(s):

Weipeng Xu ◽

Hongcheng Huang ◽

Shaoyou Pan

Keyword(s):

Object Detection ◽

Average Precision ◽

Feature Alignment

Download Full-text

Hfinger: Malware HTTP Request Fingerprinting

Entropy ◽

10.3390/e23050507 ◽

2021 ◽

Vol 23 (5) ◽

pp. 507

Author(s):

Piotr Białczak ◽

Wojciech Mazurczyk

Keyword(s):

Real World ◽

Network Traffic ◽

Experimental Evaluation ◽

Data Sets ◽

Real World Data ◽

Malicious Software ◽

Default Mode ◽

World Data ◽

Effectiveness Analysis ◽

Http Protocol

Malicious software utilizes HTTP protocol for communication purposes, creating network traffic that is hard to identify as it blends into the traffic generated by benign applications. To this aim, fingerprinting tools have been developed to help track and identify such traffic by providing a short representation of malicious HTTP requests. However, currently existing tools do not analyze all information included in the HTTP message or analyze it insufficiently. To address these issues, we propose Hfinger, a novel malware HTTP request fingerprinting tool. It extracts information from the parts of the request such as URI, protocol information, headers, and payload, providing a concise request representation that preserves the extracted information in a form interpretable by a human analyst. For the developed solution, we have performed an extensive experimental evaluation using real-world data sets and we also compared Hfinger with the most related and popular existing tools such as FATT, Mercury, and p0f. The conducted effectiveness analysis reveals that on average only 1.85% of requests fingerprinted by Hfinger collide between malware families, what is 8–34 times lower than existing tools. Moreover, unlike these tools, in default mode, Hfinger does not introduce collisions between malware and benign applications and achieves it by increasing the number of fingerprints by at most 3 times. As a result, Hfinger can effectively track and hunt malware by providing more unique fingerprints than other standard tools.

Download Full-text

A computer vision-based object detection and counting for COVID-19 protocol compliance: a case study of Jakarta

2020 International Conference on ICT for Smart Society (ICISS) ◽

10.1109/iciss50791.2020.9307594 ◽

2020 ◽

Author(s):

Muhammad Lanang Afkaar Ar ◽

Sulthan Muzakki Adytia S ◽

Yudhistira Nugraha ◽

Farizah Rizka R ◽

Andy Ernesto ◽

...

Keyword(s):

Computer Vision ◽

Object Detection ◽

Detection And Counting ◽

Protocol Compliance

Download Full-text

Improving Real-world Object Detection Using Balanced Loss

2020 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB) ◽

10.1109/bmsb49480.2020.9379858 ◽

2020 ◽

Author(s):

Shengyang Shen ◽

Zexiang Liu ◽

Bingkun Zhao ◽

Li Chen ◽

Chongyang Zhang

Keyword(s):

Object Detection ◽

Real World

Download Full-text

Simplification and Detection of Outlying Trajectories from Batch and Streaming Data Recorded in Harsh Environments

ISPRS International Journal of Geo-Information ◽

10.3390/ijgi8060272 ◽

2019 ◽

Vol 8 (6) ◽

pp. 272 ◽

Cited By ~ 1

Author(s):

Iq Reviessay Pulshashi ◽

Hyerim Bae ◽

Hyunsuk Choi ◽

Seunghwan Mun ◽

Riska Asriana Sutrisnowati

Keyword(s):

South Korea ◽

Real World ◽

Experimental Evaluation ◽

Streaming Data ◽

Harsh Environments ◽

Noise Filtering ◽

New Approach ◽

Detection Algorithms ◽

Outlying Point

Analysis of trajectory such as detection of an outlying trajectory can produce inaccurate results due to the existence of noise, an outlying point-locations that can change statistical properties of the trajectory. Some trajectories with noise are repairable by noise filtering or by trajectory-simplification. We herein propose the application of a trajectory-simplification approach in both batch and streaming environments, followed by benchmarking of various outlier-detection algorithms for detection of outlying trajectories from among simplified trajectories. Experimental evaluation in a case study using real-world trajectories from a shipyard in South Korea shows the benefit of the new approach.

Download Full-text