video processing Latest Research Papers

Super-linear speedup for real-time condition monitoring using image processing and drones

International Journal of Electrical and Computer Engineering (IJECE) ◽

10.11591/ijece.v12i2.pp1548-1557 ◽

2022 ◽

Vol 12 (2) ◽

pp. 1548

Author(s):

Moath Alsafasfeh ◽

Bradely Bazuin ◽

Ikhlas Abdel-Qader

Keyword(s):

Image Processing ◽

Real Time ◽

Condition Monitoring ◽

Video Processing ◽

Execution Time ◽

Large Scale ◽

Solar Panels ◽

Position Information ◽

Time Condition ◽

Linear Speedup

Real-time inspections for the large-scale solar system may take a long time to get the hazard situations for any failures that may take place in the solar panels normal operations, where prior hazards detection is important. Reducing the execution time and improving the system’s performance are the ultimate goals of multiprocessing or multicore systems. Real-time video processing and analysis from two camcorders, thermal and charge-coupling devices (CCD), mounted on a drone compose the embedded system being proposed for solar panels inspection. The inspection method needs more time for capturing and processing the frames and detecting the faulty panels. The system can determine the longitude and latitude of the defect position information in real-time. In this work, we investigate parallel processing for the image processing operations which reduces the processing time for the inspection systems. The results show a super-linear speedup for real-time condition monitoring in large-scale solar systems. Using the multiprocessing module in Python, we execute fault detection algorithms using streamed frames from both video cameras. The experimental results show a super-linear speedup for thermal and CCD video processing, the execution time is efficiently reduced with an average of 3.1 times and 6.3 times using 2 processes and 4 processes respectively.

A Performance-Based Comparative Encryption and Decryption Technique for Image and Video for Mobile Computing

Journal of Cases on Information Technology ◽

10.4018/jcit.20220101.oa1 ◽

2022 ◽

Vol 24 (2) ◽

pp. 1-18

Author(s):

Raya Basil Alothman ◽

Imad Ibraheem Saada ◽

Basma Salim Bazel Al-Brge

Keyword(s):

Comparative Analysis ◽

Comparative Study ◽

Video Processing ◽

Communication System ◽

Data Exchange ◽

Electronic System ◽

Video Data ◽

Encryption Algorithm ◽

Encryption And Decryption ◽

The Comparative Study

When data exchange advances through the electronic system, the need for information security has become a must. Protection of images and videos is important in today's visual communication system. Confidential image / video data must be shielded from unauthorized uses. Detecting and identifying unauthorized users is a challenging task. Various researchers have suggested different techniques for securing the transfer of images. In this research, the comparative study of these current technologies also addressed the types of images / videos and the different techniques of image / video processing with the steps used to process the image or video. This research classifies the two types of Encryption Algorithm, Symmetric and Encryption Algorithm, and provides a comparative analysis of its types, such as AES, MAES, RSA, DES, 3DES and BLOWFISH.

Approximate Constant-Coefficient Multiplication Using Hybrid Binary-Unary Computing for FPGAs

ACM Transactions on Reconfigurable Technology and Systems ◽

10.1145/3494570 ◽

2022 ◽

Vol 15 (3) ◽

pp. 1-25

Author(s):

S. Rasoul Faraji ◽

Pierre Abillama ◽

Kia Bazargan

Keyword(s):

Discrete Cosine Transform ◽

Video Processing ◽

High Speed ◽

Low Cost ◽

Digital Signal ◽

Logic Gates ◽

Cosine Transform ◽

Real Time Processing ◽

Coefficient Multiplier ◽

Encoding Method

Multipliers are used in virtually all Digital Signal Processing (DSP) applications such as image and video processing. Multiplier efficiency has a direct impact on the overall performance of such applications, especially when real-time processing is needed, as in 4K video processing, or where hardware resources are limited, as in mobile and IoT devices. We propose a novel, low-cost, low energy, and high-speed approximate constant coefficient multiplier (CCM) using a hybrid binary-unary encoding method. The proposed method implements a CCM using simple routing networks with no logic gates in the unary domain, which results in more efficient multipliers compared to Xilinx LogiCORE IP CCMs and table-based KCM CCMs (Flopoco) on average. We evaluate the proposed multipliers on 2-D discrete cosine transform algorithm as a common DSP module. Post-routing FPGA results show that the proposed multipliers can improve the {area, area × delay, power consumption, and energy-delay product} of a 2-D discrete cosine transform on average by {30%, 33%, 30%, 31%}. Moreover, the throughput of the proposed 2-D discrete cosine transform is on average 5% more than that of the binary architecture implemented using table-based KCM CCMs. We will show that our method has fewer routability issues compared to binary implementations when implementing a DCT core.

Structural motion estimation via Hilbert transform enhanced phase-based video processing

Mechanical Systems and Signal Processing ◽

10.1016/j.ymssp.2021.108418 ◽

2022 ◽

Vol 166 ◽

pp. 108418

Author(s):

G. Liu ◽

M.Z. Li ◽

Z. Mao ◽

Q.S. Yang

Keyword(s):

Motion Estimation ◽

Hilbert Transform ◽

Video Processing ◽

Structural Motion

A PERFORMANCE BASED COMPARATIVE ENCRYPTION AND DECRYPTION TECHNIQUE FOR IMAGE AND VIDEO FOR MOBILE COMPUTING

Journal of Cases on Information Technology ◽

10.4018/jcit.20220401oa01 ◽

2022 ◽

Vol 24 (2) ◽

pp. 0-0

Keyword(s):

Comparative Analysis ◽

Comparative Study ◽

Video Processing ◽

Communication System ◽

Data Exchange ◽

Electronic System ◽

Video Data ◽

Encryption Algorithm ◽

Encryption And Decryption ◽

The Comparative Study

When data exchange advances through the electronic system, the need for information security has become a must. Protection of images and videos is important in today's visual communication system. Confidential image / video data must be shielded from unauthorized uses. Detecting and identifying unauthorized users is a challenging task. Various researchers have suggested different techniques for securing the transfer of images. In this research, the comparative study of these current technologies also addressed the types of images / videos and the different techniques of image / video processing with the steps used to process the image or video. This research classifies the two types of Encryption Algorithm, Symmetric and Encryption Algorithm, and provides a comparative analysis of its types, such as AES, MAES, RSA, DES, 3DES and BLOWFISH.

Transformers in Vision: A Survey

ACM Computing Surveys ◽

10.1145/3505244 ◽

2022 ◽

Author(s):

Salman Khan ◽

Muzammal Naseer ◽

Munawar Hayat ◽

Syed Waqas Zamir ◽

Fahad Shahbaz Khan ◽

...

Keyword(s):

Computer Vision ◽

Video Processing ◽

Architectural Design ◽

Large Scale ◽

Question Answering ◽

Short Term Memory ◽

Super Resolution ◽

Input Sequence ◽

3D Analysis ◽

Comprehensive Overview

Astounding results from Transformer models on natural language tasks have intrigued the vision community to study their application to computer vision problems. Among their salient benefits, Transformers enable modeling long dependencies between input sequence elements and support parallel processing of sequence as compared to recurrent networks e.g. , Long short-term memory (LSTM). Different from convolutional networks, Transformers require minimal inductive biases for their design and are naturally suited as set-functions. Furthermore, the straightforward design of Transformers allows processing multiple modalities ( e.g. , images, videos, text and speech) using similar processing blocks and demonstrates excellent scalability to very large capacity networks and huge datasets. These strengths have led to exciting progress on a number of vision tasks using Transformer networks. This survey aims to provide a comprehensive overview of the Transformer models in the computer vision discipline. We start with an introduction to fundamental concepts behind the success of Transformers i.e., self-attention, large-scale pre-training, and bidirectional feature encoding. We then cover extensive applications of transformers in vision including popular recognition tasks ( e.g. , image classification, object detection, action recognition, and segmentation), generative modeling, multi-modal tasks ( e.g. , visual-question answering, visual reasoning, and visual grounding), video processing ( e.g. , activity recognition, video forecasting), low-level vision ( e.g. , image super-resolution, image enhancement, and colorization) and 3D analysis ( e.g. , point cloud classification and segmentation). We compare the respective advantages and limitations of popular techniques both in terms of architectural design and their experimental value. Finally, we provide an analysis on open research directions and possible future works. We hope this effort will ignite further interest in the community to solve current challenges towards the application of transformer models in computer vision.

LIME-Based Data Selection Method for SAR Images Generation Using GAN

Remote Sensing ◽

10.3390/rs14010204 ◽

2022 ◽

Vol 14 (1) ◽

pp. 204

Author(s):

Mingzhe Zhu ◽

Bo Zang ◽

Linlin Ding ◽

Tao Lei ◽

Zhenpeng Feng ◽

...

Keyword(s):

Video Processing ◽

Generative Adversarial Networks ◽

Correct Prediction ◽

Positive Contribution ◽

Sar Image ◽

Sar Images ◽

Adversarial Networks ◽

Image And Video Processing ◽

Interpretable Model

Deep learning has obtained remarkable achievements in computer vision, especially image and video processing. However, in synthetic aperture radar (SAR) image recognition, the application of DNNs is usually restricted due to data insufficiency. To augment datasets, generative adversarial networks (GANs) are usually used to generate numerous photo-realistic SAR images. Although there are many pixel-level metrics to measure GAN’s performance from the quality of generated SAR images, there are few measurements to evaluate whether the generated SAR images include the most representative features of the target. In this case, the classifier probably categorizes a SAR image into the corresponding class based on “wrong” criterion, i.e., “Clever Hans”. In this paper, local interpretable model-agnostic explanation (LIME) is innovatively utilized to evaluate whether a generated SAR image possessed the most representative features of a specific kind of target. Firstly, LIME is used to visualize positive contributions of the input SAR image to the correct prediction of the classifier. Subsequently, these representative SAR images can be selected handily by evaluating how much the positive contribution region matches the target. Experimental results demonstrate that the proposed method can ally “Clever Hans” phenomenon greatly caused by the spurious relationship between generated SAR images and the corresponding classes.

A Novel Python Video Processing Program for Identifying and Quantification of Soil Cracks in Real Time

Proceedings of the 2nd International Conference on Recent Trends in Machine Learning, IoT, Smart Cities and Applications - Lecture Notes in Networks and Systems ◽

10.1007/978-981-16-6407-6_4 ◽

2022 ◽

pp. 39-46

Author(s):

Vaibhav Khandare Dilip ◽

He Huang ◽

Ankit Garg ◽

Xilong Huang ◽

Guoxiong Mei

Keyword(s):

Real Time ◽

Video Processing ◽

Processing Program ◽

Soil Cracks

Eye of Devil: Face Recognition in Real World Surveillance Video with Feature Extraction and Pattern Matching

International Journal for Research in Applied Science and Engineering Technology ◽

10.22214/ijraset.2021.39711 ◽

2021 ◽

Vol 9 (12) ◽

pp. 2334-2337

Author(s):

Maddimsetty Bullaiaha Tej

Keyword(s):

Feature Extraction ◽

Face Recognition ◽

Pattern Matching ◽

Video Processing ◽

Facial Features ◽

Surveillance Video ◽

Skin Texture ◽

Traditional Approaches ◽

Matching Techniques ◽

Unique Dataset

Abstract: People lost, people missing etc., these are the words we come across whenever there is any mass gathering events going on or in crowded areas. To solve this issue some traditional approaches like announcements are in use. One idea is to identify the person using face recognition and pattern matching techniques. There are several techniques to implement face recognition like extraction of facial features by using the position of eyes, nose, jawbone or skin texture analysis etc., By using these techniques a unique dataset can be created for each human. Here the photograph of the missing person can be used to extract these facial features. After getting the dataset of that individual, by using pattern matching techniques, there is a scope to find the person with same facial features in the crowd images or videos. Keywords: Face-Recognition, Image-Processing, Feature extraction, Video-Processing, Pattern-Matching.

Image and Video Processing Fundamentals

10.1007/978-1-4842-7462-0_3 ◽

2021 ◽

pp. 31-48

Author(s):

Ahmed Fathi Bekhit

Keyword(s):

Video Processing ◽

Image And Video Processing

video processing
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Super-linear speedup for real-time condition monitoring using image processing and drones

A Performance-Based Comparative Encryption and Decryption Technique for Image and Video for Mobile Computing

Approximate Constant-Coefficient Multiplication Using Hybrid Binary-Unary Computing for FPGAs

Structural motion estimation via Hilbert transform enhanced phase-based video processing

A PERFORMANCE BASED COMPARATIVE ENCRYPTION AND DECRYPTION TECHNIQUE FOR IMAGE AND VIDEO FOR MOBILE COMPUTING

Transformers in Vision: A Survey

LIME-Based Data Selection Method for SAR Images Generation Using GAN

A Novel Python Video Processing Program for Identifying and Quantification of Soil Cracks in Real Time

Eye of Devil: Face Recognition in Real World Surveillance Video with Feature Extraction and Pattern Matching

Image and Video Processing Fundamentals

Export Citation Format

video processingRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Super-linear speedup for real-time condition monitoring using image processing and drones

A Performance-Based Comparative Encryption and Decryption Technique for Image and Video for Mobile Computing

Approximate Constant-Coefficient Multiplication Using Hybrid Binary-Unary Computing for FPGAs

Structural motion estimation via Hilbert transform enhanced phase-based video processing

A PERFORMANCE BASED COMPARATIVE ENCRYPTION AND DECRYPTION TECHNIQUE FOR IMAGE AND VIDEO FOR MOBILE COMPUTING

Transformers in Vision: A Survey

LIME-Based Data Selection Method for SAR Images Generation Using GAN

A Novel Python Video Processing Program for Identifying and Quantification of Soil Cracks in Real Time

Eye of Devil: Face Recognition in Real World Surveillance Video with Feature Extraction and Pattern Matching

Image and Video Processing Fundamentals

video processing
Recently Published Documents