Comparison of Deep Learning and Conventional Demosaicing Algorithms for Mastcam Images

Bayer pattern filters have been used in many commercial digital cameras. In National Aeronautics and Space Administration’s (NASA) mast camera (Mastcam) imaging system, onboard the Mars Science Laboratory (MSL) rover Curiosity, a Bayer pattern filter is being used to capture the RGB (red, green, and blue) color of scenes on Mars. The Mastcam has two cameras: left and right. The right camera has three times better resolution than that of the left. It is well known that demosaicing introduces color and zipper artifacts. Here, we present a comparative study of demosaicing results using conventional and deep learning algorithms. Sixteen left and 15 right Mastcam images were used in our experiments. Due to a lack of ground truth images for Mastcam data from Mars, we compared the various algorithms using a blind image quality assessment model. It was observed that no one algorithm can work the best for all images. In particular, a deep learning-based algorithm worked the best for the right Mastcam images and a conventional algorithm achieved the best results for the left Mastcam images. Moreover, subjective evaluation of five demosaiced Mastcam images was also used to compare the various algorithms.

Download Full-text

A Deep Learning Based No-Reference Image Quality Assessment Model for Single-Image Super-Resolution

2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp.2018.8461931 ◽

2018 ◽

Cited By ~ 8

Author(s):

Bahetiyaer Bare ◽

Ke Li ◽

Bo Yan ◽

Bailan Feng ◽

Chunfeng Yao

Keyword(s):

Deep Learning ◽

Image Quality ◽

Quality Assessment ◽

Image Quality Assessment ◽

Super Resolution ◽

Assessment Model ◽

Reference Image ◽

Single Image ◽

Image Super Resolution ◽

Single Image Super Resolution

Download Full-text

Comparison of Denoising Algorithms for Demosacing Low Lighting Images Using CFA 2.0

Signal & Image Processing An International Journal ◽

10.5121/sipij.2020.11503 ◽

2020 ◽

Vol 11 (5) ◽

pp. 37-60

Author(s):

Chiman Kwan ◽

Jude Larkin

Keyword(s):

Deep Learning ◽

Image Quality ◽

Poisson Noise ◽

Color Filter ◽

Digital Cameras ◽

Critical Step ◽

Lighting Conditions ◽

Bayer Pattern ◽

Filter Array ◽

The Impact

In modern digital cameras, the Bayer color filter array (CFA) has been widely used. It is also widely known as CFA 1.0. However, Bayer pattern is inferior to the red-green-blue-white (RGBW) pattern, which is also known as CFA 2.0, in low lighting conditions in which Poisson noise is present. It is well known that demosaicing algorithms cannot effectively deal with Poisson noise and additional denoising is needed in order to improve the image quality. In this paper, we propose to evaluate various conventional and deep learning based denoising algorithms for CFA 2.0 in low lighting conditions. We will also investigate the impact of the location of denoising, which refers to whether the denoising is done before or after a critical step of demosaicing. Extensive experiments show that some denoising algorithms can indeed improve the image quality in low lighting conditions. We also noticed that the location of denoising plays an important role in the overall demosaicing performance.

Download Full-text

Visual Scan-Path based Data-Augmentation for CNN-based 360-degree Image Quality Assessment

London Imaging Meeting ◽

10.2352/issn.2694-118x.2021.lim-21 ◽

2021 ◽

Vol 2021 (1) ◽

pp. 21-26

Author(s):

Abderrezzaq Sendjasni ◽

Mohamed-Chaker Larabi ◽

Faouzi Alaya Cheikh

Keyword(s):

Deep Learning ◽

Image Quality ◽

Quality Assessment ◽

Data Augmentation ◽

Image Quality Assessment ◽

Ground Truth ◽

Training Data ◽

Training Scheme ◽

Scan Path ◽

Scan Paths

360-degree Image quality assessment (IQA) is facing the major challenge of lack of ground-truth databases. This problem is accentuated for deep learning based approaches where the performances are as good as the available data. In this context, only two databases are used to train and validate deep learning-based IQA models. To compensate this lack, a dataaugmentation technique is investigated in this paper. We use visual scan-path to increase the learning examples from existing training data. Multiple scan-paths are predicted to account for the diversity of human observers. These scan-paths are then used to select viewports from the spherical representation. The results of the data-augmentation training scheme showed an improvement over not using it. We also try to answer the question of using the MOS obtained for the 360-degree image as the quality anchor for the whole set of extracted viewports in comparison to 2D blind quality metrics. The comparison showed the superiority of using the MOS when adopting a patch-based learning.

Download Full-text

Tracking droplets in soft granular flows with deep learning techniques

The European Physical Journal Plus ◽

10.1140/epjp/s13360-021-01849-3 ◽

2021 ◽

Vol 136 (8) ◽

Author(s):

Mihir Durve ◽

Fabio Bonaccorso ◽

Andrea Montessori ◽

Marco Lauricella ◽

Adriano Tiribocchi ◽

...

Keyword(s):

Deep Learning ◽

Moving Objects ◽

Fluid Dynamic ◽

Low Cost ◽

Equations Of Motion ◽

Ground Truth ◽

Real Data ◽

Digital Cameras ◽

Dynamic Simulations ◽

Practical Applications

AbstractThe state-of-the-art deep learning-based object recognition YOLO algorithm and object tracking DeepSORT algorithm are combined to analyze digital images from fluid dynamic simulations of multi-core emulsions and soft flowing crystals and to track moving droplets within these complex flows. The YOLO network was trained to recognize the droplets with synthetically prepared data, thereby bypassing the labor-intensive data acquisition process. In both applications, the trained YOLO + DeepSORT procedure performs with high accuracy on the real data from the fluid simulations, with low error levels in the inferred trajectories of the droplets and independently computed ground truth. Moreover, using commonly used desktop GPUs, the developed application is capable of analyzing data at speeds that exceed the typical image acquisition rates of digital cameras (30 fps), opening the interesting prospect of realizing a low-cost and practical tool to study systems with many moving objects, mostly but not exclusively, biological ones. Besides its practical applications, the procedure presented here marks the first step towards the automatic extraction of effective equations of motion of many-body soft flowing systems.

Download Full-text

Recognition of classroom student state features based on deep learning algorithms and machine learning

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-189232 ◽

2020 ◽

pp. 1-12

Author(s):

Hu Jingchao ◽

Haiying Zhang

Keyword(s):

Deep Learning ◽

Feature Recognition ◽

Subjective Evaluation ◽

Recognition Algorithm ◽

Video Data ◽

State Recognition ◽

Detection Model ◽

State Classification ◽

Intelligent Models ◽

Model Recognition

The difficulty in class student state recognition is how to make feature judgments based on student facial expressions and movement state. At present, some intelligent models are not accurate in class student state recognition. In order to improve the model recognition effect, this study builds a two-level state detection framework based on deep learning and HMM feature recognition algorithm, and expands it as a multi-level detection model through a reasonable state classification method. In addition, this study selects continuous HMM or deep learning to reflect the dynamic generation characteristics of fatigue, and designs random human fatigue recognition experiments to complete the collection and preprocessing of EEG data, facial video data, and subjective evaluation data of classroom students. In addition to this, this study discretizes the feature indicators and builds a student state recognition model. Finally, the performance of the algorithm proposed in this paper is analyzed through experiments. The research results show that the algorithm proposed in this paper has certain advantages over the traditional algorithm in the recognition of classroom student state features.

Download Full-text

Document Image Quality Assessment with Relaying Reference to Determine Minimum Readable Resolution for Compression

Electronic Imaging ◽

10.2352/issn.2470-1173.2020.9.iqsp-323 ◽

2020 ◽

Vol 2020 (9) ◽

pp. 323-1-323-8

Author(s):

Litao Hu ◽

Zhenhua Hu ◽

Peter Bauer ◽

Todd J. Harris ◽

Jan P. Allebach

Keyword(s):

Image Quality ◽

Quality Assessment ◽

Image Quality Assessment ◽

Research Area ◽

Input Image ◽

Quality Score ◽

Document Image ◽

Digital Cameras ◽

Active Research ◽

Traditional Approaches

Image quality assessment has been a very active research area in the field of image processing, and there have been numerous methods proposed. However, most of the existing methods focus on digital images that only or mainly contain pictures or photos taken by digital cameras. Traditional approaches evaluate an input image as a whole and try to estimate a quality score for the image, in order to give viewers an idea of how “good” the image looks. In this paper, we mainly focus on the quality evaluation of contents of symbols like texts, bar-codes, QR-codes, lines, and hand-writings in target images. Estimating a quality score for this kind of information can be based on whether or not it is readable by a human, or recognizable by a decoder. Moreover, we mainly study the viewing quality of the scanned document of a printed image. For this purpose, we propose a novel image quality assessment algorithm that is able to determine the readability of a scanned document or regions in a scanned document. Experimental results on some testing images demonstrate the effectiveness of our method.

Download Full-text

ROI Based Post Image Quality Assessment Technique on Multiple Localized Filtering Method On Kinect HD Sensor Mobile Robot

Kinetik Game Technology Information System Computer Network Computing Electronics and Control ◽

10.22219/kinetik.v3i2.598 ◽

2018 ◽

Vol 3 (2) ◽

pp. 145

Author(s):

Kholilatul Wardani ◽

Aditya Kurniawan

Keyword(s):

Image Quality ◽

Quality Assessment ◽

Image Quality Assessment ◽

Similarity Index ◽

Assessment Model ◽

Sensor Data ◽

Edge Region ◽

Kinect Sensor ◽

Image Region ◽

Filtering Technique

The ROI (Region of Interest) Image Quality Assessment is an image quality assessment model based on the SSI (Structural Similarity Index) index used in the specific image region desired to be assessed. Output assessmen value used by this image assessment model is 1 which means identical and -1 which means not identical. Assessment model of ROI Quality Assessment in this research is used to measure image quality on Kinect sensor capture result used in Mobile HD Robot after applied Multiple Localized Filtering Technique. The filter is applied to each capture sensor depth result on Kinect, with the aim to eliminate structural noise that occurs in the Kinect sensor. Assessment is done by comparing image quality before filter and after filter applied to certain region. The kinect sensor will be conditioned to capture a square black object measuring 10cm x 10cm perpendicular to a homogeneous background (white with RGB code 255,255,255). The results of kinect sensor data will be taken through EWRF 3022 by visual basic 6.0 program periodically 10 times each session with frequency 1 time per minute. The results of this trial show the same similar index (value 1: identical) in the luminance, contrast, and structural section of the edge region or edge region of the specimen. The value indicates that the Multiple Localized Filtering Technique applied to the noise generated by the Kinect sensor, based on the ROI Image Quality Assessment model has no effect on the image quality generated by the sensor.

Download Full-text

Spectroscopic and deep learning-based approaches to identify and quantify cerebral microhemorrhages

Scientific Reports ◽

10.1038/s41598-021-88236-1 ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Christian Crouzet ◽

Gwangjin Jeong ◽

Rachel H. Chae ◽

Krystal T. LoPresti ◽

Cody E. Dunn ◽

...

Keyword(s):

Deep Learning ◽

Prussian Blue ◽

Processing Speed ◽

Digital Pathology ◽

Ground Truth ◽

Individual Variability ◽

Rgb Images ◽

Cerebral Microhemorrhages ◽

Phasor Analysis ◽

Better Than

AbstractCerebral microhemorrhages (CMHs) are associated with cerebrovascular disease, cognitive impairment, and normal aging. One method to study CMHs is to analyze histological sections (5–40 μm) stained with Prussian blue. Currently, users manually and subjectively identify and quantify Prussian blue-stained regions of interest, which is prone to inter-individual variability and can lead to significant delays in data analysis. To improve this labor-intensive process, we developed and compared three digital pathology approaches to identify and quantify CMHs from Prussian blue-stained brain sections: (1) ratiometric analysis of RGB pixel values, (2) phasor analysis of RGB images, and (3) deep learning using a mask region-based convolutional neural network. We applied these approaches to a preclinical mouse model of inflammation-induced CMHs. One-hundred CMHs were imaged using a 20 × objective and RGB color camera. To determine the ground truth, four users independently annotated Prussian blue-labeled CMHs. The deep learning and ratiometric approaches performed better than the phasor analysis approach compared to the ground truth. The deep learning approach had the most precision of the three methods. The ratiometric approach has the most versatility and maintained accuracy, albeit with less precision. Our data suggest that implementing these methods to analyze CMH images can drastically increase the processing speed while maintaining precision and accuracy.

Download Full-text

Evaluation of focus and deep learning methods for automated image grading and factors influencing image quality in adaptive optics ophthalmoscopy

Scientific Reports ◽

10.1038/s41598-021-96068-2 ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Danuta M. Sampson ◽

David Alonso-Caneiro ◽

Avenell L. Chew ◽

Jonathan La ◽

Danial Roshandel ◽

...

Keyword(s):

Image Quality ◽

Quality Assessment ◽

Adaptive Optics ◽

Image Quality Assessment ◽

Assessment Tool ◽

Ground Truth ◽

Disease Status ◽

Retinal Location ◽

Imaging Tool ◽

Multiclass Classifier

AbstractAdaptive optics flood illumination ophthalmoscopy (AO-FIO) is an established imaging tool in the investigation of retinal diseases. However, the clinical interpretation of AO-FIO images can be challenging due to varied image quality. Therefore, image quality assessment is essential before interpretation. An image assessment tool will also assist further work on improving the image quality, either during acquisition or post processing. In this paper, we describe, validate and compare two automated image quality assessment methods; the energy of Laplacian focus operator (LAPE; not commonly used but easily implemented) and convolutional neural network (CNN; effective but more complex approach). We also evaluate the effects of subject age, axial length, refractive error, fixation stability, disease status and retinal location on AO-FIO image quality. Based on analysis of 10,250 images of 50 × 50 μm size, at 41 retinal locations, from 50 subjects we demonstrate that CNN slightly outperforms LAPE in image quality assessment. CNN achieves accuracy of 89%, whereas LAPE metric achieves 73% and 80% (for a linear regression and random forest multiclass classifier methods, respectively) compared to ground truth. Furthermore, the retinal location, age and disease are factors that can influence the likelihood of poor image quality.

Download Full-text

Synthetic polarization-sensitive optical coherence tomography by deep learning

npj Digital Medicine ◽

10.1038/s41746-021-00475-8 ◽

2021 ◽

Vol 4 (1) ◽

Author(s):

Yi Sun ◽

Jianfeng Wang ◽

Jindou Shi ◽

Stephen A. Boppart

Keyword(s):

Optical Coherence Tomography ◽

Deep Learning ◽

Imaging System ◽

Imaging Modality ◽

Similarity Index ◽

Polarization Sensitivity ◽

Imaging Method ◽

Optical Coherence ◽

The Real ◽

Oct Imaging

AbstractPolarization-sensitive optical coherence tomography (PS-OCT) is a high-resolution label-free optical biomedical imaging modality that is sensitive to the microstructural architecture in tissue that gives rise to form birefringence, such as collagen or muscle fibers. To enable polarization sensitivity in an OCT system, however, requires additional hardware and complexity. We developed a deep-learning method to synthesize PS-OCT images by training a generative adversarial network (GAN) on OCT intensity and PS-OCT images. The synthesis accuracy was first evaluated by the structural similarity index (SSIM) between the synthetic and real PS-OCT images. Furthermore, the effectiveness of the computational PS-OCT images was validated by separately training two image classifiers using the real and synthetic PS-OCT images for cancer/normal classification. The similar classification results of the two trained classifiers demonstrate that the predicted PS-OCT images can be potentially used interchangeably in cancer diagnosis applications. In addition, we applied the trained GAN models on OCT images collected from a separate OCT imaging system, and the synthetic PS-OCT images correlate well with the real PS-OCT image collected from the same sample sites using the PS-OCT imaging system. This computational PS-OCT imaging method has the potential to reduce the cost, complexity, and need for hardware-based PS-OCT imaging systems.

Download Full-text