A Stochastic Model for Block Segmentation of Images Based on the Quadtree and the Bayes Code for It

In information theory, lossless compression of general data is based on an explicit assumption of a stochastic generative model on target data. However, in lossless image compression, researchers have mainly focused on the coding procedure that outputs the coded sequence from the input image, and the assumption of the stochastic generative model is implicit. In these studies, there is a difficulty in discussing the difference between the expected code length and the entropy of the stochastic generative model. We solve this difficulty for a class of images, in which they have non-stationarity among segments. In this paper, we propose a novel stochastic generative model of images by redefining the implicit stochastic generative model in a previous coding procedure. Our model is based on the quadtree so that it effectively represents the variable block size segmentation of images. Then, we construct the Bayes code optimal for the proposed stochastic generative model. It requires the summation of all possible quadtrees weighted by their posterior. In general, its computational cost increases exponentially for the image size. However, we introduce an efficient algorithm to calculate it in the polynomial order of the image size without loss of optimality. As a result, the derived algorithm has a better average coding rate than that of JBIG.

Download Full-text

Kinerja Metode CNN untuk Klasifikasi Pneumonia dengan Variasi Ukuran Citra Input

Jurnal Teknologi Informasi dan Ilmu Komputer ◽

10.25126/jtiik.2021834515 ◽

2021 ◽

Vol 8 (3) ◽

pp. 533

Author(s):

Budi Nugroho ◽

Eva Yulia Puspaningrum

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Extreme Learning Machine ◽

Classification Performance ◽

Input Image ◽

X Rays ◽

Image Size ◽

X Ray ◽

The Difference ◽

Learning Machine

Saat ini banyak dikembangkan proses pendeteksian pneumonia berdasarkan citra paru-paru dari hasil foto rontgen (x-ray), sebagaimana juga dilakukan pada penelitian ini. Metode yang digunakan adalah Convolutional Neural Network (CNN) dengan arsitektur yang berbeda dengan sejumlah penelitian sebelumnya. Selain itu, penelitian ini juga memodifikasi model CNN dimana metode Extreme Learning Machine (ELM) digunakan pada bagian klasifikasi, yang kemudian disebut CNN-ELM. Dataset untuk uji coba menggunakan kumpulan citra paru-paru hasil foto rontgen pada Kaggle yang terdiri atas 1.583 citra normal dan 4.237 citra pneumonia. Citra asal pada dataset kaggle ini bervariasi, tetapi hampir semua diatas ukuran 1000x1000 piksel. Ukuran citra yang besar ini dapat membuat pemrosesan klasifikasi kurang efektif, sehingga mesin CNN biasanya memodifikasi ukuran citra menjadi lebih kecil. Pada penelitian ini, pengujian dilakukan dengan variasi ukuran citra input, untuk mengetahui pengaruhnya terhadap kinerja mesin pengklasifikasi. Hasil uji coba menunjukkan bahwa ukuran citra input berpengaruh besar terhadap kinerja klasifikasi pneumonia, baik klasifikasi yang menggunakan metode CNN maupun CNN-ELM. Pada ukuran citra input 200x200, metode CNN dan CNN-ELM menunjukkan kinerja paling tinggi. Jika kinerja kedua metode itu dibandingkan, maka Metode CNN-ELM menunjukkan kinerja yang lebih baik daripada CNN pada semua skenario uji coba. Pada kondisi kinerja paling tinggi, selisih akurasi antara metode CNN-ELM dan CNN mencapai 8,81% dan selisih F1 Score mencapai 0,0729. Hasil penelitian ini memberikan informasi penting bahwa ukuran citra input memiliki pengaruh besar terhadap kinerja klasifikasi pneumonia, baik klasifikasi menggunakan metode CNN maupun CNN-ELM. Selain itu, pada semua ukuran citra input yang digunakan untuk proses klasifikasi, metode CNN-ELM menunjukkan kinerja yang lebih baik daripada metode CNN. AbstractThis research developed a pneumonia detection machine based on the lungs' images from X-rays (x-rays). The method used is the Convolutional Neural Network (CNN) with a different architecture from some previous research. Also, the CNN model is modified, where the classification process uses the Extreme Learning Machine (ELM), which is then called the CNN-ELM method. The empirical experiments dataset used a collection of lung x-ray images on Kaggle consisting of 1,583 normal images and 4,237 pneumonia images. The original image's size on the Kaggle dataset varies, but almost all of the images are more than 1000x1000 pixels. For classification processing to be more effective, CNN machines usually use reduced-size images. In this research, experiments were carried out with various input image sizes to determine the effect on the classifier's performance. The experimental results show that the input images' size has a significant effect on the classification performance of pneumonia, both the CNN and CNN-ELM classification methods. At the 200x200 input image size, the CNN and CNN-ELM methods showed the highest performance. If the two methods' performance is compared, then the CNN-ELM Method shows better performance than CNN in all test scenarios. The difference in accuracy between the CNN-ELM and CNN methods reaches 8.81% at the highest performance conditions, and the difference in F1-Score reaches 0.0729. This research provides important information that the size of the input image has a major influence on the classification performance of pneumonia, both classification using the CNN and CNN-ELM methods. Also, on all input image sizes used for the classification process, the CNN-ELM method shows better performance than the CNN method.

Download Full-text

Image Compression Using Vector Quantization with Variable Block Size Division

IEEJ Transactions on Electronics Information and Systems ◽

10.1541/ieejeiss.130.1431 ◽

2010 ◽

Vol 130 (8) ◽

pp. 1431-1439 ◽

Cited By ~ 1

Author(s):

Hiroki Matsumoto ◽

Fumito Kichikawa ◽

Kazuya Sasazaki ◽

Junji Maeda ◽

Yukinori Suzuki

Keyword(s):

Image Compression ◽

Vector Quantization ◽

Block Size ◽

Variable Block

Download Full-text

A Multiple Pattern Algorithm for Block Motion Estimation Based on Variable Block-size

ACTA AUTOMATICA SINICA ◽

10.3724/sp.j.1004.2009.00034 ◽

2009 ◽

Vol 35 (1) ◽

pp. 34-38

Author(s):

Wei WANG ◽

Wei LI

Keyword(s):

Motion Estimation ◽

Block Size ◽

Variable Block ◽

Block Motion ◽

Block Motion Estimation

Download Full-text

Concrete Crack Detection Based on Well-Known Feature Extractor Model and the YOLO_v2 Network

Applied Sciences ◽

10.3390/app11020813 ◽

2021 ◽

Vol 11 (2) ◽

pp. 813

Author(s):

Shuai Teng ◽

Zongchao Liu ◽

Gongfa Chen ◽

Li Cheng

Keyword(s):

Feature Extraction ◽

Crack Detection ◽

Computational Cost ◽

Concrete Structures ◽

Detection Algorithm ◽

Computational Time ◽

Image Size ◽

Important Indicator ◽

Feature Extractor ◽

Model Training

This paper compares the crack detection performance (in terms of precision and computational cost) of the YOLO_v2 using 11 feature extractors, which provides a base for realizing fast and accurate crack detection on concrete structures. Cracks on concrete structures are an important indicator for assessing their durability and safety, and real-time crack detection is an essential task in structural maintenance. The object detection algorithm, especially the YOLO series network, has significant potential in crack detection, while the feature extractor is the most important component of the YOLO_v2. Hence, this paper employs 11 well-known CNN models as the feature extractor of the YOLO_v2 for crack detection. The results confirm that a different feature extractor model of the YOLO_v2 network leads to a different detection result, among which the AP value is 0.89, 0, and 0 for ‘resnet18’, ‘alexnet’, and ‘vgg16’, respectively meanwhile, the ‘googlenet’ (AP = 0.84) and ‘mobilenetv2’ (AP = 0.87) also demonstrate comparable AP values. In terms of computing speed, the ‘alexnet’ takes the least computational time, the ‘squeezenet’ and ‘resnet18’ are ranked second and third respectively; therefore, the ‘resnet18’ is the best feature extractor model in terms of precision and computational cost. Additionally, through the parametric study (influence on detection results of the training epoch, feature extraction layer, and testing image size), the associated parameters indeed have an impact on the detection results. It is demonstrated that: excellent crack detection results can be achieved by the YOLO_v2 detector, in which an appropriate feature extractor model, training epoch, feature extraction layer, and testing image size play an important role.

Download Full-text

A Parallel Memory System for Variable Block-Size Motion Estimation Algorithms

IEEE Transactions on Circuits and Systems for Video Technology ◽

10.1109/tcsvt.2008.918273 ◽

2008 ◽

Vol 18 (4) ◽

pp. 538-543 ◽

Cited By ~ 14

Author(s):

J. Vanne ◽

E. Aho ◽

T.D. Hamalainen ◽

K. Kuusilinna

Keyword(s):

Motion Estimation ◽

Block Size ◽

Memory System ◽

Parallel Memory ◽

Estimation Algorithms ◽

Variable Block

Download Full-text

A Coarse Grained Reconfigurable Architecture for Variable Block Size Motion Estimation

2007 International Conference on Field-Programmable Technology ◽

10.1109/fpt.2007.4439235 ◽

2007 ◽

Cited By ~ 2

Author(s):

Ruchika Verma ◽

Ali Akoglu

Keyword(s):

Motion Estimation ◽

Block Size ◽

Reconfigurable Architecture ◽

Coarse Grained ◽

Variable Block

Download Full-text

An Object-Based Image Reducing Approach

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.1044-1045.1049 ◽

2014 ◽

Vol 1044-1045 ◽

pp. 1049-1052 ◽

Cited By ~ 1

Author(s):

Chin Chen Chang ◽

I Ta Lee ◽

Tsung Ta Ke ◽

Wen Kai Tai

Keyword(s):

Visual Saliency ◽

Saliency Map ◽

Input Image ◽

Image Size ◽

Target Image ◽

Feature Maps ◽

Object Based ◽

Wide Range

Common methods for reducing image size include scaling and cropping. However, these two approaches have some quality problems for reduced images. In this paper, we propose an image reducing algorithm by separating the main objects and the background. First, we extract two feature maps, namely, an enhanced visual saliency map and an improved gradient map from an input image. After that, we integrate these two feature maps to an importance map. Finally, we generate the target image using the importance map. The proposed approach can obtain desired results for a wide range of images.

Download Full-text