image characteristics
Recently Published Documents


TOTAL DOCUMENTS

429
(FIVE YEARS 118)

H-INDEX

21
(FIVE YEARS 4)

Author(s):  
Karanrat Thammarak ◽  
Prateep Kongkla ◽  
Yaowarat Sirisathitkul ◽  
Sarun Intakosum

Optical character recognition (OCR) is a technology to digitize a paper-based document to digital form. This research studies the extraction of the characters from a Thai vehicle registration certificate via a Google Cloud Vision API and a Tesseract OCR. The recognition performance of both OCR APIs is also examined. The 84 color image files comprised three image sizes/resolutions and five image characteristics. For suitable image type comparison, the greyscale and binary image are converted from color images. Furthermore, the three pre-processing techniques, sharpening, contrast adjustment, and brightness adjustment, are also applied to enhance the quality of image before applying the two OCR APIs. The recognition performance was evaluated in terms of accuracy and readability. The results showed that the Google Cloud Vision API works well for the Thai vehicle registration certificate with an accuracy of 84.43%, whereas the Tesseract OCR showed an accuracy of 47.02%. The highest accuracy came from the color image with 1024×768 px, 300dpi, and using sharpening and brightness adjustment as pre-processing techniques. In terms of readability, the Google Cloud Vision API has more readability than the Tesseract. The proposed conditions facilitate the possibility of the implementation for Thai vehicle registration certificate recognition system.


Author(s):  
Saorabh Kumar Mondal ◽  
Arpitam Chatterjee ◽  
Bipan Tudu

Image contrast enhancement (CE) is a frequent image enhancement requirement in diverse applications. Histogram equalization (HE), in its conventional and different further improved ways, is a popular technique to enhance the image contrast. The conventional as well as many of the later versions of HE algorithms often cause loss of original image characteristics particularly brightness distribution of original image that results artificial appearance and feature loss in the enhanced image. Discrete Cosine Transform (DCT) coefficient mapping is one of the recent methods to minimize such problems while enhancing the image contrast. Tuning of DCT parameters plays a crucial role towards avoiding the saturations of pixel values. Optimization can be a possible solution to address this problem and generate contrast enhanced image preserving the desired original image characteristics. Biological behavior-inspired optimization techniques have shown remarkable betterment over conventional optimization techniques in different complex engineering problems. Gray wolf optimization (GWO) is a comparatively new algorithm in this domain that has shown promising potential. The objective function has been formulated using different parameters to retain original image characteristics. The objective evaluation against CEF, PCQI, FSIM, BRISQUE and NIQE with test images from three standard databases, namely, SIPI, TID and CSIQ shows that the presented method can result in values up to 1.4, 1.4, 0.94, 19 and 4.18, respectively, for the stated metrics which are competitive to the reported conventional and improved techniques. This paper can be considered a first-time application of GWO towards DCT-based image CE.


Author(s):  
Nataliia Khymytsia ◽  
Kateryna Petryk

The purpose of the article is to analyze the methods, techniques, branding, and image strategies practiced by cafés in Lviv in the social network of Instagram in the context of SSM activities; determine the specifics of image tools in terms of communicative interaction with the target audience; identify the criteria for successful positioning in the social network Instagram for food establishments. Methodology. The analytical method, methods of description, comparison, and generalization were used to perform the research. Scientific novelty. The image characteristics of popular food establishments are investigated; the criteria of image positioning of cafés in Lviv are determined; the features of image and communication tools that are actively practiced interacting with the target audience are analyzed. Conclusions. Today, image and brand have become important criteria of trust in companies, giving them the opportunity to be one step ahead of competitors. The key goal of the image is to get a positive attitude of consumers to the company/brand. Important criteria for successful positioning in the social network of Instagram for food establishments are visual design, emotional characteristics, the well-chosen context of information content. In the image activity of the researched cafés in Lviv, in the Instagram network: techniques of creating a bright image (naming, branding) are practiced; techniques of forming a positive attitude; techniques to enhance the image and enhance the influence of the image. Key words: image, brand, image-building, corporate identity, positioning, social network, target audience, communication, communicative processes.


Author(s):  
Junfeng Li ◽  
Hao Wang

Abstract Aiming at the vehicle navigation light guide plate (LGP) image characteristics, such as complex and gradient texures, uneven brightness, and small defects, this paper proposes a visual inspection method for LGP defects based on an improved RetinaNet. First, we use ResNeXt50 with higher accuracy under the same parameters as the backbone network, and propose the lightweight module Ghost_module to replace the 1×1 convolution in the lower half of the ResNeXt_block. This can reduce the resource parameters and consumption, and speed up training and inference. Second, we propose and use an improved feature pyramid network (IFPN) module to improve the feature fusion network in RetinaNet. It can more effectively fuse the shallow semantic information and high-level semantic information in the backbone feature extraction network, and further improve the detection ability of small target defects. Finally, the defect detection dataset constructed based on the vehicle LGP images collected at a industrial site, and experiments are performed on the vehicle LGP dataset and Aluminum Profile Defect Identification dataset (Aluminum Profile DID). The experimental results show that the proposed method is both efficient and effective. It achieves a better average detection rate of 98.6% on the vehicle LGP dataset. The accuracy and real-time performance can meet the requirements of industrial detection.


2021 ◽  
Vol 2132 (1) ◽  
pp. 012018
Author(s):  
Cailing Wang ◽  
LeiChao Li ◽  
SuQiang He ◽  
Jing Zhang

Abstract As a simple, effective and non-parameter analysis method, knn is widely used in text classification, image recognition, etc. [1]. However, this method requires a lot of calculations in practical applications, and the uneven distribution of training samples will directly lead to a decrease in the accuracy of tumor image classification. To solve this problem, we propose a method based on dynamic weighted KNN to improve the accuracy of classification, which is used to solve the problem of automatic prediction and classification of medical tumor images based on image features and automatic abnormality detection. According to the classification of tumor image characteristics, it can be divided into two categories: benign and malignant. This method can assist doctors in making medical diagnosis and analysis more accurately. The experimental results show that this method has certain advantages compared with the traditional KNN algorithm.


2021 ◽  
Vol 6 (2) ◽  
pp. 35-42
Author(s):  
Sung Hun Kim

Automatic breast ultrasound (ABUS) has been developed to compensate for the shortcomings of hand-held ultrasound (HHUS) and is mainly used for breast cancer screening purposes in women with dense breasts. Since 2021, ABUS has been covered by the Korean National Health Insurance System. It is important to scan the entire breast on ABUS and to identify the poor-quality images requiring re-scanning. In addition, a general understanding of the unique ABUS display mode, distinguishing benign from malignant lesions, the diagnostic performance of breast cancer screening, and the application of computer-aided detection/diagnosis systems is necessary to use ABUS efficiently. This review explores the acquisition method, image quality, and image characteristics of ABUS to improve general understanding of this procedure and its advantages over HHUS, so that ABUS can be applied efficiently in clinical practice.


2021 ◽  
Vol 2021 ◽  
pp. 1-9
Author(s):  
Liang Wang ◽  
Hui Song ◽  
Ming Wang ◽  
Hui Wang ◽  
Ran Ge ◽  
...  

The aim of this study was to evaluate the diagnostic value of artificial intelligence algorithm combined with ultrasound endoscopy in early esophageal cancer and precancerous lesions by comparing the examination of conventional endoscopy and artificial intelligence algorithm combined with ultrasound endoscopy, and by comparing the real-time diagnosis of endoscopy and the ultrasonic image characteristics of artificial intelligence algorithm combined with endoscopic detection and pathological results. 120 cases were selected. According to the inclusion and exclusion criteria, 80 patients who met the criteria were selected and randomly divided into two groups: endoscopic examination combined with ultrasound imaging based on intelligent algorithm processing (cascade region-convolutional neural network (Cascade RCNN) model algorithm group) and simple use of endoscopy group (control group). This study shows that the ultrasonic image of artificial intelligence algorithm is effective, and the detection performance is better than that of endoscopic detection. The results are close to the gold standard of doctor recognition, and the detection time is greatly shortened, and the recognition time is shortened by 71 frames per second. Compared with the traditional convolutional neural network (CNN) algorithm, the accuracy and recall of image analysis and segmentation using feature pyramid network are increased. The detection rates of CNN model, Cascade RCNN model, and endoscopic detection alone in early esophageal cancer and precancerous lesions are 56.3% (45/80), 88.8% (71/80), and 44.1% (35/80), respectively. The detection rate of Cascade RCNN model and CNN model was higher than that of endoscopy alone, and the difference was statistically significant ( P < 0.05 ). The sensitivity, specificity, positive predictive value, and negative predictive value of Cascade RCNN model were higher than those of CNN model, which was close to the gold standard for physician identification. This provided a reference basis for endoscopic ultrasound identification of early upper gastrointestinal cancer or other gastrointestinal cancers.


Author(s):  
Young Hyun Kim ◽  
Eun-Gyu Ha ◽  
Kug Jin Jeon ◽  
Chena Lee ◽  
Sang-Sun Han

Objectives: This study aimed to develop a fully automated human identification method based on a convolutional neural network (CNN) with a large-scale dental panoramic radiograph (DPR) dataset. Methods: In total, 2,760 DPRs from 746 subjects who had 2 to 17 DPRs with various changes in image characteristics due to various dental treatments (tooth extraction, oral surgery, prosthetics, orthodontics, or tooth development) were collected. The test dataset included the latest DPR of each subject (746 images) and the other DPRs (2,014 images) were used for model training. A modified VGG16 model with two fully connected layers was applied for human identification. The proposed model was evaluated with rank-1, –3, and −5 accuracies, running time, and gradient-weighted class activation mapping (Grad-CAM)–applied images. Results: This model had rank-1,–3, and −5 accuracies of 82.84%, 89.14%, and 92.23%, respectively. All rank-1 accuracy values of the proposed model were above 80% regardless of changes in image characteristics. The average running time to train the proposed model was 60.9 sec per epoch, and the prediction time for 746 test DPRs was short (3.2 sec/image). The Grad-CAM technique verified that the model automatically identified humans by focusing on identifiable dental information. Conclusion: The proposed model showed good performance in fully automatic human identification despite differing image characteristics of DPRs acquired from the same patients. Our model is expected to assist in the fast and accurate identification by experts by comparing large amounts of images and proposing identification candidates at high speed.


Sign in / Sign up

Export Citation Format

Share Document