image database
Recently Published Documents


TOTAL DOCUMENTS

1193
(FIVE YEARS 170)

H-INDEX

48
(FIVE YEARS 6)

2022 ◽  
Vol 2022 ◽  
pp. 1-8
Author(s):  
Xiaoyue Cui

Aiming at the problems of low image data retrieval accuracy and slow retrieval speed in the existing image database retrieval algorithms, this paper designs a clothing image database retrieval algorithm based on wavelet transform. Firstly, it represents the color consistency vector of clothing image, reflects the composition and distribution of image color through color histogram, quantifies the visual features of clothing image, aggregates them into a fixed size representation vector, and uses the Fair Value (FV) model to complete the collection of clothing image data. Then, the size of the clothing image is adjusted by using the size transformation technology, and the clothing pattern is divided into four moments with the same size. On this basis, the clothing image is discretized with the help of Hu invariant moment to complete the preprocessing of clothing image data. Finally, the generating function of wavelet transform is determined, and a cluster of functions is obtained through translation and expansion. The wavelet filter is decomposed into basic modules, and then, the wavelet transform is studied step by step. The clothing image data are regarded as a signal, split, predicted, and updated and input into the wavelet model, and the retrieval research of clothing image database is completed. The experimental results show that the design of the retrieval algorithm is reasonable, the retrieval data accuracy is high, and the retrieval speed is fast.


Sensors ◽  
2022 ◽  
Vol 22 (2) ◽  
pp. 434
Author(s):  
Marriam Nawaz ◽  
Tahira Nazir ◽  
Ali Javed ◽  
Usman Tariq ◽  
Hwan-Seung Yong ◽  
...  

Glaucoma is an eye disease initiated due to excessive intraocular pressure inside it and caused complete sightlessness at its progressed stage. Whereas timely glaucoma screening-based treatment can save the patient from complete vision loss. Accurate screening procedures are dependent on the availability of human experts who performs the manual analysis of retinal samples to identify the glaucomatous-affected regions. However, due to complex glaucoma screening procedures and shortage of human resources, we often face delays which can increase the vision loss ratio around the globe. To cope with the challenges of manual systems, there is an urgent demand for designing an effective automated framework that can accurately identify the Optic Disc (OD) and Optic Cup (OC) lesions at the earliest stage. Efficient and effective identification and classification of glaucomatous regions is a complicated job due to the wide variations in the mass, shade, orientation, and shapes of lesions. Furthermore, the extensive similarity between the lesion and eye color further complicates the classification process. To overcome the aforementioned challenges, we have presented a Deep Learning (DL)-based approach namely EfficientDet-D0 with EfficientNet-B0 as the backbone. The presented framework comprises three steps for glaucoma localization and classification. Initially, the deep features from the suspected samples are computed with the EfficientNet-B0 feature extractor. Then, the Bi-directional Feature Pyramid Network (BiFPN) module of EfficientDet-D0 takes the computed features from the EfficientNet-B0 and performs the top-down and bottom-up keypoints fusion several times. In the last step, the resultant localized area containing glaucoma lesion with associated class is predicted. We have confirmed the robustness of our work by evaluating it on a challenging dataset namely an online retinal fundus image database for glaucoma analysis (ORIGA). Furthermore, we have performed cross-dataset validation on the High-Resolution Fundus (HRF), and Retinal Image database for Optic Nerve Evaluation (RIM ONE DL) datasets to show the generalization ability of our work. Both the numeric and visual evaluations confirm that EfficientDet-D0 outperforms the newest frameworks and is more proficient in glaucoma classification.


2021 ◽  
Vol 38 (6) ◽  
pp. 1843-1851
Author(s):  
Ouarda Soltani ◽  
Souad Benabdelkader

The human color skin image database called SFA, specifically designed to assist research in the area of face recognition, constitutes a very important means particularly for the challenging task of skin detection. It has showed high performances comparing to other existing databases. SFA database provides multiple skin and non-skin samples, which in various combinations with each other allow creating new samples that could be useful and more effective. This particular aspect will be investigated, in the present paper, by creating four new representative skin samples according to the four rules of minimum, maximum, mean and median. The obtained samples will be exploited for the purpose of skin segmentation on the basis of the well-known Euclidean and Manhattan distance metrics. Thereafter, performances of the new representative skin samples versus performances of those skin samples, originally provided by SFA, will be illustrated. Simulation results in both SFA and UTD (University of Texas at Dallas) color face databases indicate that detection rates higher than 92% can be achieved with either measure.


Electronics ◽  
2021 ◽  
Vol 11 (1) ◽  
pp. 34
Author(s):  
Imran Nazir ◽  
Ihsan Ul Haq ◽  
Muhammad Mohsin Khan ◽  
Muhammad Bilal Qureshi ◽  
Hayat Ullah ◽  
...  

Over the last two decades, radiologists have been using multi-view images to detect tumors. Computer Tomography (CT) imaging is considered as one of the reliable imaging techniques. Many medical-image-processing techniques have been developed to diagnoses lung cancer at early or later stages through CT images; however, it is still a big challenge to improve the accuracy and sensitivity of the algorithms. In this paper, we propose an algorithm based on image fusion for lung segmentation to optimize lung cancer diagnosis. The image fusion technique was developed through Laplacian Pyramid (LP) decomposition along with Adaptive Sparse Representation (ASR). The suggested fusion technique fragments medical images into different sizes using the LP. After that, the LP is used to fuse the four decomposed layers. For the evaluation purposes of the proposed technique, the Lungs Image Database Consortium and Image Database Resource Initiative (LIDC-IDRI) was used. The results showed that the Dice Similarity Coefficient (DSC) index of our proposed method was 0.9929, which is better than recently published results. Furthermore, the values of other evaluation parameters such as the sensitivity, specificity, and accuracy were 89%, 98% and 99%, respectively, which are also competitive with the recently published results.


2021 ◽  
Vol 3 ◽  
Author(s):  
Makoto Segawa ◽  
Norio Iizuka ◽  
Hiroyuki Ogihara ◽  
Koichiro Tanaka ◽  
Hajime Nakae ◽  
...  

Tongue examination is an important diagnostic method for judging pathological conditions in Kampo (traditional Japanese medicine), but it is not easy for beginners to learn the diagnostic technique. One reason is that there are few objective diagnostic criteria for tongue examination findings, and the educational method for tongue examination is not standardized in Japan, warranting the need for a tongue image database for e-learning systems that could dramatically improve the efficiency of education. Therefore, we constructed a database comprising tongue images whose findings were determined on the basis of votes given by five Kampo medicine specialists (KMSs) and confirmed the educational usefulness of the database for tongue diagnosis e-learning systems. The study was conducted in the following five steps: development of a tongue imaging collection system, collection of tongue images, evaluation and annotation of tongue images, development of a tongue diagnosis e-learning system, and verification of the educational usefulness of this system. Five KMSs evaluated the tongue images obtained from 125 participants in the following eight aspects: (i) tongue body size, (ii) tongue body color, (iii) tongue body dryness and wetness, (iv) tooth marks on the edge of the tongue, (v) cracks on the surface of the tongue, (vi) thickness of tongue coating, (vii) color of tongue coating, and (viii) dryness and wetness of tongue coating. Medical students (MSs) were given a tongue diagnosis test using an e-learning system after a lecture on tongue diagnosis. The cumulative and individual match rates (%) (individual match rates of 100% (5/5), 80% (4/5), and 60% (3/5) are shown in parentheses, respectively) were as follows: (i) tongue body size: 92.8 (26.4/26.4/40.0); (ii) tongue body color: 83.2 (10.4/20.8/52.0); (iii) tongue body dryness and wetness: 88.8 (13.6/34.4/40.8); (iv) tooth marks on the edge of the tongue: 88.8 (6.4/35.2/47.2); (v) cracks on the surface of the tongue: 96.8 (24.0/35.2/37.6); (vi) thickness of tongue coating: 84.8 (7.2/21.6/56.0); (vii) color of tongue coating: 88.0 (15.2/37.6/35.2); and (viii) dryness and wetness of tongue coating: 74.4 (4.8/19.2/50.4). The test showed that the tongue diagnosis ability of MSs who attended a lecture on tongue diagnosis was almost the same as that of KMSs. We successfully constructed a tongue image database standardized for training specialists on tongue diagnosis and confirmed the educational usefulness of the e-learning system using a database. This database will contribute to the standardization and popularization of Kampo education.


Author(s):  
Zixin Zheng ◽  
Sijin Li ◽  
Licheng Mo ◽  
Weimao Chen ◽  
Dandan Zhang

AbstractHuman beings have a fundamental need to belong. Evaluating and dealing with social exclusion and social inclusion events, which represent negative and positive social interactions, respectively, are closely linked to our physical and mental health. In addition to traditional paradigms that simulate scenarios of social interaction, images are utilized as effective visual stimuli for research on socio-emotional processing and regulation. Since the current mainstream emotional image database lacks social stimuli based on a specific social context, we introduced an open-access image database of social inclusion/exclusion in young Asian adults (ISIEA). This database contains a set of 164 images depicting social interaction scenarios under three categories of social contexts (social exclusion, social neutral, and social inclusion). All images were normatively rated on valence, arousal, inclusion score, and vicarious feeling by 150 participants in Study 1. We additionally examined the relationships between image ratings and the potential factors influencing ratings. The importance of facial expression and social context in the image rating of ISIEA was examined in Study 2. We believe that this database allows researchers to select appropriate materials for socially related studies and to flexibly conduct experimental control.


Entropy ◽  
2021 ◽  
Vol 23 (12) ◽  
pp. 1614
Author(s):  
Justyna Golec ◽  
Tomasz Hachaj ◽  
Grzegorz Sokal

We propose an algorithm to generate graphical summarising of longer text passages using a set of illustrative pictures (TIPS). TIPS is an algorithm using a voting process that uses results of individual “weak” algorithms. The proposed method includes a summarising algorithm that generates a digest of the input document. Each sentence of the text summary is used as the input for further processing by the sentence transformer separately. A sentence transformer performs text embedding and a group of CLIP similarity-based algorithms trained on different image embedding finds semantic distances between images in the illustration image database and the input text. A voting process extracts the most matching images to the text. The TIPS algorithm allows the integration of the best (highest scored) results of the different recommendation algorithms by diminishing the influence of images that are a disjointed part of the recommendations of the component algorithms. TIPS returns a set of illustrative images that describe each sentence of the text summary. Three human judges found that the use of TIPS resulted in an increase in matching highly relevant images to text, ranging from 5% to 8% and images relevant to text ranging from 3% to 7% compared to the approach based on single-embedding schema.


2021 ◽  
pp. 371-379
Author(s):  
Yu Wu ◽  
Zun Li ◽  
Yang Guo ◽  
Songyang Zhang ◽  
Zhiguo Zhang ◽  
...  

Author(s):  
Preeti Aggarwal ◽  
H. K. Sardana ◽  
Renu Vig

In lung cancer computer-aided diagnosis (CAD) systems, having an accurate ground truth is critical and time consuming. Due to lack of ground truth and semantic information, lung CAD systems are not progressing in the manner these are supposed to. In this study, we have explored Lung Image Database Consortium (LIDC) database containing annotated pulmonary computed tomography (CT) scans, and we have used semantic and content-based image retrieval (CBIR) approach to exploit the limited amount of diagnostically labeled data in order to annotate unlabeled images with diagnoses. We evaluated the method by various combinations of lung nodule sets as queries and retrieves similar nodules from the diagnostically labeled dataset. In calculating the precision of this system Diagnosed dataset and computer-predicted malignancy data are used as ground truth for the undiagnosed query nodules. Our results indicate that CBIR expansion is an effective method for labeling undiagnosed images in order to improve the performance of CAD systems while tested on PGIMER data. Also a little knowledge of biopsy confirmed cases can also assist the physician’s as second opinion to mark the undiagnosed cases and avoid unnecessary biopsies


Diagnostics ◽  
2021 ◽  
Vol 11 (12) ◽  
pp. 2209
Author(s):  
Hafiz Abbad Ur Rehman ◽  
Chyi-Yeu Lin ◽  
Shun-Feng Su

Thyroid nodules are widespread in the United States and the rest of the world, with a prevalence ranging from 19 to 68%. The problem with nodules is whether they are malignant or benign. Ultrasonography is currently recommended as the initial modality for evaluating thyroid nodules. However, obtaining a good diagnosis from ultrasound imaging depends entirely on the radiologists levels of experience and other circumstances. There is a tremendous demand for automated and more reliable methods to screen ultrasound images more efficiently. This research proposes an efficient and quick detection deep learning approach for thyroid nodules. An open and publicly available dataset, Thyroid Digital Image Database (TDID), is used to determine the robustness of the suggested method. Each image is formatted into a pyramid tile-based data structure, which the proposed VGG-16 model evaluates to provide segmentation results for nodular detection. The proposed method adopts a top-down approach to hierarchically integrate high- and low-level features to distinguish nodules of varied sizes by employing fuse features effectively. The results demonstrated that the proposed method outperformed the U-Net model, achieving an accuracy of 99%, and was two times faster than the competitive model.


Sign in / Sign up

Export Citation Format

Share Document