Automatic Image Annotation Using Semantic Subspace Graph Spectral Clustering Algorithm

2011 ◽  
Vol 271-273 ◽  
pp. 1090-1095
Author(s):  
Yu Tang Guo ◽  
Chang Gang Han

Due to the existing of the semantic gap, images with the same or similar low level features are possibly different on semantic level. How to find the underlying relationship between the high-level semantic and low level features is one of the difficult problems for image annotation. In this paper, a new image annotation method based on graph spectral clustering with the consistency of semantics is proposed with detailed analysis on the advantages and disadvantages of the existed image annotation methods. The proposed method firstly cluster image into several semantic classes by semantic similarity measurement in the semantic subspace. Within each semantic class, images are re-clustered with visual features of region Then, the joint probability distribution of blobs and words was modeled by using Multiple-Bernoulli Relevance Model. We can annotate a unannotated image by using the joint distribution. Experimental results show the the effectiveness of the proposed approach in terms of quality of the image annotation. the consistency of high-level semantics and low level features is efficiently achieved.

2019 ◽  
Vol 1 (1) ◽  
pp. 31-39
Author(s):  
Ilham Safitra Damanik ◽  
Sundari Retno Andani ◽  
Dedi Sehendro

Milk is an important intake to meet nutritional needs. Both consumed by children, and adults. Indonesia has many producers of fresh milk, but it is not sufficient for national milk needs. Data mining is a science in the field of computers that is widely used in research. one of the data mining techniques is Clustering. Clustering is a method by grouping data. The Clustering method will be more optimal if you use a lot of data. Data to be used are provincial data in Indonesia from 2000 to 2017 obtained from the Central Statistics Agency. The results of this study are in Clusters based on 2 milk-producing groups, namely high-dairy producers and low-milk producing regions. From 27 data on fresh milk production in Indonesia, two high-level provinces can be obtained, namely: West Java and East Java. And 25 others were added in 7 provinces which did not follow the calculation of the K-Means Clustering Algorithm, including in the low level cluster.


2019 ◽  
Vol 92 (2) ◽  
pp. 213-221 ◽  
Author(s):  
S.W. Soh ◽  
Z.W. Zhong

Purpose Given the ever-growing air travel industry, there is an increasing strain on the systems that provide safe flights. Different methods have to be adopted to help to cope with the increasing demand, especially in Southeast Asia. The purpose of this study is to sectorise one existing airspace to better manage sector workloads. Design/methodology/approach Cambodia’s airspace was chosen for this study because it had only one sector and it was quickly approaching its limit. In this paper, after characterising the airspace, it was first bi-partitioned using the spectral clustering algorithm. The weights of the resulting subgraphs were then balanced through a weight-balancing algorithm. Also, a post-processing algorithm established the sector boundary to be drawn. The method was first carried out on one test airspace. Following the successful sectorisation of the test airspace, the actual Cambodian airspace was sectorised. The resulting two new sectors were then calculated to be able to last for approximately five years before they would reach their capacity. Hence, a further sectorisation was carried out. This resulted in four sectors, which were projected to last more than 10 years. Findings The method produced satisfactory results. The methodology presented is proven to be effective in achieving the sectorisation. The workloads of the new sectors obtained are balanced, and the sector boundaries are at least 15 km away from the air routes and nodes. The methodology is also general and can be applied to different scenarios. This means that applications to other airspace in the region are possible, which can further help to increase the safety, efficiency and capacity of the air traffic movement in this region. Originality/value This paper presents one of the approaches for airspace sector designs. The problems are clearly presented with references. The authors discuss the advantages and disadvantages of subdividing airspace and the need to sectorise Cambodia’s airspace, and present a method to solve the sectorisation problem. It is very precious to apply methodologies and algorithms to real cases. The presented method offers significant advantages such as the ease of implementation and efficiency. The problems can easily be solved using standard linear algebra algorithms. Instead of looking at the airspace as a whole, and generating new sector boundaries, our algorithm uses current sector boundaries and bisects them. Moreover, only sectors that require sectorisation would be affected. This algorithm has the advantage of maintaining the current sector boundaries to prevent radical changes to daily operations. The Voronoi diagram used in this work does not generate polygonal cells. It instead calculates the area based on pixels. The advantage of doing this is that it offers higher flexibility. Also, the sector boundary is generated based on straight lines calculated by joining the midpoints of links. This is simple and ensures that sections of the sector boundary are made up of straight, distinct lines. The authors also discuss the problems of the method and presented solutions to them.


2012 ◽  
Vol 2012 ◽  
pp. 1-19 ◽  
Author(s):  
Chih-Fong Tsai

Content-based image retrieval (CBIR) systems require users to query images by their low-level visual content; this not only makes it hard for users to formulate queries, but also can lead to unsatisfied retrieval results. To this end, image annotation was proposed. The aim of image annotation is to automatically assign keywords to images, so image retrieval users are able to query images by keywords. Image annotation can be regarded as the image classification problem: that images are represented by some low-level features and some supervised learning techniques are used to learn the mapping between low-level features and high-level concepts (i.e., class labels). One of the most widely used feature representation methods is bag-of-words (BoW). This paper reviews related works based on the issues of improving and/or applying BoW for image annotation. Moreover, many recent works (from 2006 to 2012) are compared in terms of the methodology of BoW feature generation and experimental design. In addition, several different issues in using BoW are discussed, and some important issues for future research are discussed.


2013 ◽  
Vol 411-414 ◽  
pp. 1372-1376
Author(s):  
Wei Tin Lin ◽  
Shyi Chyi Cheng ◽  
Chih Lang Lin ◽  
Chen Kuei Yang

An approach to improve the regions of interesting (ROIs) selection accuracy automatically for medical images is proposed. The aim of the study is to select the most interesting regions of image features that good for diffuse objects detection or classification. We use the AHP (Analytic Hierarchy Process) to obtain physicians high-level diagnosis vectors and are clustered using the well-known K-Means clustering algorithm. The system also automatically extracts low-level image features for improving to detect liver diseases from ultrasound images. The weights of low-level features are adaptively updated according the feature variances in the class. Finally, the high-level diagnosis decision is made based on the high-level diagnosis vectors for the top K near neighbors from the medical experts classified database. Experimental results show the effectiveness of the system.


Author(s):  
Nanda Erlangga ◽  
Solikhun Solikhun ◽  
Irawan Irawan

Corn needs are currently experiencing a fairly rapid development can be seen in terms of the domestic market, here researchers want to increase the productivity and quality of corn production. The data that will be used is the data from the Central Statistics Agency. The method in this study is the K-means clustering algorithm and the application used is Rapidminer which will be grouped into 2 clustering, namely high and low. The results of this study are 2 high level cluster provinces, 32 low level cluster provincesKeywords: Corn, Data mining, K-means Clustering c


2021 ◽  
Vol 6 (2) ◽  
pp. 161-167
Author(s):  
Eduard Yakubchykt ◽  
◽  
Iryna Yurchak

Finding similar images on a visual sample is a difficult AI task, to solve which many works are devoted. The problem is to determine the essential properties of images of low and higher semantic level. Based on them, a vector of features is built, which will be used in the future to compare pairs of images. Each pair always includes an image from the collection and a sample image that the user is looking for. The result of the comparison is a quantity called the visual relativity of the images. Image properties are called features and are evaluated by calculation algorithms. Image features can be divided into low-level and high-level. Low-level features include basic colors, textures, shapes, significant elements of the whole image. These features are used as part of more complex recognition tasks. The main progress is in the definition of high-level features, which is associated with understanding the content of images. In this paper, research of modern algorithms is done for finding similar images in large multimedia databases. The main problems of determining high-level image features, algorithms of overcoming them and application of effective algorithms are described. The algorithms used to quickly determine the semantic content and improve the search accuracy of similar images are presented. The aim: The purpose of work is to conduct comparative analysis of modern image retrieval algorithms and retrieve its weakness and strength.


2021 ◽  
Author(s):  
Rui Zhang

This thesis is primarily focused on the information combination at different levels of a statistical pattern classification framework for image annotation and retrieval. Based on the previous study within the fields of image annotation and retrieval, it has been well-recognized that the low-level visual features, such as color and texture, and high-level features, such as textual description and context, are distinct yet complementary in terms of their distributions and the corresponding discriminative powers of dealing with machine-based recognition and retrieval tasks. Therefore, effective feature combination for image annotation and retrieval has become a desirable and promising perspective from which the semantic gap can be further bridged. Motivated by this fact, the combination of the visual and context modalities and that of different features in the visual domain are tackled by developing two statistical patterns classification approaches considering that the features of the visual modality and those across different modalities exhibit different degrees of heterogeneities, and thus, should be treated differently. Regarding the cross-modality feature combination, a Bayesian framework is proposed to integrate visual content and context, which has been applied to various image annotation and retrieval frameworks. In terms of the combination of different low-level features in the visual domain, the problem is tackled with a novel method that combines texture and color features via a mixture model of their joint distribution. To evaluate the proposed frameworks, many different datasets are employed in the experiments, including the COREL database for image retrieval and the MSRC, LabelMe, PASCAL VOC2009, and an animal image database collected by ourselves for image annotation. Using various evaluation criteria, the first framework is shown to be more effective than the methods purely based on the low-level features or high-level context. As for the second, the experimental results demonstrate not only its superior performance to other feature combination methods but also its ability to discover visual clusters using texture and color simultaneously. Moreover, a demo search engine based on the Bayesian framework is implemented and available online.


Author(s):  
Kalaivani Anbarasan ◽  
Chitrakala S.

The content based image retrieval system retrieves relevant images based on image features. The lack of performance in the content based image retrieval system is due to the semantic gap. Image annotation is a solution to bridge the semantic gap between low-level content features and high-level semantic concepts Image annotation is defined as tagging images with a single or multiple keywords based on low-level image features. The major issue in building an effective annotation framework is the integration of both low level visual features and high-level textual information into an annotation model. This chapter focus on new statistical-based image annotation model towards semantic based image retrieval system. A multi-label image annotation with multi-level tagging system is introduced to annotate image regions with class labels and extract color, location and topological tags of segmented image regions. The proposed method produced encouraging results and the experimental results outperformed state-of-the-art methods


2021 ◽  
Author(s):  
Rui Zhang

This thesis is primarily focused on the information combination at different levels of a statistical pattern classification framework for image annotation and retrieval. Based on the previous study within the fields of image annotation and retrieval, it has been well-recognized that the low-level visual features, such as color and texture, and high-level features, such as textual description and context, are distinct yet complementary in terms of their distributions and the corresponding discriminative powers of dealing with machine-based recognition and retrieval tasks. Therefore, effective feature combination for image annotation and retrieval has become a desirable and promising perspective from which the semantic gap can be further bridged. Motivated by this fact, the combination of the visual and context modalities and that of different features in the visual domain are tackled by developing two statistical patterns classification approaches considering that the features of the visual modality and those across different modalities exhibit different degrees of heterogeneities, and thus, should be treated differently. Regarding the cross-modality feature combination, a Bayesian framework is proposed to integrate visual content and context, which has been applied to various image annotation and retrieval frameworks. In terms of the combination of different low-level features in the visual domain, the problem is tackled with a novel method that combines texture and color features via a mixture model of their joint distribution. To evaluate the proposed frameworks, many different datasets are employed in the experiments, including the COREL database for image retrieval and the MSRC, LabelMe, PASCAL VOC2009, and an animal image database collected by ourselves for image annotation. Using various evaluation criteria, the first framework is shown to be more effective than the methods purely based on the low-level features or high-level context. As for the second, the experimental results demonstrate not only its superior performance to other feature combination methods but also its ability to discover visual clusters using texture and color simultaneously. Moreover, a demo search engine based on the Bayesian framework is implemented and available online.


2012 ◽  
Vol 20 (3) ◽  
pp. 505-545
Author(s):  
Kavita E. Thomas ◽  
Elena Andonova

In this paper we investigate the effect of level of understanding revealed by feedback in the form of clarification requests from a route follower on a route giver’s spatial perspective choice in their response in route instruction dialogues. In an experiment varying the level of understanding displayed by route follower clarification requests (the independent variable), route giver perspective switching in response to this feedback is investigated. Three levels of understanding displayed by feedback are investigated: (1) low-level clarification requests indicating that the instruction was not processed, (2) semantic-level clarification requests indicating that the spatial direction given in the instruction could not be resolved as the speaker of the clarification request could not interpret which perspective was intended, and (3) high-level feedback which indicates that the route giver’s instruction was understood but which note an obstacle to following the instruction. Results show that perspective choice, which is a conceptual feature of language use, is sensitive to perceived level of addressee understanding. We found that route givers consistently switch perspectives in responding to semantic-level clarification requests but not in response to low-level ones, and also that switching occurs more for high-level feedback than for low-level feedback. We address how dialogue systems can take advantage of these findings by modelling our results in an Information State model of dialogue, presenting update rules for response generation which account for our findings and also update rules which enable generation of the feedback themselves.


Sign in / Sign up

Export Citation Format

Share Document