Spatial orientations of visual word pairs to improve Bag-of-Visual-Words model

KLASIFIKASI EKSPRESI WAJAH MENGGUNAKAN BAG OF VISUAL WORDS

JOURNAL OF INFORMATICS AND TELECOMMUNICATION ENGINEERING ◽

10.31289/jite.v1i2.1426 ◽

2018 ◽

Vol 1 (2) ◽

pp. 73 ◽

Cited By ~ 4

Author(s):

Muhathir Muhathir

Keyword(s):

Visual Word ◽

Interest Point ◽

Bag Of Visual Words ◽

Visual Words ◽

Speed Up ◽

Speed Up Robust Feature

<div><p class="Abstract">Pada hakikatnya, manusia dapat membedakan pola terhadap suatu objek berdasarkan bentuk visual yang mengandung keadaan emosional. Seperti membedakan ekspresi wajah seseorang pada suatu citra. Manusia dapat membedakan ekspresi pada citra tersebut secara kasat mata. Namun komputer yang tidak dapat mengenali ekspresi wajah tersebut. Bag of visual words merupakan suatu skema untuk mengklasifikasikan citra berdasarkan nilai-nilai pixel pada citra. Dengan menggunakan deteksi interest point dan ekstraksi interest point, bag of visual words mengambil ciri unik pada citra sehingga dapat membedakan pola-pola yang terdapat pada suatu citra. Bag of visual word dengan nilai K 500 mampu mengklasifikasi pola ekspresi wajah dengan tingkat akurasi 69%,</p></div>Kata kunci<strong>: </strong><em>Wajah, Klasifikasi, Speed-up Robust Feature, Bag of visual words, Ekspresi</em>

Download Full-text

Segmentation Free Word Spotting for Handwritten Documents Using Bag of Visual Words Based on Co-HOG Descriptor

International Journal of Information Retrieval Research ◽

10.4018/ijirr.2019040105 ◽

2019 ◽

Vol 9 (2) ◽

pp. 49-65

Author(s):

Thontadari C. ◽

Prabhakar C. J.

Keyword(s):

Visual Information ◽

Spatial Information ◽

Spatial Location ◽

Visual Word ◽

Bag Of Visual Words ◽

Word Spotting ◽

Handwritten Documents ◽

Visual Words ◽

Handwritten Document ◽

Free Word

In this article, the authors propose a segmentation-free word spotting in handwritten document images using a Bag of Visual Words (BoVW) framework based on the co-occurrence histogram of oriented gradient (Co-HOG) descriptor. Initially, the handwritten document is represented using visual word vectors which are obtained based on the frequency of occurrence of Co-HOG descriptor within local patches of the document. The visual word representation vector does not consider their spatial location and spatial information helps to determine a location exclusively with visual information when the different location can be perceived as the same. Hence, to add spatial distribution information of visual words into the unstructured BoVW framework, the authors adopted spatial pyramid matching (SPM) technique. The performance of the proposed method evaluated using popular datasets and it is confirmed that the authors' method outperforms existing segmentation free word spotting techniques.

Download Full-text

Dynamic Spatio-Temporal Bag of Expressions (D-STBoE) Model for Human Action Recognition

Sensors ◽

10.3390/s19122790 ◽

2019 ◽

Vol 19 (12) ◽

pp. 2790 ◽

Cited By ~ 5

Author(s):

Saima Nazir ◽

Muhammad Haroon Yousaf ◽

Jean-Christophe Nebel ◽

Sergio A. Velastin

Keyword(s):

Action Recognition ◽

Human Action Recognition ◽

Human Action ◽

Fixed Number ◽

Visual Word ◽

Support Vector ◽

Bag Of Visual Words ◽

Visual Words ◽

Visual Expression ◽

Spatio Temporal

Human action recognition (HAR) has emerged as a core research domain for video understanding and analysis, thus attracting many researchers. Although significant results have been achieved in simple scenarios, HAR is still a challenging task due to issues associated with view independence, occlusion and inter-class variation observed in realistic scenarios. In previous research efforts, the classical bag of visual words approach along with its variations has been widely used. In this paper, we propose a Dynamic Spatio-Temporal Bag of Expressions (D-STBoE) model for human action recognition without compromising the strengths of the classical bag of visual words approach. Expressions are formed based on the density of a spatio-temporal cube of a visual word. To handle inter-class variation, we use class-specific visual word representation for visual expression generation. In contrast to the Bag of Expressions (BoE) model, the formation of visual expressions is based on the density of spatio-temporal cubes built around each visual word, as constructing neighborhoods with a fixed number of neighbors could include non-relevant information making a visual expression less discriminative in scenarios with occlusion and changing viewpoints. Thus, the proposed approach makes the model more robust to occlusion and changing viewpoint challenges present in realistic scenarios. Furthermore, we train a multi-class Support Vector Machine (SVM) for classifying bag of expressions into action classes. Comprehensive experiments on four publicly available datasets: KTH, UCF Sports, UCF11 and UCF50 show that the proposed model outperforms existing state-of-the-art human action recognition methods in term of accuracy to 99.21%, 98.60%, 96.94 and 94.10%, respectively.

Download Full-text

Pooling region learning of visual word for image classification using bag-of-visual-words model

PLoS ONE ◽

10.1371/journal.pone.0234144 ◽

2020 ◽

Vol 15 (6) ◽

pp. e0234144

Author(s):

Ye Xu ◽

Xiaodong Yu ◽

Tian Wang ◽

Zezhong Xu

Keyword(s):

Image Classification ◽

Visual Word ◽

Bag Of Visual Words ◽

Visual Words

Download Full-text

KLASIFIKASI EKSPRESI WAJAH MENGGUNAKAN BAG OF VISUAL WORDS

InfoTekJar (Jurnal Nasional Informatika dan Teknologi Jaringan) ◽

10.30743/infotekjar.v2i2.181 ◽

2018 ◽

Vol 2 (2) ◽

pp. 118-123

Author(s):

Muhathir .

Keyword(s):

Visual Word ◽

Interest Point ◽

Bag Of Visual Words ◽

Visual Words ◽

Speed Up ◽

Speed Up Robust Feature

Pada hakikatnya, manusia dapat membedakan pola terhadap suatu objek berdasarkan bentuk visual yang mengandung keadaan emosional. Seperti membedakan ekspresi wajah seseorang pada suatu citra. Manusia dapat membedakan ekspresi pada citra tersebut secara kasat mata. Namun komputer yang tidak dapat mengenali ekspresi wajah tersebut. Bag of visual words merupakan suatu skema untuk mengklasifikasikan citra berdasarkan nilai-nilai pixel pada citra. Dengan menggunakan deteksi interest point dan ekstraksi interest point, bag of visual words mengambil ciri unik pada citra sehingga dapat membedakan pola-pola yang terdapat pada suatu citra. Bag of visual word dengan nilai K 500 mampu mengklasifikasi pola ekspresi wajah dengan tingkat akurasi 69%,Kata kunci: Wajah, Klasifikasi, Speed-up Robust Feature, Bag of visual words, Ekspresi

Download Full-text

Balinese Mask Characters Classification using Bag of Visual Words Model

Journal of Electrical Electronics and Informatics ◽

10.24843/jeei.2021.v05.i01.p05 ◽

2021 ◽

Vol 5 (1) ◽

pp. 25

Author(s):

Komang Budiarta ◽

Dewa Made Wiharta ◽

Komang Oka Saputra

Keyword(s):

Cultural Heritage ◽

Feature Detection ◽

Visual Word ◽

Bag Of Visual Words ◽

Visual Words ◽

Accuracy Level

Mask, often known by Balinese as “Tapel”, is made of pule wood. It depicts the representation of characters in the “badbad” or legend. Bali has many types of mask dances that are often performed, which makes tourists interested in visiting Bali. Unfortunately, many tourists do not know the information contained in Balinese masks. The most important information contained in the character of the Balinese masks. The characters of each mask are different even though they have the same type. As mask art is also a cultural heritage from generation to generation, it needs to be preserved. It is necessary to have information in the form of technology that can distinguish the characters from Balinese masks. In this study, bag of visual word method in the classification process of Balinese mask characters is used, where in this method, there are several algorithms used, namely SURF as feature detection, K-Means as a clustering process to get the value of feature quantization, and SVM as a classification of Balinese mask character. The result of the accuracy level obtained from the testing process is 80%.

Download Full-text