A weighting approach for KNN classifier

Within the Pattern Recognition field, two representations are generally considered for encoding the data: statistical codifications, which describe elements as feature vectors, and structural representations, which encode elements as high-level symbolic data structures such as strings, trees or graphs. While the vast majority of classifiers are capable of addressing statistical spaces, only some particular methods are suitable for structural representations. The kNN classifier constitutes one of the scarce examples of algorithms capable of tackling both statistical and structural spaces. This method is based on the computation of the dissimilarity between all the samples of the set, which is the main reason for its high versatility, but in turn, for its low efficiency as well. Prototype Generation is one of the possibilities for palliating this issue. These mechanisms generate a reduced version of the initial dataset by performing data transformation and aggregation processes on the initial collection. Nevertheless, these generation processes are quite dependent on the data representation considered, being not generally well defined for structural data. In this work we present the adaptation of the generation-based reduction algorithm Reduction through Homogeneous Clusters to the case of string data. This algorithm performs the reduction by partitioning the space into class-homogeneous clusters for then generating a representative prototype as the median value of each group. Thus, the main issue to tackle is the retrieval of the median element of a set of strings. Our comprehensive experimentation comparatively assesses the performance of this algorithm in both the statistical and the string-based spaces. Results prove the relevance of our approach by showing a competitive compromise between classification rate and data reduction.

Download Full-text

Video event classification using KNN classifier with hybrid features

Materials Today Proceedings ◽

10.1016/j.matpr.2021.03.154 ◽

2021 ◽

Author(s):

Susmitha Alamuru ◽

Sanjay Jain

Keyword(s):

Hybrid Features ◽

Event Classification ◽

Video Event ◽

Knn Classifier

Download Full-text

On the activity detection with incomplete acceleration data using iterative KNN classifier

2016 IEEE International Conference on Systems, Man, and Cybernetics (SMC) ◽

10.1109/smc.2016.7844779 ◽

2016 ◽

Author(s):

Gamze Uslu ◽

Sebnem Baydere

Keyword(s):

Activity Detection ◽

Knn Classifier ◽

Acceleration Data

Download Full-text

Decision fusion-based approach for content-based image classification

International Journal of Intelligent Computing and Cybernetics ◽

10.1108/ijicc-07-2016-0025 ◽

2017 ◽

Vol 10 (3) ◽

pp. 310-331 ◽

Cited By ~ 4

Author(s):

Sudeep Thepade ◽

Rik Das ◽

Saurav Ghosh

Keyword(s):

Feature Extraction ◽

Image Classification ◽

Image Recognition ◽

Extraction Techniques ◽

Data Set ◽

Content Type ◽

Knowledge Fusion ◽

Knn Classifier ◽

Query Classification ◽

First Time

Purpose Current practices in data classification and retrieval have experienced a surge in the use of multimedia content. Identification of desired information from the huge image databases has been facing increased complexities for designing an efficient feature extraction process. Conventional approaches of image classification with text-based image annotation have faced assorted limitations due to erroneous interpretation of vocabulary and huge time consumption involved due to manual annotation. Content-based image recognition has emerged as an alternative to combat the aforesaid limitations. However, exploring rich feature content in an image with a single technique has lesser probability of extract meaningful signatures compared to multi-technique feature extraction. Therefore, the purpose of this paper is to explore the possibilities of enhanced content-based image recognition by fusion of classification decision obtained using diverse feature extraction techniques. Design/methodology/approach Three novel techniques of feature extraction have been introduced in this paper and have been tested with four different classifiers individually. The four classifiers used for performance testing were K nearest neighbor (KNN) classifier, RIDOR classifier, artificial neural network classifier and support vector machine classifier. Thereafter, classification decisions obtained using KNN classifier for different feature extraction techniques have been integrated by Z-score normalization and feature scaling to create fusion-based framework of image recognition. It has been followed by the introduction of a fusion-based retrieval model to validate the retrieval performance with classified query. Earlier works on content-based image identification have adopted fusion-based approach. However, to the best of the authors’ knowledge, fusion-based query classification has been addressed for the first time as a precursor of retrieval in this work. Findings The proposed fusion techniques have successfully outclassed the state-of-the-art techniques in classification and retrieval performances. Four public data sets, namely, Wang data set, Oliva and Torralba (OT-scene) data set, Corel data set and Caltech data set comprising of 22,615 images on the whole are used for the evaluation purpose. Originality/value To the best of the authors’ knowledge, fusion-based query classification has been addressed for the first time as a precursor of retrieval in this work. The novel idea of exploring rich image features by fusion of multiple feature extraction techniques has also encouraged further research on dimensionality reduction of feature vectors for enhanced classification results.

Download Full-text

DS-kNN

International Journal of Information Security and Privacy ◽

10.4018/ijisp.2021040107 ◽

2021 ◽

Vol 15 (2) ◽

pp. 131-144

Author(s):

Redha Taguelmimt ◽

Rachid Beghdad

Keyword(s):

Intrusion Detection ◽

False Positive ◽

Detection Rate ◽

Nearest Neighbors ◽

The Other ◽

Intrusion Detection Systems ◽

K Nearest Neighbors ◽

Detection Systems ◽

Knn Classifier ◽

Better Than

On one hand, there are many proposed intrusion detection systems (IDSs) in the literature. On the other hand, many studies try to deduce the important features that can best detect attacks. This paper presents a new and an easy-to-implement approach to intrusion detection, named distance sum-based k-nearest neighbors (DS-kNN), which is an improved version of k-NN classifier. Given a data sample to classify, DS-kNN computes the distance sum of the k-nearest neighbors of the data sample in each of the possible classes of the dataset. Then, the data sample is assigned to the class having the smallest sum. The experimental results show that the DS-kNN classifier performs better than the original k-NN algorithm in terms of accuracy, detection rate, false positive, and attacks classification. The authors mainly compare DS-kNN to CANN, but also to SVM, S-NDAE, and DBN. The obtained results also show that the approach is very competitive.

Download Full-text

A Framework for Efficient Recognition and Classification of Acute Lymphoblastic Leukemia with a Novel Customized-KNN Classifier

Journal of Computing and Information Technology ◽

10.20532/cit.2018.1004123 ◽

2018 ◽

pp. 131-140 ◽

Cited By ~ 3

Author(s):

Duraiswamy Umamaheswari ◽

Shanmugam Geetha

Keyword(s):

Acute Lymphoblastic Leukemia ◽

Lymphoblastic Leukemia ◽

Knn Classifier ◽

Efficient Recognition

Download Full-text

Handwritten English Character Recognition and translate English to Devnagari Words

International Journal of Scientific Research in Computer Science Engineering and Information Technology ◽

10.32628/cseit19528 ◽

2019 ◽

pp. 142-151

Author(s):

Shivali Parkhedkar ◽

Shaveri Vairagade ◽

Vishakha Sakharkar ◽

Bharti Khurpe ◽

Arpita Pikalmunde ◽

...

Keyword(s):

Pattern Recognition ◽

Feature Extraction ◽

Character Recognition ◽

Computational Time ◽

Feature Vectors ◽

Form Processing ◽

Knn Classifier ◽

Gabor Feature ◽

Handwritten Document ◽

The Individual

In our proposed work we will accept the challenges of recognizing the words and we will work to win the challenge. The handwritten document is scanned using a scanner. The image of the scanned document is processed victimization the program. Each character in the word is isolated. Then the individual isolated character is subjected to “Feature Extraction” by the Gabor Feature. Extracted features are passed through KNN classifier. Finally we get the Recognized word. Character recognition is a process by which computer recognizes handwritten characters and turns them into a format which a user can understand. Computer primarily based pattern recognition may be a method that involves many sub process. In today’s surroundings character recognition has gained ton of concentration with in the field of pattern recognition. Handwritten character recognition is beneficial in cheque process in banks, form processing systems and many more. Character recognition is one in all the favored and difficult space in analysis. In future, character recognition creates paperless environment. The novelty of this approach is to achieve better accuracy, reduced computational time for recognition of handwritten characters. The proposed method extracts the geometric features of the character contour. These features are based on the basic line types that forms the character skeleton. The system offers a feature vector as its output. The feature vectors so generated from a training set, were then used to train a pattern recognition engine based on Neural Networks so that the system can be benchmarked. The algorithm proposed concentrates on the same. It extracts totally different line varieties that forms a specific character. It conjointly also concentrates on the point options of constant. The feature extraction technique explained was tested using a Neural Network which was trained with the feature vectors obtained from the proposed method.

Download Full-text

A Simple and Effective Approach Based on a Multi-Level Feature Selection for Automated Parkinson’s Disease Detection

Journal of Personalized Medicine ◽

10.3390/jpm12010055 ◽

2022 ◽

Vol 12 (1) ◽

pp. 55

Author(s):

Fatih Demir ◽

Kamran Siddique ◽

Mohammed Alswaitti ◽

Kursat Demir ◽

Abdulkadir Sengur

Keyword(s):

Parkinson’S Disease ◽

Parkinson's Disease ◽

Feature Selection ◽

Early Diagnosis ◽

Neurodegenerative Disorder ◽

Bayesian Optimization ◽

L1 Norm ◽

Bayesian Optimization Algorithm ◽

Knn Classifier ◽

Multi Level

Parkinson’s disease (PD), which is a slowly progressing neurodegenerative disorder, negatively affects people’s daily lives. Early diagnosis is of great importance to minimize the effects of PD. One of the most important symptoms in the early diagnosis of PD disease is the monotony and distortion of speech. Artificial intelligence-based approaches can help specialists and physicians to automatically detect these disorders. In this study, a new and powerful approach based on multi-level feature selection was proposed to detect PD from features containing voice recordings of already-diagnosed cases. At the first level, feature selection was performed with the Chi-square and L1-Norm SVM algorithms (CLS). Then, the features that were extracted from these algorithms were combined to increase the representation power of the samples. At the last level, those samples that were highly distinctive from the combined feature set were selected with feature importance weights using the ReliefF algorithm. In the classification stage, popular classifiers such as KNN, SVM, and DT were used for machine learning, and the best performance was achieved with the KNN classifier. Moreover, the hyperparameters of the KNN classifier were selected with the Bayesian optimization algorithm, and the performance of the proposed approach was further improved. The proposed approach was evaluated using a 10-fold cross-validation technique on a dataset containing PD and normal classes, and a classification accuracy of 95.4% was achieved.

Download Full-text

Optimization of deep learning features for age-invariant face recognition

International Journal of Electrical and Computer Engineering (IJECE) ◽

10.11591/ijece.v10i2.pp1833-1841 ◽

2020 ◽

Vol 10 (2) ◽

pp. 1833

Author(s):

Amal A. Moustafa ◽

Ahmed Elnakib ◽

Nihal F. F. Areed

Keyword(s):

Deep Learning ◽

Face Recognition ◽

Nearest Neighbor ◽

Recognition Rate ◽

Manhattan Distance ◽

Distance Metrics ◽

K Nearest Neighbor ◽

Face Images ◽

Knn Classifier ◽

Deep Learning Features

This paper presents a methodology for Age-Invariant Face Recognition (AIFR), based on the optimization of deep learning features. The proposed method extracts deep learning features using transfer deep learning, extracted from the unprocessed face images. To optimize the extracted features, a Genetic Algorithm (GA) procedure is designed in order to select the most relevant features to the problem of identifying a person based on his/her facial images over different ages. For classification, K-Nearest Neighbor (KNN) classifiers with different distance metrics are investigated, i.e., Correlation, Euclidian, Cosine, and Manhattan distance metrics. Experimental results using a Manhattan distance KNN classifier achieves the best Rank-1 recognition rate of 86.2% and 96% on the standard FGNET and MORPH datasets, respectively. Compared to the state-of-the-art methods, our proposed method needs no preprocessing stages. In addition, the experiments show its privilege over other related methods.

Download Full-text