Comparison and Evaluation of Different Methods for the Feature Extraction from Educational Contents

This paper analyses the capabilities of different techniques to build a semantic representation of educational digital resources. Educational digital resources are modeled using the Learning Object Metadata (LOM) standard, and these semantic representations can be obtained from different LOM fields, like the title, description, among others, in order to extract the features/characteristics from the digital resources. The feature extraction methods used in this paper are the Best Matching 25 (BM25), the Latent Semantic Analysis (LSA), Doc2Vec, and the Latent Dirichlet allocation (LDA). The utilization of the features/descriptors generated by them are tested in three types of educational digital resources (scientific publications, learning objects, patents), a paraphrase corpus and two use cases: in an information retrieval context and in an educational recommendation system. For this analysis are used unsupervised metrics to determine the feature quality proposed by each one, which are two similarity functions and the entropy. In addition, the paper presents tests of the techniques for the classification of paraphrases. The experiments show that according to the type of content and metric, the performance of the feature extraction methods is very different; in some cases are better than the others, and in other cases is the inverse.

Download Full-text

Analysis of PCA Based Feature Extraction Methods for Classification of Hyperspectral Image

2019 2nd International Conference on Innovation in Engineering and Technology (ICIET) ◽

10.1109/iciet48527.2019.9290629 ◽

2019 ◽

Author(s):

U. A. Md. Ehsan Ali ◽

Md. Ali Hossain ◽

Md. Rashedul Islam

Keyword(s):

Feature Extraction ◽

Hyperspectral Image ◽

Extraction Methods

Download Full-text

Feature Extraction and Classification of Citrus Juice by Using an Enhanced L-KSVD on Data Obtained from Electronic Nose

Sensors ◽

10.3390/s19040916 ◽

2019 ◽

Vol 19 (4) ◽

pp. 916 ◽

Cited By ~ 2

Author(s):

Wen Cao ◽

Chunmei Liu ◽

Pengfei Jia

Keyword(s):

Feature Extraction ◽

Kernel Function ◽

Electronic Nose ◽

Classification Accuracy ◽

Extraction Methods ◽

Object Function ◽

Optimal Value ◽

Processed Products

Aroma plays a significant role in the quality of citrus fruits and processed products. The detection and analysis of citrus volatiles can be measured by an electronic nose (E-nose); in this paper, an E-nose is employed to classify the juice which is stored for different days. Feature extraction and classification are two important requirements for an E-nose. During the training process, a classifier can optimize its own parameters to achieve a better classification accuracy but cannot decide its input data which is treated by feature extraction methods, so the classification result is not always ideal. Label consistent KSVD (L-KSVD) is a novel technique which can extract the feature and classify the data at the same time, and such an operation can improve the classification accuracy. We propose an enhanced L-KSVD called E-LCKSVD for E-nose in this paper. During E-LCKSVD, we introduce a kernel function to the traditional L-KSVD and present a new initialization technique of its dictionary; finally, the weighted coefficients of different parts of its object function is studied, and enhanced quantum-behaved particle swarm optimization (EQPSO) is employed to optimize these coefficients. During the experimental section, we firstly find the classification accuracy of KSVD, and L-KSVD is improved with the help of the kernel function; this can prove that their ability of dealing nonlinear data is improved. Then, we compare the results of different dictionary initialization techniques and prove our proposed method is better. Finally, we find the optimal value of the weighted coefficients of the object function of E-LCKSVD that can make E-nose reach a better performance.

Download Full-text

Machine learning, waveform preprocessing and feature extraction methods for classification of acoustic startle waveforms

MethodsX ◽

10.1016/j.mex.2020.101166 ◽

2021 ◽

Vol 8 ◽

pp. 101166

Author(s):

Timothy J. Fawcett ◽

Chad S. Cooper ◽

Ryan J. Longenecker ◽

Joseph P. Walton

Keyword(s):

Machine Learning ◽

Feature Extraction ◽

Acoustic Startle ◽

Extraction Methods

Download Full-text

ECG Beats Classification Using Mixture of Features

International Scholarly Research Notices ◽

10.1155/2014/178436 ◽

2014 ◽

Vol 2014 ◽

pp. 1-12 ◽

Cited By ~ 21

Author(s):

Manab Kumar Das ◽

Samit Ari

Keyword(s):

Feature Extraction ◽

Extraction Methods ◽

Ectopic Beat ◽

Extraction Techniques ◽

Efficient System ◽

Ecg Signals ◽

Temporal Features ◽

S Transform ◽

Electrocardiogram Ecg

Classification of electrocardiogram (ECG) signals plays an important role in clinical diagnosis of heart disease. This paper proposes the design of an efficient system for classification of the normal beat (N), ventricular ectopic beat (V), supraventricular ectopic beat (S), fusion beat (F), and unknown beat (Q) using a mixture of features. In this paper, two different feature extraction methods are proposed for classification of ECG beats: (i) S-transform based features along with temporal features and (ii) mixture of ST and WT based features along with temporal features. The extracted feature set is independently classified using multilayer perceptron neural network (MLPNN). The performances are evaluated on several normal and abnormal ECG signals from 44 recordings of the MIT-BIH arrhythmia database. In this work, the performances of three feature extraction techniques with MLP-NN classifier are compared using five classes of ECG beat recommended by AAMI (Association for the Advancement of Medical Instrumentation) standards. The average sensitivity performances of the proposed feature extraction technique for N, S, F, V, and Q are 95.70%, 78.05%, 49.60%, 89.68%, and 33.89%, respectively. The experimental results demonstrate that the proposed feature extraction techniques show better performances compared to other existing features extraction techniques.

Download Full-text

A Comparison of Feature Extraction Methods for the Classification of Dynamic Activities From Accelerometer Data

IEEE Transactions on Biomedical Engineering ◽

10.1109/tbme.2008.2006190 ◽

2009 ◽

Vol 56 (3) ◽

pp. 871-879 ◽

Cited By ~ 304

Author(s):

Stephen J. Preece ◽

John Yannis Goulermas ◽

Laurence P. J. Kenney ◽

David Howard

Keyword(s):

Feature Extraction ◽

Extraction Methods ◽

Accelerometer Data

Download Full-text

Superpixel-Based Minimum Noise Fraction Feature Extraction for Classification of Hyperspectral Images

Traitement du signal ◽

10.18280/ts.370514 ◽

2020 ◽

Vol 37 (5) ◽

pp. 812-822

Author(s):

Behnam Asghari Beirami ◽

Mehdi Mokhtarzade

Keyword(s):

Feature Extraction ◽

Extraction Methods ◽

Hyperspectral Images ◽

Support Vector ◽

Minimum Noise Fraction ◽

Vector Machines ◽

Noise Covariance ◽

Noise Fraction ◽

Minimum Noise

In this paper, a novel feature extraction technique called SuperMNF is proposed, which is an extension of the minimum noise fraction (MNF) transformation. In SuperMNF, each superpixel has its own transformation matrix and MNF transformation is performed on each superpixel individually. The basic idea behind the SuperMNF is that each superpixel contains its specific signal and noise covariance matrices which are different from the adjacent superpixels. The extracted features, owning spatial-spectral content and provided in the lower dimension, are classified by maximum likelihood classifier and support vector machines. Experiments that are conducted on two real hyperspectral images, named Indian Pines and Pavia University, demonstrate the efficiency of SuperMNF since it yielded more promising results than some other feature extraction methods (MNF, PCA, SuperPCA, KPCA, and MMP).

Download Full-text

A comparative study of feature extraction methods in defect classification of mangoes using neural network

2016 Second International Conference on Cognitive Computing and Information Processing (CCIP) ◽

10.1109/ccip.2016.7802873 ◽

2016 ◽

Cited By ~ 7

Author(s):

Vani Ashok ◽

D.S. Vinod

Keyword(s):

Neural Network ◽

Feature Extraction ◽

Comparative Study ◽

Extraction Methods ◽

Defect Classification

Download Full-text

Performance Measures of Respiratory Disease Classification using Deep Learning

International Journal for Research in Applied Science and Engineering Technology ◽

10.22214/ijraset.2021.35109 ◽

2021 ◽

Vol 9 (VI) ◽

pp. 884-892

Author(s):

G. Rama Janani

Keyword(s):

Neural Network ◽

Feature Extraction ◽

Deep Learning ◽

Performance Measures ◽

Respiratory Infections ◽

Respiratory Illness ◽

Extraction Methods ◽

Disease Classification ◽

Deep Cnn

The paper is based on classification of respiratory illness like covid 19 and pneumonia by using deep learning. The symptoms of COVID-19 and pneumonia are similar. Due to this, it is often difficult to identify what is causing your condition without being tested for COVID-19 or other respiratory infections. To find out how COVID-19 and pneumonia differs from one another, this paper presents that a novel Convolutional Neural Network in Tensor Flow and Keras based Covid-19 pneumonia classification. The proposed system supported implements CNN using Pneumonia images to classify the Covid-19, normal, pneumonia. The knowledge from these studies can potentially help in diagnosis of the concerned disease. It is predicted that the success of the anticipated results will increase if the CNN method is supported by adding extra feature extraction methods for classifying covid-19 and pneumonia successfully thereby improving the efficacy and potential of using deep CNN to pictures.

Download Full-text

An Automatic Text Document Classification using Modified Weight and Semantic Method

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.k2123.1081219 ◽

2019 ◽

Vol 8 (12) ◽

pp. 2608-2622

Keyword(s):

Feature Extraction ◽

Text Mining ◽

Crime Rate ◽

Semantic Analysis ◽

Extraction Methods ◽

Support Vector ◽

Text Document ◽

Use Of Resources ◽

Benchmark Datasets ◽

Text Document Classification

Text mining is the process of transformation of useful information from the structured or unstructured sources. In text mining, feature extraction is one of the vital parts. This paper analyses some of the feature extraction methods and proposed the enhanced method for feature extraction. Term Frequency-Inverse Document Frequency(TF-IDF) method only assigned weight to the term based on the occurrence of the term. Now, it is enlarged to increases the weight of the most important words and decreases the weight of the less important words. This enlarged method is called as M-TF-IDF. This method does not consider the semantic similarity between the terms. Hence, Latent Semantic Analysis(LSA) method is used for feature extraction and dimensionality reduction. To analyze the performance of the proposed feature extraction methods, two benchmark datasets like Reuter-21578-R8 and 20 news group and two real time datasets like descriptive type answer dataset and crime news dataset are used. This paper used this proposed method for descriptive type answer evaluation. Manual evaluation of descriptive type paper may lead to discrepancy in the mark. It is eliminated by using this type of evaluation. The proposed method has been tested with answers written by learners of our department. It allows more accurate assessment and more effective evaluation of the learning process. This method has a lot of benefits such as reduced time and effort, efficient use of resources, reduced burden on the faculty and increased reliability of results. This proposed method also used to analyze the documents which contain the details about in and around Madurai city. Madurai is a sensitive place in the southern area of Tamilnadu in India. It has been collected from the Hindu archives. This news document has been classified like crime or not. It is also used to check in which month most crime rate occurs. This analysis used to reduce the crime rate in future. The classification algorithm Support Vector Machine(SVM) used to classify the dataset. The experimental analysis and results show that the performances of the proposed feature extraction methods are outperforming the existing feature extraction methods.

Download Full-text

Analyzing the Effectiveness of the Brain–Computer Interface for Task Discerning Based on Machine Learning

Sensors ◽

10.3390/s20082403 ◽

2020 ◽

Vol 20 (8) ◽

pp. 2403

Author(s):

Jakub Browarczyk ◽

Adam Kurowski ◽

Bozena Kostek

Keyword(s):

Feature Extraction ◽

Principal Component ◽

Component Analysis ◽

Mental States ◽

Extraction Methods ◽

Support Vector ◽

Discrete Wavelet ◽

K Nearest Neighbors ◽

Vector Machines

The aim of the study is to compare electroencephalographic (EEG) signal feature extraction methods in the context of the effectiveness of the classification of brain activities. For classification, electroencephalographic signals were obtained using an EEG device from 17 subjects in three mental states (relaxation, excitation, and solving logical task). Blind source separation employing independent component analysis (ICA) was performed on obtained signals. Welch’s method, autoregressive modeling, and discrete wavelet transform were used for feature extraction. Principal component analysis (PCA) was performed in order to reduce the dimensionality of feature vectors. k-Nearest Neighbors (kNN), Support Vector Machines (SVM), and Neural Networks (NN) were employed for classification. Precision, recall, F1 score, as well as a discussion based on statistical analysis, were shown. The paper also contains code utilized in preprocessing and the main part of experiments.

Download Full-text