Neural Decoding of EEG Signals with Machine Learning: A Systematic Review

Electroencephalography (EEG) is a non-invasive technique used to record the brain’s evoked and induced electrical activity from the scalp. Artificial intelligence, particularly machine learning (ML) and deep learning (DL) algorithms, are increasingly being applied to EEG data for pattern analysis, group membership classification, and brain-computer interface purposes. This study aimed to systematically review recent advances in ML and DL supervised models for decoding and classifying EEG signals. Moreover, this article provides a comprehensive review of the state-of-the-art techniques used for EEG signal preprocessing and feature extraction. To this end, several academic databases were searched to explore relevant studies from the year 2000 to the present. Our results showed that the application of ML and DL in both mental workload and motor imagery tasks has received substantial attention in recent years. A total of 75% of DL studies applied convolutional neural networks with various learning algorithms, and 36% of ML studies achieved competitive accuracy by using a support vector machine algorithm. Wavelet transform was found to be the most common feature extraction method used for all types of tasks. We further examined the specific feature extraction methods and end classifier recommendations discovered in this systematic review.

Download Full-text

A Structural Analysis Based Feature Extraction Method for OCR System For Myanmar Printed Document Images

International Journal of Computer Vision and Image Processing ◽

10.4018/ijcvip.2012010102 ◽

2012 ◽

Vol 2 (1) ◽

pp. 16-41 ◽

Cited By ~ 1

Author(s):

Htwe Pa Pa Win ◽

Phyo Thu Thu Khine ◽

Khin Nwe Ni Tun

Keyword(s):

Feature Extraction ◽

Structural Analysis ◽

Character Recognition ◽

Optical Character Recognition ◽

Extraction Method ◽

Recognition Performance ◽

Extraction Methods ◽

Support Vector ◽

Svm Classifier ◽

Feature Extraction Method

This paper proposes a new feature extraction method for off-line recognition of Myanmar printed documents. One of the most important factors to achieve high recognition performance in Optical Character Recognition (OCR) system is the selection of the feature extraction methods. Different types of existing OCR systems used various feature extraction methods because of the diversity of the scripts’ natures. One major contribution of the work in this paper is the design of logically rigorous coding based features. To show the effectiveness of the proposed method, this paper assumed the documents are successfully segmented into characters and extracted features from these isolated Myanmar characters. These features are extracted using structural analysis of the Myanmar scripts. The experimental results have been carried out using the Support Vector Machine (SVM) classifier and compare the pervious proposed feature extraction method.

Download Full-text

Headnote Prediction Using Machine Learning

The International Arab Journal of Information Technology ◽

10.34028/iajit/18/5/7 ◽

2021 ◽

Vol 18 (5) ◽

Author(s):

Sarmad Mahar ◽

Sahar Zafar ◽

Kamran Nishat

Keyword(s):

Machine Learning ◽

Feature Extraction ◽

Active Learning ◽

Text Classification ◽

Extraction Methods ◽

Text Summarization ◽

Training Data ◽

Second Step ◽

Support Vector ◽

Classification Algorithms

Headnotes are the precise explanation and summary of legal points in an issued judgment. Law journals hire experienced lawyers to write these headnotes. These headnotes help the reader quickly determine the issue discussed in the case. Headnotes comprise two parts. The first part comprises the topic discussed in the judgment, and the second part contains a summary of that judgment. In this thesis, we design, develop and evaluate headnote prediction using machine learning, without involving human involvement. We divided this task into a two steps process. In the first step, we predict law points used in the judgment by using text classification algorithms. The second step generates a summary of the judgment using text summarization techniques. To achieve this task, we created a Databank by extracting data from different law sources in Pakistan. We labelled training data generated based on Pakistan law websites. We tested different feature extraction methods on judiciary data to improve our system. Using these feature extraction methods, we developed a dictionary of terminology for ease of reference and utility. Our approach achieves 65% accuracy by using Linear Support Vector Classification with tri-gram and without stemmer. Using active learning our system can continuously improve the accuracy with the increased labelled examples provided by the users of the system.

Download Full-text

How to Utilize My App Reviews? A Novel Topics Extraction Machine Learning Schema for Strategic Business Purposes

Entropy ◽

10.3390/e22111310 ◽

2020 ◽

Vol 22 (11) ◽

pp. 1310

Author(s):

Ioannis Triantafyllou ◽

Ioannis C. Drivas ◽

Georgios Giannakopoulos

Keyword(s):

Machine Learning ◽

Feature Extraction ◽

Software Maintenance ◽

Extraction Methods ◽

Machine Learning Algorithms ◽

Classification Problems ◽

Case Scenario ◽

Feature Extraction Method ◽

Term Extraction ◽

One Step

Acquiring knowledge about users’ opinion and what they say regarding specific features within an app, constitutes a solid steppingstone for understanding their needs and concerns. App review utilization helps project management teams to identify threads and opportunities for app software maintenance, optimization and strategic marketing purposes. Nevertheless, app user review classification for identifying valuable gems of information for app software improvement, is a complex and multidimensional issue. It requires foresight and multiple combinations of sophisticated text pre-processing, feature extraction and machine learning methods to efficiently classify app reviews into specific topics. Against this backdrop, we propose a novel feature engineering classification schema that is capable to identify more efficiently and earlier terms-words within reviews that could be classified into specific topics. For this reason, we present a novel feature extraction method, the DEVMAX.DF combined with different machine learning algorithms to propose a solution in app review classification problems. One step further, a simulation of a real case scenario takes place to validate the effectiveness of the proposed classification schema into different apps. After multiple experiments, results indicate that the proposed schema outperforms other term extraction methods such as TF.IDF and χ2 to classify app reviews into topics. To this end, the paper contributes to the knowledge expansion of research and practitioners with the purpose to reinforce their decision-making process within the realm of app reviews utilization.

Download Full-text

Performance Evaluation of MadBoost on Face Detection

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.892.200 ◽

2019 ◽

Vol 892 ◽

pp. 200-209

Author(s):

Rayner Pailus ◽

Rayner Alfred

Keyword(s):

Feature Extraction ◽

Face Detection ◽

Recognition Performance ◽

Principal Component ◽

Haar Wavelet ◽

Extraction Methods ◽

Face Image ◽

Support Vector ◽

Feature Extraction Method ◽

Face Images

Adaboost Viola-Jones method is indeed a profound discovery in detecting face images mainly because it is fast, light and one of the easiest methods of detecting face images among other techniques of face detection. Viola Jones uses Haar wavelet filter to detect face images and it produces almost 80%accuracy of face detection. This paper discusses proposed methodology and algorithms that involved larger library of filters used to create more discrimination features among the images by processing the proposed 15 Haar rectangular features (an extension from 4 Haar wavelet filters of Viola Jones) and used them in multiple adaptive ensemble process of detecting face image. After facial detection, the process continues with normalization processes by applying feature extraction such as PCA combined with LDA or LPP to extract our week learners’ wavelet for more classification features. Upon the process of feature extraction proposed feature selection to index these extracted data. These extracted vectors are used for training and creating MADBoost (Multiple Adaptive Diversified Boost)(an improvement of Adaboost, which uses multiple feature extraction methods combined with multiple classifiers) is able to capture, recognize and distinguish face image (s) faster. MADBoost applies the ensemble approach with better weights for classification to produce better face recognition results. Three experiments have been conducted to investigate the performance of the proposed MADBoost with three other classifiers, Neural Network (NN), Support Vector Machines (SVM) and Adaboost classifiers using Principal Component Analysis (PCA) as the feature extraction method. These experiments were tested against obstacles of POIES (Pose, Obstruction, Illumination, Expression, Sizes). Based on the results obtained, Madboost is found to be able to improve the recognition performance in matching failures, incorrect matching, matching success percentages and acceptable time taken to perform the classification task.

Download Full-text

Classification of Multi-spectral Landsat Satellite Imagery Dataset using Feature Extraction Methods

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.d1589.029420 ◽

2020 ◽

Vol 9 (4) ◽

Keyword(s):

Land Use ◽

Feature Extraction ◽

Satellite Imagery ◽

Extraction Method ◽

Spatial Information ◽

Extraction Methods ◽

Support Vector ◽

Feature Extraction Method ◽

Landsat Satellite ◽

Spatial Maps

The feature extraction of multi-spectral Landsat satellite imagery Dataset is essential for vegetation monitoring, urban planning, change assessment, and other land-use applications. The spatial information provided by Remote sensing satellite imagery data is helpful for planning and decision-making policies. In the present study, classify the features of the multi-spectral Landsat satellite imagery dataset in different periods using the feature extraction method, and is produced the spatial maps of the study area. The study is to analyze the appropriate method of feature extraction for classifying the orchards, vegetation, rangeland, agricultural land, wetland, water body, and urban land using multi-temporal satellite dataset. In this study, use the three feature extraction methods are support vector machine (SVM), minimum distance (MD), and Maximum likelihood classifier (MLC) for supervised pixel-based classification using medium resolution (30 m) satellite dataset. The accuracy of feature extraction method is performed by the MLC (86.29% and 93% in the year 2003 and 2017) and SVM (86.37% and 90% in the year 2003 and 2017). The result of the presented study shows MLC and SVM classifier performs similar results but better than MD classifier for land-use/cover features classification. The classified spatial maps provide the essential spatial information for land-use changes occurred during the last 15 years (2003 to 2017).

Download Full-text

A Comparative Analysis of Time-frequency Feature Extraction Techniques for Large Scale Electroencephalogram Data

International Journal of Advanced Trends in Computer Science and Engineering ◽

10.30534/ijatcse/2021/031012021 ◽

2021 ◽

Vol 10 (1) ◽

pp. 14-24

Keyword(s):

Feature Extraction ◽

Large Scale ◽

Extraction Methods ◽

Kernel Functions ◽

The Body ◽

Support Vector ◽

Discrete Wavelet ◽

Eeg Signals ◽

Time Frequency ◽

The Impact

Recognition of human emotions is a fascinating research field that motivates many researchers to use various approaches, such as facial expression, speech or gesture of the body. Electroencephalogram (EEG) is another approach of recognizing human emotion through brain signals and has offered promising findings. Although EEG signals provide detail information on human emotional states, the analysis of non-linear and chaotic characteristics of EEG signals is a substantial problem. The main challenge remains in analyzing EEG signals to extract relevant features in order to achieve optimum classification performance. Various feature extraction methods have been developed by researchers, which mainly can be categorized under time, frequency or time-frequency based feature extraction methods. Yet, there are numerous setting that could affect the performance of any model. In this paper, we investigated the performance of Discrete Wavelet Transform (DWT) and Discrete Wavelet Packet Transform (DWPT), which are time-frequency domain methods using Support Vector Machine (SVM) and k-Nearest Neighbor (KNN) classification techniques. Different SVM kernel functions and distance metrics of KNN are tested in this study by using subject-dependent and subject -independent approaches. The experiment is implemented using publicly available DEAP dataset. The experimental results show that DWT is mostly suitable with weighted KNN classifier while DWPT reported better results when tested using Linear SVM classifier to accurately classify the EEG signals on subject-dependent approach. Consistent results are observed for DWT-KNN on subject-independent approach, however SVM works better in the setting of quadratic kernel functions. These results indicate that further investigation is significant to examine the impact of different setting of methods in analyzing large scale of EEG data

Download Full-text

Introduction to Motor Imagery-Based Brain-Computer Interface

Biomedical and Clinical Engineering for Healthcare Advancement - Advances in Bioinformatics and Biomedical Engineering ◽

10.4018/978-1-7998-0326-3.ch009 ◽

2020 ◽

pp. 168-197

Author(s):

Nitesh Singh Malan ◽

Shiru Sharma

Keyword(s):

Feature Extraction ◽

Motor Imagery ◽

Brain Computer Interface ◽

Extraction Methods ◽

Computer Interface ◽

Support Vector ◽

Svm Classifier ◽

Complex Wavelet Transform ◽

Feature Extraction Method ◽

Time Frequency

In this chapter, motor imagery (MI) based brain-computer interface (BCI) is introduced incorporating the explanation of key components required to design a practical BCI device. Its application to the medical and nonmedical sector is discussed in detail. In the experimental study, a feature extraction method using time, frequency, and phase analysis of Motor imagery EEG is presented. For the classification of MI task, EEG signals are decomposed using a dual-tree complex wavelet transform (DTCWT) and then time, frequency, and phase features are extracted. The validation of the proposed method is conducted using BCI competition IV dataset 2b. A Support vector machine (SVM) classifier is used to perform the classification task. Performance of the proposed method is compared with the standard feature extraction methods. The proposed scheme achieved a larger average classification accuracy of 82.81% which is better than that obtained by other methods.

Download Full-text

Twitter Sentiment Analysis Using Different Machine Learning and Feature Extraction Techniques

Al-Nahrain Journal of Science ◽

10.22401/anjs.24.3.08 ◽

2021 ◽

Vol 24 (3) ◽

pp. 50-54

Author(s):

Mohammad W.Habib ◽

◽

Zainab N. Sultani ◽

Keyword(s):

Machine Learning ◽

Feature Extraction ◽

Logistic Regression ◽

Sentiment Analysis ◽

Extraction Methods ◽

Supervised Machine Learning ◽

Support Vector ◽

Extraction Techniques ◽

Machine Learning Approach ◽

Negative Sentiment

Twitter is considered a significant source of exchanging information and opinion in today's business. Analysis of this data is critical and complex due to the size of the dataset. Sentiment Analysis is adopted to understand and analyze the sentiment of such data. In this paper, a Machine learning approach is employed for analyzing the data into positive or negative sentiment (opinion). Different arrangements of preprocessing techniques are applied to clean the tweets, and various feature extraction methods are used to extract and reduce the dimension of the tweets' feature vector. Sentiment140 dataset is used, and it consists of sentiment labels and tweets, so supervised machine learning models are used, specifically Logistic Regression, Naive Bayes, and Support Vector Machine. According to the experimental results, Logistic Regression was the best amongst other models with all feature extraction techniques.

Download Full-text

Comparison of different feature extraction methods for applicable automated ICD coding

BMC Medical Informatics and Decision Making ◽

10.1186/s12911-022-01753-5 ◽

2022 ◽

Vol 22 (1) ◽

Author(s):

Zhao Shuai ◽

Diao Xiaolin ◽

Yuan Jing ◽

Huo Yanni ◽

Cui Meng ◽

...

Keyword(s):

Machine Learning ◽

Feature Extraction ◽

Extraction Methods ◽

Fine Tuning ◽

Medical Texts ◽

Optimal Method ◽

Feature Extraction Method ◽

Icd Codes ◽

Shed Light ◽

Icd Coding

Abstract Background Automated ICD coding on medical texts via machine learning has been a hot topic. Related studies from medical field heavily relies on conventional bag-of-words (BoW) as the feature extraction method, and do not commonly use more complicated methods, such as word2vec (W2V) and large pretrained models like BERT. This study aimed at uncovering the most effective feature extraction methods for coding models by comparing BoW, W2V and BERT variants. Methods We experimented with a Chinese dataset from Fuwai Hospital, which contains 6947 records and 1532 unique ICD codes, and a public Spanish dataset, which contains 1000 records and 2557 unique ICD codes. We designed coding tasks with different code frequency thresholds (denoted as $$f_s$$ f s ), with a lower threshold indicating a more complex task. Using traditional classifiers, we compared BoW, W2V and BERT variants on accomplishing these coding tasks. Results When $$f_s$$ f s was equal to or greater than 140 for Fuwai dataset, and 60 for the Spanish dataset, the BERT variants with the whole network fine-tuned was the best method, leading to a Micro-F1 of 93.9% for Fuwai data when $$f_s=200$$ f s = 200 , and a Micro-F1 of 85.41% for the Spanish dataset when $$f_s=180$$ f s = 180 . When $$f_s$$ f s fell below 140 for Fuwai dataset, and 60 for the Spanish dataset, BoW turned out to be the best, leading to a Micro-F1 of 83% for Fuwai dataset when $$f_s=20$$ f s = 20 , and a Micro-F1 of 39.1% for the Spanish dataset when $$f_s=20$$ f s = 20 . Our experiments also showed that both the BERT variants and BoW possessed good interpretability, which is important for medical applications of coding models. Conclusions This study shed light on building promising machine learning models for automated ICD coding by revealing the most effective feature extraction methods. Concretely, our results indicated that fine-tuning the whole network of the BERT variants was the optimal method for tasks covering only frequent codes, especially codes that represented unspecified diseases, while BoW was the best for tasks involving both frequent and infrequent codes. The frequency threshold where the best-performing method varied differed between different datasets due to factors like language and codeset.

Download Full-text

Improving Motor Imagery EEG Signals Classification Accuracy with CSP by Available Machine Learning Approach

Journal of Engineering Science ◽

10.3329/jes.v12i2.54632 ◽

2021 ◽

Vol 12 (2) ◽

pp. 67-77

Author(s):

Umme Farhana ◽

Mst Jannatul Ferdous

Keyword(s):

Machine Learning ◽

Feature Extraction ◽

Motor Imagery ◽

Classification Accuracy ◽

Support Vector ◽

Band Pass Filter ◽

K Nearest Neighbor ◽

Eeg Signals ◽

Pass Filter ◽

Linear Discriminant

In brain computer interface (BCI) systems, the electroencephalography (EEG) signals give a pathway to a motor disabled person to communicate outside using the brain signal and a computer. EEG signals of different motor imagery (MI) movements can be differentiated using an effective classification technique to aid a motor disabled patient. The purpose of this paper is to classify two different types of MI movement tasks, movement of the left hand and movement of the right foot EEG signals accurately. For this purpose we have used a publicly available dataset. Since the feature extraction for classification is an important task, so we have used popular common spatial pattern (CSP) method for spatial feature extraction. Two different machine learning classifiers named support vector machine (SVM) and K-nearest neighbor (KNN) have been used to verify the proposed method. We got the highest average results 95.55%, 98.73% and 92.38% in case of SVM and 93.5%, 98.73% and 90.15% in case of KNN for classification accuracy, sensitivity, and specificity, respectively when a Butterworth band-pass filter passed through [10–30] Hz. On the other hand accuracy came to 89.4% in [10-30] Hz when applying CSP for feature extraction and fisher linear discriminant analysis (FLDA) for classification on this dataset earlier. Journal of Engineering Science 12(2), 2021, 67-77

Download Full-text