scholarly journals Analysis of EEG signals using Machine Learning for the Detection and Diagnosis of Epilepsy

Electroencephalogram (EEG) is one of the most commonly used tools for epilepsy detection. In this paper we have presented two methods for the diagnosis of epilepsy using machine learning techniques.EEG waveforms have five different kinds of frequency bands. Out of which only two namely theta and gamma bands carry epileptic seizure information. Our model determines the statistical features like mean, variance, maximum, minimum, kurtosis, and skewness from the raw data set. This reduces the mathematical complexities and time consumption of the feature extraction method. It then uses a Logistic regression model and decision tree model to classify whether a person is epileptic or not. After the implementation of the machine learning models, parameters like accuracy, sensitivity, and recall have been found. The results for the same are analyzed in detail in this paper. Epileptic seizures cause severe damage to the brain which affects the health of a person. Our key objective from this paper is to help in the early prediction and detection of epilepsy so that preventive interventions can be provided and precautionary measures are taken to prevent the patient from suffering any severe damage

2020 ◽  
Author(s):  
Rebecca O'Donovan ◽  
Emre Sezgin ◽  
Sven Bambach ◽  
Eric Butter ◽  
Simon Lin

BACKGROUND Qualitative self- or parent-reports used in assessing children’s behavioral disorders are often inconvenient to collect and can be misleading due to missing information, rater biases, and limited validity. A data-driven approach to quantify behavioral disorders could alleviate these concerns. This study proposes a machine learning approach to identify screams in voice recordings that avoids the need to gather large amounts of clinical data for model training. OBJECTIVE The goal of this study is to evaluate if a machine learning model trained only on publicly available audio data sets could be used to detect screaming sounds in audio streams captured in an at-home setting. METHODS Two sets of audio samples were prepared to evaluate the model: a subset of the publicly available AudioSet data set and a set of audio data extracted from the TV show Supernanny, which was chosen for its similarity to clinical data. Scream events were manually annotated for the Supernanny data, and existing annotations were refined for the AudioSet data. Audio feature extraction was performed with a convolutional neural network pretrained on AudioSet. A gradient-boosted tree model was trained and cross-validated for scream classification on the AudioSet data and then validated independently on the Supernanny audio. RESULTS On the held-out AudioSet clips, the model achieved a receiver operating characteristic (ROC)–area under the curve (AUC) of 0.86. The same model applied to three full episodes of Supernanny audio achieved an ROC-AUC of 0.95 and an average precision (positive predictive value) of 42% despite screams only making up 1.3% (n=92/7166 seconds) of the total run time. CONCLUSIONS These results suggest that a scream-detection model trained with publicly available data could be valuable for monitoring clinical recordings and identifying tantrums as opposed to depending on collecting costly privacy-protected clinical data for model training.


2021 ◽  
Author(s):  
Debarati Dey Roy ◽  
Debashis De

Abstract Cardio vascular disease or alternatively heart disease is the primitive cause of death all around the world. Last few decades, it was observed that maximum death cases occurred due to heart failure. The heart failure death cases are associated with many risk factors for example high blood pressure, cholesterol level, sugar level etcetera. Therefore, it is advisable that regular and early diagnosis of these factors may reduce the risk of heart failure and hence achieve prompt disease management service. A commonly used technique to process these enormous medical data is called data mining, which help the researchers in health care domain. Several machine learning algorithms are used to analyses these data and help to design the best-fit model for early detection of heart diseases. This research paper contributes various attributes related to heart mal functioning and build the best-fit model using supervised learning algorithm such as various tree (fine tree, medium tree etc), Gaussian Naïve Bayes, Coarse KNN, Medium Gaussian SVM algorithms. In this paper, we used the data set from Kaggle.com. These data set comprises with total 732 instances along with 5 attributes. All these 5 attributes are to be considered for testing purpose and also to find out the best fit model for prediction of heart disease. In this research article we also compare the various classification models based on supervised learning algorithms. Based on the performance and accuracy rate we therefore, choose ‘Medium Tree’ model as the best-fit model. Maximum accuracy is obtained for ‘Medium Tree’ model. The confusion matrix for each model are calculated and analyze.


2022 ◽  
Vol 2161 (1) ◽  
pp. 012020
Author(s):  
Sohit Kummar ◽  
Asutosh Mohanty ◽  
Jyotsna ◽  
Sudeshna Chakraborty

Abstract Coronavirus (Covid-19) pandemic has impacted the whole world and has forced health emergencies internationally. The contact of this pandemic has been fallen over almost all the development sectors. A lot of precautionary measures have been taken to control the Covid-19 spread, where wearing a face mask is an essential precaution. Wearing a face mask correctly has been essential in controlling the Covid-19 transmission. Moreover, this research aims to detect the face mask with fine-grained wearing states: face with the correct mask and face without mask. Our work has two challenging tasks due to two main reasons firstly the presence of augmented data set available in the online market and the training of large datasets. This paper represents a mobile application for face mask detection. The fully automated Machine Learning Cloud service known as Google Cloud ML API is used for training the model in TensorFlow file format. This paper highlights the efficiency of the ML model. Additionally, this paper examines the advancement of the cloud technology used for machine learning over the traditional coding methods.


Mathematics ◽  
2020 ◽  
Vol 8 (8) ◽  
pp. 1311
Author(s):  
Qiyi He ◽  
Xiaolin Meng ◽  
Rong Qu ◽  
Ruijie Xi

Connected and Autonomous Vehicle (CAV)-related initiatives have become some of the fastest expanding in recent years, and have started to affect the daily lives of people. More and more companies and research organizations have announced their initiatives, and some have started CAV road trials. Governments around the world have also introduced policies to support and accelerate the deployments of CAVs. Along these, issues such as CAV cyber security have become predominant, forming an essential part of the complications of CAV deployment. There is, however, no universally agreed upon or recognized framework for CAV cyber security. In this paper, following the UK CAV cyber security principles, we propose a UML (Unified Modeling Language)-based CAV cyber security framework, and based on which we classify the potential vulnerabilities of CAV systems. With this framework, a new CAV communication cyber-attack data set (named CAV-KDD) is generated based on the widely tested benchmark data set KDD99. This data set focuses on the communication-based CAV cyber-attacks. Two classification models are developed, using two machine learning algorithms, namely Decision Tree and Naive Bayes, based on the CAV-KDD training data set. The accuracy, precision and runtime of these two models when identifying each type of communication-based attacks are compared and analysed. It is found that the Decision Tree model requires a shorter runtime, and is more appropriate for CAV communication attack detection.


2017 ◽  
Author(s):  
Rishav Kumar ◽  
Rishi Raj Singh Jhelumi ◽  
Achintye Madhav Singh ◽  
Prasoon Kumar

AbstractEpilepsy is one of the major neurological disorders affecting nearly 1 percentage of the global population. The major blunt is born by under developed and developing countries due to expensive treatment of epileptic conditions. Further, the lack of proper forecasting methods for an occurrence of epileptic seizures in epileptic-drug resistant patients or patients not amenable for surgery affects their psychological behaviour and restricts their daily activities. The forecasting is usually performed by human experts that leave a wide gap for human-bias and human error. Therefore, in the current work, we have evaluated the efficiency of several machine learning algorithms to automatically identify the preictal patterns corresponding to epileptic seizures from intracranial EEG signals. The robustness of the machine learning algorithms were tested after the data set was pre-processed using carefully chosen feature engineering strategies viz. denoised Fourier transforms as well as cross-correlation across electrodes in time and frequency domain. Extensive experimentations were carried out to determine the best combination of feature engineering techniques and machine learning algorithms. The best combination of feature engineering techniques and machine learning algorithm resulted in 0.7685 AUC (Area under the Receiver Operating Characteristic curve) on the random test samples. The suggested approach was fairly good at prediction of epilepsy in random samples and therefore, it can be used in epileptic seizure forecasting in patients where medication/surgery is ineffective. Eventually, our strategy reveals a robust method for brain disorders forecasting from EEGs.


10.2196/18279 ◽  
2020 ◽  
Vol 4 (6) ◽  
pp. e18279 ◽  
Author(s):  
Rebecca O'Donovan ◽  
Emre Sezgin ◽  
Sven Bambach ◽  
Eric Butter ◽  
Simon Lin

Background Qualitative self- or parent-reports used in assessing children’s behavioral disorders are often inconvenient to collect and can be misleading due to missing information, rater biases, and limited validity. A data-driven approach to quantify behavioral disorders could alleviate these concerns. This study proposes a machine learning approach to identify screams in voice recordings that avoids the need to gather large amounts of clinical data for model training. Objective The goal of this study is to evaluate if a machine learning model trained only on publicly available audio data sets could be used to detect screaming sounds in audio streams captured in an at-home setting. Methods Two sets of audio samples were prepared to evaluate the model: a subset of the publicly available AudioSet data set and a set of audio data extracted from the TV show Supernanny, which was chosen for its similarity to clinical data. Scream events were manually annotated for the Supernanny data, and existing annotations were refined for the AudioSet data. Audio feature extraction was performed with a convolutional neural network pretrained on AudioSet. A gradient-boosted tree model was trained and cross-validated for scream classification on the AudioSet data and then validated independently on the Supernanny audio. Results On the held-out AudioSet clips, the model achieved a receiver operating characteristic (ROC)–area under the curve (AUC) of 0.86. The same model applied to three full episodes of Supernanny audio achieved an ROC-AUC of 0.95 and an average precision (positive predictive value) of 42% despite screams only making up 1.3% (n=92/7166 seconds) of the total run time. Conclusions These results suggest that a scream-detection model trained with publicly available data could be valuable for monitoring clinical recordings and identifying tantrums as opposed to depending on collecting costly privacy-protected clinical data for model training.


Author(s):  
. Anika ◽  
Navpreet Kaur

The paper exhibits a formal audit on early detection of heart disease which are the major cause of death. Computational science has potential to detect disease in prior stages automatically. With this review paper we describe machine learning for disease detection. Machine learning is a method of data analysis that automates analytical model building.Various techniques develop to predict cardiac disease based on cases through MRI was developed. Automated classification using machine learning. Feature extraction method using Cell Profiler and GLCM. Cell Profiler a public domain software, freely available is flourished by the Broad Institute's Imaging Platform and Glcm is a statistical method of examining texture .Various techniques to detect cardio vascular diseases.


2020 ◽  
Author(s):  
Marc Philipp Bahlke ◽  
Natnael Mogos ◽  
Jonny Proppe ◽  
Carmen Herrmann

Heisenberg exchange spin coupling between metal centers is essential for describing and understanding the electronic structure of many molecular catalysts, metalloenzymes, and molecular magnets for potential application in information technology. We explore the machine-learnability of exchange spin coupling, which has not been studied yet. We employ Gaussian process regression since it can potentially deal with small training sets (as likely associated with the rather complex molecular structures required for exploring spin coupling) and since it provides uncertainty estimates (“error bars”) along with predicted values. We compare a range of descriptors and kernels for 257 small dicopper complexes and find that a simple descriptor based on chemical intuition, consisting only of copper-bridge angles and copper-copper distances, clearly outperforms several more sophisticated descriptors when it comes to extrapolating towards larger experimentally relevant complexes. Exchange spin coupling is similarly easy to learn as the polarizability, while learning dipole moments is much harder. The strength of the sophisticated descriptors lies in their ability to linearize structure-property relationships, to the point that a simple linear ridge regression performs just as well as the kernel-based machine-learning model for our small dicopper data set. The superior extrapolation performance of the simple descriptor is unique to exchange spin coupling, reinforcing the crucial role of choosing a suitable descriptor, and highlighting the interesting question of the role of chemical intuition vs. systematic or automated selection of features for machine learning in chemistry and material science.


Author(s):  
Jun Pei ◽  
Zheng Zheng ◽  
Hyunji Kim ◽  
Lin Song ◽  
Sarah Walworth ◽  
...  

An accurate scoring function is expected to correctly select the most stable structure from a set of pose candidates. One can hypothesize that a scoring function’s ability to identify the most stable structure might be improved by emphasizing the most relevant atom pairwise interactions. However, it is hard to evaluate the relevant importance for each atom pair using traditional means. With the introduction of machine learning methods, it has become possible to determine the relative importance for each atom pair present in a scoring function. In this work, we use the Random Forest (RF) method to refine a pair potential developed by our laboratory (GARF6) by identifying relevant atom pairs that optimize the performance of the potential on our given task. Our goal is to construct a machine learning (ML) model that can accurately differentiate the native ligand binding pose from candidate poses using a potential refined by RF optimization. We successfully constructed RF models on an unbalanced data set with the ‘comparison’ concept and, the resultant RF models were tested on CASF-2013.5 In a comparison of the performance of our RF models against 29 scoring functions, we found our models outperformed the other scoring functions in predicting the native pose. In addition, we used two artificial designed potential models to address the importance of the GARF potential in the RF models: (1) a scrambled probability function set, which was obtained by mixing up atom pairs and probability functions in GARF, and (2) a uniform probability function set, which share the same peak positions with GARF but have fixed peak heights. The results of accuracy comparison from RF models based on the scrambled, uniform, and original GARF potential clearly showed that the peak positions in the GARF potential are important while the well depths are not. <br>


2020 ◽  
Vol 27 (4) ◽  
pp. 313-320 ◽  
Author(s):  
Xuan Xiao ◽  
Wei-Jie Chen ◽  
Wang-Ren Qiu

Background: The information of quaternary structure attributes of proteins is very important because it is closely related to the biological functions of proteins. With the rapid development of new generation sequencing technology, we are facing a challenge: how to automatically identify the four-level attributes of new polypeptide chains according to their sequence information (i.e., whether they are formed as just as a monomer, or as a hetero-oligomer, or a homo-oligomer). Objective: In this article, our goal is to find a new way to represent protein sequences, thereby improving the prediction rate of protein quaternary structure. Methods: In this article, we developed a prediction system for protein quaternary structural type in which a protein sequence was expressed by combining the Pfam functional-domain and gene ontology. turn protein features into digital sequences, and complete the prediction of quaternary structure through specific machine learning algorithms and verification algorithm. Results: Our data set contains 5495 protein samples. Through the method provided in this paper, we classify proteins into monomer, or as a hetero-oligomer, or a homo-oligomer, and the prediction rate is 74.38%, which is 3.24% higher than that of previous studies. Through this new feature extraction method, we can further classify the four-level structure of proteins, and the results are also correspondingly improved. Conclusion: After the applying the new prediction system, compared with the previous results, we have successfully improved the prediction rate. We have reason to believe that the feature extraction method in this paper has better practicability and can be used as a reference for other protein classification problems.


Sign in / Sign up

Export Citation Format

Share Document