NIMG-37. PREDICTING SEIZURE IN GLIOMA PATIENTS USING A RANDOM FOREST CLASSIFIER TRAINED ON SEX-SPECIFIC AND MIXED COHORTS
Abstract PURPOSE Brain tumor related epilepsy (BTE) is a major co-morbidity in patients with glioma. It is difficult to determine whether the use of anti-epileptic drugs is necessary. We attempted to build a machine-learning model to predict the probability of seizure presentation (SP) with glioma. METHODS We trained a random forest classifier using the following variables: volumetric data of pre-treatment MR images (T1Gd and T2-FLAIR sequences), patient demographics (age; sex), and measurements of tumor proliferation (log(ρ)), invasiveness (log(D)) and their relative ratio (log(ρ/D)). Our cohort consisted of 221 patients total. Using an 80-20 ratio, we used 176 patients (76 SP, 100 nSP) for training and the remaining 45 patients (19 SP, 26 nSP) were used for testing. We also trained on male-only and female-only cohorts to evaluate any sex differences in prediction. For training, 108 males (53 SP, 55 nSP) were used and 28 for testing (14 SP, 14 nSP). We used 72 females (21 SP, 49 nSP) for training and 15 (7 SP, 8 nSP) for testing. We corrected for class imbalance in the female cohort before training. Using 10-fold cross-validation and a separate testing set, we measured performance by ROC curve (AUC), accuracy, sensitivity, and specificity of predictions (average of folds in cross validation). RESULTS The female model achieved the highest AUC (0.853) followed by the mixed model (0.726) and the male model (0.651). In the validation set, the accuracy/sensitivity/specificity of the three cohorts were as follows: mixed (0.726/0.696/0.750), female (0.853/0.830/0.875), and male (0.651/0.577/0.722). The performance of the testing set, in terms of accuracy/sensitivity/specificity were: mixed (0.733/0.74/0.73), female (0.8/0.57/1), and male (0.714/0.64/0.79). CONCLUSION We found a negative correlation between seizure probability and size and invasiveness of tumors. Our model shows promising performance on testing set data. Further cohort studies and training is warranted.