Deep learning-based tumor microenvironment segmentation is predictive of tumor mutations and patient survival in non-small-cell lung cancer
Despite the fact that tumor microenvironment (TME) and gene mutations are the main determinants of progression of the deadliest cancer in the world - lung cancer - their interrelations are not well understood. Digital pathology data provide a unique insight into the spatial composition of the TME. Here, we generated 23,199 image patches from 55 hematoxylin-and-eosin (H&E)-stained lung cancer tissue sections and annotated them into 9 different tissue classes. Using this dataset, we trained a deep neural network and used it to segment 467 lung cancer H&E images downloaded from The Cancer Genome Atlas (TCGA) database. We used the segmented images to compute human interpretable features reflecting the heterogeneous composition of the TME, and successfully utilized them to predict patient survival (c-index 0.723) and cancer gene mutations (largest AUC 73.5% for PDGFRB). Our approach can be generalized to different cancer types to inform precision medicine strategies.