Machine learning for prediction of immunotherapy efficacy in non-small cell lung cancer from simple clinical and biological data

ABSTRACTBackgroundImmune checkpoint inhibitors (ICIs) are now a therapeutic standard in advanced non-small cell lung cancer (NSCLC), but strong predictive markers for ICIs efficacy are still lacking. We evaluated machine learning models built on simple clinical and biological data to individually predict response to ICIs.MethodsPatients with metastatic NSCLC who received ICI in second line or later were included. We collected clinical and hematological data and studied the association of this data with disease control rate (DCR), progression free survival (PFS) and overall survival (OS). Multiple machine learning (ML) algorithms were assessed for their ability to predict response.ResultsOverall, 298 patients were enrolled. The overall response rate and DCR were 15.3 % and 53%, respectively. Median PFS and OS were 3.3 and 11.4 months, respectively. In multivariable analysis, DCR was significantly associated with performance status (PS) and hemoglobin level (OR 0.58, p<0.0001; OR 1.8, p<0.001). These variables were also associated with PFS and OS and ranked top in random forest-based feature importance. Neutrophils-to-lymphocytes ratio was also associated with DCR, PFS and OS. The best ML algorithm was a random forest. It could predict DCR with satisfactory efficacy based on these three variables. Ten-fold cross-validated performances were: accuracy 0.68 ± 0.04, sensitivity 0.58 ± 0.08; specificity 0.78 ± 0.06; positive predictive value 0.70 ± 0.08; negative predictive value 0.68 ± 0.06; AUC 0.74 ± 0.03.ConclusionCombination of simple clinical and biological data could accurately predict disease control rate at the individual level.Highlights-Machine learning applied to a large set of NSCLC patients could predict efficacy of immunotherapy with a 69% accuracy using simple routine data-Hemoglobin levels and performance status were the strongest predictors and significantly associated with DCR, PFS and OS-Neutrophils-to-lymphocyte ratio was also associated with outcome-Benchmark of 8 machine learning models

Download Full-text

Machine Learning for Prediction of Immunotherapy Efficacy in Non-Small Cell Lung Cancer from Simple Clinical and Biological Data

Cancers ◽

10.3390/cancers13246210 ◽

2021 ◽

Vol 13 (24) ◽

pp. 6210

Author(s):

Sébastien Benzekry ◽

Mathieu Grangeon ◽

Mélanie Karlsen ◽

Maria Alexa ◽

Isabella Bicalho-Frazeto ◽

...

Keyword(s):

Machine Learning ◽

Lung Cancer ◽

Random Forest ◽

Small Cell Lung Cancer ◽

Disease Control ◽

Cell Lung Cancer ◽

Predictive Value ◽

Small Cell ◽

Biological Data ◽

Small Cell Lung

Background: Immune checkpoint inhibitors (ICIs) are now a therapeutic standard in advanced non-small cell lung cancer (NSCLC), but strong predictive markers for ICIs efficacy are still lacking. We evaluated machine learning models built on simple clinical and biological data to individually predict response to ICIs. Methods: Patients with metastatic NSCLC who received ICI in second line or later were included. We collected clinical and hematological data and studied the association of this data with disease control rate (DCR), progression free survival (PFS) and overall survival (OS). Multiple machine learning (ML) algorithms were assessed for their ability to predict response. Results: Overall, 298 patients were enrolled. The overall response rate and DCR were 15.3% and 53%, respectively. Median PFS and OS were 3.3 and 11.4 months, respectively. In multivariable analysis, DCR was significantly associated with performance status (PS) and hemoglobin level (OR 0.58, p < 0.0001; OR 1.8, p < 0.001). These variables were also associated with PFS and OS and ranked top in random forest-based feature importance. Neutrophil-to-lymphocyte ratio was also associated with DCR, PFS and OS. The best ML algorithm was a random forest. It could predict DCR with satisfactory efficacy based on these three variables. Ten-fold cross-validated performances were: accuracy 0.68 ± 0.04, sensitivity 0.58 ± 0.08; specificity 0.78 ± 0.06; positive predictive value 0.70 ± 0.08; negative predictive value 0.68 ± 0.06; AUC 0.74 ± 0.03. Conclusion: Combination of simple clinical and biological data could accurately predict disease control rate at the individual level.

Download Full-text

PO-0956 Non Invasive Grading of Non-Small Cell Lung Cancer Using Machine Learning Models Based on Radiomics

Radiotherapy and Oncology ◽

10.1016/s0167-8140(19)31376-3 ◽

2019 ◽

Vol 133 ◽

pp. S518

Author(s):

S. Aouadi ◽

R. Hammoud ◽

T. Torfeh ◽

N. Al-Hammadi

Keyword(s):

Machine Learning ◽

Lung Cancer ◽

Small Cell Lung Cancer ◽

Cell Lung Cancer ◽

Small Cell ◽

Learning Models ◽

Small Cell Lung ◽

Non Invasive ◽

Machine Learning Models

Download Full-text

Histological classification of non-small cell lung cancer with RNA-seq data using machine learning models

10.1145/3459930.3471168 ◽

2021 ◽

Author(s):

Robert B. Eshun ◽

Md Khurram Monir Rabby ◽

A. K. M. Kamrul Islam ◽

Marwan U. Bikdash

Keyword(s):

Machine Learning ◽

Lung Cancer ◽

Small Cell Lung Cancer ◽

Small Cell ◽

Histological Classification ◽

Rna Seq ◽

Learning Models ◽

Small Cell Lung ◽

Machine Learning Models

Download Full-text

Using Nomogram and Machine Learning Models to Predict Non-Small Cell Lung Cancer Prognosis

10.21203/rs.3.rs-616177/v1 ◽

2021 ◽

Author(s):

Haike Lei ◽

Chun Liu ◽

Zheng Xu ◽

Na Hong ◽

Xiaosheng Li ◽

...

Keyword(s):

Machine Learning ◽

Lung Cancer ◽

Cox Regression ◽

Small Cell ◽

Seer Database ◽

Learning Models ◽

Small Cell Lung ◽

Vital Status ◽

Nsclc Patients ◽

Machine Learning Models

Abstract BackgroundPatients with non-small cell lung cancer (NSCLC) often have a poor prognosis. Overall survival (OS) prediction through the early diagnosis of cancer has many benefits, such as allowing providers to design the best treatment plan for patients. In this study, we aimed to evaluate the prognostic factors in NSCLC patients, construct a nomogram, and develop machine learning models to predict the OS. We also conducted feature importance analysis to understand how relevant factors of NSCLC patients impact their OS.ResultsMultiple machine learning models were adopted in a retrospective cohort of patients from 2010 to 2015 in the Surveillance, Epidemiology, and End Results (SEER) database. Independent prognostic factors for NSCLC were determined using Cox proportional hazards regression analysis. We modeled OS and vital status as the outcomes and constructed and validated a nomogram to predict the OS of NSCLC. Furthermore, we applied logistic regression, random forest, XGBoost, decision tree, multilayer perceptron, and LightGBM to predict the patients’ vital status. We tested the prediction ability of the models and evaluated their performances using accuracy, sensitivity, specificity, precision, and the area under the receiver operating characteristic curve. A total of 34,567 patients selected from the SEER database that met our criteria were included in this study. The nomogram visualized the OS prediction results of the Cox regression model. Among the classifiers, XGBoost had the best prediction performance, with an area under the curve of 0.733.ConclusionsThe results demonstrated that machine learning-based classifier models are capable of predicting the outcomes of patients with NSCLC. And Cox regression model-based nomogram interpreted the results well and supports potential medical applications.

Download Full-text

Clinical outcomes and predictive value of programmed cell death-ligand 1 expression in response to anti-programmed cell death 1/ligand 1 antibodies in non-small cell lung cancer patients with performance status 2 or greater

International Journal of Clinical Oncology ◽

10.1007/s10147-020-01789-5 ◽

2020 ◽

Vol 26 (1) ◽

pp. 78-86

Author(s):

Takayuki Jodai ◽

Koichi Saruwatari ◽

Tokunori Ikeda ◽

Eiji Moriyama ◽

Kosuke Kashiwabara ◽

...

Keyword(s):

Lung Cancer ◽

Cell Death ◽

Programmed Cell Death ◽

Cancer Patients ◽

Predictive Value ◽

Performance Status ◽

Small Cell ◽

Small Cell Lung ◽

Lung Cancer Patients ◽

Programmed Cell Death 1

Download Full-text

Predictive value of serum VEGF levels for elderly patients or for patients with poor performance status receiving anti-PD-1 antibody therapy for advanced non-small-cell lung cancer

Cancer Immunology Immunotherapy ◽

10.1007/s00262-020-02539-2 ◽

2020 ◽

Vol 69 (7) ◽

pp. 1229-1236 ◽

Cited By ~ 2

Author(s):

Ryota Shibaki ◽

Shuji Murakami ◽

Yuki Shinno ◽

Yuji Matsumoto ◽

Tatsuya Yoshida ◽

...

Keyword(s):

Lung Cancer ◽

Elderly Patients ◽

Small Cell Lung Cancer ◽

Predictive Value ◽

Performance Status ◽

Antibody Therapy ◽

Poor Performance ◽

Small Cell ◽

Poor Performance Status ◽

Small Cell Lung

Download Full-text

Pilot Study of Performance Status 2 vs. Performance Status 0-1 Non-small Cell Lung Cancer Patients Treated With Chemo/Immunotherapy

Case Medical Research ◽

10.31525/ct1-nct04253964 ◽

2020 ◽

Author(s):

Keyword(s):

Lung Cancer ◽

Pilot Study ◽

Small Cell Lung Cancer ◽

Cancer Patients ◽

Cell Lung Cancer ◽

Performance Status ◽

Small Cell ◽

Small Cell Lung ◽

Lung Cancer Patients

Download Full-text

Faculty Opinions recommendation of Palliative thoracic radiotherapy for patients with advanced non-small cell lung cancer and poor performance status.

Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature ◽

10.3410/f.725268190.793503674 ◽

2015 ◽

Author(s):

Mellar Davis

Keyword(s):

Lung Cancer ◽

Small Cell Lung Cancer ◽

Cell Lung Cancer ◽

Performance Status ◽

Poor Performance ◽

Small Cell ◽

Poor Performance Status ◽

Thoracic Radiotherapy ◽

Small Cell Lung

Download Full-text

Document Preprocessing with TF-IDF to Improve the Polarity Classification Performance of Unstructured Sentiment Analysis

Kinetik Game Technology Information System Computer Network Computing Electronics and Control ◽

10.22219/kinetik.v5i3.1066 ◽

2020 ◽

pp. 235-242

Author(s):

Farrikh Alzami ◽

Erika Devi Udayanti ◽

Dwi Puji Prabowo ◽

Rama Aria Megantara

Keyword(s):

Machine Learning ◽

Feature Extraction ◽

Random Forest ◽

Sentiment Analysis ◽

Classification Performance ◽

Document Preparation ◽

Learning Models ◽

Polarity Classification ◽

Negative Sentiment ◽

Machine Learning Models

Sentiment analysis in terms of polarity classification is very important in everyday life, with the existence of polarity, many people can find out whether the respected document has positive or negative sentiment so that it can help in choosing and making decisions. Sentiment analysis usually done manually. Therefore, an automatic sentiment analysis classification process is needed. However, it is rare to find studies that discuss extraction features and which learning models are suitable for unstructured sentiment analysis types with the Amazon food review case. This research explores some extraction features such as Word Bags, TF-IDF, Word2Vector, as well as a combination of TF-IDF and Word2Vector with several machine learning models such as Random Forest, SVM, KNN and Naïve Bayes to find out a combination of feature extraction and learning models that can help add variety to the analysis of polarity sentiments. By assisting with document preparation such as html tags and punctuation and special characters, using snowball stemming, TF-IDF results obtained with SVM are suitable for obtaining a polarity classification in unstructured sentiment analysis for the case of Amazon food review with a performance result of 87,3 percent.

Download Full-text

Phase II study of docetaxel and cisplatin in advanced non-small-cell lung cancer.

Journal of Clinical Oncology ◽

10.1200/jco.1998.16.5.1948 ◽

1998 ◽

Vol 16 (5) ◽

pp. 1948-1953 ◽

Cited By ~ 60

Author(s):

J Zalcberg ◽

M Millward ◽

J Bishop ◽

M McKeage ◽

A Zimet ◽

...

Keyword(s):

Lung Cancer ◽

Phase Ii ◽

Advanced Nsclc ◽

Response Rate ◽

Phase Ii Study ◽

Performance Status ◽

Objective Response ◽

Small Cell ◽

Small Cell Lung ◽

Grade 3

PURPOSE Docetaxel (Taxotere, Rhone-Poulenc Rorer, Antony, France) and cisplatin are two of the most active single agents used in the treatment of non-small-cell lung cancer (NSCLC). A recently reported phase I study of the combination of docetaxel and cisplatin recommended a dose of 75 mg/m2 of both drugs every 3 weeks for subsequent phase II study. PATIENTS AND METHODS Eligible patients were aged 18 to 75 years with a World Health Organization (WHO) performance status < or = 2 and life expectancy > or = 12 weeks, with metastatic and/or locally advanced NSCLC proven histologically or cytologically. Patients were not permitted to have received prior chemotherapy, extensive radiotherapy, or any radiotherapy to the target lesion and must have had measurable disease. Concurrent treatment with colony-stimulating factors (CSFs) or prophylactic antibiotics was not permitted. Docetaxel (75 mg/m2) in 250 mL 5% dextrose was given intravenously (i.v.) over 1 hour immediately before cisplatin (75 mg/m2) in 500 mL normal saline given i.v. over 1 hour in 3-week cycles. Premedication included ondansetron, dexamethasone, promethazine, and standard hyperhydration with magnesium supplementation. RESULTS A total of 47 patients, two thirds of whom had metastatic disease, were entered onto this phase II study. The majority of patients were male (72%) and of good (WHO 0 to 1) performance status (85%). All 47 patients were assessable for toxicity and 36 were for response. Three patients were ineligible and eight (17%) discontinued treatment because of significant toxicity. In assessable patients, the overall objective response rate was 38.9% (95% confidence limits [CL], 23.1% to 56.5%), 36.1% had stable disease, and 25% progressive disease. On an intention-to-treat analysis, the objective response rate was 29.8%. Median survival was 9.6 months and estimated 1-year survival was 33%. Significant (grade 3/4) toxicities included nausea (26%), hypotension (15%), diarrhea (13%), and dyspnea mainly related to chest infection (13%). One patient experienced National Cancer Institute (NCI) grade 3 neurosensory toxicity after eight cycles. Grade 3/4 neutropenia was common and occurred in 87% of patients, but thrombocytopenia > or = grade 3 was rare (one patient). Significant (grade 3/4) abnormalities of magnesium levels were common (24%). Febrile neutropenia occurred in 13% of patients and neutropenic infection in 11%, contributing to two treatment-related deaths. No neutropenic enterocolitis or severe fluid retention was reported. CONCLUSION Compared with other active regimens used in this setting, the combination of docetaxel and cisplatin in advanced NSCLC is an active regimen with a similar toxicity profile to other combination regimens.

Download Full-text