Development and validation of a novel ten-gene signature predicting prognosis in Hepatocellular carcinoma
Abstract Background: Hepatocellular carcinoma (HCC) is the most common type of liver cancer with relatively poor prognosis. Thus, we aimed to identify novel molecular biomarkers to effectively predict the prognosis of HCC patients and eventually guide treatment. Methods: Prognosis-associated genes were determined by Kaplan-Meier and multivariate Cox regression analyses using the expression and clinical data of 373 HCC patients from The Cancer Genome Atlas (TCGA) database and validated in an independent Gene Expression Omnibus (GEO) dataset. The classification of AML was performed by unsupervised hierarchical clustering of ten gene expression levels. A prognostic risk score was established based on a linear combination of ten gene expression levels using the regression coefficients derived from the multivariate Cox regression models. Results: A total of 183 genes were significantly associated with prognosis in HCC. SLC25A15, RAB8A, GOT2, SORBS2, IL18RAP were top five protective genes, while FHL3, AMD1, DCAF13, UBE2E1, PTDSS2 were top five risk genes in HCC. SLC25A15, GOT2, IL18RAP were significantly down-regulated and DCAF13, PTDSS2 and SORBS2 were significantly up-regulated in the HCC samples and these genes exhibited high accuracy in differentiating HCC tissues from normal liver tissues. Hierarchical clustering analysis of the ten genes discovered three clusters of HCC patients. HCC tumors of cluster1 and 2 were significantly associated with more favourable OS than those of cluster3, cluster2 tumors showed higher pathologic stage than cluster3 tumors. The risk score was predictive of increased mortality rate in HCC patients. Conclusions: The ten-gene signature and the risk score may turn out to be novel molecular biomarkers and stratification of HCC patients to considerably ameliorate the prognostic prediction.