Predicting non-small cell lung cancer prognosis by fully automated microscopic pathology image features

Nat Commun. 2016 Aug 16:7:12474. doi: 10.1038/ncomms12474.

Abstract

Lung cancer is the most prevalent cancer worldwide, and histopathological assessment is indispensable for its diagnosis. However, human evaluation of pathology slides cannot accurately predict patients' prognoses. In this study, we obtain 2,186 haematoxylin and eosin stained histopathology whole-slide images of lung adenocarcinoma and squamous cell carcinoma patients from The Cancer Genome Atlas (TCGA), and 294 additional images from Stanford Tissue Microarray (TMA) Database. We extract 9,879 quantitative image features and use regularized machine-learning methods to select the top features and to distinguish shorter-term survivors from longer-term survivors with stage I adenocarcinoma (P<0.003) or squamous cell carcinoma (P=0.023) in the TCGA data set. We validate the survival prediction framework with the TMA cohort (P<0.036 for both tumour types). Our results suggest that automatically derived image features can predict the prognosis of lung cancer patients and thereby contribute to precision oncology. Our methods are extensible to histopathology images of other organs.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adenocarcinoma / diagnosis*
  • Aged
  • Carcinoma, Squamous Cell / diagnosis*
  • Female
  • Humans
  • Image Processing, Computer-Assisted / methods
  • Kaplan-Meier Estimate
  • Lung / pathology*
  • Lung Neoplasms / diagnosis*
  • Machine Learning
  • Male
  • Middle Aged
  • Pathology, Clinical / methods
  • Prognosis