Article Text

PDF

Integrated FDG-PET/CT does not make invasive staging of the intrathoracic lymph nodes in non-small cell lung cancer redundant: a prospective study
  1. K G Tournoy1,
  2. S Maddens1,
  3. R Gosselin2,
  4. G Van Maele3,
  5. J P van Meerbeeck1,
  6. A Kelles4
  1. 1Department of Respiratory Medicine, Ghent University Hospital, Ghent, Belgium
  2. 2Department of Radiology, Ghent University Hospital, Ghent, Belgium
  3. 3Department of Medical Statistics, Ghent University Hospital, Ghent, Belgium
  4. 4Department of Nuclear Medicine, Ghent University Hospital, Ghent, Belgium
  1. Correspondence to:
    Dr Kurt G Tournoy
    Department of Respiratory Medicine, Ghent University Hospital, De Pintelaan 185, 9000 Ghent, Belgium; kurt.tournoy{at}ugent.be

Abstract

Background: Staging of non-small cell lung cancer (NSCLC) is important for determining choice of treatment and prognosis. The accuracy of FDG-PET scans for staging of lymph nodes is too low to replace invasive nodal staging. It is unknown whether the accuracy of integrated FDG-PET/CT scanning makes invasive staging redundant.

Methods: In a prospective study, the mediastinal and/or hilar lymph nodes in patients with proven NSCLC were investigated with integrated FDG-PET/CT scanning. Pathological confirmation of all suspect lymph nodes was obtained to calculate the accuracy of the fusion images. In addition, the use of the standardised uptake value (SUV) in the staging of intrathoracic lymph nodes was analysed.

Results: 105 intrathoracic lymph node stations from 52 patients with NSCLC were characterised. The prevalence of malignancy in the lymph nodes was 36%. The sensitivity of the integrated FDG-PET/CT scan to detect malignant lymph nodes was 84% and its specificity was 85% (positive likelihood ratio 5.64, negative likelihood ratio 0.19). SUVmax, SUVmean and the SUVmax/SUVliver ratio were all significantly higher in malignant than in benign lymph nodes. The area under the receiver operating curve did not differ between these three quantitative variables, but the highest accuracy was found with the SUVmax/SUVliver ratio. At a cut-off value of 1.5 for the SUVmax/SUVliver ratio, the sensitivity and specificity to detect malignant lymph node invasion were 82% and 93%, respectively.

Conclusion: The accuracy of integrated FDG-PET/CT scanning is too low to replace invasive intrathoracic lymph node staging in patients with NSCLC. The visual interpretation of the fusion images of the integrated FDG-PET/CT scan can be replaced by the quantitative variable SUVmax/SUVliver without loss of accuracy for intrathoracic lymph node staging.

  • CT, computed tomography
  • EBUS-TBNA, endobronchial endoscopic ultrasound with real-time guided transbronchial needle aspiration
  • EUS-FNA, endoscopic ultrasound with real-time guided fine needle aspiration
  • FDG, 18-fluoro-2-deoxy-D-glucose
  • NSCLC, non-small cell lung cancer
  • PET, positron emission tomography
  • ROC, receiver operating curve
  • SUV, standardised uptake value

Statistics from Altmetric.com

Staging non-small lung cancer (NSCLC) is an important part of the diagnostic course in patients with lung cancer since it guides the treatment modalities and predicts survival.1 While staging with computed tomography (CT) has an important role in the initial staging by providing excellent anatomical information on the extent of the primary tumour (T denominator), the CT scan has limited ability to differentiate between benign and malignant lymph nodes (N denominator). Whole body positron emission tomography (PET) with 18-fluoro-2-deoxy-D-glucose (FDG) has a higher accuracy for detecting intrathoracic lymph node metastasis,2–4 and demonstrates occult distant metastasis in approximately 10% of patients.2 According to other reports, CT and FDG-PET perform similarly in mediastinal staging.5 Because of an unacceptable rate of false positive and false negative findings, FDG-PET has been shown to be insufficiently accurate to replace invasive lymph node staging on tissue specimens.6–8 In addition, the visual localisation of intrathoracic lymph nodes with FDG-PET is not always unequivocal because of the low spatial resolution of the PET images.

Integrated FDG-PET/CT scans theoretically overcome this problem because of the co-acquisition of CT and FDG-PET images resulting in so-called fusion images. Lardinois et al9 showed that, in 50 patients with NSCLC, the integrated FDG-PET/CT scan was more accurate than FDG-PET alone for nodal staging; however, no difference in accuracy was noted when integrated FDG-PET/CT scans were compared with CT scans alone. Several investigators have indicated that the maximum standardised uptake value (SUVmax) of the primary tumour is a prognostic factor in patients with NSCLC.10,11 With the integrated FDG-PET/CT scan, the application of this quantitative technique at the level of the lymph node becomes more precise since the anatomical borders of the lymph nodes can be exactly identified for determining the region of FDG uptake.12

From a clinical point of view, an accuracy or a negative/positive predictive value of at least 90–95% for the integrated FDG-PET/CT scan is required to make invasive staging redundant. To evaluate whether tissue-confirmed lymph node staging by surgery or echo-endoscopy can be avoided, we prospectively assessed the accuracy of the integrated FDG-PET/CT scan in the nodal staging of NSCLC. In addition, we investigated whether an objective measure of FDG uptake based on SUV values could substitute for the subjective interpretation of the fusion images.

METHODS

Patients

Consecutive patients with suspected or pathologically proven primary NSCLC were eligible if a tissue specimen from at least one of the intrathoracic lymph nodes was available and if they underwent an integrated FDG-PET/CT scan. All investigations were done before the start of any treatment, and both the integrated PET/CT scan and examination of lymph node tissue were performed within 14 days of each other. The study was approved by the ethics committee of Ghent University Hospital.

Pathology of primary lung tumours and of intrathoracic lymph nodes

A tissue specimen of the primary tumour was obtained for pathological examination by either bronchoscopy, CT-guided transthoracic puncture or a surgical procedure (thoracotomy or video-assisted thoracoscopy). For intrathoracic lymph nodes, a tissue sample was obtained either by mediastinoscopy, surgical resection or by linear endoscopic ultrasound. The latter consisted of either oesophageal endoscopic ultrasound with real-time guided fine needle aspiration (EUS-FNA) or endobronchial endoscopic ultrasound with real-time guided transbronchial needle aspiration (EBUS-TBNA). Because the negative predictive values of EUS-FNA and EBUS-TBNA are considered too low,13,14 surgical confirmation was always done in case no malignant lymph node invasion could be demonstrated by either of these endoscopic techniques.

Staging procedures with CT and FDG-PET/CT

Patients fasted for at least 6 hours, after which blood glucose levels were determined to ascertain a level of <200 mg/dl. Patients then received 4 MBq/kg FDG intravenously followed by 250 ml sodium chloride and 20 mg furosemide. For muscle relaxation and to minimise background staining, diazepam 5 mg was given orally. Image acquisition started 60 min after injection of FDG in a relaxed supine position with the arms alongside the body using an integrated FDG-PET/CT scanner (Philips Gemini FDG-PET/CT, Philips Medical Systems, Cleveland, Ohio, USA). First, a total body low-dose CT scan for calculation of the attenuation correction was performed (120 kV, effective tube current-time product maximum 30 mAS, pitch 0.9, collimation 16×1.5 mm, rotation time 0.5 s, reconstructed contiguous slices of 5 mm, scan field from head up to the upper tights). Second, a scan was performed with a dual head injector (175 mAS, otherwise the same scan parameters) after intravenous injection of 120 ml contrast medium with an iodine concentration of 300 mg/ml at a flow rate of 1.8 ml/s followed by a saline flush. No oral contrast was administered. Next, the FDG-PET scan from the orbitomeatal region up to the upper tights (consisting of 8–9 bed positions of 3 min per table position) was performed. Patients were instructed to breathe normally during the acquisition of the CT and FDG-PET/CT images. PET image data sets were reconstructed iteratively using a row action maximum likelihood algorithm with segmented correction for attenuation with use of the CT data. Co-registered images were displayed by means of SYNTEGRA software (Philips Medical Systems).

The CT and integrated FDG-PET/CT scans represented a single procedure of data acquisition but were read separately. For the CT analysis the radiologist was blinded to the FDG-PET data. All intrathoracic lymph nodes were noted and the small and long axes were measured (mm). A lymph node with a short axis of at least 10 mm was indicated as suspect. The FDG-PET/CT scan was interpreted based on both CT and FDG-PET images which were read by a nuclear physician and a radiologist.

Determination of FDG-PET/CT SUV variables

The maximum and mean SUV values were determined by drawing regions of interest on the attenuation-corrected FDG-PET fusion images around the primary tumour or the involved lymph node. The variables SUVmax and SUVmean were then calculated as the maximum and mean SUV values, respectively, within the region of interest. Quantitative evaluation based on the SUVmax/SUVliver ratio was calculated as the ratio of the SUVmax over the mean SUV value obtained from the homogenous distribution of radioactivity in the liver.15

Statistical analysis

A clinical research form was completed for each patient and integrated in an electronic database for analysis with SPSS 14.0 (SPSS Inc, Chicago, Illinois, USA). Except for the demographic data, calculations were primarily performed at the single lymph node level. The test characteristics of the CT scan and integrated FDG-PET/CT scan were also calculated at the individual patient level. Values are expressed as mean (SD). Continuous variables were compared with the Mann-Whitney U test and Kruskal-Wallis test. Comparison of proportions (accuracy analysis) was done with a Z-test. A two-sided p value of <0.05 was considered statistically significant. The receiver operating curves (ROC) and the ROC areas, the latter being a parameter to measure how well a certain variable can distinguish between benign and malignant lymph nodes, were analysed with SPSS 14.0 and compared with Medcalc Version 9.1.0.1.

RESULTS

Characteristics of patients and procedures

The study included 52 patients with NSCLC whose characteristics are shown in table 1. Patients were recruited between November 2005 and April 2006. A diagnosis of NSCLC was obtained in the primary tumour specimen in 67% of cases. For the others, the diagnosis was extrapolated from the malignant cells found in the lymph node specimen. 52% of the patients had at least one malignant intrathoracic lymph node. Integrated FDG-PET/CT analysis and tissue specimens were obtained for 105 intrathoracic lymph nodes from 50 patients (an average of 2.01 lymph nodes per patient). In two patients the integrated FDG-PET/CT scan was useful for evaluating the primary tumour only as its central location precluded a confident discrimination of mediastinal lymph nodes. In 10% of the patients a confirmatory surgical staging technique was necessary because of a negative endoscopic ultrasound examination.

Table 1

 Characteristics of patients and investigations

Standardised uptake values for the primary tumour

Table 2 shows the mean values of the different SUV variables for the primary tumours. There was no difference in SUVmax, SUVmean or in the ratio SUVmax/SUVliver between the different histological subtypes of NSCLC (Kruskal-Wallis test), nor between squamous and non-squamous carcinomas. Both standard deviation (table 2) and variance (data not shown) of the SUV variables indicate that the spread of the SUVmax was considerably larger than for SUVmean or the SUVmax/SUVliver ratio. In addition, lung tumours <3 cm had a statistically significant lower SUVmax, SUVmean and SUVmax/SUVliver than tumours >3 cm. We did not find a significant correlation between the four T stages and the respective SUV values (data not shown).

Table 2

 Mean (SD) PET/CT data for the primary tumour in patients with non-small cell lung cancer (NSCLC)

Accuracy of CT and integrated FDG-PET/CT for staging lymph nodes

The accuracy of the CT findings and of the integrated FDG-PET/CT fusion images for staging intrathoracic lymph nodes is shown in table 3. The prevalence of malignant lymph nodes was 36%. Whereas the sensitivity of CT and integrated FDG-PET/CT scans to detect malignant lymph node invasion was equal (both 84%), integrated FDG-PET/CT scanning had a specificity of 85% compared with 61% for the CT scan. The accuracy for detecting malignant intrathoracic lymph nodes was 69% and 85%, respectively, for CT scans and integrated FDG-PET/CT scans. These figures do not differ significantly from the accuracy of the CT scan (74%) and integrated FDG-PET/CT scan (84%) calculated at the level of the individual patient. The negative predictive value of small lymph nodes (short axis <10 mm) without uptake of FDG was 91% (95% CI 77% to 97%; n = 43); the positive predictive value of enlarged lymph nodes (short axis >10 mm) with FDG uptake was 79% (95% CI 62% to 90%; n = 38); the negative predictive value of enlarged lymph nodes without uptake of FDG was 90% (95% CI 60% to 98%; n = 20); and the positive predictive value of small lymph nodes with FDG uptake was 50% (95% CI 6% to 93%; n = 4).

Table 3

 Accuracy of interpretation of computed tomography (CT) scans versus integrated whole body positron emission tomography with 18-fluoro-2-deoxy-D-glucose/CT (FDG-PET/CT) fusion images for staging of intrathoracic lymph nodes in patients with non-small cell lung cancer (NSCLC)

Standardised uptake values for intrathoracic lymph nodes and ROC curves

Figure 1 shows the SUV-based quantitative integrated FDG-PET/CT characteristics for intrathoracic lymph node staging. A statistically significant difference was observed between benign and malignant lymph nodes for SUVmax, SUVmean and for the SUVmax/SUVliver ratio. This difference was found in the enlarged lymph nodes but not in the smaller ones (data not shown). The ROC curves, ROC area and cut-off values are shown in fig 2. No statistical difference between these three variables was found. For SUVmax the cut-off value for the highest accuracy was 2.9, which corresponds to a sensitivity of 76% and a specificity of 90%. For SUVmean the highest accuracy was reached at a cut-off point of 2.3, yielding a sensitivity of 68% and a specificity of 93%. The SUVmax/SUVliver ratio had at a cut-off value of 1.5, the highest accuracy with a sensitivity of 82% and a specificity of 93%.

Figure 1

 Distribution of the different standardised uptake values (SUV) according to the pathological state of the intrathoracic lymph node (LN).

Figure 2

 Receiver operating characteristic (ROC) curves for intrathoracic lymph node staging with PET/CT variables. SUV, standardised uptake value.

DISCUSSION

In this prospective study we have analysed the value of the integrated FDG-PET/CT scan for assessing intrathoracic lymph nodes in patients with NSCLC. Its accuracy for predicting malignant lymph node invasion was found to be higher than with a CT scan. We also found that the SUVmax/SUVliver ratio, a quantitative measure of FDG uptake, predicts malignant invasion in the intrathoracic lymph node with an accuracy comparable to that obtained with the integrated FDG-PET/CT fusion images. However, neither the interpretation of the integrated FDG-PET/CT fusion images nor the calculated cut-offs enabled us to discontinue using tissue-based lymph node staging in NSCLC because all the test characteristics were below the threshold of 95% at which malignant lymph node invasion can be confidently ruled in or out. Therefore, neither a positive nor a negative integrated FDG-PET/CT scan allows the clinician to confidently predict whether or not an intrathoracic lymph node is malignant.

The advantage of the integrated FDG-PET/CT scan over the CT scan as well as the FDG-PET scan in thoracic oncology is the fact that the spatial resolution of the integrated scan is much higher than with the FDG-PET scan. Because of co-acquisition and fusion, FDG uptake can be exactly localised even in relatively small focal abnormalities such as intrathoracic lymph nodes. This allows a detailed analysis of the value of SUV-based variables within a certain area of interest defined by the anatomical borders. We chose SUVmax, SUVmean and a tumour-to-background ratio SUVmax/SUVliver and applied these parameters not only to primary tumours but also to intrathoracic lymph nodes.

It has been shown in several retrospective series that the SUVmax obtained with FDG-PET scanning of the primary tumour predicts both stage and survival of the patient with NSCLC.10,11 A correlation between T stage and SUVmax was found by Cerfolio et al11 for T1 and T2 tumours but not for T3 and T4 tumours. The same authors also suggested that squamous cell carcinomas have a higher SUVmax than other pathological subtypes.11 SUVmax was proposed as a variable because it was suggested that its reproducibility is better than for SUVmean.16 However, others have suggested that SUVmean as well as the SUVmax/SUVliver ratio give more stable and reliable results.15

In our prospective series we analysed these SUV-based characteristics of the primary tumour with integrated FDG-PET/CT scanning and found that the clustering of SUVmean and the SUVmax/SUVliver ratio was better than that of SUVmax. This suggests that the former two might be more reliable and stable than SUVmax. Although our data also confirm that T1 tumours had a lower SUVmax than T2–4 tumours,11 we did not observe a difference between the different pathological subtypes.

For lymph node staging, we found that the integrated FDG-PET/CT scan performed better than the CT scan in terms of specificity (85% vs 61%, p = 0.002) but not sensitivity (84% for both). It is already known that FDG-PET scans are better than CT scans for staging of intrathoracic lymph nodes.6–8,17 However, Shim et al18 directly compared the accuracy of CT and integrated FDG-PET/CT scanning and suggested that the integrated FDG-PET/CT scan is superior for lymph node staging. Our data confirm this finding and further suggest that a visual interpretation of the fusion images is not sufficiently accurate to avoid invasive tissue specimen-based lymph node staging. Because the implication of a false positive staging evaluation is a missed opportunity for surgical care, we confirm that, as for FDG-PET, a positive integrated FDG-PET/CT result does not confidently rule in lymph node invasion. de Langen et al19 found that lymph nodes measuring 10–15 mm without FDG uptake had a post-test probability for malignant invasion of only 5%, so they suggested that these patients do not need further mediastinal investigation. Since the absolute number of false negative lymph nodes in our series was low, we are cautious about drawing firm conclusions on the negative predictive value of integrated FDG-PET/CT scans. Nevertheless, we calculated a specificity of 85% and a negative predictive value of 90%, which compares with the findings of Shim et al.18

We also determined whether the accuracy of intrathoracic lymph node staging could be improved by calculating SUV variables on the fusion images. As expected, we found that enlarged malignant lymph nodes have significantly higher SUV values than benign ones. With an FDG-PET scan it was proposed that a SUVmax cut-off of 4.4 was the best threshold to discriminate between benign and malignant lymph nodes.17 With the integrated FDG-PET/CT scan, we calculated ROCs for SUVmax and also for SUVmean and the SUVmax/SUVliver ratio for lymph node staging and found that optimal cut-off points for all three variables could be identified. For SUVmax we found that the optimal threshold for discriminating malignancy was 2.9. The discrepancy between published data17 and our findings could be partly attributed to the fact that FDG-PET techniques have evolved over the past 10 years resulting in more sensitive cameras, and also to the fact that different technology was used for the attenuation correction and for the calculation of the SUV data. For an FDG-PET examination, an external cesium source is used for calculating the attenuation correction while, for an FDG-PET/CT camera, the CT data are used to calculate the attenuation correction. This is the reason why we also included a ratio as variable (SUVmax/SUVliver) so that camera-specific variables could be omitted. In addition, the lower SUVmax threshold in our study could also be related to the fact that we were able to delineate exactly the nodal focus on the integrated FDG-PET/CT fusion images.

More importantly, our results indicate that a thoracic oncology clinic working with FDG-PET or with integrated FDG-PET/CT should not automatically rely on published SUV-based cut-off values but should determine their own machine-specific cut-off points. Although at a cut-off point of 1.5 the accuracy of the SUVmax/SUVliver ratio was comparable to the one obtained with the fusion images, calculated SUV values also do not replace lymph node tissue diagnosis.

This study has a number of limitations. A number of methods encompassing echo-endoscopy and surgical staging procedures were used to obtain lymph node pathology. Although all negative echo-endoscopies were confirmed with surgery, it is clear that patients who had proven N2/N3 disease after echo-endoscopy did not have full pathological mapping of the mediastinum. In addition, although over 100 lymph nodes were characterised, it is inevitable that—for the assessment of test variables—false negative and false positive findings are calculated on relatively small numbers. As discussed above, this means that the interpretation of negative predictive values of a small lymph node which is FDG-PET/CT negative should be done with caution. Furthermore, the data were not corrected for possible intrasubject correlation between nodes. Although re-calculation of the test characteristics at the patient level did not show a difference from the data obtained at the single lymph node level, this must be taken into account when interpreting the data.

In conclusion, integrated FDG-PET/CT scanning has an overall accuracy which is too low to replace invasive intrathoracic lymph node staging in patients with NSCLC. The visual interpretation of the fusion images of the integrated FDG-PET/CT scan can be replaced by the quantitative variable SUVmax/SUVliver without loss of accuracy for staging of intrathoracic lymph nodes.

Acknowledgments

This is an investigator initiated study.

REFERENCES

View Abstract

Footnotes

  • Funding: None.

  • Competing interests: None.

Request permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

Linked Articles