Article Text

PDF

Screening for lung cancer: a systematic review and meta-analysis of controlled trials
  1. R L Manser1,
  2. L B Irving2,
  3. G Byrnes3,
  4. M J Abramson4,
  5. C A Stone5,
  6. D A Campbell6
  1. 1Clinical Epidemiology and Health Service Evaluation Unit, Royal Melbourne Hospital, Parkville, Victoria, Australia and Department of Respiratory Medicine, St Vincent’s Hospital, Fitzroy, Victoria, Australia
  2. 2Department of Respiratory Medicine, Royal Melbourne Hospital, Parkville, Victoria, Australia
  3. 3Department of Mathematics and Statistics, University of Melbourne, Victoria, Australia
  4. 4Department of Epidemiology and Preventive Medicine, Monash Medical School, The Alfred Hospital, Prahran, Victoria, Australia
  5. 5Public Health Division, Department of Human Services, Melbourne, Victoria, Australia
  6. 6Clinical Epidemiology and Health Service Evaluation Unit, Royal Melbourne Hospital, Parkville, Victoria, Australia
  1. Correspondence to:
    Dr R Manser, Clinical Epidemiology and Health Service Evaluation Unit, Ground Floor, Charles Connibere Building, Royal Melbourne Hospital, Grattan Street, Parkville 3050, Victoria, Australia;
    ManserRL{at}mh.org.au

Abstract

Background: Lung cancer is a substantial public health problem in western countries. Previous studies have examined different screening strategies for lung cancer but there have been no published systematic reviews.

Methods: A systematic review of controlled trials was conducted to determine whether screening for lung cancer using regular sputum examinations or chest radiography or computed tomography (CT) reduces lung cancer mortality. The primary outcome was lung cancer mortality; secondary outcomes were lung cancer survival and all cause mortality.

Results: One non-randomised controlled trial and six randomised controlled trials with a total of 245 610 subjects were included in the review. In all studies the control group received some type of screening. More frequent screening with chest radiography was associated with an 11% relative increase in mortality from lung cancer compared with less frequent screening (RR 1.11, 95% CI 1.00 to 1.23). A non-statistically significant trend to reduced mortality from lung cancer was observed when screening with chest radiography and sputum cytological examination was compared with chest radiography alone (RR 0.88, 95% CI 0.74 to 1.03). Several of the included studies had potential methodological weaknesses. Controlled studies of spiral CT scanning have not been reported.

Conclusions: The current evidence does not support screening for lung cancer with chest radiography or sputum cytological examination. Frequent chest radiography might be harmful. Further methodologically rigorous trials are required before any new screening methods are introduced into clinical practice.

  • meta-analysis
  • lung cancer
  • screening

Statistics from Altmetric.com

Lung cancer is the commonest cause of cancer death in the western world.1 It currently accounts for approximately 5% of all deaths in most developed countries and, as such, constitutes a major public health problem.1 Current therapeutic interventions have had little impact on the epidemic proportions of the disease, and the case fatality rate remains at 85–90%.2 The overwhelming majority of cases of lung cancer are attributable to cigarette smoking and thus primary prevention should continue to be a major focus of public health campaigns. However, such measures are likely to have only a limited impact on mortality in the short term because of a lag phase in the order of 20 years.

Most cases of lung cancer present at an advanced stage and therefore previous studies have investigated the role of screening for the detection of preclinical disease.3,4 Following a series of lung cancer screening trials conducted in the 1970s, it has generally been felt that early detection of lung cancer with chest radiographic or sputum cytology screening does not improve outcome, particularly disease specific mortality.5 However, recent narrative reviews have drawn conflicting conclusions.6,7 Furthermore, newer screening methods have now been proposed such as spiral computed tomographic (CT) scanning.8 We therefore conducted a systematic review in order to assess the evidence for various screening methods to reduce lung cancer mortality and to evaluate the morbidity and harms associated with screening. We chose the primary outcome measure of disease specific mortality. This outcome is not influenced by the screening biases that may affect survival data—in particular, lead time and overdiagnosis bias. Lead time bias may occur if screening advances the time of diagnosis but does not actually alter the time of death, hence survival appears longer but mortality is unchanged. Overdiagnosis bias may occur if a screening programme detects cases of cancer that would not have led to death in that individual’s lifetime. This may occur if indolent cancers are detected, or if cancers are detected in individuals who would have died from co-morbid disease before the cancer became clinically apparent. If overdiagnosis bias occurs there may be an apparent improvement in stage distribution, resection rates, and survival without an improvement in disease specific mortality. Because the most appropriate outcome measure for screening trials is debatable, we have also considered the secondary outcomes of survival and all cause mortality.

METHODS

Identification of studies

Medline (1966–2000), Premedline (up to April 2001), Embase, and the Cochrane Controlled Trials Register were searched. The full search strategy is outlined elsewhere.10 We also contacted experts, examined bibliographies, and hand searched the journal Lung Cancer (1985–2000).

Selection of studies

Randomised or non-randomised controlled trials that examined the impact of screening for lung cancer in adult populations on lung cancer mortality were included. All screening interventions were considered including chest radiography, sputum cytological examination, and CT scanning. Studies that compared different screening modalities or different screening frequencies were included.

Two reviewers independently assessed the titles and abstracts from the electronic searches and relevant articles were selected for full text review. Studies were selected for inclusion in the review after both reviewers assessed the full text articles. Disagreements were resolved by consensus. When assessing the eligibility and quality of studies, the reviewers were aware of the authorship and source of publication of the studies.

Study quality

Study quality was evaluated independently by two reviewers and disagreements were resolved by consensus. For randomised controlled studies quality was assessed by noting the method of randomisation and whether allocation was concealed. For all studies we assessed whether there was blinding of the outcome assessment and whether there was an adequate description of withdrawals and dropouts. The criteria used to assess the adequacy of allocation concealment are described in the Cochrane Handbook.10 For the remaining quality criteria we used the descriptions outlined by Jadad et al.11

Data extraction

One reviewer extracted the data, and a second study member extracted the data for the main results. Authors of included studies were asked to confirm the data extracted. Some results were extracted from graphs.

Outcome measures

The primary outcome was lung cancer mortality. Secondary outcomes included lung cancer survival, all cause mortality, and morbidity associated with screening.

Statistical analysis

Outcomes from included trials were combined using the Review Manager (Version 4.1, Update Software, Oxford).12 For dichotomous outcomes, relative risks (RR) are reported with 95% confidence intervals (CI). Homogeneity of effect sizes between studies being pooled was tested using p<0.10 as the cut off level for significance. For those outcomes where there was significant statistical heterogeneity, relative risks were reported using the random effects model, but for other outcomes the fixed effects model was used. The fixed effects model considers only the within study variability in the calculation of the common effect whereas the random effects model takes into account both between study and within study variability in the calculation of the common effect. Random effects models will usually produce wider confidence intervals. Data were analysed on an intention to screen basis. The level of agreement between reviewers evaluating studies for inclusion and undertaking quality assessments was assessed using simple kappa and weighted kappa statistics.

Survival data were summarised using relative risks. We were unable to calculate hazards ratios for the survival analysis because the primary studies did not include sufficient information. None of the primary studies included life tables, Kaplan-Meier survival curves, or hazards ratios. Some of the studies did include graphs of cumulative event rates which could be extracted and used to approximate the hazards ratio.13 However, this method ignores censoring and effectively will approximate the ratio of cumulative mortality—that is, the risk ratio.

RESULTS

Literature search and inclusion in study

A total of 1869 citations were identified by the Medline search, 119 of which were selected for full text review (kappa=0.54; moderate agreement). Following the full text review, six studies (all with multiple citations) were selected for inclusion in the review (kappa=0.9; very good agreement).3,4,14–17 A further study was selected for inclusion after bibliographies of review articles were searched.18 Searches of Embase, Premedline, hand searching of Lung Cancer, and contact with primary authors and experts in the field did not reveal any further relevant studies that had not been identified by the Medline search. One ongoing study was also identified for which no results are yet available.19

Missing data and contact with authors

We attempted to contact authors of all the studies in the review. Authors of two of the studies have responded to our requests for further information,4,15 but data extraction has been confirmed for only one of the studies to date.4 For lung cancer mortality the results of the Erfurt County study could not be included in the pooled analysis due to insufficient data.3 All cause mortality was not reported in two of the studies14,17 and survival was not reported in one of the studies.18

Study characteristics

Of the seven controlled trials included in the review, one was non-randomised,3 one was a cluster randomised study,14 and the remaining five studies were randomised.4,15–18 In all these studies participants in the control groups underwent variable degrees of screening. Five studies effectively compared more frequent chest radiographic screening with less frequent screening.3,4,14,16,18 The details of these studies are outlined in table 1. A further two studies were designed to assess whether sputum cytological examination at 4 monthly intervals would reduce lung cancer mortality when added to screening with annual chest radiographs.15,17 The two studies had an almost identical study design.20 Both enrolled male heavy smokers over the age of 45. The intervention groups were offered an annual chest radiograph and 4 monthly sputum cytological examinations while the control groups were offered an annual chest radiograph. Participants in these studies were recruited from the Baltimore17 and New York metropolitan areas.15

Table 1

Design of screening studies comparing different frequencies of chest radiographic screening

Quality of included studies

Concealment was inadequate in one of the randomised studies18 and was not described in the remaining five studies.4,14–17 After contacting one of the study authors, however, concealment of allocation was classified as adequate in the Memorial Sloan-Kettering study,15 and in the Mayo Lung Project the randomisation book was open and therefore allocation concealment was inadequate (confirmed by contacting study author). Appropriate methods were used to generate random sequences in all but one of the studies.18 In four of the studies the investigators, who were masked to the screening status of subjects, assessed the cause of death.4,15,17,18 Withdrawals and dropouts were adequately described in four studies.3,4,14,15 In the Erfurt County study, however, losses to follow up were significantly greater in the control group (4.9% v 3.6%, p=0.0001).3 Follow up was poor in the Kaiser Permanente study18 and was not adequately reported in the Czech study.16 In the Johns Hopkins study 1.3% of participants were lost to follow up but no further details were provided.16 Extended follow up has recently been reported for two of the studies.21,22 In the Mayo Lung Project vital status was ascertained by searching the National Death Index,22 but the methods of follow up were not described in the Czech study.21

Compliance with screening

In the Mayo Lung Project compliance with scheduled screening averaged 75% in the intervention group and 73% of the control group received non-study chest radiographs during the final 2 years of the study.4 In the North London study14 63.2% of workers in the intervention group and 62.7% of workers in the control group attended for the final radiograph at the end of 3 years. In the Erfurt County Study3 compliance with scheduled screening was not described in detail. In the Czechoslovakian study16 92% of the intervention group and 95% of the control group attended screening that took place at the end of the first 3 years. In the Kaiser Permanente study18 the mean number of multiphasic health check ups (MHCs) per person during the study period was 6.8 (maximum 18, median 6) in the intervention group and 2.8 (maximum 17, median 1) in the control group. In the Memorial Sloan-Kettering study participants were considered compliant if they had their last radiograph in 1982, or more than 5 years after enrolment, or within 1 year of death.15 Of the intervention group, 63% were compliant, as were 65% of the control group. In the Johns Hopkins study17 19% of participants withdrew from active screening, but the proportion in each group was not described.

Lung cancer mortality

For studies which effectively compared more frequent chest radiographic screening with less frequent screening, the relative risk (RR) was 1.11 (95% CI 0.95 to 1.31) and there was no significant statistical heterogeneity between the results of different studies (p=0.67). The analysis was also conducted using data from the extended follow up reported for two of the studies (table 2).21,22 Lung cancer mortality was significantly greater in the group undergoing more frequent chest radiographic screening than in those receiving less frequent screening (RR 1.11 (95% CI 1.00 to 1.23), p=0.05). These results were not altered by the method of meta-analysis (random effects v fixed effects analysis). The results of the study excluded from the pooled analysis were similar to those included, with a slight trend to increased mortality in the intervention group.3

Table 2

Relative risk of death from lung cancer: studies comparing frequent chest radiographic screening with less frequent screening.

The results from the two trials comparing annual chest radiographic screening with annual chest radiography plus 4 monthly sputum cytological examinations were pooled (table 3) giving a combined RR estimate of 0.88 (95% CI 0.74 to 1.03; p=0.11). There was no significant statistical heterogeneity (p=0.25).

Table 3

Relative risk of death from lung cancer: studies comparing annual chest radiography with annual chest radiography plus 4 monthly sputum cytological examination

Lung cancer survival

Survival was examined by comparing the proportion of patients diagnosed with lung cancer who were alive 5 years after diagnosis. For studies which compared more frequent chest radiographic screening with less frequent screening, survival was better in the intervention group with a combined RR of death from lung cancer of 0.91 (95% CI 0.84 to 0.99; p=0.02). There was significant statistical heterogeneity between the results of studies being pooled (p=0.023). For studies that compared annual chest radiographic screening plus 4 monthly sputum cytological examination with annual chest radiographic screening, survival was better in the group receiving regular sputum cytological examinations (RR 0.83 (95% CI 0.75 to 0.92; p=0.0003)).

All cause mortality

A pooled analysis (table 4) for all cause mortality was conducted on those studies which effectively compared more frequent chest radiographic screening with less frequent screening (RR 1.01 (95% CI 0.94 to 1.08)). There was significant statistical heterogeneity between the results of the studies (p=0.08). Visual inspection of the graph for overlap of the 95% confidence intervals suggested that the results of the Czech study16 differed from the other studies. When the Czech study was excluded from the analysis the RR was 0.97 (95% CI 0.94 to 1.01) and there was no significant statistical heterogeneity (p=0.47). All cause mortality was reported in only one of the studies comparing annual chest radiography with annual chest radiography plus 4 monthly sputum cytological examinations (RR 1.03 (95% CI 0.9–1.17)).15

Table 4

Relative risk of death (all causes): studies comparing more frequent chest radiographic screening with less frequent screening

Harms and morbidity associated with screening

In general, harms associated with screening were poorly reported. Postoperative deaths were not reported in three of the studies.14,17,18 In the remaining studies the number of postoperative deaths was small and did not differ significantly between intervention and control groups; for most of the studies they appear to have been included as lung cancer deaths.3,4,15,16 Morbidity associated with diagnostic work ups and surgical procedures was not well described—for example, no details were provided on the proportion of subjects with positive screening results who required further invasive tests. The details of the therapeutic interventions received by participants—such as extent of surgery—were poorly described.

DISCUSSION

This is the first systematic review of lung cancer screening studies reported in the literature. A previously reported meta-analysis was not undertaken in the context of a systematic review.5 The results of the present meta-analysis suggest that, overall, more frequent chest radiographic screening does not result in reduced lung cancer mortality compared with less frequent screening. In fact, when data from the prolonged periods of follow up recently reported for two of the primary studies are included in the analysis, more frequent chest radiographic screening is associated with an 11% relative increase in lung cancer mortality compared with less frequent screening. Screening biases, such as lead time bias and overdiagnosis bias, will influence survival as an outcome in screening studies. The finding in this review of a significant increase in survival from lung cancer in association with an increase in disease specific mortality emphasises the unreliability of survival as an outcome measure in screening trials. These outcomes differ in that survival is usually measured from the point of diagnosis and the denominator is restricted to those who are diagnosed with cancer. In contrast, mortality is assessed by evaluating the number of deaths among all those assigned to a particular intervention group during a defined time period. As highlighted in the introduction, mortality (either disease specific or all cause) is not affected by screening biases. A more detailed description of these biases is outlined elsewhere.7

Screening with 4 monthly sputum cytological examination in addition to annual chest radiography was not associated with a reduction in lung cancer mortality compared with annual chest radiographic screening alone. However, the 95% confidence intervals were relatively wide and included a range of potentially clinically significant values—for example, the true effect might lie between a 26% relative reduction in lung cancer mortality and a 3% relative increase in lung cancer mortality. None of the studies in the review addressed the question of whether there might be a benefit from less frequent chest radiographic screening (with or without sputum cytological examination) compared with no screening. However, a multi-screening study which is currently underway has been designed to assess whether screening for lung cancer with annual chest radiography can reduce disease specific mortality compared with no screening.19

We identified potentially important methodological weaknesses in most of the included studies. Not all the methods used to assess the quality of studies included in this review have been validated for screening studies. The adequacy of allocation concealment has been shown to be an important source of bias in obstetric treatment studies, but has not been evaluated extensively in other fields.23 Recent reports suggest that it may be an important source of bias in screening studies.24 It should be noted that, although inadequate concealment will usually favour the intervention arm, bias can occur in either direction. Inadequate randomisation could therefore give rise to imbalances in prognostic variables between intervention and control groups at baseline and this could account for the finding of increased mortality from lung cancer in the group undergoing more frequent chest radiographic screening. It is noteworthy, however, that adjustment for baseline prognostic factors in one study did not alter the results substantially.25 Of course, if randomisation was inadequate, there is little assurance that potential confounders which were not measured or not known were balanced at baseline. The results of this review should therefore be interpreted with caution. In particular, the finding of an increase in lung cancer mortality in the group receiving more frequent chest radiographic screening may be the result of systematic errors in the primary studies.

Bias may also have occurred as a result of the misclassification of lung cancer deaths. Deaths due to other causes might be more likely to be attributed to lung cancer in the group undergoing more frequent screening.26 This has been termed “sticking diagnosis bias”. It is also possible that the treatment of “pseudodisease” could have led to a relative increase in the number of treatment related deaths.26 In addition to the above limitations, it could be argued that studies conducted several decades ago cannot be generalised to current practice. In particular, women were excluded from most of the studies and the incidence of different histological subtypes of lung cancer has changed since the 1970s.27

We certainly need to consider the possibility that screening with frequent chest radiographs might be harmful. Based on current understanding of the effects of medical radiation, it seems unlikely that the increased incidence and mortality of lung cancer is related to radiation exposure.28 Another possible explanation is that, in the absence of a benefit from early detection and treatment, early diagnosis adversely influences outcome. When cancer is diagnosed in patients who would otherwise have remained asymptomatic for some time, or until death from another cause, the prognosis may be influenced by the fact that they are now aware of the diagnosis. The diagnosis of cancer is frequently associated with adjustment difficulties or depression. Higher levels of psychological stress in patients treated for cancer have been shown to inhibit cellular immune responses,29 and coping and emotional distress have been shown to be independent predictors of survival in lung cancer.30

This review was limited to a small number of controlled trials conducted over two decades ago. We adopted the Cochrane methodology and therefore uncontrolled trials were excluded. Although such studies represent a lower level in the evidence hierarchy, they can be an important source of knowledge and some might argue that their exclusion is a limitation of reviews such as this. For example, we have not included uncontrolled studies of spiral CT scanning, although preliminary studies indicate that spiral CT scanning is a more sensitive screening tool than chest radiography.8 Randomised controlled trials of spiral CT scanning are now being carefully planned which draw on past experience with lung cancer screening trials.31 Strategies are also being developed to evaluate other approaches to prevention such as early detection with sputum immunohistochemical techniques coupled with chemoprevention.32 The weaknesses of the primary studies discussed in this review should be viewed in an appropriate historical context. We have highlighted some of the problems in relation to the reporting of these studies. In particular, there were limitations with the reporting of harms associated with screening and follow up details were not well described in some studies. Contemporary studies now have the benefit of greater collective experience and guidelines for the reporting of randomised controlled trials have been published.33

In conclusion, there is currently insufficient evidence to support screening for lung cancer with any screening modality. Our results suggest that there may be a role for sputum cytology as an adjunct to other screening methods, but this warrants further evaluation. Some experts have suggested that current public policies that discourage routine chest radiographic screening should be reconsidered,34 and others have proposed non-comparative study designs for the evaluation of cancer screening tools.35 However, we cannot ignore the potential harms associated with screening asymptomatic individuals in the community. Frequent chest radiographic screening clearly does not significantly reduce lung cancer mortality compared with less frequent screening and, given the risk of false positive test results, the overall impact of such screening might be detrimental. Emerging screening technologies need to be evaluated in well designed studies before mass screening programmes are adopted.

Acknowledgments

The authors thank the Iberoamerican Cochrane Center for assistance with database searches and Dr Consol Serra (coordinator of the Cochrane Lung Cancer Group) for assistance with protocol development and editing of the review. They acknowledge the help provided by authors of primary studies who have responded to our correspondence and provided additional information (Dr Robert Fontana, Dr Myron Melamed) and are grateful to Carol Roberts who assisted with the literature search and the retrieval of studies relevant to the review.

REFERENCES

View Abstract

Footnotes

  • Sources of support: RLM is supported by a National Health and Medical Research Council scholarship (no 201713).

Request permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.