Article Text

Original research
Liverpool Lung Project lung cancer risk stratification model: calibration and prospective validation
Free
  1. John K Field1,
  2. Daniel Vulkan2,
  3. Michael P A Davies1,
  4. Stephen W Duffy2,
  5. Rhian Gabe2
  1. 1 Molecular and Clinical Cancer Medicine, Faculty of Health and Life Sciences, University of Liverpool, Liverpool, UK
  2. 2 Centre for Cancer Prevention, Wolfson Institute of Preventive Medicine, Queen Mary University of London, London, UK
  1. Correspondence to Dr John K Field, Molecular and Clinical Cancer Medicine, Faculty of Health and Life Sciences, University of Liverpool, Liverpool L69 3BX, UK; J.K.Field{at}liverpool.ac.uk

Abstract

Background Early detection of lung cancer saves lives, as demonstrated by the two largest published low-dose CT screening trials. Optimal implementation depends on our ability to identify those most at risk.

Methods Version 2 of the Liverpool Lung Project risk score (LLPv2) was developed from case-control data in Liverpool and further adapted when applied for selection of subjects for the UK Lung Screening Trial. The objective was to produce version 3 (LLPv3) of the model, by calibration to national figures for 2017. We validated both LLPv2 and LLPv3 using questionnaire data from 75 958 individuals, followed up for lung cancer over 5 years. We validated both discrimination, using receiver operating characteristic (ROC) analysis, and absolute incidence, by comparing deciles of predicted incidence with observed incidence. We calculated proportionate difference as the percentage excess or deficit of observed cancers compared with those predicted. We also carried out Hosmer-Lemeshow tests.

Results There were 599 lung cancers diagnosed over 5 years. The discrimination of both LLPv2 and LLPv3 was significant with an area under the ROC curve of 0.81 (95% CI 0.79 to 0.82). However, LLPv2 overestimated absolute risk in the population. The proportionate difference was −58.3% (95% CI −61.6% to −54.8%), that is, the actual number of cancers was only 42% of the number predicted.

In LLPv3, calibrated to national 2017 figures, the proportionate difference was −22.0% (95% CI −28.1% to −15.5%).

Conclusions While LLPv2 and LLPv3 have the same discriminatory power, LLPv3 improves the absolute lung cancer risk prediction and should be considered for use in further UK implementation studies.

  • imaging/CT MRI etc
  • lung cancer

Statistics from Altmetric.com

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

Key messages

What is the key question?

  • The Liverpool Lung Project risk model (LLPv2) has been developed to identify those at high risk of lung cancer for intervention (eg, by low-dose CT in the UK Lung Screening Trial), here we assess the discriminatory and predictive power of the risk model and propose an updated version (LLPv3) more suitable for a contemporary country-wide population.

What is the bottom line?

  • The LLPv2 risk model was validated with a significant discriminatory power, but overestimates risk compared with the improved LLPv3.

Why read on?

  • We demonstrate an improved risk prediction model for lung cancer, by relatively simple a priori calibration to updated cancer incidence data.

Introduction

Evidence from randomised trials estimates a significant 20% reduction in lung cancer mortality with invitation to low-dose CT screening.1 2 In the USA, lung cancer screening has been recommended by the United States Preventive Services Task Force (USPSTF) and in other countries there are a growing number of implementation studies aimed at addressing outstanding pragmatic issues of wider roll out for national programmes. An issue of fundamental importance is identification of a population at sufficient risk of disease to optimise detection in those who would benefit from screening for lung cancer, minimise unnecessary investigations for those at low risk and provide a cost-effective programme. While there is ample evidence on the discriminatory power of various risk prediction tools, there is a need to validate these tools on prospectively collected data to estimate their predictive power, as recommended by Toll et al 3 and as performed by Ten Haaf et al.4

The Liverpool Lung Project risk model (LLPv2) was developed on a case-control study in Liverpool and provides a percentage risk score for an individual over the next 5 years, based on questionnaire data.5 It has been validated in three international data sets and had an area under the curve (AUC) of 0.82 in the UK Cohort.6 It was later amended (LLPv2) due to observations in the UK Lung Screening (UKLS) Trial7 8 in which it was used to select subjects for low-dose CT screening. In this paper, we report on further amendments to the model (LLPv3) calibrated to the whole of England and to more recently available lung cancer incidence data, with prospective validation in a large cohort covering two regions of England.

Methods

Development and calibration

The LLP risk model was derived from data relating to 579 individuals with lung cancer and 1157 controls, recruited between 1998 and 2005 in an area of North West England. Variables included in the model were age, sex, cigarette smoking history, prior diagnosis of pneumonia, occupational exposure to asbestos, prior diagnosis of malignant tumour and family history of lung cancer. The model development process and internal validation have been published elsewhere.5

A second version of the model (LLPv2) was subsequently developed, following observations in the UKLS that resulted in changes to two variables: previous diagnoses of other lung diseases were considered alongside pneumonia, and cigar and pipe smoking were included alongside cigarettes8 in the UKLS Questionnaire (online supplemental material 1). The change was prompted by the observation in UKLS that those included on the basis of duration of use of other smoking materials had at least the prevalence of lung cancer as those included with the same duration of cigarette smoking.7 The numerical parameters of the model remained unchanged. Model formulation is given in online supplemental material 2. The LLP risk models only use smoking duration, as in common with others,9–11 we found this to be the strongest predictor of the various smoking metrics, and other aspects such as amount smoked were not significant when adjusting for duration.4

Supplemental material

The age-standardised incidence ratio for lung cancer in Liverpool compared with England as a whole for the years 2012–2016 was 1.86.12 The age-standardised rates of newly diagnosed cases of lung cancer for England were 101.4 per 100 000 of population for men in 2006 and 57.9 for women; the corresponding rates in 2017 (the most recent year for which these data were available) were 86.8 and 67.0, respectively.13 In order to produce an updated version of the model (LLPv3) for current use in the UK, the age-related risk factors in the model were each adjusted to approximate the effect of reducing the calculated risk score by a factor of 1.86×101.4/86.8 for men and 1.86×57.9/67.0 for women, in the absence of any other risk factors (see online supplemental material 2).

Validation

Participants for the UKLS were recruited from people aged 50–75 (at the time of initial approach) living in the areas covered by six Primary Care Trusts in two regions of the UK (North West England and East of England), by means of a questionnaire sent to just under a quarter of a million people. Recruitment methods and participant rates for UKLS have been reported elsewhere.8

Of the 75 958 people who expressed an interest in participating in the study (by completing and returning the questionnaire between September 2011 and March 2013), 8729 were assessed to have a high-risk score (estimated 5% risk or higher over 5 years) for lung cancer. After exclusions, 4061 gave informed consent for low-dose CT randomisation and biosample provision. Of these, 2028 were randomised to the CT screening arm (intervention) and 2027 to the non-screening arm (control); the remaining 6 were not randomised.

The questionnaire sent out as part of the initial approach for recruitment to the trial used a standardised format (see online supplemental material 1) and covered personal history of non-malignant lung diseases, previous malignancy, exposure to asbestos, family history of lung and other cancers, and smoking history. For all those who had ever smoked regularly, questions covered the type of smoking material, quantity and the age of starting and (if applicable) stopping smoking. For validation of the amended model (LLPv3), we used prospective data for the 75 958 people who returned the initial questionnaire. Coding and interpretation details are given in online supplemental material 3.

Cancer and death registration data between 2011 and 2018 were obtained from Public Health England (PHE) and linked through NHS number by a third party to the research cohort (PHE Office of Data Release study 1819_074). This provided the minimum of 5 years of follow-up required for testing the LLP model. Only diagnoses within the first 5 years were taken into account.

A risk score representing cancer risk over 5 years was calculated for each individual from the questionnaire data using both LLPv2 and LLPv3. The input variables were derived according to the algorithms shown in the online supplemental material.

A receiver operating characteristic (ROC) curve was plotted to assess the ability of the model to successfully distinguish between those persons who went on to develop lung cancer and those who did not. The ROC curve is a plot of the true positive rate (or sensitivity) against the false positive rate (1−specificity) for different risk scores. The AUC was calculated, along with the corresponding 95% CI.

Individuals allocated to the intervention arm of the UKLS trial, nested within the validation cohort, may have potentially received additional diagnostic procedures in the form of low-dose CT scans, compared with the rest of the study population. In order to investigate the possible impact of this, sensitivity analysis was carried out, excluding this subgroup.

In order to assess the calibration of the predictive model, we ranked the LLPv3 risk scores, assigned decile groups and carried out a linear regression of the log-odds (Embedded Image ) of the rate of observed cancers in each group against the log-odds of the mean of the risk scores in the group. The proportionate difference (the number of cancers observed in excess or deficit of those that would have been expected based on the risk scores, divided by that expected number) was also calculated. In addition, we carried out Hosmer-Lemeshow tests, with the population classified by predicted risk deciles.14 However, interpretation of these has to be qualified by the test’s sensitivity to categorisation, sample size and ties.15–17

Statistical analyses were performed using STATA, V.15.1 (StataCorp).

Results

Table 1 shows the basic demographic and risk profile of the study population, and of the subgroup developing lung cancer. Overall, almost half the population were men, 65% were aged 60 years or over, 18% were in the most deprived index of multiple deprivation (IMD) quintile and more than half the population had smoked at some time in their lives. As expected, the lung cancer cases were older, had higher rates of smoking and were more deprived than the overall population.

Table 1

Baseline factors and lung cancers diagnosed

Lung cancers were diagnosed in a total of 599 people over the period of follow-up (5 years from the questionnaire completion date), that is, 0.8% of the study population. Table 2 shows the number of cancers that were potentially detectable, that is, occurring in subjects who qualify for surveillance and those occurring in subjects who do not qualify using the amended LLP model (assuming 100% sensitivity of CT screening) for selected referral thresholds. The table also shows the corresponding sensitivity and specificity of exceeding the threshold. Results are given for men and women separately, and overall. Under LLPv2, a threshold of 3.7% would have yielded 10 864 subjects qualifying. The equivalent threshold under LLPv3 would be 2.0%. The numbers needed to screen (with the risk model) per cancer potentially detectable, fall from 62 with a 0.5% LLPv3 threshold to 21 with a 4.5% threshold. The positive likelihood ratios increased from 2.0 at the LLPv3 0.5% threshold to 6.3 at the 4.5% threshold. The corresponding ranges for men only were 1.9–5.4 and for women 2.3–7.8.

Table 2

Sensitivity and specificity by LLPv3 5-year risk threshold

The ROC curve for LLPv3 is given in figure 1. The curves for LLPv2 and LLPv3 are almost identical, as would be expected, since the coefficients pertaining to the risk factors are the same, and the ranking of most individuals is unchanged from version 2 to version 3. As a result, the discrimination is very similar. The AUC in each case was 0.81 (95% CI 0.79 to 0.82). After excluding individuals allocated to the UKLS trial intervention arm, the AUC and 95% CI were also unchanged.

Figure 1

Receiver operating characteristic (ROC) curve for Version 3 of the Liverpool Lung Project risk score (LLPv3).

The numbers of cancers detected by LLP decile groups and the expected numbers of cancers in each group are given in table 3, for both LLPv2 and LLPv3. Corresponding figures separately for men and women are given in online supplemental material 4. The increasing actual risk with predicted risk can be seen from the increasing percentage of cancers in successive risk deciles. The 10% of subjects in the second decile of predicted risk by LLPv3 contribute 2% (12/599) of the cancers whereas the 10% in the sixth decile contribute 4% (24/599) and the 10% in the highest decile contribute 45% (269/599).

Table 3

Risk score ranges and observed and expected lung cancers by risk decile for Version 2 of the Liverpool Lung Project risk score (LLPv2) and Version 3 of the Liverpool Lung Project risk score (LLPv3)

Figure 2 shows the log odds of the observed rate of cancers (with 95% CIs) plotted against the log odds of the mean risk score for both LLPv2 and LLPv3.

Figure 2

Observed versus predicted log odds of cancer for Version 2 of the Liverpool Lung Project risk score (LLPv2) and Version 3 of the Liverpool Lung Project risk score (LLPv3).

For LLPv2, the slope of the calibration line was 0.88 (95% CI 0.74 to 1.02), and the intercept at the origin (corresponding to a risk score of 50%) was −1.31 (95% CI −1.96 to −0.66). The orange line represents perfect agreement between observed and expected cancers, indicating that the LLPv2 model overestimated the risk. The proportionate difference was (599−1436)/1436 or −58.3% (95% CI −61.6% to −54.8%).

For LLPv3, the slope was largely unchanged at 0.90 (95% CI 0.75 to 1.05), the intercept at the origin was −0.66 (95% CI −1.46 to 0.15), and the proximity to the orange line indicates reduced discrepancy between observed and expected cancers. There was still a tendency to overestimate the number of cancers, especially at the highest risk levels, but the proportionate difference was considerably lower, at −22.0%.

Figure 3 shows the corresponding graphs for LLPv3 for men and women separately. The slope for men was 0.89 (95% CI 0.79 to 0.99) and the intercept was −0.74 (95% CI −1.28 to −0.21). For women, there was stronger agreement between predicted and observed numbers, with a slope of 0.93 (95% CI 0.72 to 1.14) and an intercept of 0.47 (95% CI −1.6 to +0.65).

Figure 3

Observed versus predicted log odds of cancer for Version 3 of the Liverpool Lung Project risk score (LLPv3) (A) men; (B) women.

The corresponding Hosmer-Lemeshow tests yielded a χ2 of 530.06 (p<0.001) for LLPv2 overall, with 437.78 (p<0.001) for males and 110.18 (p<0.001) for women. For LLPv3, a considerably improved fit was shown, with the Hosmer-Lemeshow statistics reduced by a factor of 10, with χ2 statistics of 50.87 (p<0.001) overall, 43.82 (p<0.001) for men and 15.19 (p=0.06) for women. In the figures for LLPv3, overall and men only, 65% and 73%, respectively, of the χ2 figures were contributed by the highest decile. This can also be seen in table 3 and online supplemental material 4.

Discussion

We present an updated version of the LLP model (LLPv3) designed to assess risk of lung cancer in the next 5 years for the contemporaneous English population. Its primary intended use is to identify people who might benefit from lung cancer screening. We recalibrated the LLPv2 model used in the UKLS trial of lung cancer screening taking account of changes to the age-specific and sex-specific lung cancer risk profile of England and the difference in risk between the Liverpool area and England as a whole. We prospectively validated the LLP model by using baseline risk factors and follow-up of lung cancer diagnoses in a large cohort (>75 000) of individuals expressing interest in the UKLS study. This demonstrated excellent discrimination based on an AUC measure of 0.81 (95% CI 0.79 to 0.82) for both LLPv2 and the updated LLPv3. The newer model also provided a very good estimation of future risk through comparison of observed and expected lung cancer outcomes.

However, without the calibration to the current England population in terms of age-specific and sex-specific lung cancer risk, there would have been considerable overestimation of risk. This is consistent with the findings of Katki et al,18 who concluded that previous versions of the LLP model overestimated future incidence.

Risk-based lung cancer screening strategies prevent significantly more lung cancer deaths than the current the USPSTF recommendations on lung cancer screening,19 which is currently the only internationally approved government stratification system and is also supported by the USA medical care agency (Medicare).20–23 Two lung cancer risk prediction models have been previously used in lung cancer CT screening clinical trials, the PanCan risk model (precursor to the PLCOM2012 model), in the PanCan cohort CT trial24 and the LLPv2 in the UKLS trial.8 The PanCan model did show good overall discrimination; however, it underestimated the observed risk of lung cancer by 30%. The authors tested both the PanCan and the PLCOm2012 in the PLCO trial ‘ever smokers’ and found small differences in the overall prediction, discrimination and calibration.24 A number of other lung cancer risk models have been developed and although some have been tested in previously recruited National Lung Screening Trial (NLST) subjects and a subgroup of the PLCO population, none have been used as the basis for eligibility in the clinical trial environment.4

The UKLS was a pilot CT screening trial and a pragmatic decision was made to select individuals with a 5% predicted risk of developing lung cancer over a 5-year period, in order to ensure the trial had a large number of lung cancers. This objective was successfully achieved, as 2.2% of subjects with lung cancer were identified in the baseline screen, which was significantly higher than either the NLST1 or NELSON2 trials. It was acknowledged that the risk cut-off might be lower in a large national screening programme.25

The discriminatory power of the LLPv2 was significant with an AUC of 0.81; however, the model was found to overestimate the absolute risk approximately twofold. The LLP risk model was based on a very high-risk population recruited between 1997 and 2005 in the NW of England and the incidence rates for lung cancer included in the model were based on 2007 data. On calibrating the age-specific and gender-specific intercepts to the current English data,12 13 a recalibrated LLP risk model is now provided here (LLPv3 risk model). As the calibration was independent of the data available from the UKLS study, we were able to use that to validate the model. The recalibrated LLPv3 model demonstrated a significantly improved absolute prediction with the expected rates of cancer in the population, as demonstrated by the correlation between the risk score and the observed cancer rates.

While the prediction as measured by the Hosmer-Lemeshow test is not perfect, we note the reservations about this test specified above. We also note that the LLPv3 clearly predicts better for women than for men, and that the only substantial departure of observed from predicted numbers is in one extreme category, the highest decile of risk, and in males only.

We acknowledge there are a number of limitations; however, many are shared by similar risk model studies. First, the study population, although large at 75 958, represents a self-selecting group from 247 354 individuals (30.7%) approached for the UKLS trial. Potentially, there is over-representation of lower risk, better educated, ‘worried well’ individuals (often associated with higher socioeconomic groups).26 On examining the UKLS IMD quintiles, the two least deprived (51.2%) outnumber the two most deprived (31.2%); however, it should be acknowledged that the LLP risk score is predictive across all IMD quintiles and a similar IMD profile would be expected on the introduction of national screening. There is a recognition that more should be done to target the ‘hard to reach’ individuals in lower socioeconomic groups, who have higher rates of lung cancer (as confirmed here: most deprived quintile 15.4 per 1000 in 5 years, least deprived 5.1).

Second, the UKLS study population represents a limited geographical subpopulation of the UK (both in the North West and the East of England), and one of the areas overlaps considerably with the area supplying the original estimation dataset. However, no individual appears in both estimation and validation sets, and the validation set appears to be broadly representative as demonstrated by the fit achieved for risk calibrated to data for England as a whole. This highlights that basing a risk score on a defined region (ie, Liverpool) one may overestimate lung cancer risk if that region has a higher incidence than the rest of the country. When extrapolating risk models from smaller studies in defined regions, we therefore recommend that this is taken into account.

In terms of discrimination, previous retrospective validation exercises on LLPv2 have shown a range of discriminatory capacities, with AUCs ranging from 0.67 to 0.82.6 However, the lower figure pertains to a cohort undergoing surgery, which would not necessarily be representative of lung cancer cases as a whole.

Primary care data are usually not sufficient to supply the risk factors in this model, or in most other lung cancer risk models.4 18 The paradigm for selection of individuals for additional lung cancer surveillance or prevention is to use a crude risk criterion such as ever smoking to select candidates from primary care, then carry out a formal risk assessment as part of a lung health check on those candidates using a tool such as one of the LLP or PLCO models. This has been used in the Lung Screen Uptake Trial,27 the Yorkshire Lung Screening Trial,28 and the Manchester29 and Liverpool30 demonstration projects.

We note here that the LLPv2 and LLPv3 are essentially aimed at risk of lung cancer in the next 5 years. They are not diagnostic tests for lung cancer. As such, the positive likelihood ratios are low in comparison with some diagnostic tests. The clinical utility of our preferred risk tool LLPv3 will depend on what intervention might be triggered by a given risk level. The most obvious intervention would be surveillance with low-dose CT, but this need not be the only response to lung cancer risk. In this paper, therefore, we have concentrated on accuracy of the prediction, and have not introduced further dimensions such as clinical or public preference. Detailed clinical implications of different risk thresholds using LLPv3 will be the subject of a future paper. This will likely include Decision Curve Analysis and possibly other tools such as Mean Risk Stratification and Incremental Net Benefit.31–33

The LLPv2 and PLCOm2012 have been used and initial data published in three UK lung cancer early detection/CT screening implementation projects, the Liverpool Healthy Lung Programme (LLPv2),30 the Lung Screen Uptake Trial,27 and the West London Lung Cancer Screening pilot (WLLCS).34 All three of these projects successfully selecting high-risk patients and identified early stage lung cancers. The Yorkshire Lung Screening trial28 has also used the LLPv2 and PLCOm2012 risk models; however, the results have not as yet been published.

The WLLCS pilot has published its baseline data on both the LLPv2 and PLCOm2012 risk models. The authors reported that 1159 participants were eligible for a CT scan, of which 451/1159 (38.9 %) had a LLPv2 ≥2.0% ; only 71/1159 (6.1 %) had a PLCO M2012 ≥1.5 %, lung cancer was detected in 29/1145 (2.5 %) participants scanned 5/29 participants with lung cancer did not meet a PLCO M2012 threshold of ≥1.51%; however, all had a LLPv2 ≥2.0%.

The Manchester Health Lung Check (MLHC) selected patients for their study with the PLCOm2012 risk model. They have recently analysed the performance of both the LLPv2 and PLCOm2012 models in their data set,35 using ≥2.5% LLPv2 risk model and >1.51% risk calculated from the PLCOm2012 model. In the MLHC 1430 dataset, the authors calculated that 93.5% (58 of the 62 identified) of the lung cancer cases would have been identified by the LLPv2 ≥2.5% risk; with a further 272 participants who would have been eligible with LLPv2 but ineligible with PLCOm2012, however, the outcome of these individuals was unknown.

An international comprehensive analysis of seven lung cancer risk models, (including LLPv1) has been undertaken using data from the PLCO and NLST data sets.4 However, a number of the LLPv1 parameters were not available; asbestos exposure, nor history of pneumonia, nor the age of diagnosis of the first degree relatives’ family history of lung cancer in the PLCO and NLST datasets. Ten Haaf and coauthors reported that all the seven risk models outperformed the NLST trial eligibility criteria over a wide range of risk thresholds in decision curve analysis, with a higher sensitivity for all models and a slightly higher specificity for a number of the models. The PLCOm2012 outperformed all of the models in their analysis and both the LLPv1 and the authors’ simplified LLP risk model did not perform as well, overestimating the risk. However, the validation exercise used the LLP v1 model and three of the risk parameters were not available in their test sets.

Given that a likely application for lung cancer risk models such as LLPv3 is determination of eligibility for low-dose CT screening for lung cancer, it is worth remarking that there is a trade-off between potential increases in cost effectiveness with higher risk thresholds and likely comorbidities, which may either impair eligibility for potentially curative surgery in those detected with lung cancer, or otherwise shorten expected life and thus diminish the potential benefits of screening. See Rivera et al, for a thorough exegesis of this issue.36

This current analysis provides evidence that the LLPv2 and LLPv3 have the same discriminatory power, but the LLPv3 is now recommended for future UK lung cancer CT screening programmes, as it better estimates the absolute lung cancer risk. The NHS England ‘Targeted Screening for Lung Cancer with Low Radiation Dose Computed Tomography’ protocol37 has included the use of the PLCOm2012 (at a 1.5% risk cut-off) and the LLPv2 (at 2.5% risk cut-off). It is now recommended that NHS England use the LLPv3, which is calibrated to sex-specific and age-specific incidence for the whole of England in 2017; a LLPv2 cut-off of 2.5% equates to a risk cut-off of 1.33% over 5 years for LLPv3, so it is in line with the PLCOm2012 of >1.5% over 6 years. To ensure a substantially enhanced risk group, it might be prudent to use a 2.5% threshold using LLPv3 (see table 2).

In conclusion, discrimination of LLPv2 and LLPv3 was excellent. LLPv3, which was calibrated to contemporary, English incidence, achieved more accurate prediction of absolute incidence, and would be more effective in selecting a high-risk group for surveillance in England today.

Acknowledgments

The authors wish to thank all the individuals who participated in the Pilot UKLS trial. Furthermore, they thank all the individuals involved in the UKLS data management, especially Beverley Green and Nial Hodge. The authors wish to thank Tariq Malik and his colleagues at Public Health England (PHE) for their assistance in providing an update of the UKLS dataset. The role of Public Health England. This project involves the data derived from patient-level information collected by the NHS, as part of the care and support of cancer patients. The data are collated, maintained and quality assured by the National Cancer Registration and Analysis Service, which is part of Public Health England Access to the data was facilitated by the PHE Office for Data Release.

References

Supplementary materials

  • Supplementary Data

    This web only file has been produced by the BMJ Publishing Group from an electronic file supplied by the author(s) and has not been edited for content.

Footnotes

  • JKF and DV are joint first authors.

  • Contributors JF: trial conception and design, data interpretation and manuscript review. SWD: trial design, statistical analysis, data interpretation, manuscript review. DV: risk modelling, statistical analysis, data interpretation, manuscript review. MPAD and RG: data interpretation, manuscript review.

  • Funding Initial development of the LLP was funded by the Roy Castle Lung Cancer Foundation. The UKLS is funded by the Health Technology Assessment programme of the National Institute for Health Research. Michael Davies is a Roy Castle Lung Cancer Foundation Senior Research Fellow. Daniel Vulkan’s and Stephen Duffy’s contribution to this research was funded by the National Institute for Health Research (NIHR) Policy Research Programme, conducted through the Policy Research Unit in Cancer Awareness, Screening and Early Diagnosis, PR-PRU-1217-21601.

  • Disclaimer The funding source had no role in the design of our analyses, its interpretation, or the decision to submit the manuscript for publication. The views expressed are those of the author(s) and not necessarily those of the NIHR or the Department of Health and Social Care.

  • Competing interests JF: Speaker’s Bureau: AstraZenecaAdvisory Board: Epigenomics; NUCLEIX Ltd. AstraZeneca, iDNA. Grant Support: Janssen Research & Development, LLC. No competing interests from all other coauthors.

  • Patient consent for publication Not required.

  • Ethics approval Ethical approval for the study was given by Liverpool Central Research Ethics Committee in December 2010 (reference number 10/H1005/74).

  • Provenance and peer review Not commissioned; externally peer reviewed.

  • Data availability statement Data are available upon reasonable request. We are committed, in principle, to data sharing with fellow researchers, and are currently drawing upoperating procedures for this. We anticipate that the data will be stored securely in Liverpool and reasonable requests for data for further research will be accommodated, subject to compliance with regulations, maintaining the integrity of information governance, and ensuring no loss of confidentiality on the part of the participants of the study. Requests that require considerable data manipulation and management on our part will need to be resourced by those requesting data.