Defining community acquired pneumonia severity on presentation to hospital: an international derivation and validation study
- W S Lim1,
- M M van der Eerden2,
- R Laing3,
- W G Boersma2,
- N Karalus4,
- G I Town3,
- S A Lewis5,
- J T Macfarlane1
- 1Respiratory Infection Research Group, Respiratory Medicine, Nottingham City Hospital, Nottingham NG5 1PB, UK
- 2Medical Centre Alkmaar, PO Box 501, 1800 AM Alkmaar, The Netherlands
- 3Christchurch School of Medicine and Health Sciences, University of Otago, Christchurch Hospital, Christchurch, New Zealand
- 4Waikato Respiratory Department, Health Waikato, Hamilton, New Zealand
- 5Division of Respiratory Medicine, Nottingham City Hospital, Nottingham NG5 1PB, UK
- Correspondence to:
Dr J T Macfarlane, Respiratory Infection Research Group, Respiratory Medicine, Nottingham City Hospital, Nottingham NG5 1PB, UK;
- Accepted 13 December 2002
- Revised 2 December 2002
Background: In the assessment of severity in community acquired pneumonia (CAP), the modified British Thoracic Society (mBTS) rule identifies patients with severe pneumonia but not patients who might be suitable for home management. A multicentre study was conducted to derive and validate a practical severity assessment model for stratifying adults hospitalised with CAP into different management groups.
Methods: Data from three prospective studies of CAP conducted in the UK, New Zealand, and the Netherlands were combined. A derivation cohort comprising 80% of the data was used to develop the model. Prognostic variables were identified using multiple logistic regression with 30 day mortality as the outcome measure. The final model was tested against the validation cohort.
Results: 1068 patients were studied (mean age 64 years, 51.5% male, 30 day mortality 9%). Age ⩾65 years (OR 3.5, 95% CI 1.6 to 8.0) and albumin <30 g/dl (OR 4.7, 95% CI 2.5 to 8.7) were independently associated with mortality over and above the mBTS rule (OR 5.2, 95% CI 2.7 to 10). A six point score, one point for each of Confusion, Urea >7 mmol/l, Respiratory rate ⩾30/min, low systolic(<90 mm Hg) or diastolic (⩽60 mm Hg) Blood pressure), age ⩾65 years (CURB-65 score) based on information available at initial hospital assessment, enabled patients to be stratified according to increasing risk of mortality: score 0, 0.7%; score 1, 3.2%; score 2, 3%; score 3, 17%; score 4, 41.5% and score 5, 57%. The validation cohort confirmed a similar pattern.
Conclusions: A simple six point score based on confusion, urea, respiratory rate, blood pressure, and age can be used to stratify patients with CAP into different management groups.
In the assessment and management of community acquired pneumonia (CAP), disease severity assessment is crucial, guiding therapeutic options such as the need for hospital or intensive care (ICU) admission, suitability for discharge home, the extent of investigation, and choice and route of antimicrobial agent.1,2
The Pneumonia Severity Index (PSI) developed by Fine et al in the USA provides a means of stratifying groups of patients according to their risk of mortality and features in recently published North American guidelines.1–3 Unfortunately it is complicated to use, requiring computation of a score based on 20 variables, and hence may not be practical for routine application in busy hospital emergency departments or the primary care setting. In addition, it is best validated for assessing patients with a low mortality risk who may be suitable for home management rather than those with severe CAP at the time of hospital admission.
An alternative severity assessment tool proposed by the British Thoracic Society (BTS) and modified by Neill et al (mBTS)4 which relies on four easily measurable clinical features was developed mainly as a means of identifying patients with severe CAP at high risk of mortality. The presence of two or more of the following features—mental confusion, respiratory rate ⩾30/min, diastolic blood pressure ⩽60 mm Hg, and blood urea >7 mmol/l—predicted mortality with an overall sensitivity and specificity of about 80%.5–7 However, this tool also has limitations; by stratifying patients into only two groups (severe or non-severe) it does not identify patients at low risk of mortality who might be suitable for early hospital discharge or home management. The aim of this study was to develop an assessment tool to enable stratification of patients presenting to hospital with CAP into mortality risk groups that might be suitable for different management options.
Study institutions and subjects
Data from three large prospective studies of CAP were retrospectively combined into a single dataset.6,8 Adults admitted as medical emergencies with CAP to (1) Nottingham City Hospital (UK) over an 18 month period from 4 October 1998, (2) Christchurch and Waikato hospitals (New Zealand) over a 12 month period from 26 July 1999, and (3) Medical Centre Alkmaar (the Netherlands) over a 2 year period from December 1998 were eligible. All four participating study hospitals are teaching centres that admit unselected adults for acute medical care. All subjects gave written informed consent and ethical approval was obtained from local hospital ethics committees.
In all three studies CAP was defined as an acute respiratory tract illness associated with radiographic shadowing on an admission chest radiograph consistent with infection which was neither pre-existing nor of any other known cause. The following exclusion criteria applied: (1) pneumonia was (a) not the primary cause for hospital admission, (b) an expected terminal event, or (c) distal to bronchial obstruction; (2) patients with tuberculosis, bronchiectasis, solid organ and haematological malignancies or human immunodeficiency virus (HIV) infection; (3) patients who had been in hospital within the previous 14 days, were immunocompromised, or had previously been entered in the study; and (4) nursing home residents. In New Zealand and the Netherlands co-morbid illness was defined along the lines of Fine et al.3 Co-morbid illness was defined similarly in Nottingham as the presence of any of the following conditions for which the patient was under active medical supervision or was receiving treatment at the time of hospital admission: chronic lung disease, cardiac disease (ischaemic heart disease, cardiac failure, hypertension, atrial fibrillation), cerebrovascular disease (including previous transient ischaemic attacks), cognitive impairment, diabetes mellitus, chronic liver disease, chronic renal disease, and inflammatory rheumatological disorders (excluding osteoarthritis). Mental confusion was defined as a mental test score of 8 or less (UK, NZ)9 or disorientation in person, place or time (the Netherlands).
Patients were seen within 24 hours of admission by a study investigator and then regularly in hospital and after discharge until their clinical and radiological features had stabilised. The main outcome measure was 30 day mortality.
Data were analysed using SPSS Version 9.0 for Windows, Chicago, USA. The combined dataset was randomly divided into two to create a derivation cohort comprising 80% of the data and a validation cohort with the remaining 20%. The derivation cohort was used to develop the severity assessment model. The size of the derivation cohort required was calculated to enable multiple logistic regression identifying seven independent prognostic features with no fewer than 10 outcome events per independent variable, thus avoiding the potential problem of overfitting of data.10 Based on an estimate of 80% sensitivity and specificity, this sample size would also allow the sensitivity and specificity of the mBTS rule to be determined with confidence intervals of ±3% in the derivation cohort and ±6% in the validation cohort.
Continuously distributed variables were re-categorised into binary factors using threshold values implicit in one or more of the existing severity prediction rules. Based on the mBTS assessment tool, a “CURB” severity score (Confusion, Urea >7 mmol/l, Respiratory rate ⩾30/min, and low Blood pressure (diastolic blood pressure (DBP) ⩽60 mm Hg or systolic blood pressure (SBP) <90 mm Hg)) was calculated and one point given for each feature present (range 0–4 points).6 The association between the CURB score and 30 day mortality was examined and formed the basis for the development of a new severity assessment tool for stratifying patients into distinct risk groups.
The univariate association between 30 day mortality and each potential predictor variable, including the CURB score and each of the components of the CURB score, was analysed using a χ2 test. To avoid spurious associations arising from multiple statistical tests, 12 predictor variables selected from the current literature as most consistently important in predicting prognosis in CAP were examined.3–5,7,11–19 This established that each of the components of the CURB score is a predictor of mortality. Independent predictors over and above the CURB score were identified by entering the CURB score as a single variable plus each of the other potential predictors into a backward logistic regression. The CURB score was categorised as <2 or ⩾2 in this model, a score of ⩾2 being taken to identify patients with severe CAP. Following backward regression, each of the excluded variables was entered back into the suggested model to ensure they had no effect on the final model.
A clinical prediction rule based on the CURB score and the newly identified independent prognostic features was subsequently derived. In order to ensure that the final prediction rule remained simple to use and practical, prognostic features not usually available at the time of initial assessment on hospital admission were excluded from the model. All results were tested against the validation cohort. The results are expressed as odds ratios (OR) and p values with 95% confidence intervals (CI) and using receiver operating curves (ROC) to plot sensitivity against specificity.
One thousand and sixty eight patients, 550 (51.5 %) male, were studied. The mean (median, range) age of the patients was 64.1 (69, 17–100) years and 391 (37%) were aged 75 years and above. Baseline characteristics of patients from the three study centres were very similar (table 1).
All patients were treated with empirical antimicrobial agents according to local hospital guidelines. This usually comprised a β-lactamase stable β-lactam in combination with a macrolide. Dutch patients were randomised within 2 hours of admission to receive either empirical treatment according to the ATS guidelines or pathogen directed treatment based on the results of rapid microbiological tests (Gram stain or antigen detection).20 The intravenous route was used in 920 patients (86%); 55 (5%) were admitted to the ICU and 44 (4%) required mechanical ventilation. The 30 day mortality was 9%.
Associations with 30 day mortality
There were 821 patients in the derivation cohort. Of the 29 patients with an SBP <90 mm Hg, 28 (97%) had a DBP of 60 mm Hg or less. These two features were therefore combined as the presence of one or both and analysed as a single variable (low BP). Of the 11 features examined for an association with mortality on univariate analysis, all except multilobar shadowing on chest radiography and a pulse rate of ⩾125/min were found to be significant (table 2). Increasing age cohorts were associated with increasing mortality.
Serum albumin <30 g/dl and age ⩾65 years were both independently associated with 30 day mortality after adjusting for CURB score (table 3). Although an association between temperature <37°C and mortality was found on multivariate analysis, this only just achieved statistical significance (OR 1.9, 95% CI 1.01 to 3.6, p<0.047). Entering each of the other variables in turn back into the resulting model did not alter the results.
Development of practical clinical prediction rule
The sensitivity and specificity of the mBTS rule—that is, a CURB score of 2 or more—in the derivation cohort was 75% (95% CI 72 to 78) and 69% (95% CI 66 to 72), respectively. Corresponding values in the validation cohort were 74% (95% CI 68 to 80) and 73% (95% CI 67 to 79).
Based on the results of the multivariate analysis, age ⩾65 years was added as another adverse prognostic feature to the CURB score resulting in a six point score (CURB-65, range 0–5) which allowed patients to be stratified according to increasing risk of mortality ranging from 0.7% (score 0) to 40% (score 4; table 4) The numbers with a score of 5 (highest) were small; of the seven patients, only one (14%) died. However, three (43%) others required mechanical ventilation in an intensive care setting and survived. A further model based only on clinical features available from a clinical assessment without laboratory results (confusion, respiratory rate, blood pressure, and age; CRB-65 score) was also tested and found to correlate well with risk of mortality and need for mechanical ventilation. These models performed equally well when applied to the validation cohort (table 4). The sensitivities, specificities, and predictive values of the different rules for predicting 30 day mortality are given in table 5 and fig 1.
This is the largest cohort study of severity assessment in CAP conducted outside the USA. The strength of this study lies in the large sample size, the prospective and systematic collection of relevant clinical information, the focusing of statistical tests of association on selected clinical features based on the current evidence base (thus avoiding spurious associations), and the inclusion of unselected adults of all ages. Furthermore, the study cohort comprised patients admitted as unselected emergencies into public (that is, government funded) hospitals in three countries with similar primary and secondary healthcare systems. This is supported by the observation that direct GP referrals comprised 62% and 70% of the admissions in the NZ and UK cohorts, respectively (the source of referral was not recorded in the Netherlands but it is estimated that ~60% of the admissions were referred by a GP). Also, emergency medical admissions to private healthcare facilities rarely happen in these countries. This should allow the results to be generalised to other countries which share such healthcare systems, which differ from those in the USA.
Although the profiles of the three cohorts are very similar (table 1), there may be other potential differences between the cohorts which may not be obvious. This represents a limitation of the study.
The importance of mental confusion, low blood pressure, raised respiratory rate, and raised urea as “core” adverse prognostic features in patients admitted to hospital with CAP is underlined. This study confirmed our previous finding in the UK subset of patients that a simple severity scoring system based on these four adverse prognostic features (CURB score) correlates well with mortality.6
In the development of a clinical prediction rule suitable for use in busy casualty departments or admission units, we sought to include clinical features of prognostic importance which were easily measurable at the time of initial assessment. Thus, although both age ⩾65 years and a low serum albumin were identified as independent prognostic variables in addition to the CURB score, the latter was not included in the final model as it is not a routinely available test in many hospitals, particularly out of hours. In contrast, age is easy to determine and has been consistently found to be very strongly associated with prognosis in most studies of severity assessment in adults with CAP.21 Indeed, the determination of the score in the PSI developed by Fine et al is heavily influenced by age.
The overall sensitivity and specificity of the CURB-65 score is similar to the mBTS severity assessment tool as seen in the receiver operating curves (ROC) in fig 1. The advantage of the CURB-65 score is that it provides a bigger range of sensitivities for specificity, thus enabling patients to be stratified as possibly suitable for three different management options. Patients with scores 0 and 1 are at low risk of mortality (<2%) and might be suitable for management as hospital outpatients or by their general practitioner (fig 2). Patients with a score of 2 are at intermediate risk of mortality (9%) and should be considered for hospital supervised treatment. Patients with a score of >2 are at high risk of mortality (>19%) and correspond to those patients who meet the mBTS criteria for severe CAP. They should be treated in hospital. Initial care in an intensive care or high dependency unit may be appropriate in those with the highest scores.
These three groupings are similar in terms of risk of mortality to the treatment groups identified by the PSI when applied to this study cohort (risk class I–III, mortality <3.2%; risk class IV, mortality 12%; and risk class V, mortality 32%). However, as discussed in the Canadian and American Thoracic Society (ATS) guidelines, the PSI has practical limitations related to its complexity which may restrict its widespread adoption. Indeed, the ATS guidelines do not offer any algorithm for the clinical assessment of disease severity.22 The advantage of the CURB-65 severity assessment tool is that it is simple to use, relying on only five (rather than 20) easily measurable features, with all but urea being clinical observations which could be made by healthcare workers in primary and secondary care. It may also be useful in the community for general practitioners when deciding on the need for hospital referral for patients with CAP, even if the blood urea is unavailable (fig 3). In all instances overall clinical judgement is crucial. Social factors and patients’ wishes will also influence where to manage a patient. Whether the proposed CURB-65 score can be applied in conjunction with different management strategies to improve clinical outcomes and health service utilisation requires further study.
This study offers a simple clinical prediction rule based on the five clinical features of age, confusion, urea, respiratory rate, and blood pressure (CURB-65 score) as a practical means of stratifying patients with CAP into low, intermediate, and high mortality risk groups. The size of the dataset and the consistency of the findings in derivation and validation cohorts suggest that this rule could be applied with some confidence in practice while undergoing further prospective evaluations.
The authors acknowledge the following for their help with the study: Associate Professor Stephen Chambers, infectious diseases physician; Dr Chris Frampton, biostatistician; Dr Rodger Jackson, radiologist; Dr David Murdoch, clinical microbiologist; Dr Lance Jennings, virologist; and Dr Jenny McWha, research nurse (all in New Zealand); all the physicians at Nottingham City Hospital who allowed the study of their patients; Dr Don Rose, radiologist; and Melanie Caine, research nurse.
Funding: Nottingham: local trust fund and educational grant from Hoechst Marion Roussel; New Zealand: Health Research Council of New Zealand; The Netherlands: partial funding by grants from Astra Zeneca BV Netherlands and Pfizer BV Netherlands.