Geo-social gradients in predicted COVID-19 prevalence in Great Britain: results from 1 960 242 users of the COVID-19 Symptoms Study app

Ruth C E Bowyer; Thomas Varsavsky; Ellen J Thompson; Carole H Sudre; Benjamin A K Murray; Maxim B Freidin; Darioush Yarand; Sajaysurya Ganesh; Joan Capdevila; Elco Bakker; M Jorge Cardoso; Richard Davies; Jonathan Wolf; Tim D Spector; Sebastien Ourselin; Claire J Steves; Cristina Menni

doi:10.1136/thoraxjnl-2020-215119

Article Text

PDF

PDF +
Supplementary
Material

XML

Brief communication

Geo-social gradients in predicted COVID-19 prevalence in Great Britain: results from 1 960 242 users of the COVID-19 Symptoms Study app

Ruth C E Bowyer1,
Thomas Varsavsky2,
Ellen J Thompson1,
Carole H Sudre2,3,
Benjamin A K Murray2,
Maxim B Freidin1,
Darioush Yarand1,
Sajaysurya Ganesh4,
Joan Capdevila4,
Elco Bakker4,
M Jorge Cardoso2,
http://orcid.org/0000-0003-2050-3994Richard Davies4,
Jonathan Wolf4,
Tim D Spector1,
Sebastien Ourselin2,
Claire J Steves1,
http://orcid.org/0000-0001-9790-0571Cristina Menni1

¹ Twin Research, King's College London, London, UK
² School of Biomedical Engineering & Imaging Sciences, King's College London, London, UK
³ MRC Unit for Lifelong Health and Ageing, University College London, London, UK
⁴ Zoe Global Limited, London, UK

Correspondence to Dr Cristina Menni, Twin Research, King's College London, London, UK; cristina.menni{at}kcl.ac.uk

Abstract

Understanding the geographical distribution of COVID-19 through the general population is key to the provision of adequate healthcare services. Using self-reported data from 1 960 242 unique users in Great Britain (GB) of the COVID-19 Symptom Study app, we estimated that, concurrent to the GB government sanctioning lockdown, COVID-19 was distributed across GB, with evidence of ‘urban hotspots’. We found a geo-social gradient associated with predicted disease prevalence suggesting urban areas and areas of higher deprivation are most affected. Our results demonstrate use of self-reported symptoms data to provide focus on geographical areas with identified risk factors.

clinical epidemiology
infection control

https://creativecommons.org/licenses/by/4.0/

This is an open access article distributed in accordance with the Creative Commons Attribution 4.0 Unported (CC BY 4.0) license, which permits others to copy, redistribute, remix, transform and build upon this work for any purpose, provided the original work is properly cited, a link to the licence is given, and indication of whether changes were made. See: https://creativecommons.org/licenses/by/4.0/.

https://doi.org/10.1136/thoraxjnl-2020-215119

Statistics from Altmetric.com

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

The COVID-19 epidemic has led to large-scale closures and lockdown measures worldwide with the British government sanctioning lockdown from 23 March 2020 (https://www.gov.uk/government/speeches/pm-address-to-the-nation-on-coronavirus-23-march-2020).

Early in the pandemic, case distribution was not evenly spread across countries, with dense urban centres being the most affected.1 Individuals in deprived areas have lower life expectancy,2 are more likely to have multiple underlying comorbidities, have a higher level of influenza-associated hospitalisation3 and therefore could be more susceptible to COVID-19.2

Based on the known socioeconomic health gradient, we hypothesised that individuals in deprived areas were at greater risk of contracting COVID-19. Understanding the geographical distribution of the virus in a socioeconomic context is key to assist adequate healthcare resourcing, particularly intensive care beds.4

Here we investigated the geographical distribution of COVID-19 in Great Britain (GB) and its association with area-level deprivation using self-reported data from almost 2 million users of the COVID-19 Symptom Study. 5

We studied 1 960 242 unique GB app users (20–69 years old) reporting on COVID-19 symptoms, hospitalisation, reverse-transcription PCR (RT-PCR) test outcomes, demographic information and pre-existing medical conditions (online supplemental methods) over 23 days (29 March–19 April) of major social distancing measures (‘lockdown’). We computed a proxy of contracting COVID-19, based on reported symptoms6 (positive predicted value=0.69 (0.66; 0.71) (online supplemental methods). We then calculated a predicted prevalence as the proportion of app users that we predicted to have COVID-19 within each area (online supplementary figure S1).

Supplemental material

[thoraxjnl-2020-215119supp001.pdf]

Supplemental material

[thoraxjnl-2020-215119supp002.pdf]

Following aggregation of variables to local authority district level (LAD/geographic unit representing ~17 000 individuals), we tested the geographical distribution of predicted prevalence at eight different time points spanning 23 days. We used Local Moran’s I tests, which assess for non-random spatial distribution and clustering of a feature and can be used to identify disease hotspots and cold spots relative to the mean GB predicted prevalence7 (online supplemental methods).

Next, we used data from the eight different time points and used multivariable mixed-effects models to investigate the association of predicted area-level prevalence (at middle super output area level (MSOA)) and deprivation (as captured by the Index of Multiple Deprivatio) adjusting for different factors including geo-social mediators and confounders (air pollution, general practitioners per MSOA, household density and urbanicity) area level aggregates of obesity and comorbidities) and area-level adjusted mean age and sex and spatial autocorrelations8 (online supplemental methods).

table table 1 1 and online supplemental table S1. The number of predicted COVID-19 positive individuals ranged between 15 991 and 79 378.

Supplemental material

[thoraxjnl-2020-215119supp004.xlsx]

View this table:

Table 1

Demographic characteristics of the study population at eight time points

Local Moran’s I showed that predicted COVID-19 prevalence clustered in urban areas across GB when considered as a proportion of the population per LAD7 (figure 1 and online supplemental figure S2) adjusting for multiple testing. Predicted prevalence decreased over time, consistent with ‘lockdown’ (figure 1 and online supplemental figure S2) (pairwise Wilcoxon rank-sum tests, prevalence: all time points except T2:T3 and T1:T4, p<0.001), but some hotspots remained.

Supplemental material

[thoraxjnl-2020-215119supp003.pdf]

Figure 1

Geographical distribution of predicted COVID-19 prevalence across four time points. Prevalence is presented as proportional to the responders per local authority district (LAD). Analyses are adjusted for multiple testing using Benjamini- Hochberg false discovery rate correction (p<0.05). Inset highlights London where LAD areas are smaller. Hot and cold spots are defined relatively to their neighbours and the mean GB predicted prevalence. Red/blue coloured perimeter lines around each LAD denote hotspot/coldspot.

In the MSOA-level analysis, area-level deprivation was significantly associated with predicted area-level prevalence in all models (M1–M6, see online supplemental table S2), including in the full model (M6) when adjusting for all geo-social covariates and comorbidities (M6: Beta (95% CI)=−0.15 (−0.17 to –0.130, p<0.001). This suggests that people in deprived areas were at higher risk.

Predicted COVID-19 prevalence was higher in urban areas compared with rural and in more deprived areas compared with less deprived. This could reflect the likelihood of individuals in more deprived areas working/living with people whose vocations mean they are unable to work from home and are thus more likely to be exposed to circulating COVID-19. Accumulation of socioenvironmental exposures across the life course are known to contribute to a greater health deficit and disease burden2; our results suggest that COVID-19 is no exception.

Moreover, our study illustrates how app data could be used to successfully monitor COVID-19 over time and identify hotspots as the viral pandemic progresses and social distancing measures are implemented or eased. Using this method, we detected a geo-social gradient associated with prevalence in the context of COVID-19, suggesting the focus of resources should be on deprived urban areas.

Our study has some limitations and assumptions. We used self-reported data on symptoms that can lead to bias. For example, should users in deprived areas report more symptoms due to a facet of the socioeconomic environment (eg, higher air pollution), this could lead to an incorrectly higher predicted prevalence in deprived areas. Second, app users are a self-selected group, not representative of the general population. Our approach to adjust for age and sex differences at MSOA level is unlikely to sufficiently overcome selection and collider bias.9 Third, our predicted COVID-19 prevalence is not from confirmed tests via RT-PCR, but rather based on self-reported symptoms. Additionally, we assume that people who have symptoms or have been exposed to COVID-19 are equally likely to use the app as those who do not. We performed a sensitivity analysis by rerunning the pooled analysis on individuals who were self-reportedly healthy at sign up and found the observed associations remained (online supplemental table S3), suggesting selection bias associated with being unhealthy at sign up is not influencing the observed associations of COVID-19 and deprivation. We also assume that people report symptoms in the same way and that their drop-out patterns do not differ by space, time and symptom reports. Finally, we aggregated data at MSOA level that could lead to ecological bias. We also cannot conclude that deprivation increased COVID-19 prevalence, as there could be unmeasured confounders or other factors.

Future work should check our assumptions and seek to integrate these data with data on area-level morbidity, extended pollution data, ethnicity and disease severity. Indeed, higher mortality has been observed among minority ethnic groups,10 and disentangling the environmental and biological factors contributing to greater disease burden in both deprived areas and among ethnic minorities is an essential focus of future work to ensure resources and intervention are better assigned.

Ethics statements

Ethics approval

The Ethics for the app has been approved by King’s College London ethics Committee (REMAS ID 18210, review reference LRS-19/20-18210), and all users provided consent for non-commercial use. An informal consultation with TwinsUK members over email and social media prior to the app having been launched found that they were overwhelmingly supportive of the project.

Acknowledgments

We express our sincere thanks to all the participants of the COVID Symptom Study app. We would like to thank the staff of Zoe Global Limited, the Department of Twin Research for their tireless work in contributing to the running of the study and data collection. Finally, we would like to thank Professor Kate Tilling of the University of Bristol for her invaluable insight and help in refining the manuscript.

References

↵
2. Stier A ,
3. Berman M ,
4. Bettencourt L
. COVID-19 attack rate increases with City size, 2020. Available: https://papersssrncom/sol3/paperscfm?abstract_id=3564464
↵
2. Marmot M
. Health equity in England: the Marmot review 10 years on. BMJ 2020;368:m693.doi:10.1136/bmj.m693
OpenUrl FREE Full Text
↵
2. Hungerford D ,
3. Ibarz-Pavon A ,
4. Cleary P , et al
. Influenza-Associated hospitalisation, vaccine uptake and socioeconomic deprivation in an English City region: an ecological study. BMJ Open 2018;8:e023275. doi:10.1136/bmjopen-2018-023275
↵
2. Blumenshine P ,
3. Reingold A ,
4. Egerter S , et al
. Pandemic influenza planning in the United States from a health disparities perspective. Emerg Infect Dis 2008;14:709–15.doi:10.3201/eid1405.071301
OpenUrl CrossRef PubMed Web of Science
↵
2. Drew DA ,
3. Nguyen LH ,
4. Steves CJ , et al
. Rapid implementation of mobile technology for real-time epidemiology of COVID-19. Science 2020;368:1362–7.doi:10.1126/science.abc0473
OpenUrl Abstract/FREE Full Text
↵
2. Menni C ,
3. Valdes AM ,
4. Freidin MB , et al
. Real-Time tracking of self-reported symptoms to predict potential COVID-19. Nat Med 2020;26:1037–40.doi:10.1038/s41591-020-0916-2 pmid:http://www.ncbi.nlm.nih.gov/pubmed/32393804
OpenUrl CrossRef PubMed
↵
2. Zhang C ,
3. Luo L ,
4. Xu W , et al
. Use of local Moran’s I and GIS to identify pollution hotspots of Pb in urban soils of Galway, Ireland. Sci Total Environ 2008;398:212–21.doi:10.1016/j.scitotenv.2008.03.011
OpenUrl CrossRef PubMed Web of Science
↵
2. Anselin L ,
3. Griffith DA
. Do spatial effects really matter in regression analysis? Papers - Regional Science Association 1988.
↵
2. Griffith GJ ,
3. Morris TT ,
4. Tudball MJ , et al
. Collider bias undermines our understanding of COVID-19 disease risk and severity. Nat Commun 2020;11:5749. doi:10.1038/s41467-020-19478-2 pmid:http://www.ncbi.nlm.nih.gov/pubmed/33184277
OpenUrl CrossRef PubMed
↵
2. Khunti K ,
3. Singh AK ,
4. Pareek M , et al
. Is ethnicity linked to incidence or outcomes of covid-19? BMJ 2020;369:m1548.
OpenUrl FREE Full Text

Supplementary materials

Supplementary Data

This web only file has been produced by the BMJ Publishing Group from an electronic file supplied by the author(s) and has not been edited for content.

Data supplement 1
Data supplement 2
Data supplement 3
Data supplement 4

Footnotes

Twitter @mjorgecardoso
RCEB and TV contributed equally.
CJS and CM contributed equally.
Contributors Conceived and designed the experiments: CJS, TDS, SO and CM; analysed the data: RCB and TV. Contributed reagents/materials/analysis tools: MF, CHS, BM, MF, DY, SG, JC, ET, EB, MJC, RD and JW wrote the manuscript: RCB, TV and CM; revised the manuscript: all.
Funding Zoe provided in kind support for all aspects of building, running and supporting the app and service to all users worldwide. The Department of Twin Research is funded by the Wellcome Trust, Medical Research Council, European Union, Chronic Disease Research Foundation (CDRF), Zoe Global Ltd and the National Institute for Health Research (NIHR)-funded BioResource, Clinical Research Facility and Biomedical Research Centre based at Guy’s and St Thomas’ NHS Foundation Trust in partnership with King’s College London. CM is funded by the Chronic Disease Research Foundation and by the MRC Aim-Hy project grant. CHS is an Alzheimer’s Society Junior Fellowship AS-JF-17-011; SO and MJC are funded by the Wellcome/EPSRC Centre for Medical Engineering (WT203148/Z/16/Z), Wellcome Flagship Programme (WT213038/Z/18/Z).
Map disclaimer The depiction of boundaries on this map does not imply the expression of any opinion whatsoever on the part of BMJ (or any member of its group) concerning the legal status of any country, territory, jurisdiction or area or of its authorities. This map is provided without any warranty of any kind, either express or implied.
Competing interests TDS is a consultant to Zoe Global Ltd ('Zoe'). SG, JC, EB, RD and JW are or have been employees of Zoe Global Limited. Other authors have no conflict of interest to declare.
Provenance and peer review Not commissioned; externally peer reviewed.

Linked Articles

Editorial
Citizen science in the time of COVID-19

Linda J Birkin Eleftheria Vasileiou Helen Ruth Stagg
Thorax 2021; 76 636-637 Published Online First: 02 Mar 2021. doi: 10.1136/thoraxjnl-2020-216673
Smoking
Current smoking and COVID-19 risk: results from a population symptom app in over 2.4 million people

Nicholas S Hopkinson Niccolo Rossi Julia El-Sayed_Moustafa Anthony A Laverty Jennifer K Quint Maxim Freidin Alessia Visconti Ben Murray Marc Modat Sebastien Ourselin Kerrin Small Richard Davies Jonathan Wolf Tim D Spector Claire J Steves Mario Falchi
Thorax 2021; 76 714-722 Published Online First: 05 Jan 2021. doi: 10.1136/thoraxjnl-2020-216422
Airwaves
Highlights from this issue

The Triumvirate
Thorax 2021; 76 635-635 Published Online First: 15 Jun 2021. doi: 10.1136/thoraxjnl-2021-217721

[1] ↵

Stier A ,
Berman M ,
Bettencourt L
. COVID-19 attack rate increases with City size, 2020. Available: https://papersssrncom/sol3/paperscfm?abstract_id=3564464

[3] Stier A ,

[4] Berman M ,

[5] Bettencourt L

[6] ↵

Marmot M
. Health equity in England: the Marmot review 10 years on. BMJ 2020;368:m693.doi:10.1136/bmj.m693
OpenUrl FREE Full Text

[8] Marmot M

[9] ↵

Hungerford D ,
Ibarz-Pavon A ,
Cleary P , et al
. Influenza-Associated hospitalisation, vaccine uptake and socioeconomic deprivation in an English City region: an ecological study. BMJ Open 2018;8:e023275. doi:10.1136/bmjopen-2018-023275

[11] Hungerford D ,

[12] Ibarz-Pavon A ,

[13] Cleary P , et al

[14] ↵

Blumenshine P ,
Reingold A ,
Egerter S , et al
. Pandemic influenza planning in the United States from a health disparities perspective. Emerg Infect Dis 2008;14:709–15.doi:10.3201/eid1405.071301
OpenUrl CrossRef PubMed Web of Science

[16] Blumenshine P ,

[17] Reingold A ,

[18] Egerter S , et al

[19] ↵

Drew DA ,
Nguyen LH ,
Steves CJ , et al
. Rapid implementation of mobile technology for real-time epidemiology of COVID-19. Science 2020;368:1362–7.doi:10.1126/science.abc0473
OpenUrl Abstract/FREE Full Text

[21] Drew DA ,

[22] Nguyen LH ,

[23] Steves CJ , et al

[24] ↵

Menni C ,
Valdes AM ,
Freidin MB , et al
. Real-Time tracking of self-reported symptoms to predict potential COVID-19. Nat Med 2020;26:1037–40.doi:10.1038/s41591-020-0916-2 pmid:http://www.ncbi.nlm.nih.gov/pubmed/32393804
OpenUrl CrossRef PubMed

[26] Menni C ,

[27] Valdes AM ,

[28] Freidin MB , et al

[29] ↵

Zhang C ,
Luo L ,
Xu W , et al
. Use of local Moran’s I and GIS to identify pollution hotspots of Pb in urban soils of Galway, Ireland. Sci Total Environ 2008;398:212–21.doi:10.1016/j.scitotenv.2008.03.011
OpenUrl CrossRef PubMed Web of Science

[31] Zhang C ,

[32] Luo L ,

[33] Xu W , et al

[34] ↵

Anselin L ,
Griffith DA
. Do spatial effects really matter in regression analysis? Papers - Regional Science Association 1988.

[36] Anselin L ,

[37] Griffith DA

[38] ↵

Griffith GJ ,
Morris TT ,
Tudball MJ , et al
. Collider bias undermines our understanding of COVID-19 disease risk and severity. Nat Commun 2020;11:5749. doi:10.1038/s41467-020-19478-2 pmid:http://www.ncbi.nlm.nih.gov/pubmed/33184277
OpenUrl CrossRef PubMed

[40] Griffith GJ ,

[41] Morris TT ,

[42] Tudball MJ , et al

[43] ↵

Khunti K ,
Singh AK ,
Pareek M , et al
. Is ethnicity linked to incidence or outcomes of covid-19? BMJ 2020;369:m1548.
OpenUrl FREE Full Text

[45] Khunti K ,

[46] Singh AK ,

[47] Pareek M , et al

Log in using your username and password

Main menu

Log in using your username and password

You are here

Abstract

Statistics from Altmetric.com

Request Permissions

Supplemental material

Supplemental material

Supplemental material

Supplemental material

Ethics statements

Patient consent for publication

Ethics approval

Acknowledgments

References

Supplementary materials

Supplementary Data

Footnotes

Linked Articles

Read the full text or download the PDF:

Log in using your username and password