On the misuse of the reproduction number in the COVID‐19 surveillance system in Italy

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

The Italian Ministry of Health, jointly with the Department of Infectious Diseases of the Italian National Institute of Health (Istituto Superiore di Sanità [ISS]), promptly built the integrated surveillance system for COVID‐19. To evaluate the severity of the virus spread, the reproduction number R t , defined as the average number of cases generated by an infected individual in a population where everyone is susceptible to infection, is estimated. Unfortunately, in Italy, R t is not only used to provide a picture of the epidemic spread but rather as a decision tool to plan and organize nonpharmaceutical interventions by imposing a priori thresholds to define different levels of risks, on which daily‐life restrictions apply. We believe this is a misuse of R t , which is dangerous and widely uncertain. For this reason, it is important that this parameter as an indicator for restriction measures must be managed by an expert in the field. Some practical and statistically relevant considerations are given in Gostic et al. 1 Nature (https://www.nature.com/articles/d41586-020-02009-w), in July, already discussed the potential bias in using R t over its real meaning. Here, we discuss the main limits of the Italian approach to estimate and use R t , showing that the restrictions imposed on the population are based on an unreliable estimate of the reproduction number. The reference model is proposed by Cori et al. 2 The reproduction number is estimated in a Bayesian framework and requires the a priori definition/estimation of some fundamental quantities needed to estimate R t . The authors correctly discussed the limitations of their approach, but it seems that the Italian authorities neglect them. The main issues are related to the time window defined to estimate R t ; the distributions assumed to model the number of new cases and the generation time. The first risk is clearly stated in the Introduction, “When the data aggregation time step is small (e.g., daily data), estimates of R t can vary considerably over short time periods, producing substantial negative autocorrelation.” In other words, the obtained estimates of R t depend on the choice of the time window size. Which are the risks to consider an inappropriate time window? Small values lead to more rapid detection of changes in transmission but also more statistical noise; large values lead to more smoothing and reductions in statistical noise. Cori et al. 2 suggest an approach to detect the optimal time window based on the coefficient of variation. How this is dealt with in Italy is swept under the carpet. Moreover, Cori et al. 2 assumed that the distribution of infectiousness through time after infection is independent of calendar time and follows a Poisson process, that is, overdispersion is not accounted for. This is a rather restrictive assumption that must be carefully checked on the real data. It is well‐known that Poisson‐based estimates are biased if overdispersion arises in the data. We believe these points are already enough to conclude that the estimates of R t should be used with caution, but the most relevant assumptions strongly affecting the estimates of R t are not still discussed. Indeed, in Cori et al. 2 we further read, “Estimates of the reproduction number are highly dependent on the choice of the infectiousness profile. This can be approximated by the distribution of the generation time (i.e., time from the infection of a primary case to infection of the cases he/she generates). However, times of infection are rarely observed and the generation time distribution is therefore difficult to measure. On the other hand, the timing of symptoms onset is usually known and such data collected in closed settings where transmission can reliably be ascertained (e.g., households) can be used to estimate the distribution of the serial interval (time between symptoms onset of a case and symptoms onset of his/her secondary cases).” In other words, different estimates of the serial interval lead to different estimates of R t , that is, a reliable estimate of the serial interval is mandatory because it drives the estimate of R t , and its misspecification is the major source of bias. In Italy, the reference serial interval to estimate the official R t is taken from Cerada et al., 3 and it is based on 90 pairs of cases in Lombardy in February, where the authors found an infector–infectee relationship and have the dates of symptom onset of both cases. Results are displayed in figure S8 in Cerada et al. 3 and refer to a Gamma‐distributed estimated serial interval with parameters shape = 1.87 and scale = 0.28. Bearing in mind that the Gamma is a continuous distribution and in this context is used to fit a discrete process, figure S8 in Cerada et al. 3 clearly shows multimodality and the Gamma distribution does not fit the data too well. In other words, the serial interval is poorly estimated. Moreover, this estimate is taken for granted for all the other Italian regions, that is, the same serial interval is assumed for all the regions and never updated. A crucial assumption for the adopted model is poorly estimated, wrongly applied to very heterogeneous contexts, and not checked again after the early phase of the first outbreak. We are puzzled about it, as the model by Cori et al. 2 accepts any parametric or empirical discrete distribution with support on positive values to approximate the serial interval and the generation time, and not only estimated values from a Gamma distribution. Gostic et al. 1 illustrate the consequences of misspecifying the form and the variance on the serial interval distribution. Moreover, Ganyani et al. 4 report country‐specific estimates for the generation time, remarking that estimating R t in different heterogeneous regions requires different estimates of the generation time. In addition, the delay between the date in which the result of the test was received and the date of the recording in the data set also plays a crucial role. Cori et al. 2 uses the instantaneous reproductive number and considers incidence cases observed before time point t; therefore, data may be affected by underreporting due to the delay between tests and reports: larger the delay, less accurate the estimation of R t due to missing information concerning incidence cases that are not yet recorded. Furthermore, the underreporting rate is not constant; it mostly affects the cases observed at the previous time points and closer to t, introducing a bias effect in the estimation. As a critical consequence, when the delay between test and report is large, the estimates of R t may be biased and in significant delay with respect to the current evolution of the epidemic process. Available epidemiological data are not ideal, and this reinforces, even more, the idea that statistical adjustments are needed to obtain accurate estimates of R t . As a result of neglecting all these issues, uncertain estimates of R t are obtained. Just to provide an example, we focus on the R t estimates reported in the ISS weekly report (see e.g., figure 8 at https://www.epicentro.iss.it/coronavirus/bollettino/Bollettino-sorveglianza-integrata-COVID-19_20-gennaio-2021.pdf). All credible intervals are rather wide, and even huge for some regions. The high uncertainty surrounding these estimates is a clear indication that the use of R t must be limited to provide a trend in the epidemic spread, but it must be avoided any further use. Annunziato and Asikainen 5 compare different methods to estimate Rt and show that point estimates vary across methods, though they share a similar trend. In Italy, instead, through a priori specified levels of the reproductive number, R t estimates are used to label the administrative regions in classes of risks (called scenario in the main ISS report, see e.g., http://www.salute.gov.it/imgs/C_17_monitoraggi_13_0_fileNazionale.pdf), with the respective restrictions. For estimating R t no golden standard methods exist. The work by Cori et al. 2 is a milestone in epidemiology research. Nevertheless, like many other models, it is based on assumptions that must be checked and fulfilled to avoid misleading inference. In Italy, not only are these assumptions neglected but the estimates of R t are used widely over their reliable interpretation. At the end of the games the R t seems a dancer, dancing music depending on the actual director of the orchestra who performs it.

Related collections

Most cited references 3

Record: found
Abstract: found
Article: not found

A New Framework and Software to Estimate Time-Varying Reproduction Numbers During Epidemics

Anne Cori, Neil M Ferguson, Christophe Fraser … (2014)

Abstract The quantification of transmissibility during epidemics is essential to designing and adjusting public health responses. Transmissibility can be measured by the reproduction number R, the average number of secondary cases caused by an infected individual. Several methods have been proposed to estimate R over the course of an epidemic; however, they are usually difficult to implement for people without a strong background in statistical modeling. Here, we present a ready-to-use tool for estimating R from incidence time series, which is implemented in popular software including Microsoft Excel (Microsoft Corporation, Redmond, Washington). This tool produces novel, statistically robust analytical estimates of R and incorporates uncertainty in the distribution of the serial interval (the time between the onset of symptoms in a primary case and the onset of symptoms in secondary cases). We applied the method to 5 historical outbreaks; the resulting estimates of R are consistent with those presented in the literature. This tool should help epidemiologists quantify temporal changes in the transmission intensity of future epidemics by using surveillance data.

0 comments Cited 546 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: found

Is Open Access

Estimating the generation interval for coronavirus disease (COVID-19) based on symptom onset data, March 2020

Tapiwa Ganyani, Cécile Kremer, Dongxuan Chen … (2020)

Background Estimating key infectious disease parameters from the coronavirus disease (COVID-19) outbreak is essential for modelling studies and guiding intervention strategies. Aim We estimate the generation interval, serial interval, proportion of pre-symptomatic transmission and effective reproduction number of COVID-19. We illustrate that reproduction numbers calculated based on serial interval estimates can be biased. Methods We used outbreak data from clusters in Singapore and Tianjin, China to estimate the generation interval from symptom onset data while acknowledging uncertainty about the incubation period distribution and the underlying transmission network. From those estimates, we obtained the serial interval, proportions of pre-symptomatic transmission and reproduction numbers. Results The mean generation interval was 5.20 days (95% credible interval (CrI): 3.78–6.78) for Singapore and 3.95 days (95% CrI: 3.01–4.91) for Tianjin. The proportion of pre-symptomatic transmission was 48% (95% CrI: 32–67) for Singapore and 62% (95% CrI: 50–76) for Tianjin. Reproduction number estimates based on the generation interval distribution were slightly higher than those based on the serial interval distribution. Sensitivity analyses showed that estimating these quantities from outbreak data requires detailed contact tracing information. Conclusion High estimates of the proportion of pre-symptomatic transmission imply that case finding and contact tracing need to be supplemented by physical distancing measures in order to control the COVID-19 outbreak. Notably, quarantine and other containment measures were already in place at the time of data collection, which may inflate the proportion of infections from pre-symptomatic individuals.

0 comments Cited 263 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: found

Is Open Access

Practical considerations for measuring the effective reproductive number, R t

Katelyn M Gostic, Lauren McGough, Edward B Baskerville … (2020)

Estimation of the effective reproductive number R t is important for detecting changes in disease transmission over time. During the Coronavirus Disease 2019 (COVID-19) pandemic, policy makers and public health officials are using R t to assess the effectiveness of interventions and to inform policy. However, estimation of R t from available data presents several challenges, with critical implications for the interpretation of the course of the pandemic. The purpose of this document is to summarize these challenges, illustrate them with examples from synthetic data, and, where possible, make recommendations. For near real-time estimation of R t , we recommend the approach of Cori and colleagues, which uses data from before time t and empirical estimates of the distribution of time between infections. Methods that require data from after time t, such as Wallinga and Teunis, are conceptually and methodologically less suited for near real-time estimation, but may be appropriate for retrospective analyses of how individuals infected at different time points contributed to the spread. We advise caution when using methods derived from the approach of Bettencourt and Ribeiro, as the resulting R t estimates may be biased if the underlying structural assumptions are not met. Two key challenges common to all approaches are accurate specification of the generation interval and reconstruction of the time series of new infections from observations occurring long after the moment of transmission. Naive approaches for dealing with observation delays, such as subtracting delays sampled from a distribution, can introduce bias. We provide suggestions for how to mitigate this and other technical challenges and highlight open problems in R t estimation.

0 comments Cited 174 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Contributors

Antonello Maruotti:

ORCID: https://orcid.org/0000-0001-8377-9950

a.maruotti@lumsa.it , antonello.maruotti@uib.no

Journal

Journal ID (nlm-ta): J Med Virol

Journal ID (iso-abbrev): J Med Virol

Journal ID (doi): 10.1002/(ISSN)1096-9071

Journal ID (publisher-id): JMV

Title: Journal of Medical Virology

Publisher: John Wiley and Sons Inc. (Hoboken )

ISSN (Print): 0146-6615

ISSN (Electronic): 1096-9071

Publication date (Electronic): 19 February 2021

Electronic Location Identifier: 10.1002/jmv.26881

Affiliations

[ ¹ ] Department GEPLI Libera Università Maria SS Assunta Rome Italy

[ ² ] Department of Mathematics University of Bergen Bergen Norway

[ ³ ] Unit of Clinical Pathology and Microbiology University Campus Bio‐Medico of Rome Rome Italy

[ ⁴ ] Laboratory of Biostatistics and Computational Epidemiology, Department of Biosciences University of Molise Campobasso Italy

Author notes

[*] [* ] Correspondence Antonello Maruotti, Department GEPLI, Libera Università Maria Ss Assunta, Rome 00192, Italy.

Email: a.maruotti@ 123456lumsa.it , antonello.maruotti@ 123456uib.no

Author information

Antonello Maruotti https://orcid.org/0000-0001-8377-9950

Massimo Ciccozzi http://orcid.org/0000-0003-3866-9239

Fabio Divino https://orcid.org/0000-0003-4107-3727

Article

Publisher ID: JMV26881

DOI: 10.1002/jmv.26881

PMC ID: 8014213

PubMed ID: 33590895

SO-VID: 8aa3677b-7c86-47ea-ab2c-1835cd9c2b18

License:

This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc-nd/4.0/ License, which permits use and distribution in any medium, provided the original work is properly cited, the use is non‐commercial and no modifications or adaptations are made.

History

Date revision received : 11 February 2021

Date received : 04 February 2021

Date accepted : 12 February 2021

Page count

Figures: 0, Tables: 0, Pages: 2, Words: 1598

Custom metadata

source-schema-version-number 2.0

edited-state corrected-proof

details-of-publishers-convertor Converter:WILEY_ML3GV2_TO_JATSPMC version:6.0.1 mode:remove_FC converted:01.04.2021

ScienceOpen disciplines: Microbiology & Virology

Data availability:

ScienceOpen disciplines: Microbiology & Virology

Comments

Comment on this article

scite_

Cited by 6

See all cited by

Most referenced authors 148

See all reference authors

- Version 1

On the misuse of the reproduction number in the COVID‐19 surveillance system in Italy

Read this article at

Abstract

Related collections

Wiley: Novel Coronavirus COVID-19

Most cited references 3

A New Framework and Software to Estimate Time-Varying Reproduction Numbers During Epidemics

Estimating the generation interval for coronavirus disease (COVID-19) based on symptom onset data, March 2020

Practical considerations for measuring the effective reproductive number, R t

Author and article information

Contributors

Journal

Affiliations

Author notes

Author information

Article

History

Page count

Categories

Custom metadata

Comments

Comment on this article

Similar content 21

Cited by 6

Most referenced authors 148