The Italian Ministry of Health, jointly with the Department of Infectious Diseases
of the Italian National Institute of Health (Istituto Superiore di Sanità [ISS]),
promptly built the integrated surveillance system for COVID‐19. To evaluate the severity
of the virus spread, the reproduction number R
t
, defined as the average number of cases generated by an infected individual in a
population where everyone is susceptible to infection, is estimated. Unfortunately,
in Italy, R
t
is not only used to provide a picture of the epidemic spread but rather as a decision
tool to plan and organize nonpharmaceutical interventions by imposing a priori thresholds
to define different levels of risks, on which daily‐life restrictions apply.
We believe this is a misuse of R
t
, which is dangerous and widely uncertain. For this reason, it is important that this
parameter as an indicator for restriction measures must be managed by an expert in
the field. Some practical and statistically relevant considerations are given in Gostic
et al.
1
Nature (https://www.nature.com/articles/d41586-020-02009-w), in July, already discussed
the potential bias in using R
t
over its real meaning. Here, we discuss the main limits of the Italian approach to
estimate and use R
t
, showing that the restrictions imposed on the population are based on an unreliable
estimate of the reproduction number.
The reference model is proposed by Cori et al.
2
The reproduction number is estimated in a Bayesian framework and requires the a priori
definition/estimation of some fundamental quantities needed to estimate R
t
. The authors correctly discussed the limitations of their approach, but it seems
that the Italian authorities neglect them. The main issues are related to
the time window defined to estimate R
t
;
the distributions assumed to model the number of new cases and the generation time.
The first risk is clearly stated in the Introduction, “When the data aggregation time
step is small (e.g., daily data), estimates of R
t
can vary considerably over short time periods, producing substantial negative autocorrelation.” In
other words, the obtained estimates of R
t
depend on the choice of the time window size. Which are the risks to consider an
inappropriate time window? Small values lead to more rapid detection of changes in
transmission but also more statistical noise; large values lead to more smoothing and
reductions in statistical noise. Cori et al.
2
suggest an approach to detect the optimal time window based on the coefficient of
variation. How this is dealt with in Italy is swept under the carpet.
Moreover, Cori et al.
2
assumed that the distribution of infectiousness through time after infection is independent
of calendar time and follows a Poisson process, that is, overdispersion is not accounted
for. This is a rather restrictive assumption that must be carefully checked on the
real data. It is well‐known that Poisson‐based estimates are biased if overdispersion
arises in the data.
We believe these points are already enough to conclude that the estimates of R
t
should be used with caution, but the most relevant assumptions strongly affecting
the estimates of R
t
are not still discussed. Indeed, in Cori et al.
2
we further read, “Estimates of the reproduction number are highly dependent on the
choice of the infectiousness profile. This can be approximated by the distribution
of the generation time (i.e., time from the infection of a primary case to infection
of the cases he/she generates). However, times of infection are rarely observed and
the generation time distribution is therefore difficult to measure. On the other hand,
the timing of symptoms onset is usually known and such data collected in closed settings
where transmission can reliably be ascertained (e.g., households) can be used to estimate
the distribution of the serial interval (time between symptoms onset of a case and
symptoms onset of his/her secondary cases).” In other words, different estimates of
the serial interval lead to different estimates of R
t
, that is, a reliable estimate of the serial interval is mandatory because it drives
the estimate of R
t
, and its misspecification is the major source of bias. In Italy, the reference serial
interval to estimate the official R
t
is taken from Cerada et al.,
3
and it is based on 90 pairs of cases in Lombardy in February, where the authors found
an infector–infectee relationship and have the dates of symptom onset of both cases.
Results are displayed in figure S8 in Cerada et al.
3
and refer to a Gamma‐distributed estimated serial interval with parameters shape = 1.87
and scale = 0.28. Bearing in mind that the Gamma is a continuous distribution and
in this context is used to fit a discrete process, figure S8 in Cerada et al.
3
clearly shows multimodality and the Gamma distribution does not fit the data too well.
In other words, the serial interval is poorly estimated. Moreover, this estimate is
taken for granted for all the other Italian regions, that is, the same serial interval
is assumed for all the regions and never updated. A crucial assumption for the adopted
model is poorly estimated, wrongly applied to very heterogeneous contexts, and not
checked again after the early phase of the first outbreak. We are puzzled about it,
as the model by Cori et al.
2
accepts any parametric or empirical discrete distribution with support on positive
values to approximate the serial interval and the generation time, and not only estimated
values from a Gamma distribution. Gostic et al.
1
illustrate the consequences of misspecifying the form and the variance on the serial
interval distribution. Moreover, Ganyani et al.
4
report country‐specific estimates for the generation time, remarking that estimating
R
t
in different heterogeneous regions requires different estimates of the generation
time.
In addition, the delay between the date in which the result of the test was received
and the date of the recording in the data set also plays a crucial role. Cori et al.
2
uses the instantaneous reproductive number and considers incidence cases observed
before time point t; therefore, data may be affected by underreporting due to the
delay between tests and reports: larger the delay, less accurate the estimation of
R
t
due to missing information concerning incidence cases that are not yet recorded.
Furthermore, the underreporting rate is not constant; it mostly affects the cases
observed at the previous time points and closer to t, introducing a bias effect in
the estimation. As a critical consequence, when the delay between test and report
is large, the estimates of R
t
may be biased and in significant delay with respect to the current evolution of the
epidemic process.
Available epidemiological data are not ideal, and this reinforces, even more, the
idea that statistical adjustments are needed to obtain accurate estimates of R
t
. As a result of neglecting all these issues, uncertain estimates of R
t
are obtained. Just to provide an example, we focus on the R
t
estimates reported in the ISS weekly report (see e.g., figure 8 at https://www.epicentro.iss.it/coronavirus/bollettino/Bollettino-sorveglianza-integrata-COVID-19_20-gennaio-2021.pdf).
All credible intervals are rather wide, and even huge for some regions. The high uncertainty
surrounding these estimates is a clear indication that the use of R
t
must be limited to provide a trend in the epidemic spread, but it must be avoided
any further use. Annunziato and Asikainen
5
compare different methods to estimate Rt
and show that point estimates vary across methods, though they share a similar trend.
In Italy, instead, through a priori specified levels of the reproductive number, R
t
estimates are used to label the administrative regions in classes of risks (called
scenario in the main ISS report, see e.g., http://www.salute.gov.it/imgs/C_17_monitoraggi_13_0_fileNazionale.pdf),
with the respective restrictions.
For estimating R
t
no golden standard methods exist. The work by Cori et al.
2
is a milestone in epidemiology research. Nevertheless, like many other models, it
is based on assumptions that must be checked and fulfilled to avoid misleading inference.
In Italy, not only are these assumptions neglected but the estimates of R
t
are used widely over their reliable interpretation. At the end of the games the R
t
seems a dancer, dancing music depending on the actual director of the orchestra who
performs it.