The COVID-19 media bulletins of the State of Karnataka, from 7th March to 26th June, provided detailed information on the tested positive patients. In particular there was data on how each one of them contracted the virus (either due to travel history or by being a contact of someone who has already tested positive for COVID-19) or what led to them being tested (either as a Severe Acute Respiratory Infection patient or someone with Influenza like symptoms).
Clusters
We first classify the tested positive cases into clusters based on the source of infection, for example “From Europe” or “Pharmaceutical Company Nanjangud”. Then in each cluster we place all the patients who contracted the virus independently from the place of origin, and then recursively add the patients to whom they passed the infection.
Before Phase 1 (25th March - 14th April) of the lockdown began, almost all the COVID-19 cases that were confirmed in Karnataka were either individuals who had some form of international travel history (from Middle East, USA, South America, United Kingdom and the rest of Europe) or those who were contacts of such individuals. Phase 1 and Phase 2 (15th April - 3rd May) of the lockdown in Karnataka saw heavy restrictions on travel and nearly all services and factories were suspended.
During Phase 1 and Phase 2 of the lockdown, a Pharmaceutical company in Nanjangud, Mysore, saw a sudden increase in the COVID-19 cases. Although the exact reason for the infection to have reached the company is unknown, the first patient to be infected (35 year old male, was confirmed to be infected on 26th March) came in contact with health care workers treating COVID-19 patients. Another cluster that began during this period was the “TJ Congregation”, which contained those who attended the Tablighi Jamaat Congregation from 13th to 18th March in Delhi. The first patient in this cluster was confirmed as a COVID-19 case on 2nd April. Both these clusters were very well contained and the last patients to be attributed to these clusters tested positive on 29th April and 21st May respectively. No more patients were attributed to these clusters since then. Phase 3 and 4 of the lockdown loosened restrictions on Domestic Travel and many infected individuals had some domestic travel history. The state saw a large influx of infected individuals from states like Maharashtra, Gujarat, Rajasthan and the Southern States (Tamil Nadu, Telangana and Andhra Pradesh). There were also patients whose source of infection was listed as inter-district travel in Karnataka, travel to foreign countries or other states, healthcare workers and policemen on COVID-19 duty and their contacts. The cases due to these reasons were too few to form separate clusters. We placed all these patients in a cluster called “Others”.
Testing strategy in India is governed by ICMR guidelines. The guidelines on 20th March mandated that all Severe Acute Respiratory Illness patients (i.e., patients with fever AND cough and/or shortness of breath) should be tested for COVID-19, while the guidelines on 4th April mandated the same for all symptomatic patients with Influenza like Illness (fever, cough, sore throat, runny nose). Thus two other clusters that began during Phase 1 and Phase 2 of the lockdown were the Severe Acute Respiratory Infection (“SARI”) (first infection 7th April) and Influenza Like Illness (“ILI”) (first infection 15th April) clusters. These clusters contain those patients who have a history of SARI(and ILI), and those who can be traced back as contacts of such patients. It should be noted that only the first generation of the patients in this cluster are those with a history of SARI (and ILI), but the subsequent contacts of these patients need not be. In the media bulletins, patients whose contact tracing was incomplete were mentioned as ‘Contact Under Tracing’. We have assumed that these patients did not fall under SARI or ILI and placed them in a cluster called “Unknown”, along with their contacts who tested positive. An initiative taken by the government was to create Containment Zones in certain regions. The guidelines for these zones were clearly specified. The first case in contact with a containment zone was reported on 24th April. Since then a large fraction of the increase in this cluster occurred during Phase 3 (4th May–17th May) and Phase 4 (18th May–31st May) of the lockdown. For all these clusters, there was no information provided on the source of infection for the ‘parents’.
Our consolidated list of clusters are then given by
$$\begin{aligned}&{\text{From Middle East, From USA, From United Kingdom, From Rest of Europe,}} \\&{\text {From South America, From Maharashtra, From Rajasthan, From Southern States,}} \\&{\text{From Gujarat, Influenza like illness(ILI), Severe Acute Respiratory Infections(SARI),}} \\&{\text{Unknown, Pharmaceutical Company-Nanjangud, T.J. Congregation in Delhi,}} \\&{\text{Containment Zones, Others. }} \end{aligned}$$
(2.1)
Reproduction number and Dispersion
In epidemiology, the “basic reproduction number” of an infection, denoted by \(R_0\), can be thought of as the expected number of cases to have contracted the infection directly from one case. Thus on an average, each infected person passes on the infection to \(R_0\) many healthy individuals. As mentioned earlier, in Karnataka during the period 9th March - 26th June we have observed the COVID-19 infection spread in a controlled environment. So whenever we calculate basic reproduction numbers we are actually calculating the short term effective reproduction number of the disease during this period. To be cognizant of this we shall use the notation \(R_{\hbox {eff}}\) to denote the basic reproduction number for a cluster instead of the usual notation \(R_0\).
We will examine Reproduction number and dispersion for “The 8 clusters” in this section, namely:
$$\begin{aligned}&\hbox {From Southern States, Influenza like illness, Severe Acute Respiratory Infections,}\nonumber \\&\hbox {Containment Zones, Unknown, Others, TJ Congregation in Delhi, and Pharmaceutical Company Nanjangud. }\end{aligned}$$
(2.2)
These began before 3rd May 2020 and have more than 50 individuals. There are ten clusters that satisfy these criteria from (2.1). We have omitted two clusters from analysis which satisfy these criteria, namely: “From Maharashtra” and “From Middle East”. We will analyse them in a later section. In Fig. 3 we present a summary distribution of parents, children, grandchildren, and great grandchildren in each of “The 8 clusters”.
For each individual i in the cluster we will denote the number of children (or the number of tested positive cases) assigned to patient i by \(y_{i}\). This means that there were \(y_i\) many positive infections whom the media bulletins listed as ‘Contact of Patient-i’. The mean of \(y_i\) is the basic reproduction number \(R_{\hbox {eff}}\). In Table 1 we present a comparison of the summary distribution parameters (Maximum, Zeroes, Size, etc.) across clusters and we see that the variance does not match the mean. Further, as noted in Fig. 1, heterogeneity in the infectiousness of each individual implies that \(R_{\hbox {eff}}\) by itself is not a good measure of the infection spread. To account for the large variance, we now consider the standard method of mixture of Poisson distributions to model the data set. For each cluster, using the Negative Binomial with mean \(R_{\hbox {eff}}\) and dispersion k (see Lloyd-Smith James et al. 2005 and Section A for details) as the offspring distribution, we will use the Maximum Likelihood method for estimating \(R_{\hbox {eff}}\) and k (see Section B for details). Using the methods developed in Saha and Paul Sudhir (2005) we provide 95% confidence interval for k and conditional on the estimates we perform the \(\chi ^2\)-goodness of fit test. The details of the above can be found in Sect. B, C, and D of the Appendix.
Table 1 This table considers the different generations of infections as seen in Karnataka for “The 8 clusters”. For each generation, the table contains the number of individuals in that generation, the number of patients causing zero secondary infections, the maximum number of infections caused by an individual in that generation and the mean number of infections caused by an individual in that generation Cases due to Migration in Phase 3,4 and Unlockdown 1.0
As mentioned earlier we had omitted two clusters from analysis, namely: From Maharashtra and From Middle East. Phase 3 and 4 of the lockdown, along with Unlockdown 1.0 in June loosened restrictions on Domestic Travel and International travel. The state saw a large influx of infected individuals from within India and abroad. During Phase-3 of the lockdown, the “From Maharashtra” cluster saw the most growth and dominated the test positive counts by a significant margin. The “From Maharashtra” cluster accounted for approximately \(52.5\%\) of cases in the stipulated period. The “From Middle East” cluster seems to have two phases. The first occurred before the lockdown was enforced during which international travel was suspended. The second, more recent, was due to the repatriation flights from the region. We provide the Maximum Likelihood estimators for \(R_{\hbox {eff}}\) and k, along with their summary in Table 1. During this period domestic and international travellers were quarantined/tested on arrival. To make any meaningful inferences using reproduction numbers and dispersion one would have take into account a more detailed tracing history procedure from their origin of travel.
To understand cases due to Migration (6871 out of 10391) in this period we reorganized our clusters from (2.1) into four groups. Namely
$$\begin{aligned}&{\text{Inter-District Travel: consisting of 429 patients who belong to}} \,{\rm{``}}{\text{Others'' cluster whose testing positive is attributed to inter-district travel within Karnataka;}} \\ & {\text{Inter-State Travel group: consisting of 582 patients who belong to}}\, {\rm{``}} {\text{From Gujarat'',}}\,{\rm{``}}{\text{From Rajasthan'',}}\, {\rm{``}}{\text{From the Southern States'\,'(Kerala, Tamil Nadu, Telangana and Andhra Pradesh) and}} {\rm{``}}{\text{Others'' cluster who had traveled to Delhi.;}}\\ & {\text{Foreign group: consisting of 379 patients who belong to the}} {\rm{``}}{\text{From Middle East'',}}\,{\rm{``}}{\text{From United Kingdom'' and}}\, {\rm{``}}{\text{From the rest of Europe'' cluster as well as a few cases which originated from Nepal, Indonesia, Philippines and Malaysia; and}} \\ &{\text{From Maharashtra cluster: consisting of 5481 patients in that cluster as in (2.1).}} \end{aligned}$$
(2.3)
Data
We have sourced all our data from the Daily Media Bulletins of Government of Karnataka: https://karnataka.gov.in/common-10/en (till 27th April, 2020) and https://covid19.karnataka.gov.in/govt_bulletin/en (post 27th April, 2020). The media bulletins were very detailed and contained the following information till 21st, July 2020. We have converted them from their pdf format into usable CSV format and made them publicly available for use at our Data Repository at https://www.isibang.ac.in/~athreya/incovid19/.