Skip to contents

Data from which serial interval and generation time estimates were performed by Ganyani et al, 2020

Usage

data("ganyani_clusters")

Format

An object of class tbl_df (inherits from tbl, data.frame) with 196 rows and 6 columns.

Details

Original article licensed under Creative Commons 4.0. Data was cleansed and formatted for R.

ganyani_clusters dataframe with 196 rows and 6 columns

id (dbl)

a unique id for a person (unique within the source)

contacts (list dbl)

list of known contacts in the cluster

cluster_id (dbl)

id of a cluster (unique within the source)

symptom_onset (date)

symptom onset date

known_primary_case (lgl)

flag if this person is know to be the primary case in the cluster

source (chr)

geographical source of the data

References

Ganyani T, Kremer C, Chen D, Torneri A, Faes C, Wallinga J, Hens N. Estimating the generation interval for coronavirus disease (COVID-19) based on symptom onset data, March 2020. Euro Surveill. 2020 Apr;25(17):2000257. doi: 10.2807/1560-7917.ES.2020.25.17.2000257. PMID: 32372755; PMCID: PMC7201952.

Examples

dplyr::glimpse(ganyani_clusters)
#> Rows: 196
#> Columns: 6
#> $ id                 <dbl> 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 
#> $ contacts           <list> <3, 4, 9, 22, 25, 7, 11, 13, 14, 17, 18, 24, 28, 3…
#> $ cluster_id         <dbl> 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 
#> $ symptom_onset      <date> 2020-01-22, 2020-01-24, 2020-01-25, 2020-01-21, 20…
#> $ known_primary_case <lgl> FALSE, FALSE, FALSE, FALSE, FALSE, FALSE, FALSE, FA…
#> $ source             <chr> "tianjin", "tianjin", "tianjin", "tianjin", "tianji…