@@ -11,7 +11,7 @@ grand_parent: COVIDcast Epidata API
11
11
* ** Earliest issue available:** November 30, 2020
12
12
* ** Number of data revisions since May 19, 2020:** 0
13
13
* ** Date of last change:** Never
14
- * ** Available for:** county, MSA, HRR, state (see [ geography coding docs] ( ../covidcast_geography.md ) )
14
+ * ** Available for:** county, MSA, HRR, state, HHS, nation (see [ geography coding docs] ( ../covidcast_geography.md ) )
15
15
* ** Time type:** day (see [ date format docs] ( ../covidcast_times.md ) )
16
16
* ** License:** To download or use the data, you must agree to the Google [ Terms of Service] ( https://policies.google.com/terms )
17
17
@@ -57,13 +57,22 @@ The state-level and county-level `raw_search` signals for specific symptoms such
57
57
as _ anosmia_ and _ ageusia_ are taken directly from the [ COVID-19 Search Trends
58
58
symptoms
59
59
dataset] ( https://github.com/google-research/open-covid-19-data/tree/master/data/exports/search_trends_symptoms_dataset )
60
- without changes. We aggregate the county-level data to the MSA and HRR levels
61
- using the population-weighted average. For MSAs/HRRs that include counties that
62
- have no data provided due to quality or privacy issues for a certain day, we
63
- simply assume the values to be 0 during aggregation. The values for MSAs/HRRs
64
- with no counties having non-NaN values will not be reported. Thus, the resulting
65
- MSA/HRR level data does not fully match the _ actual_ MSA/HRR level data (which
66
- we are not provided).
60
+ without changes.
61
+
62
+ We aggregate county and state data to other geographic levels using
63
+ population-weighted averaging.
64
+
65
+ | Source level | Aggregated level |
66
+ | ------------ | ---------------- |
67
+ | county | MSA, HRR |
68
+ | state | HHS, nation |
69
+
70
+ For aggregation purposes only, we assign a value of 0 to source regions that
71
+ have no data provided due to quality or privacy issues for a certain day (see
72
+ Limitations for details). We do not report aggregated regions if none of their
73
+ source regions have data. Because of this censoring behavior, the resulting data
74
+ for aggregated regions does not fully match the _ actual_ search volume for these
75
+ regions (which is not provided to us).
67
76
68
77
## Lag and Backfill
69
78
Google does not currently update the search data daily, but usually twice a week.
0 commit comments