A. DATASET DESCRIPTION
This dataset contains COVID-19 positive confirmed cases aggregated by several different geographic areas and by day. COVID-19 cases are mapped to the residence of the individual and shown on the date the positive test was collected. In addition, 2019 American Community Survey (ACS) 5-year population estimates are included to calculate the cumulative rate per 10,000 residents.
Dataset covers cases going back to March 18th, 2020 when the first person in Marin County tested positive for COVID-19. This data may not be immediately available for recently reported cases and data will change to reflect as information becomes available. Data updated daily.
COVID-19 case data undergo quality assurance and other data verification processes and are continually updated to maximize completeness and accuracy of information. This means data may change for previous days as information is updated.
Geographic areas summarized are:
1. City, Town, or Community Area
2. Census Tracts
3. Census ZIP Code Tabulation Areas (ZCTAs)
B. HOW THE DATASET IS CREATED
Addresses from the COVID-19 case data are geocoded by Marin County HHS. Those addresses are spatially joined to the geographic areas. Counts are generated based on the number of address points that match each geographic area for a given date.
The 2019 ACS estimates for population provided by the Census are used to create a cumulative rate which is equal to ([cumulative count up to that date] / [acs_population]) * 10000) representing the number of total cases per 10,000 residents (as of the specified date).
C. UPDATE PROCESS
Geographic analysis is scripted by Marin HHS staff and synced to this dataset each day.
D. HOW TO USE THIS DATASET
This dataset can be used to track the spread of COVID-19 throughout Marin County in a variety of geographic areas. Note that the new cases column in the data represents the number of new cases confirmed in a certain area on the specified day, while the cumulative cases column is the cumulative total of cases in a certain area as of the specified date.
Privacy rules in effect
To protect privacy, certain rules are in effect:
1. Any area with a cumulative case count less than 10 are dropped for all days the cumulative count was less than 10. These will be null values. For example if a zip code did not have 10 cumulative cases until June 1, 2020 that location will not be included in the dataset until June 1.
2. Once an area has a cumulative case count of 10 or greater, that area will have a new row of case data every day following.
3. 3. Cases are dropped altogether for areas where acs_population < 1000. Some adjacent geographic areas may be combined until the ACS population exceeds 1,000 to still provide information for these regions.
Note: 14-day case rate or 30-day case rate where the counts are lower than 20 may be unstable. We advise caution in interpreting rates at these small numbers.
A note on Census ZIP Code Tabulation Areas (ZCTAs)
ZIP Code Tabulation Areas are special boundaries created by the U.S. Census based on ZIP Codes developed by the USPS. They are not, however, the same thing. ZCTAs are areal representations of routes.