Manhattan, New York City, 2020 Traffic Time Series + R Code for Analysis
Data files
Jun 07, 2021 version files 298.60 KB
-
CCC_COVID19.txt
246.54 KB
-
README.md
2.73 KB
-
stay-at-home_fatigue_modeling.R
49.32 KB
Abstract
This dataset includes (1) a .txt file of processed time-series with four traffic congestion levels for the borough of Manhattan, NYC, averaged every 3 hours for the duration of 2020, and (2) an R script for completing analysis of the traffic time series to determine patterns in traffic over the year 2020, and to evaluate the impact of stay-at-home orders implemented in response to the COIVD-19 pandemic.
Methods
Raw (pre-processed) data was collected by automatically downloading tiles of Google Traffic maps. These images were then processed to select colors used by Google Traffic that correspond to traffic congestion level on their maps (green = free-flowing traffic, orange = some traffic delays, red = traffic congestion, dark red / maroon = severely congested traffic) and to determine the percent of the map area covered by each color. Each row in the data represents the average for a 3 hour period. Traffic time series were then analyzed to determine traffic patterns over the course of 2020 and to assess adherence to social distancing interventions put in place during the pandemic. Details about raw data collection, processing, and analysis can be found in other sources:
Hilpert M, Shearston JA, Cole J, Chillrud SN, Martinez ME. Acquisition and analysis of crowd-sourced traffic data. arXiv. 2021:2105.12235. https://arxiv.org/abs/2105.12235
Jenni A. Shearston, Micaela E. Martinez, Yanelli Nunez, Markus Hilpert. Social-distancing Fatigue: Evidence from Real-time Crowd-sourced Traffic Data. medRxiv 2021.03.04.21252917; doi:https://doi.org/10.1101/2021.03.04.21252917
Usage notes
The README file contains a data dictionary, also copied below:
Each row corresponds to one 3-hour time window.
Data Dictionary:
DateNum (Matlab) - Matlab datetime format
DateString - Datetime converted into a character/string format
Maroon - Percent of map area covered by maroon (dark red) color on Google Traffic map, indicating severe traffic congestion (3-hour average)
Red - Percent of map area covered by red color on Google Traffic map, indicating traffic congestion (3-hour average)
Orange - Percent of map area covered by orange color on Google Traffic map, indicating traffic delays (3-hour average)
Green - Percent of map area covered by green color on Google Traffic map, indicating free-flowing traffic (3-hour average)
Gray - Percent of map area covered by gray color on Google Traffic map, indicating not enough vehicles sending data to Google to determine traffic color (3-hour average)