Skip to main content
Dryad

Simulated cannabis days-of-use data

Data files

Nov 20, 2022 version files 43.74 KB

Abstract

Background:

  • The numbers of days that people consume alcohol and other drugs over a fixed time interval, such as 28 days, are often collected in surveys for research in the addictions field.
  • The presence of an upper bound on these variables can result in response distributions with "ceiling effects".
  • Also, if some peoples’ substance use behaviors are characterized by various weekly patterns of use, summaries of substance days-of-use over longer periods can exhibit multiple modes. Multiple modes can also result from "heaping" of responses when respondents are unsure about the precise value.
  • These characteristics of substance days-of-use data mean that models assuming common parametric response distributions will not always provide a good fit.

Repository contents:

Simulate longitudinal cannabis days-of-use over 28-day intervals intended to reproduce characteristics of data reported by respondents to an Australian survey of illicit drug users run over 4 waves during the COVID-19 pandemic in Australia in 2020–21. The dataset includes generated subject_id and survey_wave and iso explanatory variables, where iso is a dummy variable indicating subjects that were in quarantine or isolation at the time of the 28-day interval.

R-code to fit proportional-odds and continuation-ratio ordinal models as well as binomial, beta-binomial, negative binomial and hurdle negative binomial models to these data are available at a linked companion website.