Data from: Are replication rates the same across academic fields? community forecasts from the DARPA SCORE program
Data files
Jul 21, 2020 version files 1.74 MB
-
codebook.xlsx
9.82 KB
-
final_demography_data.csv
886.82 KB
-
final_market_data.csv
187.77 KB
-
Final_Script.R
25.24 KB
-
final_survey_data.csv
403.77 KB
-
SCORE_PREREG_META_FINAL.pdf
220.40 KB
-
ssr_data.xlsx
9.17 KB
Abstract
This dataset was collected via online platform over the course two weeks in August 2019. The participants of the surveys and prediction markets were recruited through blog posts, twitter and emailing lists, primarily aimed at academics, however anyone was free to join. The data was exported from the online platforms and anonymised before processing and analysis.
The datatables were processed into formats more suited to analysis, and columns name changed to be more intuitive. New columns were also appended which provided information, for instance survey question wording was included as well as question code. All ‘test’ surveys (or surveys completed by admins) were removed before analysis.
In prediction markets, we used the logarithmic scoring rule (Hanson 2005) with base 2 and a liquidity parameter of b = 100. Participants received an initial endowment of 100 points to trade with.
Within the demography survey, empty values are indicated with NA values. The fields of interest question asks about ‘sociology and criminology separately however for the other surveys and prediction markets these fields are combined. Therefore combing these fields of interest will make them consistent with the rest of the project.
The user_id’s provided in the prediction market and survey data are consistent across all datasets.
A codebook is provided for explanations of column names