This repository (described in detail at https://datadryad.org/stash/dataset/doi:10.25338/B80K9R) contains the following files: 1. eis_corpus_2013-2020.rds This is an RDS compressed data.table object (https://cran.r-project.org/web/packages/data.table/) in which each page is a row. The data.table has three columns, showing the File, text, and page #. 2. eis_documents_record.csv This CSV file itemizes documents in the e-nepa database (https://cdxnodengn.epa.gov/cdx-enepa-public/action/eis/search) by row, and provides meta-data for each document. Projects and documents are linked by EIS Number (an 8-digit identifier in which the first four digits show the year of publication). Every file (document) is saved with the EIS Number appended to the front of the file name, followed by an underscore, and then the original file name as downloaded from the EPA website. 3. eis_record_detail.csv This CSV file itemizes projects in the e-nepa database (https://cdxnodengn.epa.gov/cdx-enepa-public/action/eis/search) by row, and provides meta-data for each project. 4. extra_docs.csv This CSV file describes additional documents that were collected by hand where documentation was not found on the EPA website. These can also be linked to the project record CSV file by EIS Number.