This ZIP file contains a deduplicated set of spreadsheets retrieved from the Insys collection of the UCSF-JHU Opioid Industry Documents Archive ( https://www.industrydocuments.ucsf.edu/opioids/ ) in 2024. New documents (including spreadsheets) may be added to the Insys collection in the future, at which point the OIDA team will need to regenerate this dataset to be complete. Initially the team retrieved just .xls and .xlsx files, as noted in these two files: * download.log * download_file_ids.csv Later, we added .xlsm files, as noted in: * metadata-xlsm-insys.csv SearchMyFiles ( https://www.nirsoft.net/utils/search_my_files.html ) was used to deduplicate files with .xls, .xlsx, .xlsm, and .csv extensions. The first occurrence of each duplicate spreadsheet was preserved, and the remainder deleted. Another "Duplicate Search" was performed to ensure no duplicates were found. Testing on a sample of spreadsheets was used to validate the robustness of the deduplication methods.