This ZIP file contains a deduplicated set of spreadsheets retrieved from the McKinsey collection of the UCSF-JHU Opioid Industry Documents Archive ( https://www.industrydocuments.ucsf.edu/opioids/ ) in 2024. Initially the team retrieved just .xls, .xlsx, and .csv files, as noted in these two files: * download.log * download_file_ids.csv Later, we added .xlsm files. SearchMyFiles ( https://www.nirsoft.net/utils/search_my_files.html ) was used to deduplicate files with .xls, .xlsx, .xlsm, and .csv extensions. The first occurrence of each duplicate spreadsheet was preserved, and the remainder deleted. Another "Duplicate Search" was performed to ensure no duplicates were found. Testing on a sample of spreadsheets was used to validate the robustness of the deduplication methods.