Dartmouth Breast Cancer Recurrence Risk Dataset
This dataset comprises 990 hematoxylin and eosin (H&E)-stained, formalin-fixed paraffin-embedded (FFPE) whole-slide images (WSIs) and corresponding recurrence risk and clinicopathologic information, including Oncotype DX Breast Recurrence Score®, patient age, tumor size, tumor grade, histologic type, ER status, PR status, and HER2 status for breast cancer cases from the Department of Pathology and Laboratory Medicine at Dartmouth Health. The dataset has been de-identified and released with permission from the Dartmouth Health Institutional Review Board (IRB). These slides were used to develop and evaluate a multi-model approach that integrates whole-slide imaging and clinicopathologic data to predict low- and high-risk categories of breast cancer recurrence based on the Oncotype DX Breast Recurrence Score®. For more details, please refer to “A multi-model approach integrating whole-slide imaging and clinicopathologic features to predict breast cancer recurrence risk”.
Breast Cancer Recurrence Risk
Breast cancer recurrence risk based on the Oncotype DX Breast Recurrence Score® plays an important role in guiding treatment decisions for breast cancer patients. You can read more about this risk score here.
Dataset Description
The dataset includes:
- Metadata.csv
- DHMC_ZIP_001.zip - (9.85 GB)
- DHMC_ZIP_002.zip - (9.75 GB)
- DHMC_ZIP_003.zip - (9.75 GB)
- DHMC_ZIP_004.zip - (9.99 GB)
- DHMC_ZIP_005.zip - (9.86 GB)
- DHMC_ZIP_006.zip - (9.73 GB)
- DHMC_ZIP_007.zip - (9.89 GB)
- DHMC_ZIP_008.zip - (9.85 GB)
- DHMC_ZIP_009.zip - (9.58 GB)
- DHMC_ZIP_010.zip - (9.95 GB)
- DHMC_ZIP_011.zip - (10.00 GB)
- DHMC_ZIP_012.zip - (9.91 GB)
- DHMC_ZIP_013.zip - (9.92 GB)
- DHMC_ZIP_014.zip - (9.97 GB)
- DHMC_ZIP_015.zip - (9.69 GB)
- DHMC_ZIP_016.zip - (9.58 GB)
- DHMC_ZIP_017.zip - (9.98 GB)
- DHMC_ZIP_018.zip - (9.79 GB)
- DHMC_ZIP_019.zip - (9.95 GB)
- DHMC_ZIP_020.zip - (9.73 GB)
- DHMC_ZIP_021.zip - (9.89 GB)
- DHMC_ZIP_022.zip - (9.98 GB)
- DHMC_ZIP_023.zip - (9.98 GB)
- DHMC_ZIP_024.zip - (1.63 GB)
WSI Images
Includes 990 H&E-stained FFPE WSIs. Slides were digitized using Leica Aperio AT2 and CS2 scanners.
Sample Whole-Slide Images
Meta Data
- Oncotype DX Breast Recurrence Score®
- Patient age
- Tumor size
- Tumor grade
- Histologic type
- ER status
- PR status
- HER2 status
Accessing the Dataset
Please fill out the form below to request access to the dataset by email.
Citation
If you use this dataset, please cite the corresponding paper:
Manu Goyal, Jonathan D. Marotti, Adrienne A. Workman, Graham M. Tooker, Seth K. Ramin, Elaine P. Kuhn, Mary D. Chamberlin, Roberta M. diFlorio-Alexander, Saeed Hassanpour, "A multi-model approach integrating whole-slide imaging and clinicopathologic features to predict breast cancer recurrence risk", npj Breast Cancer 10, 93 (2024).
title={A multi-model approach integrating whole-slide imaging and clinicopathologic features to predict breast cancer recurrence risk},
author={Goyal, Manu and Marotti, Jonathan D and Workman, Adrienne A and Tooker, Graham M and Ramin, Seth K and Kuhn, Elaine P and Chamberlin, Mary D and diFlorio-Alexander, Roberta M and Hassanpour, Saeed},
journal={npj Breast Cancer},
volume={10},
number={1},
pages={93},
year={2024},
publisher={Nature Publishing Group},
doi={10.1038/s41523-024-00700-z}
}
FAQ
“I have not received any email after submitting the form.”
Please check your Junk/Spam email folder just in case the email was delivered there instead of your inbox. If you still could not find an email, please wait for a few hours and submit the form again.
“I received an email, but a download link has expired.”
By default, the download links will be expired after 4 hours. Please submit the form again to receive new links and download data before the links expire.
For inquiries, please contact us at BMIRDS.
If you are interested in histology image analysis, please check out other datasets from our group.