BMIRDS Datasets

Dartmouth Breast Cancer Recurrence Risk Dataset

This dataset comprises 990 hematoxylin and eosin (H&E)-stained, formalin-fixed paraffin-embedded (FFPE) whole-slide images (WSIs) and corresponding recurrence risk and clinicopathologic information, including Oncotype DX Breast Recurrence Score®, patient age, tumor size, tumor grade, histologic type, ER status, PR status, and HER2 status for breast cancer cases from the Department of Pathology and Laboratory Medicine at Dartmouth Health. The dataset has been de-identified and released with permission from the Dartmouth Health Institutional Review Board (IRB). These slides were used to develop and evaluate a multi-model approach that integrates whole-slide imaging and clinicopathologic data to predict low- and high-risk categories of breast cancer recurrence based on the Oncotype DX Breast Recurrence Score®. For more details, please refer to “A multi-model approach integrating whole-slide imaging and clinicopathologic features to predict breast cancer recurrence risk”.

Breast Cancer Recurrence Risk

Breast cancer recurrence risk based on the Oncotype DX Breast Recurrence Score® plays an important role in guiding treatment decisions for breast cancer patients. You can read more about this risk score here.

Dataset Description

The dataset includes:

  • Metadata.csv
  • DHMC_ZIP_001.zip - (9.85 GB)
  • DHMC_ZIP_002.zip - (9.75 GB)
  • DHMC_ZIP_003.zip - (9.75 GB)
  • DHMC_ZIP_004.zip - (9.99 GB)
  • DHMC_ZIP_005.zip - (9.86 GB)
  • DHMC_ZIP_006.zip - (9.73 GB)
  • DHMC_ZIP_007.zip - (9.89 GB)
  • DHMC_ZIP_008.zip - (9.85 GB)
  • DHMC_ZIP_009.zip - (9.58 GB)
  • DHMC_ZIP_010.zip - (9.95 GB)
  • DHMC_ZIP_011.zip - (10.00 GB)
  • DHMC_ZIP_012.zip - (9.91 GB)
  • DHMC_ZIP_013.zip - (9.92 GB)
  • DHMC_ZIP_014.zip - (9.97 GB)
  • DHMC_ZIP_015.zip - (9.69 GB)
  • DHMC_ZIP_016.zip - (9.58 GB)
  • DHMC_ZIP_017.zip - (9.98 GB)
  • DHMC_ZIP_018.zip - (9.79 GB)
  • DHMC_ZIP_019.zip - (9.95 GB)
  • DHMC_ZIP_020.zip - (9.73 GB)
  • DHMC_ZIP_021.zip - (9.89 GB)
  • DHMC_ZIP_022.zip - (9.98 GB)
  • DHMC_ZIP_023.zip - (9.98 GB)
  • DHMC_ZIP_024.zip - (1.63 GB)

WSI Images

Includes 990 H&E-stained FFPE WSIs. Slides were digitized using Leica Aperio AT2 and CS2 scanners.

Sample Whole-Slide Images

sample whole-slide images (breast cancer examples)

Meta Data

  • Oncotype DX Breast Recurrence Score®
  • Patient age
  • Tumor size
  • Tumor grade
  • Histologic type
  • ER status
  • PR status
  • HER2 status

Accessing the Dataset

Please fill out the form below to request access to the dataset by email.

Citation

If you use this dataset, please cite the corresponding paper:

Manu Goyal, Jonathan D. Marotti, Adrienne A. Workman, Graham M. Tooker, Seth K. Ramin, Elaine P. Kuhn, Mary D. Chamberlin, Roberta M. diFlorio-Alexander, Saeed Hassanpour, "A multi-model approach integrating whole-slide imaging and clinicopathologic features to predict breast cancer recurrence risk", npj Breast Cancer 10, 93 (2024).

FAQ

“I have not received any email after submitting the form.”

Please check your Junk/Spam email folder just in case the email was delivered there instead of your inbox. If you still could not find an email, please wait for a few hours and submit the form again.

By default, the download links will be expired after 4 hours. Please submit the form again to receive new links and download data before the links expire.




For inquiries, please contact us at :mailbox:BMIRDS.

If you are interested in histology image analysis, please check out other datasets from our group.