IDC Sept 2024 release v19: Focus on digital pathology

IDC data release v19 is out! :sparkler:

Over the past months, we put significant effort into curating and harmonizing several new digital pathology collections. But the wait is over, and over 25,000 of new pathology images are now available for you in IDC to explore, visualize, download, analyze and reuse as you like under the CC-BY license!

  • GTEx: 25,503 slides, 8.5TB The Genotype-Tissue Expression (GTEx) Project established a data resource and tissue bank to study the relationship between genetic variants and gene expression in multiple human tissues and across individuals [data descriptor page] [open in IDC Portal]
  • Cancer Moonshot Biobank: 590 slides, 1.4TB The Cancer Moonshot Biobank (CMB) is a National Cancer Institute initiative to support current and future investigations into drug resistance and sensitivity and other NCI-sponsored cancer research initiatives, with an aim of improving researchers’ understanding of cancer and how to intervene in cancer initiation and progression. During the course of this study, biospecimens (blood and tissue removed during medical procedures) and associated data will be collected longitudinally from at least 1000 patients across at least 10 cancer types, who represent the demographic diversity of the U.S. and receiving standard of care cancer treatment at multiple NCI Community Oncology Research Program (NCORP) sites [data descriptor page] [open in IDC Portal]
  • Molecular Characterization Initiative of the National Cancer Institute’s Childhood Cancer Data Initiative: 464 slides, 0.5TB The Molecular Characterization Initiative (MCI) is a component of the National Cancer Institute’s (NCI) Childhood Cancer Data Initiative (CCDI). It offers state-of-the-art molecular testing at no cost to newly diagnosed children, adolescents, and young adults (AYAs) with central nervous system (CNS) tumors, soft tissue sarcomas (STS), certain rare childhood cancers (RAR), and certain neuroblastomas (NBL) treated at a Children’s Oncology Group (COG)–affiliated hospital. The goal of MCI is to enhance the understanding of genetic factors in pediatric cancers and to provide timely, clinically relevant findings to doctors and families to aid in treatment decisions and determine eligibility for certain planned COG clinical trials. [data descriptor] [open in IDC Portal]

See a quick demo of how to navigate and download those in the following video. :arrow_heading_down:

All of our slide microscopy images are available in a uniform standard format as DICOM Whole Slide Microscopy Image (DICOM SM) objects. Never heard of those before? DICOM SM is a format that gains adoption both among manufacturers of clinical slide scanners and in open source tools. OpenSlide, BioFormats, QuPath, wsidicom are some of the popular tools that support it natively.

We have new tutorials to help you get started with IDC digital pathology images: find those here. Learn by doing how DICOM slide images are organized, how to use metadata to search those, how to download, visualize and process them using open source tools.

We are here to help you turn the new data we release in IDC v19 into new discoveries!

2 Likes

Congratulations to the IDC team! :+1:

2 posts were split to a new topic: How to access accompanying clinical data for the CCDI-MCI collection?