New study utilizing IDC: Evaluation of digital pathology image compression schemes

If you are working with digital pathology image analysis, check out this new preprint by Maximilian Fischer from German Cancer Research Center in Germany, which relied on the IDC RMS-Mutation-Prediction collection:

Fischer, M., Neher, P., Schüffler, P., Ziegler, S., Xiao, S., Peretzke, R., Clunie, D., Ulrich, C., Baumgartner, M., Muckenhuber, A., Almeida, S. D., Götz, M., Kleesiek, J., Nolden, M., Braren, R. & Maier-Hein, K. Unlocking the potential of digital pathology: Novel baselines for compression. arXiv [eess.IV] (2024). at https://arxiv.org/abs/2412.13137

In this preprint Max compared traditional (JPEG, JPEG-XL and WebP) and neural compression schemes (SQLC, SPL2 and CAI) on several publicly available datasets with respect to the impact on perceptual image quality and downstream computational tasks.

Studies evaluating impact of compression approaches can benefit from original, uncompressed image data, which is not easy to find. Quoting from the preprint “[IDC RMS-Mutation-Prediction] is one of the few large-scale WSI cohorts […] now publicly available that has not been previously lossy compressed.”

Abstract of the study follows for your reference.

Digital pathology offers a groundbreaking opportunity to transform clinical practice in histopathological image analysis, yet faces a significant hurdle: the substantial file sizes of pathological Whole Slide Images (WSI). While current digital pathology solutions rely on lossy JPEG compression to address this issue, lossy compression can introduce color and texture disparities, potentially impacting clinical decision-making. While prior research addresses perceptual image quality and downstream performance independently of each other, we jointly evaluate compression schemes for perceptual and downstream task quality on four different datasets. In addition, we collect an initially uncompressed dataset for an unbiased perceptual evaluation of compression schemes. Our results show that deep learning models fine-tuned for perceptual quality outperform conventional compression schemes like JPEG-XL or WebP for further compression of WSI. However, they exhibit a significant bias towards the compression artifacts present in the training data and struggle to generalize across various compression schemes. We introduce a novel evaluation metric based on feature similarity between original files and compressed files that aligns very well with the actual downstream performance on the compressed WSI. Our metric allows for a general and standardized evaluation of lossy compression schemes and mitigates the requirement to independently assess different downstream tasks. Our study provides novel insights for the assessment of lossy compression schemes for WSI and encourages a unified evaluation of lossy compression schemes to accelerate the clinical uptake of digital pathology.