Distilling Knowledge from Ensembles of Cluster-Constrained-Attention Multiple-Instance Learners for Whole Slide Image Classification

Alamudun, Folami; Dash, Sajal; Hinkle, Jacob; Hernandez Arreguin, Benjamin; Tsaris, Aristeidis (aris); Yoon, Hong-Jun

doi:10.1109/BigData55660.2022.10020938

Distilling Knowledge from Ensembles of Cluster-Constrained-Attention Multiple-Instance Learners for Whole Slide Image Classification

Conference · Thu Dec 01 04:00:00 EST 2022

DOI:https://doi.org/10.1109/BigData55660.2022.10020938· OSTI ID:1965255

^[1]; Dash, Sajal ^[1]; ^[1]; ^[1]; Tsaris, Aristeidis (aris) ^[1]; ^[1]

ORNL

The peculiar nature of whole slide imaging (WSI), digitizing conventional glass slides to obtain multiple high resolution images which capture microscopic details of a patient’s histopathological features, has garnered increased interest from the computer vision research community over the last two decades. Given the unique computational space and time complexity inherent to gigapixel-size whole slide image data, researchers have proposed novel machine learning algorithms to aid in the performance of diagnostic tasks in clinical pathology. One effective algorithm represents a Whole slide image as a bag of smaller image patches, which can be represented as low-dimension image patch embeddings. Weakly supervised deep-learning methods, such as cluster-constrained-attention multiple instance learning (CLAM), have shown promising results when combined with image patch embeddings. While traditional ensemble classifiers yield improved task performance, such methods come with a steep cost in model complexity. Through knowledge distillation, it is possible to retain some performance improvements from an ensemble, while minimizing costs to model complexity. In this work, we implement a weakly supervised ensemble using clustering-constrained-attention multiple-instance learners (CLAM), which uses attention and instance-level clustering to identify task salient regions and feature extraction in whole slides. By applying logit-based and attention-based knowledge distillation, we show it is possible to retain some performance improvements resulting from the ensemble at zero cost to model complexity.

View Conference

Research Organization:: Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)

Sponsoring Organization:: USDOE; USDOE Office of Science (SC)

DOE Contract Number:: AC05-00OR22725

OSTI ID:: 1965255

Country of Publication:: United States

Language:: English

References (18)

Ensemble learning: A survey Sagi, Omer; Rokach, Lior WIREs Data Mining and Knowledge Discovery, Vol. 8, Issue 4 https://doi.org/10.1002/widm.1249	journal	February 2018
Data-efficient and weakly supervised computational pathology on whole-slide images Lu, Ming Y.; Williamson, Drew F. K.; Chen, Tiffany Y. Nature Biomedical Engineering, Vol. 5, Issue 6 https://doi.org/10.1038/s41551-020-00682-w	journal	March 2021
On the Efficacy of Knowledge Distillation Cho, Jang Hyun; Hariharan, Bharath 2019 IEEE/CVF International Conference on Computer Vision (ICCV) https://doi.org/10.1109/ICCV.2019.00489	conference	October 2019
Artificial intelligence in digital pathology — new tools for diagnosis and precision oncology Bera, Kaustav; Schalper, Kurt A.; Rimm, David L. Nature Reviews Clinical Oncology, Vol. 16, Issue 11 https://doi.org/10.1038/s41571-019-0252-y	journal	August 2019
Clinical-grade computational pathology using weakly supervised deep learning on whole slide images Campanella, Gabriele; Hanna, Matthew G.; Geneslaw, Luke Nature Medicine, Vol. 25, Issue 8 https://doi.org/10.1038/s41591-019-0508-1	journal	July 2019
An ensemble learning method based on deep neural network and group decision making Zhou, Xiaojun; He, Jingyi; Yang, Chunhua Knowledge-Based Systems, Vol. 239 https://doi.org/10.1016/j.knosys.2021.107801	journal	March 2022
MorphNet: Fast & Simple Resource-Constrained Structure Learning of Deep Networks Gordon, Ariel; Eban, Elad; Nachum, Ofir 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) https://doi.org/10.1109/CVPR.2018.00171	conference	June 2018
Classification and mutation prediction from non–small cell lung cancer histopathology images using deep learning Coudray, Nicolas; Ocampo, Paolo Santiago; Sakellaropoulos, Theodore Nature Medicine, Vol. 24, Issue 10 https://doi.org/10.1038/s41591-018-0177-5	journal	September 2018
Deep Residual Learning for Image Recognition He, Kaiming; Zhang, Xiangyu; Ren, Shaoqing 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) https://doi.org/10.1109/CVPR.2016.90	conference	June 2016
A survey on ensemble learning Dong, Xibin; Yu, Zhiwen; Cao, Wenming Frontiers of Computer Science, Vol. 14, Issue 2 https://doi.org/10.1007/s11704-019-8208-z	journal	August 2019
Quantized Convolutional Neural Networks for Mobile Devices Wu, Jiaxiang; Leng, Cong; Wang, Yuhang 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) https://doi.org/10.1109/CVPR.2016.521	conference	June 2016
RMDL: Recalibrated multi-instance deep learning for whole slide gastric image classification Wang, Shujun; Zhu, Yaxi; Yu, Lequan Medical Image Analysis, Vol. 58 https://doi.org/10.1016/j.media.2019.101549	journal	December 2019
Neural Image Compression for Gigapixel Histopathology Image Analysis Tellez, David; Litjens, Geert; van der Laak, Jeroen IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 43, Issue 2 https://doi.org/10.1109/TPAMI.2019.2936841	journal	February 2021
Diagnostic Assessment of Deep Learning Algorithms for Detection of Lymph Node Metastases in Women With Breast Cancer Ehteshami Bejnordi, Babak; Veta, Mitko; Johannes van Diest, Paul JAMA, Vol. 318, Issue 22 https://doi.org/10.1001/jama.2017.14585	journal	December 2017
Automatic detection of invasive ductal carcinoma in whole slide images with convolutional neural networks Cruz-Roa, Angel; Basavanhally, Ajay; González, Fabio SPIE Proceedings https://doi.org/10.1117/12.2043872	conference	March 2014
Digital pathology and artificial intelligence Niazi, Muhammad Khalid Khan; Parwani, Anil V.; Gurcan, Metin N. The Lancet Oncology, Vol. 20, Issue 5 https://doi.org/10.1016/S1470-2045(19)30154-8	journal	May 2019
A Scalable Pipeline for Gigapixel Whole Slide Imaging Analysis on Leadership Class HPC Systems Dash, Sajal; Hernandez, Benjamin; Tsaris, Aristeidis 2022 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) https://doi.org/10.1109/IPDPSW55747.2022.00223	conference	May 2022
Model compression Buciluǎ, Cristian; Caruana, Rich; Niculescu-Mizil, Alexandru Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining - KDD '06 https://doi.org/10.1145/1150402.1150464	conference	January 2006

Similar Records

A comparison of histopathology imaging comprehension algorithms based on multiple instance learning

Conference · Sat Apr 01 00:00:00 EDT 2023 · OSTI ID:1969817

A Scalable Pipeline for Gigapixel Whole Slide Imaging Analysis on Leadership Class HPC Systems

Conference · Sun May 01 00:00:00 EDT 2022 · OSTI ID:1885343

Root identification in minirhizotron imagery with multiple instance learning

Journal Article · Tue Jun 23 20:00:00 EDT 2020 · Machine Vision and Applications · OSTI ID:1773853

Distilling Knowledge from Ensembles of Cluster-Constrained-Attention Multiple-Instance Learners for Whole Slide Image Classification

Citation Formats

References (18)

Similar Records

Related Subjects