Wootz: a compiler-based framework for fast CNN pruning via composability

Guan, Hui; Shen, Xipeng; Lim, Seung-Hwan

doi:10.1145/3314221.3314652

Wootz: a compiler-based framework for fast CNN pruning via composability

Conference · Sat Jun 01 04:00:00 EDT 2019

DOI:https://doi.org/10.1145/3314221.3314652· OSTI ID:1543204

Guan, Hui ^[1]; Shen, Xipeng ^[1]; ^[2]

North Carolina State University
ORNL

Convolutional Neural Networks (CNN) are widely used for Deep Learning tasks. CNN pruning is an important method to adapt a large CNN model trained on general datasets to fit a more specialized task or a smaller device. The key challenge is on deciding which filters to remove in order to maximize the quality of the pruned networks while satisfying the constraints. It is time-consuming due to the enormous configuration space and the slowness of CNN training.The problem has drawn many efforts from the machine learning field, which try to reduce the set of network configurations to explore. This work tackles the problem distinctively from a programming systems perspective, trying to speed up the evaluations of the remaining configurations through computation reuse via a compiler-based framework. We empirically uncover the existence of composability in the training of a collection of pruned CNN models, and point out the opportunities for computation reuse. We then propose composability-based CNN pruning, and design a compression-based algorithm to efficiently identify the set of CNN layers to pre-train for maximizing their reuse benefits in CNN pruning. We further develop a compiler-based framework named Wootz, which, for an arbitrary CNN, automatically generates code that builds a Teacher-Student scheme to materialize composability-based pruning. Experiments show that network pruning enabled by Wootz shortens the state-of-art pruning process by up to 186X while producing significantly better pruning results.

View Conference

Research Organization:: Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)

Sponsoring Organization:: USDOE

DOE Contract Number:: AC05-00OR22725

OSTI ID:: 1543204

Country of Publication:: United States

Language:: English

References (23)

A Configurable Cloud-Scale DNN Processor for Real-Time AI Fowers, Jeremy; Ovtcharov, Kalin; Papamichael, Michael 2018 ACM/IEEE 45th Annual International Symposium on Computer Architecture (ISCA) https://doi.org/10.1109/ISCA.2018.00012	conference	June 2018
Learning Efficient Convolutional Networks through Network Slimming Liu, Zhuang; Li, Jianguo; Shen, Zhiqiang 2017 IEEE International Conference on Computer Vision (ICCV) https://doi.org/10.1109/ICCV.2017.298	conference	October 2017
MorphNet: Fast & Simple Resource-Constrained Structure Learning of Deep Networks Gordon, Ariel; Eban, Elad; Nachum, Ofir 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) https://doi.org/10.1109/CVPR.2018.00171	conference	June 2018
Motivation for Variable Length Intervals and Hierarchical Phase Behavior Lau, J.; Perelman, E.; Hamerly, G. IEEE International Symposium on Performance Analysis of Systems and Software, 2005. ISPASS 2005. https://doi.org/10.1109/ISPASS.2005.1430568	conference	January 2005
Evaluating pruned object detection networks for real-time robot vision O'Keeffe, Simon; Villing, Rudi 2018 IEEE International Conference on Autonomous Robot Systems and Competitions (ICARSC) https://doi.org/10.1109/ICARSC.2018.8374166	conference	April 2018
Channel Pruning for Accelerating Very Deep Neural Networks He, Yihui; Zhang, Xiangyu; Sun, Jian 2017 IEEE International Conference on Computer Vision (ICCV) https://doi.org/10.1109/ICCV.2017.155	conference	October 2017
Scalpel Yu, Jiecao; Lukefahr, Andrew; Palframan, David ACM SIGARCH Computer Architecture News, Vol. 45, Issue 2 https://doi.org/10.1145/3140659.3080215	journal	June 2017
Deep Residual Learning for Image Recognition He, Kaiming; Zhang, Xiangyu; Ren, Shaoqing 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) https://doi.org/10.1109/CVPR.2016.90	conference	June 2016
Deep LDA-Pruned Nets for Efficient Facial Gender Classification Tian, Qing; Arbel, Tal; Clark, James J. 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) https://doi.org/10.1109/CVPRW.2017.78	conference	July 2017
Designing Energy-Efficient Convolutional Neural Networks Using Energy-Aware Pruning Yang, Tien-Ju; Chen, Yu-Hsin; Sze, Vivienne 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) https://doi.org/10.1109/CVPR.2017.643	conference	July 2017
Whole program path-based dynamic impact analysis Law, J.; Rothermel, G. 25th International Conference on Software Engineering, 2003. Proceedings. https://doi.org/10.1109/ICSE.2003.1201210	conference	January 2003
Value-Based Deep-Learning Acceleration Moshovos, Andreas; Albericio, Jorge; Judd, Patrick IEEE Micro, Vol. 38, Issue 1 https://doi.org/10.1109/MM.2018.112130309	journal	January 2018
DaDianNao: A Neural Network Supercomputer Luo, Tao; Liu, Shaoli; Li, Ling IEEE Transactions on Computers, Vol. 66, Issue 1 https://doi.org/10.1109/TC.2016.2574353	journal	January 2017
Look Closer to See Better: Recurrent Attention Convolutional Neural Network for Fine-Grained Image Recognition Fu, Jianlong; Zheng, Heliang; Mei, Tao 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) https://doi.org/10.1109/CVPR.2017.476	conference	July 2017
Dynamic hot data stream prefetching for general-purpose programs Chilimbi, Trishul M.; Hirzel, Martin ACM SIGPLAN Notices, Vol. 37, Issue 5 https://doi.org/10.1145/543552.512554	journal	May 2002
Going deeper with convolutions Szegedy, Christian 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) https://doi.org/10.1109/CVPR.2015.7298594	conference	June 2015
Model compression Buciluǎ, Cristian; Caruana, Rich; Niculescu-Mizil, Alexandru Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining - KDD '06 https://doi.org/10.1145/1150402.1150464	conference	January 2006
ImageNet Large Scale Visual Recognition Challenge Russakovsky, Olga; Deng, Jia; Su, Hao International Journal of Computer Vision, Vol. 115, Issue 3 https://doi.org/10.1007/s11263-015-0816-y	journal	April 2015
Diversified Visual Attention Networks for Fine-Grained Object Classification Zhao, Bo; Wu, Xiao; Feng, Jiashi IEEE Transactions on Multimedia, Vol. 19, Issue 6 https://doi.org/10.1109/TMM.2017.2648498	journal	June 2017
Efficient representations and abstractions for quantifying and exploiting data reference locality Chilimbi, Trishul M. ACM SIGPLAN Notices, Vol. 36, Issue 5 https://doi.org/10.1145/381694.378840	journal	May 2001
3D Object Representations for Fine-Grained Categorization Krause, Jonathan; Stark, Michael; Deng, Jia 2013 IEEE International Conference on Computer Vision Workshops https://doi.org/10.1109/ICCVW.2013.77	conference	December 2013
Using compression algorithms to support the comprehension of program traces Walkinshaw, Neil; Afshan, Sheeva; McMinn, Phil Proceedings of the Eighth International Workshop on Dynamic Analysis https://doi.org/10.1145/1868321.1868323	conference	July 2010
Whole program paths Larus, James R. ACM SIGPLAN Notices, Vol. 34, Issue 5 https://doi.org/10.1145/301631.301678	journal	May 1999

Similar Records

Composability-Centered Convolutional Neural Network Pruning

Technical Report · Wed Feb 14 23:00:00 EST 2018 · OSTI ID:1427608

A Novel Pruning Method for Convolutional Neural Networks Based off Identifying Critical Filters

Conference · Mon Jul 01 00:00:00 EDT 2019 · OSTI ID:1557493

Ps and Qs: Quantization-Aware Pruning for Efficient Low Latency Neural Network Inference

Journal Article · Thu Jul 08 20:00:00 EDT 2021 · Frontiers in Artificial Intelligence · OSTI ID:1824191

Wootz: a compiler-based framework for fast CNN pruning via composability

Citation Formats

References (23)

Similar Records

Related Subjects