Towards Efficient Convolutional Neural Networks Through Low-Error Filter Saliency Estimation

by Zi Wang, Chengcheng Li, Xiangyang Wang, Dali Wang

Publication Type

Conference Paper

Book Title

Trends in AI

Publication Date

August, 2019

Page Numbers

255 to 267

Volume

11671

Conference Name

16th Pacific Rim International Conference on Artificial Intelligence (PRICAI 2019)

Conference Location

cuvu, Fiji

Conference Sponsor

Various

Conference Date

Aug 29, 2019 - Aug 31, 2019

View DOI Listing

Abstract

Filter saliency based channel pruning is a state-of-the-art method for deep convolutional neural network compression and acceleration. This channel pruning method ranks the importance of individual filter by estimating its impact of each filter’s removal on the training loss, and then remove the least important filters and fine-tune the remnant network. In this work, we propose a systematic channel pruning method that significantly reduces the estimation error of filter saliency. Different from existing approaches, our method largely reduces the magnitude of parameters in a network by introducing alternating direction method of multipliers (ADMM) into the pre-training procedure. Therefore, the estimation of filter saliency based on Taylor expansion is significantly improved. Extensive experiments with various benchmark network architectures and datasets demonstrate that the proposed method has a much improved unimportant filter selection capability and outperform state-of-the-art channel pruning method.

Towards Efficient Convolutional Neural Networks Through Low-Error Filter Saliency Estimation

Abstract

Researchers

Organizations