Deep Compression: Compressing Deep Neural Networks with Pruning,...
Neural networks are both computationally intensive and memory intensive, making them difficult to deploy on embedded systems with limited hardware resources. To address this limitation, we...