TANGRAM | Proceedings of the Twenty-Fourth International Conference on Architectural Support for Programming Languages and Operating Systems
We show that DNN accelerator micro-architectures and their program mappings represent
specific choices of loop order and hardware parallelism for computing the seven nested
loops of DNNs, which enables us to create a formal taxonomy of all existing dense
...