ET: Re-Thinking Self-Attention for Transformer Models on GPUs
Published in SC 21 (to appear), 2021
Download here
Published in SC 21 (to appear), 2021
Download here
Published in arXiv, 2021
Download here
Published in IEEE Transactions on Parallel and Distributed Systems, 2021
Download here
Published in IEEE Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC20), 2020
Download here
Published in 2019 IEEE HPEC, 2019
Download here