TTLG - An Efficient Tensor Transposition Library for GPUs

Conference Proceedings
2018 IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2018
Authors
Jyothi Vedurada, Arjun Suresh, Aravind Sukumaran Rajam, Jinsung Kim, Changwan Hong, Ajay Panyala, Sriram Krishnamoorthy, V. Krishna Nandivada, Rohit Kumar Srivastava, P. Sadayappan
English