Kronecker Matrix-Matrix Multiplication (Kron-Matmul) is the multiplication of a matrix with the Kronecker Product of several smaller matrices. Kron-Matmul is a core operation for many scientific and machine learning computations. State-of-the-art Kron-Matmul implementations utilize existing tensor algebra operations, such as matrix multiplication, transpose, and tensor matrix multiplication. However, this design choice prevents several Kron-Matmul specific optimizations, thus, leaving significant performance on the table. To address this issue, we present FastKron, an efficient technique for Kron-Matmul on single and multiple GPUs. FastKron is independent of linear algebra operations enabling several new optimizations for Kron-Matmul. Thus, it performs up to 8.50× and 4.15× faster than existing implementations on 1 and 16 GPUs respectively.
Wed 6 MarDisplayed time zone: London change
10:00 - 11:00 | Linear AlgebraMain Conference at Moorfoot Chair(s): I-Ting Angelina Lee Washington University in St. Louis, USA | ||
10:00 20mTalk | A Row Decomposition-based Approach for Sparse Matrix Multiplication on GPUs Main Conference Pang Meng Department of Computer Science and Technology, Tsinghua University, Xiang Fei Department of Computer Science and Technology, Tsinghua University, Peng Qu Department of Computer Science and Technology, Tsinghua University, Youhui Zhang Department of Computer Science and Technology, Tsinghua University, Zhaolin Li Department of Computer Science and Technology, Tsinghua University Link to publication DOI | ||
10:20 20mTalk | Fast Kronecker Matrix-Matrix Multiplications on GPUs Main Conference Link to publication DOI | ||
10:40 20mTalk | Arrow Matrix Decomposition: A Novel Approach for Communication-Efficient Sparse Matrix Multiplication Main Conference Lukas Gianinazzi ETH Zurich, Alexandros Nikolaos Ziogas ETH Zurich, Piotr Luczynski ETH Zurich, Langwen Huang ETH Zurich, Saleh Ashkboosh ETH Zurich, Florian Scheidl ETH Zurich, Armon Carigiet ETH Zurich, Chio Ge ETH Zurich, Nabil Abubaker ETH Zurich, Maciej Besta ETH Zurich, Tal Ben-Nun Lawrence Livermore National Laboratory, Torsten Hoefler ETH Zurich Link to publication DOI |