LU Factorization of Small Matrices: Accelerating Batched DGETRF on the GPUдоклад на конференции