Draft: Matmul Micro-kernels F32 <- (QSI8D32) LHS x (QAI4C32) RHS (!309) · Merge requests · Kleidi / KleidiAI · GitLab

Anitha Raj requested to merge f32_qai4_gemm into main Feb 20, 2025

GEMM Micro-kernel to compute the matrix multiplication of dynamically quantized symmetric signed 8-bit integer with per-block quantization (QSI8D32) LHS matrix and quantized asymmetric 4-bit signed integer with per-block quantization (QAI4C32) RHS matrix and the accumulation of the result into a single-precision (F32), optimized with FEAT_I8MM.

Signed-off-by: Anitha Raj anitha.raj@arm.com