RUS  ENG
Full version
JOURNALS // Proceedings of the Institute for System Programming of the RAS // Archive

Proceedings of ISP RAS, 2025 Volume 37, Issue 1, Pages 87–106 (Mi tisp953)

Approach to building AI-compilers using the MLIR framework

I. I. Kulaginab, R. A. Buchatskiya, M. V. Pantilimonova, A. V. Vyazovtsevac, M. M. Romanovab, D. M. Melnikab

a Ivannikov Institute for System Programming of the RAS
b Lomonosov Moscow State University
c Moscow Institute of Physics and Technology

Abstract: The development of matrix extensions of processor architectures, as well as the implementation of these extensions in specialized AI processors, can significantly improve the efficiency of artificial neural networks. The paper provides an overview of the basic functionality of some popular matrix extensions of processor architectures, in particular, ARM SME, RISC-V IME, RISC-V AME extensions, as well as the DaVinci processor architecture. As a result of the analysis, a model of an abstract matrix processor was proposed. This model reflects the features of modern processor architectures supporting matrix extensions. For the introduced model of the matrix processor, a heterogeneous matrix intermediate representation was developed, which can be used to build compilers for neural networks. The proposed intermediate representation was implemented in the MLIR infrastructure as a heteroMx dialect. The paper also describes an approach to building an AI compiler using the heteroMx dialect. The developed intermediate representation can be adapted or specified for other matrix processor architectures.

Keywords: matrix extension, RISC-V, ARM SME, AI-compiler intermediate representation, MLIR infrastructure

DOI: 10.15514/ISPRAS-2025-37(1)-5



© Steklov Math. Inst. of RAS, 2025