RUS  ENG
Full version
JOURNALS // Zapiski Nauchnykh Seminarov POMI // Archive

Zap. Nauchn. Sem. POMI, 2023 Volume 530, Pages 24–37 (Mi znsl7430)

Vector graphics generation with LLMs: approaches and models

B. Timofeenkoa, V. Efimovaa, A. Filchenkovb

a ITMO University
b GO AI LAB

Abstract: The task of generating vector graphics with AI is under-researched. Recently, large language models (LLMs) have been successfully applied to many downstream tasks. For example, modern LLMs achieve remarkable quality in code generation tasks and are open for public access. This study compares approaches to vector graphics generation with LLMs, namely ChatGPT (GPT-3.5) and GPT-4. GPT-4 has noticeable improvements compared to ChatGPT. Both models easily generate geometric primitives but struggle even with simple objects. The results produced by GPT-4 visually resemble the prompts but are inaccurate. GPT-4 is able to correct the output according to instructions. Additionally, it is challenging for both models to recognize an object from an SVG image. Both models recognize only primitive objects correctly.

Key words and phrases: large language models, vector graphics, generative AI, image generation, text-to-image synthesis.

UDC: 004.932

Received: 06.09.2023

Language: English



© Steklov Math. Inst. of RAS, 2024