{"ID":2846208,"CreatedAt":"2026-06-01T04:54:23.091178241Z","UpdatedAt":"2026-06-01T04:54:23.091178241Z","DeletedAt":null,"paper_url":"https://arxiv.org/abs/2511.02530","arxiv_id":"2511.02530","title":"Implementation and Evaluation of Stable Diffusion on a General-Purpose CGLA Accelerator","abstract":"This paper presents the first implementation and in-depth evaluation of the primary computational kernels from the stable-diffusion.cpp image generation framework on IMAX3, a general-purpose Coarse-Grained Reconfigurable Array (CGRA) accelerator. We designed IMAX3 as a versatile computational platform, and this work assesses its capabilities by executing a demanding image generation workload. We evaluate its performance on a current Field-Programmable Gate Array (FPGA) prototype to establish a baseline and project its potential for a future Application-Specific Integrated Circuit (ASIC) implementation. Our results demonstrate that, despite its general-purpose architecture, IMAX3 achieves promising performance and power efficiency, particularly in its projected ASIC form. This work provides concrete guidelines for future IMAX architectural designs and establishes a foundation for developing next-generation, AI-specialized Coarse-Grained Linear Array (CGLA) accelerators by refining this versatile platform. Ultimately, this achievement contributes to the realization of energy-efficient, on-device, multi-modal AI platforms.","short_abstract":"This paper presents the first implementation and in-depth evaluation of the primary computational kernels from the stable-diffusion.cpp image generation framework on IMAX3, a general-purpose Coarse-Grained Reconfigurable Array (CGRA) accelerator. We designed IMAX3 as a versatile computational platform, and this work as...","url_abs":"https://arxiv.org/abs/2511.02530","url_pdf":"https://arxiv.org/pdf/2511.02530v1","authors":"[\"Takuto Ando\",\"Yu Eto\",\"Yasuhiko Nakashima\"]","published":"2025-11-04T12:33:58Z","proceeding":"cs.AR","tasks":"[\"cs.AR\"]","methods":"[\"Diffusion Model\"]","has_code":false}
