2025 AccuQuant: Simulating Multiple Denoising Steps for Quantizing Diffusion Models
페이지 정보
작성자 연구소장 작성일 26-01-07 16:01본문
- Year
- 2025
Abstract
We present in this paper a novel post-training quantization (PTQ) method, dubbed
AccuQuant, for diffusion models. We show analytically and empirically that quan-
tization errors for diffusion models are accumulated over denoising steps in a
sampling process. To alleviate the error accumulation problem, AccuQuant mini-
mizes the discrepancies between outputs of a full-precision diffusion model and its
quantized version within a couple of denoising steps. That is, it simulates multiple
denoising steps of a diffusion sampling process explicitly for quantization, account-
ing the accumulated errors over multiple denoising steps, which is in contrast to
previous approaches to imitating a training process of diffusion models, namely,
minimizing the discrepancies independently for each step. We also present an
efficient implementation technique for AccuQuant, together with a novel objective,
which reduces a memory complexity significantly from O(n) to O(1), where n
is the number of denoising steps. We demonstrate the efficacy and efficiency of
AccuQuant across various tasks and diffusion models on standard benchmarks.
첨부파일
-
AccuQuant- Simulating Multiple Denoising Steps for Quantizing Diffusion Models.pdf (20.9M)
11회 다운로드 | DATE : 2026-01-07 16:01:25