PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator

Read original: arXiv:2405.07510 - Published 9/4/2024 by Hanshu Yan, Xingchao Liu, Jiachun Pan, Jun Hao Liew, Qiang Liu, Jiashi Feng

🤷

Overview

Introduces a new flow-based method called Piecewise Rectified Flow (PeRFlow) for accelerating diffusion models
PeRFlow divides the sampling process into time windows and straightens the trajectories in each interval, creating piecewise linear flows
PeRFlow achieves superior performance in few-step generation and shows advantageous transfer ability as a universal plug-and-play accelerator for pre-trained diffusion models

Plain English Explanation

PeRFlow is a new technique that can make diffusion models, a type of generative AI model, generate samples much faster. Diffusion models work by gradually transforming random noise into realistic images or other data. PeRFlow divides this transformation process into shorter time intervals and "straightens out" the trajectories in each interval, resulting in piecewise linear flows that are more efficient.

This approach allows PeRFlow to generate high-quality samples in just a few steps, much faster than standard diffusion models. Additionally, PeRFlow models can be used as universal "plug-and-play" accelerators that work with different types of pre-trained diffusion models, making them broadly applicable. The researchers have made the code for training and using PeRFlow publicly available, allowing others to build on this work.

Technical Explanation

PeRFlow is a flow-based method that aims to accelerate the sampling process of diffusion models. The key idea is to divide the diffusion process into several time windows and apply a "reflow" operation to straighten the trajectories within each interval, creating a piecewise linear flow.

The reflow operation is implemented using a dedicated parameterization that enables the model to learn the optimal piecewise linear approximation of the diffusion trajectories. This approach allows PeRFlow to achieve superior performance in few-step generation compared to standard diffusion models.

Moreover, the researchers demonstrate that the obtained PeRFlow models can serve as universal plug-and-play accelerators that are compatible with various workflows based on pre-trained diffusion models, such as DiffFlow3D and PaddingFlow. This transfer ability is achieved through dedicated parameterizations and training strategies.

Critical Analysis

The paper presents a novel and promising approach to accelerating diffusion models. By introducing piecewise linear flows, PeRFlow is able to generate high-quality samples in just a few steps, which is a significant improvement over standard diffusion models.

One potential limitation mentioned in the paper is that the reflow operation may introduce some bias in the sampling process, which could impact the fidelity of the generated samples. The researchers acknowledge this issue and suggest further research to address this trade-off between sample quality and generation speed.

Additionally, while the transfer ability of PeRFlow models is a notable advantage, the paper does not provide a comprehensive evaluation of their performance across a wide range of pre-trained diffusion models and tasks. Further research could explore the versatility and limitations of PeRFlow as a universal accelerator.

Overall, the PeRFlow method presents an interesting and valuable contribution to the field of flow-based generative models, demonstrating the potential for improving the efficiency of diffusion-based approaches to generative AI.

Conclusion

PeRFlow is a novel flow-based method that can significantly accelerate the sampling process of diffusion models, enabling the generation of high-quality samples in just a few steps. By dividing the diffusion process into piecewise linear flows, PeRFlow achieves superior performance in few-step generation while also showing advantageous transfer ability as a universal plug-and-play accelerator for pre-trained diffusion models.

The open-sourcing of the PeRFlow implementation allows researchers and developers to build upon this work, potentially leading to further advancements in the efficiency and practicality of diffusion-based generative models. As the field of generative AI continues to evolve, techniques like PeRFlow that can enhance the performance and applicability of these models will likely play an increasingly important role.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🤷

PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator

Hanshu Yan, Xingchao Liu, Jiachun Pan, Jun Hao Liew, Qiang Liu, Jiashi Feng

We present Piecewise Rectified Flow (PeRFlow), a flow-based method for accelerating diffusion models. PeRFlow divides the sampling process of generative flows into several time windows and straightens the trajectories in each interval via the reflow operation, thereby approaching piecewise linear flows. PeRFlow achieves superior performance in a few-step generation. Moreover, through dedicated parameterizations, the PeRFlow models inherit knowledge from the pretrained diffusion models. Thus, the training converges fast and the obtained models show advantageous transfer ability, serving as universal plug-and-play accelerators that are compatible with various workflows based on the pre-trained diffusion models. Codes for training and inference are publicly released. https://github.com/magic-research/piecewise-rectified-flow

9/4/2024

Improving the Training of Rectified Flows

Sangyun Lee, Zinan Lin, Giulia Fanti

Diffusion models have shown great promise for image and video generation, but sampling from state-of-the-art models requires expensive numerical integration of a generative ODE. One approach for tackling this problem is rectified flows, which iteratively learn smooth ODE paths that are less susceptible to truncation error. However, rectified flows still require a relatively large number of function evaluations (NFEs). In this work, we propose improved techniques for training rectified flows, allowing them to compete with knowledge distillation methods even in the low NFE setting. Our main insight is that under realistic settings, a single iteration of the Reflow algorithm for training rectified flows is sufficient to learn nearly straight trajectories; hence, the current practice of using multiple Reflow iterations is unnecessary. We thus propose techniques to improve one-round training of rectified flows, including a U-shaped timestep distribution and LPIPS-Huber premetric. With these techniques, we improve the FID of the previous 2-rectified flow by up to 72% in the 1 NFE setting on CIFAR-10. On ImageNet 64$times$64, our improved rectified flow outperforms the state-of-the-art distillation methods such as consistency distillation and progressive distillation in both one-step and two-step settings and rivals the performance of improved consistency training (iCT) in FID. Code is available at https://github.com/sangyun884/rfpp.

5/31/2024

Text-to-Image Rectified Flow as Plug-and-Play Priors

Xiaofeng Yang, Cheng Chen, Xulei Yang, Fayao Liu, Guosheng Lin

Large-scale diffusion models have achieved remarkable performance in generative tasks. Beyond their initial training applications, these models have proven their ability to function as versatile plug-and-play priors. For instance, 2D diffusion models can serve as loss functions to optimize 3D implicit models. Rectified flow, a novel class of generative models, enforces a linear progression from the source to the target distribution and has demonstrated superior performance across various domains. Compared to diffusion-based methods, rectified flow approaches surpass in terms of generation quality and efficiency, requiring fewer inference steps. In this work, we present theoretical and experimental evidence demonstrating that rectified flow based methods offer similar functionalities to diffusion models - they can also serve as effective priors. Besides the generative capabilities of diffusion priors, motivated by the unique time-symmetry properties of rectified flow models, a variant of our method can additionally perform image inversion. Experimentally, our rectified flow-based priors outperform their diffusion counterparts - the SDS and VSD losses - in text-to-3D generation. Our method also displays competitive performance in image inversion and editing.

6/6/2024

SlimFlow: Training Smaller One-Step Diffusion Models with Rectified Flow

Yuanzhi Zhu, Xingchao Liu, Qiang Liu

Diffusion models excel in high-quality generation but suffer from slow inference due to iterative sampling. While recent methods have successfully transformed diffusion models into one-step generators, they neglect model size reduction, limiting their applicability in compute-constrained scenarios. This paper aims to develop small, efficient one-step diffusion models based on the powerful rectified flow framework, by exploring joint compression of inference steps and model size. The rectified flow framework trains one-step generative models using two operations, reflow and distillation. Compared with the original framework, squeezing the model size brings two new challenges: (1) the initialization mismatch between large teachers and small students during reflow; (2) the underperformance of naive distillation on small student models. To overcome these issues, we propose Annealing Reflow and Flow-Guided Distillation, which together comprise our SlimFlow framework. With our novel framework, we train a one-step diffusion model with an FID of 5.02 and 15.7M parameters, outperforming the previous state-of-the-art one-step diffusion model (FID=6.47, 19.4M parameters) on CIFAR10. On ImageNet 64$times$64 and FFHQ 64$times$64, our method yields small one-step diffusion models that are comparable to larger models, showcasing the effectiveness of our method in creating compact, efficient one-step diffusion models.

7/19/2024