Build Custom Diffusion Workflows with the New Modular Diffusers Architecture

Hugging Face · Feature Update · 2026-03-05 · notable

Briefing for: Engineering

What happened

Hugging Face released Diffusers 0.37.0, introducing 'Modular Diffusers' which allows developers to build pipelines by composing reusable blocks rather than writing monolithic classes. The update also adds compatibility for Transformers v5, new context-parallelism backends for distributed inference, and significant new caching methods like MagCache.

Why it matters

This architectural shift significantly reduces code duplication and makes it easier to create hybrid pipelines (e.g., mixing Flux components with custom guiders). The library improvements in parallelism and caching are critical for maintaining performance as video generation models grow in parameter size.

What this enables

If you develop custom image generation apps, use the modular blocks to swap specific components like schedulers or encoders without overriding the entire pipeline class.
If you run distributed inference across multiple GPUs, the new Ulysses and Sequence Parallel attention backends optimize memory usage for high-resolution video generation.
If you are migrating to Transformers v5, you can now update your core environment without breaking your diffusion-based generation code.

Get personalized AI briefings for your role at Changecast →