Add NewbiePipeline and NextDiT_3B_GQA_patch2_Adaln_Refiner_WHIT_CLIP transformer #12789

E-Anlia · 2025-12-04T05:27:05Z

This PR introduces a new text-to-image pipeline named NewbiePipeline, as well as a new
NextDiT-based transformer architecture,
NextDiT_3B_GQA_patch2_Adaln_Refiner_WHIT_CLIP, fully implemented following
Diffusers' pipeline and model design principles.

🚀 Main additions

• New pipeline
Adds NewbiePipeline under diffusers.pipelines.newbie/.
The pipeline follows the standard Diffusers structure (DiffusionPipeline subclass) and
supports loading via from_pretrained.

• New transformer architecture
Adds transformer_newbie.py, implementing:

NextDiT backbone with grouped-query attention (GQA)
Adaln-Refiner blocks
Patch-size 2 vision encoder
36 transformer layers
2304 hidden dims
WHIT CLIP–style text conditioning

The transformer inherits from ModelMixin, enabling standard save/load, weight
serialization and integration with Diffusers utilities.

• RMSNorm implementation
Adds RMSNorm to diffusers.models.components, using a PyTorch fallback and supporting
Apex fused RMSNorm if available.

• Scheduler compatibility
The pipeline is compatible with FlowMatchEulerDiscreteScheduler without requiring
additional custom scheduler code.

🧩 Motivation

This PR provides an implementation of a modern NextDiT-style text-to-image architecture
with high-resolution capability and strong conditioning support.
The goal is to enable researchers and users to load, run, and fine-tune this model
directly through Diffusers with minimal friction.

📁 Files added

src/diffusers/models/components.py
src/diffusers/models/transformers/transformer_newbie.py
src/diffusers/pipelines/newbie/pipeline_newbie.py
src/diffusers/pipelines/newbie/init.py

shell
Copy code

📁 Files modified

src/diffusers/init.py
src/diffusers/models/init.py
src/diffusers/models/transformers/init.py
src/diffusers/pipelines/init.py

yaml
Copy code

✔ Notes

No external dependencies required
Apex is optional; PyTorch RMSNorm is the default path
The pipeline has been tested locally with from_pretrained and produces expected outputs
Follows the established structure of Diffusers pipelines & transformer modules

Fixes # (no issue linked)

Before submitting

I have read the contributor guidelines
This PR introduces a new pipeline and model
All necessary registration points are updated
The implementation is consistent with existing Diffusers conventions

Who can review?

Tagging pipeline & transformer reviewers:
@asomoza @yiyixuxu @sayakpaul

sayakpaul · 2025-12-04T11:01:03Z

Can you link the original codebase, paper, and some results of this model?

E-Anlia · 2025-12-05T03:11:10Z

https://huggingface.co/NewBie-AI/NewBie-image-Exp0.1
https://github.com/[NewBieAI-Lab/NewBie-image-Exp0.1

This model is based on improvements made to research on lumina.
Based on NextDiT
Example：

Add NewbiePipeline and NextDiT_3B_GQA_patch2_Adaln_Refiner_WHIT_CLIP

5e1b2d3

Merge branch 'main' into add-newbie-pipeline

6cc072c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add NewbiePipeline and NextDiT_3B_GQA_patch2_Adaln_Refiner_WHIT_CLIP transformer #12789

Add NewbiePipeline and NextDiT_3B_GQA_patch2_Adaln_Refiner_WHIT_CLIP transformer #12789

E-Anlia commented Dec 4, 2025

Uh oh!

sayakpaul commented Dec 4, 2025

Uh oh!

E-Anlia commented Dec 5, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add NewbiePipeline and NextDiT_3B_GQA_patch2_Adaln_Refiner_WHIT_CLIP transformer #12789

Are you sure you want to change the base?

Add NewbiePipeline and NextDiT_3B_GQA_patch2_Adaln_Refiner_WHIT_CLIP transformer #12789

Conversation

E-Anlia commented Dec 4, 2025

🚀 Main additions

🧩 Motivation

📁 Files added

📁 Files modified

✔ Notes

Before submitting

Who can review?

Uh oh!

sayakpaul commented Dec 4, 2025

Uh oh!

E-Anlia commented Dec 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

E-Anlia commented Dec 5, 2025 •

edited

Loading