add mxfp4 qat, mainly packing code. #2347

lkk12014402 · 2025-12-01T14:19:43Z

Description

add mxfp4 qat

Copilot

Pull request overview

This PR adds support for MXFP4 (4-bit microscaling floating point) quantization-aware training (QAT), extending the existing MXFP8 support. The implementation includes packing utilities to convert FP4 values into packed uint8 format and export functionality for serialization.

Key changes:

Added MXFP4 to QAT module mappings alongside MXFP8
Implemented FP4 packing/unpacking utilities with bit manipulation
Extended export logic to handle MXFP4 format with packed weight buffers

Reviewed changes

Copilot reviewed 5 out of 5 changed files in this pull request and generated 4 comments.

Show a summary per file

File	Description
neural_compressor/torch/quantization/config.py	Adds MXFP4 to QAT module mappings for torch.nn.Linear
neural_compressor/torch/export/export_hf.py	Adds MXFP4 export path that packs weights into buffers
neural_compressor/torch/algorithms/qat/tensor_quantizer.py	Implements MXFP4 weight packing using new packing utilities
neural_compressor/torch/algorithms/qat/quant_utils.py	Adds MXFP4 detection and sets float-quantized format
neural_compressor/torch/algorithms/qat/mxfp4_packing.py	New file with FP4 casting and uint4-to-uint8 packing functions

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

neural_compressor/torch/export/export_hf.py

neural_compressor/torch/algorithms/qat/mxfp4_packing.py

for more information, see https://pre-commit.ci

yiliu30

Please add some UTs.

neural_compressor/torch/algorithms/qat/mxfp4_packing.py

neural_compressor/torch/quantization/config.py

lkk12014402 · 2025-12-03T08:56:31Z

Please add some UTs.

add ut

for more information, see https://pre-commit.ci

.../pytorch/nlp/huggingface_models/language-modeling/quantization/llm_qat/quantize_autoround.py

chensuyue · 2025-12-05T12:26:42Z

Code issue not related to this PR.

add mxfp4 qat, mainly packing code.

01c059d

lkk12014402 requested review from Copilot and yiliu30 December 1, 2025 14:19

Copilot AI reviewed Dec 1, 2025

View reviewed changes

[pre-commit.ci] auto fixes from pre-commit.com hooks

c2c7cee

for more information, see https://pre-commit.ci

yiliu30 reviewed Dec 2, 2025

View reviewed changes

neural_compressor/torch/algorithms/qat/mxfp4_packing.py Outdated Show resolved Hide resolved

neural_compressor/torch/quantization/config.py Show resolved Hide resolved

add mxfp4 example and override trainer.save_model for saving.

af6ef4e

lkk12014402 added this to the 3.7 milestone Dec 3, 2025

lkk12014402 added 4 commits December 3, 2025 08:20

refine docs.

e758d17

refine doc.

64cd6b7

add ut.

8ae016a

replace fp4 packing func.

9d7b4a2

pre-commit-ci bot and others added 2 commits December 3, 2025 08:59

[pre-commit.ci] auto fixes from pre-commit.com hooks

12a061e

for more information, see https://pre-commit.ci

Merge branch 'master' into qat_mxpf4

248dfb1

yiliu30 reviewed Dec 5, 2025

View reviewed changes

.../pytorch/nlp/huggingface_models/language-modeling/quantization/llm_qat/quantize_autoround.py Show resolved Hide resolved

lkk12014402 and others added 2 commits December 5, 2025 08:57

add more ut.

bda4d93

Merge branch 'master' into qat_mxpf4

0a23156

yiliu30 self-requested a review December 5, 2025 10:29

yiliu30 approved these changes Dec 5, 2025

View reviewed changes

chensuyue merged commit 134dd92 into intel:master Dec 5, 2025
19 of 22 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add mxfp4 qat, mainly packing code. #2347

add mxfp4 qat, mainly packing code. #2347

Uh oh!

lkk12014402 commented Dec 1, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

yiliu30 left a comment

Uh oh!

Uh oh!

Uh oh!

lkk12014402 commented Dec 3, 2025

Uh oh!

Uh oh!

chensuyue commented Dec 5, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

add mxfp4 qat, mainly packing code. #2347

add mxfp4 qat, mainly packing code. #2347

Uh oh!

Conversation

lkk12014402 commented Dec 1, 2025

Description

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

yiliu30 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

lkk12014402 commented Dec 3, 2025

Uh oh!

Uh oh!

chensuyue commented Dec 5, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants