Others: Doubt regarding the scales and zero value calculate for each layer in PTQ and QAT

## 📝 Description
While going through your YouTube video explanation on Quantisation. I came across this doubt when I was validating the formulas of `scales` and `zero_point` for Asymmetric and Symmetric Quantization and found mismatch in the values in Notebook code examples.

<img width="1052" alt="image" src="https://github.com/hkproj/quantization-notes/assets/59639615/fcbe9802-4fb2-4c94-8a67-56902226eaae">

## 🤯 Observation

In the above SS of Post-Training Quantisation notebook:
the MinMaxObserver for Linea1 layer1 has calculated the min(beta) and max_value(alpha) as `min_val=-53.58397674560547, max_val=34.898128509521484`

Using the above min and max values, the scale and zero_point  for QuantizedLinear layer1 are  `scale=0.6967094540596008, zero_point=77`

## ❔ Question/Doubt

Formulae for calculating s and z for Asymmetric Quantization
```
Scale = (Xmax - Xmin)/ (2^n - 1)
Zero_point = -1 * (Xmin/Scale)
```
Considering the `The default qscheme = torch.per_tensor.affine & dtype=torch.quint8 for MinMaxObserver`
The Quantisation used by torch Quantisation library is Asymmetric.

Shouldn't the value for scale and zero_point for QuantizedLinear layer according to Asymmetric Quantization to 8 bit INT be:
`scale=0.34698863, zero_point=100.57`??

Why the scale value in the notebook ss is 2X of the scale value calculated by the formulae which is 2 * 0.34598863 ~= 0.6967094540596008,


<img width="893" alt="image" src="https://github.com/hkproj/quantization-notes/assets/59639615/aad74764-7002-49ab-a82b-f8de24d14b82">

<img width="749" alt="image" src="https://github.com/hkproj/quantization-notes/assets/59639615/159c7937-1e9d-4012-bef9-1edd1ae7bab4">

@hkproj Can you please shed some light on the calculation?

Thank you


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Others: Doubt regarding the scales and zero value calculate for each layer in PTQ and QAT #1

📝 Description

🤯 Observation

❔ Question/Doubt

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Others: Doubt regarding the scales and zero value calculate for each layer in PTQ and QAT #1

Description

📝 Description

🤯 Observation

❔ Question/Doubt

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions