Custom Quantization #1785

anthony-lemurian · 2024-05-10T22:18:18Z

anthony-lemurian
May 10, 2024

Hello, Im just wondering if its possible to define a custom data type to do WOQ in this repo? Im following the MX branch to see how they add that data type, however i wonder if there is a more straightforward approach since im only after WOQ

Answered by yiliu30

May 13, 2024

Hey @anthony-lemurian, thanks for showing interest in our project!

For WOQ, the basic idea is quantize and dequantize the tensor (weight) to mimic the quantization error. The main function for this is quant_tensor, which takes a tensor (weight) and certain configurations to select the qdq_weight_actor.

Taking 4-bits as an example, qdq_weight_asym applies asymmetrical quantization and dequantization to the provided weight.

Hope this can give you some insights to define new data type :)

View full answer

yiliu30 · 2024-05-13T14:37:53Z

yiliu30
May 13, 2024
Collaborator

Hey @anthony-lemurian, thanks for showing interest in our project!

For WOQ, the basic idea is quantize and dequantize the tensor (weight) to mimic the quantization error. The main function for this is quant_tensor, which takes a tensor (weight) and certain configurations to select the qdq_weight_actor.

Taking 4-bits as an example, qdq_weight_asym applies asymmetrical quantization and dequantization to the provided weight.

Hope this can give you some insights to define new data type :)

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Custom Quantization #1785

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Custom Quantization #1785

Uh oh!

anthony-lemurian May 10, 2024

Replies: 1 comment

Uh oh!

yiliu30 May 13, 2024 Collaborator

anthony-lemurian
May 10, 2024

yiliu30
May 13, 2024
Collaborator