Skip to content

Custom Quantization #1785

Answered by yiliu30
anthony-lemurian asked this question in Q&A
Discussion options

You must be logged in to vote

Hey @anthony-lemurian, thanks for showing interest in our project!

For WOQ, the basic idea is quantize and dequantize the tensor (weight) to mimic the quantization error. The main function for this is quant_tensor, which takes a tensor (weight) and certain configurations to select the qdq_weight_actor.

Taking 4-bits as an example, qdq_weight_asym applies asymmetrical quantization and dequantization to the provided weight.

Hope this can give you some insights to define new data type :)

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by xin3he
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants