MarlinQQQTensor¶
- class torchao.dtypes.MarlinQQQTensor(tensor_impl: AQTTensorImpl, block_size: Tuple[int, ...], shape: Size, quant_min: Optional[Union[int, float]] = None, quant_max: Optional[Union[int, float]] = None, zero_point_domain: ZeroPointDomain = ZeroPointDomain.INT, dtype=None, strides=None)[source]¶
MarlinQQQ 量化張量子類,繼承自 AffineQuantizedTensor 類。
要了解在 choose_qparams_and_quantize_affine_qqq、marllin qqq 量化的量化和反量化過程中發生的情況,請訪問 https://github.com/pytorch/ao/blob/main/torchao/quantization/quant_primitives.py 並檢視兩個量化原始操作:choose_qparams_and_quantize_affine_qqq 和 dequantize_affine_qqq