vllm.model_executor.layers.quantization.kernels.mixed_precision.xpu ¶
_XPUWNA16_SUPPORTED_QUANT_TYPES module-attribute ¶
XPUwNa16LinearKernel ¶
Bases: MPLinearKernel
Source code in vllm/model_executor/layers/quantization/kernels/mixed_precision/xpu.py
apply_weights ¶
Source code in vllm/model_executor/layers/quantization/kernels/mixed_precision/xpu.py
can_implement classmethod ¶
can_implement(
c: MPLinearLayerConfig,
) -> tuple[bool, str | None]
Source code in vllm/model_executor/layers/quantization/kernels/mixed_precision/xpu.py
process_weights_after_loading ¶
process_weights_after_loading(layer: Module)