vllm.model_executor.layers.quantization.kernels.mixed_precision.MPLinearKernel
MPLinearKernel
¶
Bases: ABC
Source code in vllm/model_executor/layers/quantization/kernels/mixed_precision/MPLinearKernel.py
__init__
¶
__init__(
c: MPLinearLayerConfig,
w_q_param_name: str,
w_s_param_name: str,
w_zp_param_name: Optional[str] = None,
w_gidx_param_name: Optional[str] = None,
) -> None
Source code in vllm/model_executor/layers/quantization/kernels/mixed_precision/MPLinearKernel.py
_get_weight_params
¶
Source code in vllm/model_executor/layers/quantization/kernels/mixed_precision/MPLinearKernel.py
_transform_param
¶
Source code in vllm/model_executor/layers/quantization/kernels/mixed_precision/MPLinearKernel.py
apply_weights
abstractmethod
¶
can_implement
abstractmethod
classmethod
¶
can_implement(
c: MPLinearLayerConfig,
) -> tuple[bool, Optional[str]]