vllm.models.deepseek_v4.attention ¶
DeepseekV4 MLA Attention Layer
_select_v4_sparse_impl ¶
_select_v4_sparse_impl() -> type[
DeepseekV4SparseMLAAttentionImpl
]
Pick the platform-specific V4 sparse MLA impl class. Sole platform check.