Skip to content

vllm.models.deepseek_v4.common.ops.sparse_attn_compress_cutedsl

CuTe DSL sparse-attention compressor for DeepSeek V4.

The public wrappers provide the C4 fused and C128 split kernels.