vllm.model_executor.layers.mamba.mamba2_metadata
Mamba2Metadata
dataclass
¶
Source code in vllm/model_executor/layers/mamba/mamba2_metadata.py
_query_start_loc_to_chunk_indices_offsets
¶
_query_start_loc_to_chunk_indices_offsets(
query_start_loc: Tensor,
chunk_size: int,
total_seqlens: int,
)
Source code in vllm/model_executor/layers/mamba/mamba2_metadata.py
get_platform_metadata_classes
¶
get_platform_metadata_classes() -> tuple[
type[AttentionMetadata], ...
]
Returns the appropriate metadata classes for the current platform.
Source code in vllm/model_executor/layers/mamba/mamba2_metadata.py
prepare_mamba2_metadata
¶
prepare_mamba2_metadata(
chunk_size: int, attn_metadata: AttentionMetadata
) -> Mamba2Metadata