vllm.v1.kv_offload.tiering.fs.io ¶
_get_tmp_suffix ¶
_get_tmp_suffix() -> str
Generate a thread-local unique suffix for temporary files.
Source code in vllm/v1/kv_offload/tiering/fs/io.py
load_block ¶
load_block(
source_path: str,
view: memoryview,
offset: int,
block_size: int,
) -> None
Load callback: read one KV block from disk. Remove the file on failure.
Source code in vllm/v1/kv_offload/tiering/fs/io.py
store_block ¶
store_block(
dest_path: str,
buffer: memoryview,
offset: int,
block_size: int,
) -> None
Store callback: Writes to a temp file then atomically replaces the destination.