vllm.entrypoints.cli.benchmark.latency ¶
BenchmarkLatencySubcommand ¶
Bases: BenchmarkSubcommandBase
The latency subcommand for vllm bench.
Source code in vllm/entrypoints/cli/benchmark/latency.py
help class-attribute instance-attribute ¶
add_cli_args classmethod ¶
add_cli_args(parser: ArgumentParser) -> None