vllm.entrypoints.cli.launch ¶
LaunchSubcommand ¶
Bases: CLISubcommand
The launch subcommand for the vLLM CLI.
Uses nested sub-subcommands so each component can define its own arguments independently (e.g. vllm launch render).
Source code in vllm/entrypoints/cli/launch.py
LaunchSubcommandBase ¶
Bases: CLISubcommand
The base class of subcommands for vllm launch.
Source code in vllm/entrypoints/cli/launch.py
add_cli_args classmethod ¶
add_cli_args(parser: FlexibleArgumentParser) -> None
Add the CLI arguments to the parser.
By default, adds the standard vLLM serving arguments. Subclasses can override to add component-specific arguments.
Source code in vllm/entrypoints/cli/launch.py
RenderSubcommand ¶
Bases: LaunchSubcommandBase
The render subcommand for vllm launch.
Source code in vllm/entrypoints/cli/launch.py
run_launch_fastapi async ¶
run_launch_fastapi(args: Namespace) -> None
Run the online serving layer with FastAPI (no GPU inference).