Skip to content
vLLM
vllm.lora.ops
Initializing search
GitHub
Home
User Guide
Developer Guide
API Reference
CLI Reference
Community
vLLM
GitHub
Home
User Guide
Developer Guide
API Reference
API Reference
API Reference
Contents
Contents
vllm.beam_search
vllm.collect_env
vllm.config
vllm.connections
vllm.env_override
vllm.envs
vllm.forward_context
vllm
vllm.jsontree
vllm.logger
vllm.logits_process
vllm.outputs
vllm.pooling_params
vllm.sampling_params
vllm.scalar_type
vllm.scripts
vllm.sequence
vllm.test_utils
vllm.tracing
vllm.version
adapter_commons
assets
attention
benchmarks
compilation
core
device_allocator
distributed
engine
entrypoints
executor
inputs
logging_utils
lora
lora
vllm.lora
vllm.lora.fully_sharded_layers
vllm.lora.layers
vllm.lora.lora
vllm.lora.models
vllm.lora.peft_helper
vllm.lora.request
vllm.lora.resolver
vllm.lora.utils
vllm.lora.worker_manager
ops
ops
vllm.lora.ops
vllm.lora.ops
Table of contents
ops
torch_ops
triton_ops
xla_ops
punica_wrapper
model_executor
multimodal
platforms
plugins
profiler
prompt_adapter
reasoning
spec_decode
transformers_utils
triton_utils
usage
utils
v1
worker
CLI Reference
Community
Table of contents
ops
vllm.lora.ops
Modules:
Name
Description
torch_ops
triton_ops
xla_ops
Back to top