vllm.v1.sample.ops.penalties
_convert_to_tensors
¶
_convert_to_tensors(
output_token_ids: list[list[int]],
vocab_size: int,
device: device,
) -> Tensor
Convert the different list data structures to tensors.
Source code in vllm/v1/sample/ops/penalties.py
apply_all_penalties
¶
apply_all_penalties(
logits: Tensor,
prompt_token_ids: Tensor,
presence_penalties: Tensor,
frequency_penalties: Tensor,
repetition_penalties: Tensor,
output_token_ids: list[list[int]],
) -> Tensor
Applies presence, frequency and repetition penalties to the logits.