vllm.tracing.utils ¶
LoadingSpanAttributes ¶
Custom attributes for code-level tracing (file, line number).
Source code in vllm/tracing/utils.py
SpanAttributes ¶
Standard attributes for spans.
These are largely based on OpenTelemetry Semantic Conventions but are defined here as constants so they can be used by any backend or logger.
Source code in vllm/tracing/utils.py
GEN_AI_LATENCY_TIME_IN_MODEL_DECODE class-attribute instance-attribute ¶
GEN_AI_LATENCY_TIME_IN_MODEL_EXECUTE class-attribute instance-attribute ¶
GEN_AI_LATENCY_TIME_IN_MODEL_FORWARD class-attribute instance-attribute ¶
GEN_AI_LATENCY_TIME_IN_MODEL_INFERENCE class-attribute instance-attribute ¶
GEN_AI_LATENCY_TIME_IN_MODEL_PREFILL class-attribute instance-attribute ¶
GEN_AI_LATENCY_TIME_IN_QUEUE class-attribute instance-attribute ¶
GEN_AI_LATENCY_TIME_IN_SCHEDULER class-attribute instance-attribute ¶
GEN_AI_LATENCY_TIME_TO_FIRST_TOKEN class-attribute instance-attribute ¶
GEN_AI_REQUEST_MAX_TOKENS class-attribute instance-attribute ¶
GEN_AI_REQUEST_TEMPERATURE class-attribute instance-attribute ¶
GEN_AI_REQUEST_TOP_P class-attribute instance-attribute ¶
GEN_AI_RESPONSE_MODEL class-attribute instance-attribute ¶
GEN_AI_USAGE_COMPLETION_TOKENS class-attribute instance-attribute ¶
GEN_AI_USAGE_NUM_SEQUENCES class-attribute instance-attribute ¶
GEN_AI_USAGE_PROMPT_TOKENS class-attribute instance-attribute ¶
contains_trace_headers ¶
extract_trace_headers ¶
Extract only trace-related headers from a larger header dictionary. Useful for logging or passing context to a non-OTel client.