vllm.v1.core.sched.request_queue
FCFSRequestQueue
¶
Bases: deque[Request]
, RequestQueue
A first-come-first-served queue that supports deque operations.
Source code in vllm/v1/core/sched/request_queue.py
__iter__
¶
__reversed__
¶
prepend_requests
¶
prepend_requests(requests: RequestQueue) -> None
Prepend all requests from another queue to the front of this queue.
remove_requests
¶
Remove multiple specific requests from the queue.
Source code in vllm/v1/core/sched/request_queue.py
PriorityRequestQueue
¶
Bases: RequestQueue
A priority queue that supports heap operations.
Requests with a smaller value of priority
are processed first.
If multiple requests have the same priority, the one with the earlier
arrival_time
is processed first.
Source code in vllm/v1/core/sched/request_queue.py
139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 |
|
__init__
¶
__iter__
¶
Iterate over the queue according to priority policy.
__reversed__
¶
add_request
¶
add_request(request: Request) -> None
Add a request to the queue according to priority policy.
prepend_request
¶
prepend_request(request: Request) -> None
Add a request to the queue according to priority policy.
Note: In a priority queue, there is no concept of prepending to the front. Requests are ordered by (priority, arrival_time).
Source code in vllm/v1/core/sched/request_queue.py
prepend_requests
¶
prepend_requests(requests: RequestQueue) -> None
Add all requests from another queue according to priority policy.
Note: In a priority queue, there is no concept of prepending to the front. Requests are ordered by (priority, arrival_time).
Source code in vllm/v1/core/sched/request_queue.py
remove_requests
¶
Remove multiple specific requests from the queue.
Source code in vllm/v1/core/sched/request_queue.py
RequestQueue
¶
Bases: ABC
Abstract base class for request queues.
Source code in vllm/v1/core/sched/request_queue.py
__iter__
abstractmethod
¶
__reversed__
abstractmethod
¶
prepend_requests
abstractmethod
¶
prepend_requests(requests: RequestQueue) -> None
remove_requests
abstractmethod
¶
SchedulingPolicy
¶
create_request_queue
¶
create_request_queue(
policy: SchedulingPolicy,
) -> RequestQueue
Create request queue based on scheduling policy.