vllm.compilation.collective_fusion
AllGatherGEMMPattern
¶
Bases: BasePattern
Source code in vllm/compilation/collective_fusion.py
get_inputs
¶
register
¶
Source code in vllm/compilation/collective_fusion.py
AsyncTPPass
¶
Bases: VllmInductorPass
Source code in vllm/compilation/collective_fusion.py
patterns
instance-attribute
¶
__init__
¶
__init__(config: VllmConfig)
Source code in vllm/compilation/collective_fusion.py
is_applicable_for_shape
¶
BasePattern
¶
Source code in vllm/compilation/collective_fusion.py
GEMMReduceScatterPattern
¶
Bases: BasePattern