vllm.compilation.passes.fusion.collective_fusion ¶
AllGatherCutlassScaledMMPattern ¶
Bases: BasePattern
Source code in vllm/compilation/passes/fusion/collective_fusion.py
get_inputs ¶
Source code in vllm/compilation/passes/fusion/collective_fusion.py
register ¶
Source code in vllm/compilation/passes/fusion/collective_fusion.py
AllGatherGEMMPattern ¶
Bases: BasePattern
Source code in vllm/compilation/passes/fusion/collective_fusion.py
get_inputs ¶
register ¶
Source code in vllm/compilation/passes/fusion/collective_fusion.py
AllGatherScaledMMPattern ¶
Bases: BasePattern
Source code in vllm/compilation/passes/fusion/collective_fusion.py
get_inputs ¶
Source code in vllm/compilation/passes/fusion/collective_fusion.py
register ¶
Source code in vllm/compilation/passes/fusion/collective_fusion.py
AsyncTPPass ¶
Bases: VllmPatternMatcherPass
Source code in vllm/compilation/passes/fusion/collective_fusion.py
patterns instance-attribute ¶
__init__ ¶
__init__(config: VllmConfig) -> None
Source code in vllm/compilation/passes/fusion/collective_fusion.py
is_applicable_for_range ¶
Source code in vllm/compilation/passes/fusion/collective_fusion.py
CutlassScaledMMReduceScatterPattern ¶
Bases: BasePattern
Source code in vllm/compilation/passes/fusion/collective_fusion.py
get_inputs ¶
Source code in vllm/compilation/passes/fusion/collective_fusion.py
register ¶
Source code in vllm/compilation/passes/fusion/collective_fusion.py
GEMMReduceScatterPattern ¶
Bases: BasePattern
Source code in vllm/compilation/passes/fusion/collective_fusion.py
get_inputs ¶
register ¶
Source code in vllm/compilation/passes/fusion/collective_fusion.py
ScaledMMReduceScatterPattern ¶
Bases: BasePattern