vllm.model_executor.layers.fused_moe.router.fused_topk_router ¶
FusedTopKRouter ¶
Bases: BaseRouter
Default router using standard fused top-k routing.
Source code in vllm/model_executor/layers/fused_moe/router/fused_topk_router.py
_compute_routing ¶
_compute_routing(
hidden_states: Tensor,
router_logits: Tensor,
indices_type: dtype | None,
) -> tuple[Tensor, Tensor]
Compute routing using standard fused top-k.