Ban&Pick Achieving Free Performance Gains and Inference Speedup via Smarter Routing in MoE-LLMs

AI Review

Please note the paper has not yet undergone AI review.

Keywords

Click the button to extract keywords

Insights

Click the button to extract insights