nemo-mbridge-perf-expert-parallel-overlap

安装量: 576
排名: #9208

安装

npx skills add https://github.com/nvidia/skills --skill nemo-mbridge-perf-expert-parallel-overlap

MoE Expert-Parallel Overlap Skill References Stable docs: @docs/training/communication-overlap.md Structured metadata: @skills/nemo-mbridge-perf-expert-parallel-overlap/card.yaml What It Is Expert-parallel (EP) overlap hides the cost of token dispatch/combine all-to-all communication by running it concurrently with expert FFN compute. Optionally, delayed expert weight-gradient computation ( delay_wgrad_compute ) provides additional overlap by deferring wgrad to overlap with the next layer's forward. Bridge supports two dispatcher paths: Show more Installs 543 Repository nvidia/skills GitHub Stars 1.3K First Seen May 29, 2026 Security Audits Gen Agent Trust Hub Pass Socket Pass Snyk Pass

返回排行榜