Skip to content

[Dev] Add VAE Inference Example#38

Merged
jiahy0825 merged 3 commits into
SandAI-org:mainfrom
themistbeforedawn:dev/inference-example
Jun 30, 2026
Merged

[Dev] Add VAE Inference Example#38
jiahy0825 merged 3 commits into
SandAI-org:mainfrom
themistbeforedawn:dev/inference-example

Conversation

@themistbeforedawn

Copy link
Copy Markdown
Collaborator

🗂️ PR Category

  • ✨ New Feature
  • 🚀 Optimization (performance, memory, etc.)
  • 💥 Breaking Change
  • 🐛 Bug Fix
  • 🛠️ Development / Refactoring
  • 📚 Documentation
  • 🧹 Chore (Dependencies, CI/CD, Configuration, etc.)
  • 🧪 Testing

📝 Description

This PR only adds the WAN 2.2 VAE inference benchmark/example for measuring the performance impact of the optimization passes introduced in previous PRs.

The following numbers are CUDA HW sum averages over profiled iterations on the WAN 2.2 VAE 540p workload, measured on an NVIDIA H100 80G HBM3 GPU. Parentheses show MagiCompiler speedup over the corresponding baseline.

Decode

Shape mode MAGI_COMPILE TORCH_COMPILE EAGER
Static H/W 457.943 ms 526.973 ms (1.15x) 855.131 ms (1.87x)
Dynamic H/W 553.543 ms 768.700 ms (1.39x) 855.131 ms (1.54x)

Encode

Shape mode MAGI_COMPILE TORCH_COMPILE EAGER
Static H/W 134.444 ms 151.183 ms (1.12x) 269.702 ms (2.01x)
Dynamic H/W 179.025 ms 289.522 ms (1.62x) 269.702 ms (1.51x)

jiahy0825
jiahy0825 previously approved these changes Jun 30, 2026

@jiahy0825 jiahy0825 left a comment

Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@jiahy0825 jiahy0825 merged commit ee74106 into SandAI-org:main Jun 30, 2026
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants