Uh oh!

There was an error while loading. Please reload this page.

SemiAnalysisAI / InferenceX Public

Notifications You must be signed in to change notification settings
Fork 210
Star 1.2k

Code
Issues 111
Pull requests 107
Discussions
Actions
Projects
Models
Security and quality
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Models
Security and quality
Insights

Pull requests: SemiAnalysisAI/InferenceX

Labels 45 Milestones 6

New pull request New

107 Open 1,472 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

chore(deps): bump the github-actions group across 1 directory with 2 updates dependencies

Pull requests that update a dependency file

github_actions

Pull requests that update GitHub Actions code

#1960 opened Jun 30, 2026 by dependabot Bot

Loading…

[Klaud Cold] [AMD] Enable AITER MoE for MiniMax-M3 MI355X FP4 vLLM MTP benchmark full-sweep-fail-fast

#1958 opened Jun 30, 2026 by functionstackx Collaborator

Loading…

Update Qwen3.5 FP4 MI355X MTP recipe with tuned env/flags

#1957 opened Jun 29, 2026 by amd-fuyuajin Collaborator

Loading…

[merging June 30 at 4pm PT] making this an hard guideline & enforcing consistent reviews on upstream sglang/vllm docker repo to PR CheckList

#1956 opened Jun 29, 2026 by functionstackx Collaborator

Loading…

[AMD] Enable AITER MoE for MiniMax-M3 MI355X vLLM MTP benchmarks

#1955 opened Jun 29, 2026 by Fangzhou-Ai Collaborator • Draft

2 of 3 tasks

Add MTP evaluation

#1953 opened Jun 29, 2026 by hjjq Collaborator

Loading…

[AMD] Tune MiniMax-M3 MXFP8 MI300X vLLM: async scheduling + big-prefill, fix conc256 EP8→EP1 full-sweep-enabled

#1951 opened Jun 29, 2026 by ZhengGong-amd Collaborator

Loading…

7 of 8 tasks

Amd/vllm disagg minimax fp8 cdna3

#1949 opened Jun 28, 2026 by haic0 Collaborator • Draft

Add /run-evals comment command

#1948 opened Jun 27, 2026 by adibarra Collaborator

Loading…

[WIP] add SWE-bench Lite accuracy eval

#1947 opened Jun 26, 2026 by adibarra Collaborator

Loading…

[AMD] Add MiniMax-M3-FP4 MI355X ATOMESH update 0623 AMD evals-only

Suppress throughput and run only eval jobs; combine with all-evals to expand selection

#1940 opened Jun 26, 2026 by seungrokj Collaborator

Loading…

8 tasks

Add MiniMax-M3 MXFP8 B300 1k/1k sweep and update image full-sweep-enabled

#1937 opened Jun 25, 2026 by RohitNagraj Collaborator

Loading…

Add MiniMax-M3 NVFP4 B200 single-node vLLM benchmark (EAGLE3 spec decode) full-sweep-enabled

#1933 opened Jun 25, 2026 by Ankur-singh Collaborator

Loading…

[AMD] Add MiniMax-M3-FP8 MI355X ATOMESH update 0623 AMD evals-only

Suppress throughput and run only eval jobs; combine with all-evals to expand selection

#1930 opened Jun 25, 2026 by seungrokj Collaborator

Loading…

8 tasks

[WIP] agentic: add Kimi Mooncake LMCacheMP disagg recipe sweep-enabled

#1924 opened Jun 24, 2026 by YukioZzz Collaborator • Draft

[Do Not Merge][NV] dsv4-fp4-b200 sglang image to nightly full-sweep-enabled

#1923 opened Jun 24, 2026 by hshrivastava-droid Collaborator

Loading…

Add GLM-5-FP8 GB300 multinode dynamo-sglang MTP benchmark full-sweep-enabled

#1907 opened Jun 23, 2026 by hshrivastava-droid Collaborator

Loading…

glm5.1-fp4-mi355x-sglang: bump image to v0.5.13.post1-20260622 + enable aiter allreduce fusion full-sweep-enabled

#1905 opened Jun 23, 2026 by jiacao-amd Collaborator

Loading…

CollectiveX: experimental cross-vendor collective/EP benchmark

#1896 opened Jun 23, 2026 by Oseltamivir Collaborator

Loading…

Add GLM-5-FP8 GB200 dynamo-sglang multinode benchmark full-sweep-enabled

#1895 opened Jun 23, 2026 by hshrivastava-droid Collaborator

Loading…

[AMD] dsv4 atom-disagg eval sweep — validate reduced ATOM logging all-evals

Expand eval selection to every fixed-sequence config

evals-only

Suppress throughput and run only eval jobs; combine with all-evals to expand selection

full-sweep-enabled

#1882 opened Jun 22, 2026 by Oseltamivir Collaborator

Loading…

[CI] Validate aggregate benchmark results before upload

#1881 opened Jun 21, 2026 by edwingao28

Loading…

[codex] Enforce complete eval validation and quiet ATOM logs

#1878 opened Jun 21, 2026 by Oseltamivir Collaborator • Draft

job.slurm: opt-in model auto-download fallback (SGLang path)

#1864 opened Jun 19, 2026 by andyluo7 Collaborator • Draft

[Klaud Cold] MI300X MiniMax-M3 nightly image and FP8 KV cache full-sweep-fail-fast

#1858 opened Jun 19, 2026 by cquil11 Collaborator

Loading…

Previous 1 2 3 4 5 Next

Previous Next

ProTip! Type g p on any issue or pull request to go back to the pull request listing page.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!