Allow overriding popcorn id with POPCORN_SUBMITTER_ID env arg by Jack-Khuu · Pull Request #57 · gpu-mode/popcorn-cli

Jack-Khuu · 2026-04-21T01:42:48Z

When submitting via proxy or in a setting with multiple accounts it is difficult to switch submitter identites (currently always reads from .popcorn.yaml)

This PR adds POPCORN_SUBMITTER_ID as an env flag override

Build

./build.sh

Normal run

> ./target/release/popcorn-cli submissions list --leaderboard matmul_v2

ID       Leaderboard          File                 Time                 GPU(s)       Status          Score
---------------------------------------------------------------------------------------------------------

Override

> POPCORN_SUBMITTER_ID=0987654 ./target/release/popcorn-cli submissions list --leaderboard matmul_v2

**Application error: Server returned status 401 Unauthorized: Invalid or unauthorized auth header elaine**

>   POPCORN_SUBMITTER_ID=$(grep 'cli_id:' ~/.popcorn.yaml | awk '{print $2}') ./target/release/popcorn-cli submissions list --leaderboard matmul_v2

ID       Leaderboard          File                 Time                 GPU(s)       Status          Score
---------------------------------------------------------------------------------------------------------

Super important ascii art

feat/pre launch update

Rust port

* Create symlink for popcorn-cli after installation I found that after installation, the default binary name is actually `popcorn-cli`. If we want to use `popcorn` as binary name in the subsequent steps, I believe we need to create a symlink. * Add `popcorn` alias to install.sh and remove manual symlink step from docs Move the symlink creation into install.sh so users get the `popcorn` command automatically. Uses symlink on Linux/macOS and copy on Windows.

* docs: add ACF (booster pack) usage guide to helion-hackathon.md Documents how to use PTXAS Advanced Controls Files from /opt/booster_pack/ during autotuning (autotune_search_acf) and in hardcoded submissions (advanced_controls_file). Includes the important caveat that ACF search only works when the autotuner actually runs. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * docs: remove "How ACFs work" subsection Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

* docs: add TileIR backend usage guide to helion-hackathon.md Documents ENABLE_TILE=0 vs ENABLE_TILE=1 and the TileIR compilation pipeline available via nvtriton on B200 instances. Covers how to enable TileIR with Helion (ENABLE_TILE=1 + HELION_BACKEND=tileir), the different tunables (num_ctas/occupancy vs num_warps/maxnreg), and how to hardcode TileIR configs in submissions. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * docs: restructure ACF + TileIR as optional performance knobs Group both sections under a single "Optional: Extra Performance Knobs" heading to emphasize neither is required. Streamline both into step 1 (autotune) / step 2 (hardcode) format. Add a "Which combination" section showing all 4 options to try. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * docs: remove "(Booster Pack)" from ACF heading Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * docs: consolidate TileIR env var instructions Remove duplicate bash export block — the Python os.environ in the code example is sufficient for both local autotuning and submissions. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * docs: clarify TileIR tunables come from autotuner output Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * docs: shorten "Which should I use?" section Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * docs: be explicit about ENABLE_TILE=0 vs ENABLE_TILE=1 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * docs: simplify TileIR comparison table to just backend names Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

* docs: add TileIR backend usage guide to helion-hackathon.md Documents ENABLE_TILE=0 vs ENABLE_TILE=1 and the TileIR compilation pipeline available via nvtriton on B200 instances. Covers how to enable TileIR with Helion (ENABLE_TILE=1 + HELION_BACKEND=tileir), the different tunables (num_ctas/occupancy vs num_warps/maxnreg), and how to hardcode TileIR configs in submissions. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * docs: restructure ACF + TileIR as optional performance knobs Group both sections under a single "Optional: Extra Performance Knobs" heading to emphasize neither is required. Streamline both into step 1 (autotune) / step 2 (hardcode) format. Add a "Which combination" section showing all 4 options to try. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * docs: remove "(Booster Pack)" from ACF heading Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * docs: consolidate TileIR env var instructions Remove duplicate bash export block — the Python os.environ in the code example is sufficient for both local autotuning and submissions. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * docs: clarify TileIR tunables come from autotuner output Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * docs: shorten "Which should I use?" section Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * docs: be explicit about ENABLE_TILE=0 vs ENABLE_TILE=1 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * docs: simplify TileIR comparison table to just backend names Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * docs: add scoring system, rules, and open-ended contribution track Add point allocation table, scoring formula (correctness + performance ranking), rules & requirements, and the separate open-ended contribution track for non-kernel Helion contributions. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * docs: allow unlimited submissions, best one counts Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * docs: clarify rules to match actual submission format Each submission uses one static helion.Config for all shapes, not per-shape configs. Simplified rules to reflect this. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * Revert "docs: clarify rules to match actual submission format" This reverts commit 4fac3a8. * Add per-shape config dispatch pattern to all submissions Use a factory function (_make_kernel) to create kernel variants with different helion.Config objects, and dispatch in custom_kernel() based on input tensor shapes. This lets participants optimize each benchmark shape independently. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * docs: update example to show all shapes, remove DEFAULT_CONFIG Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * docs: use Config(...) placeholders with distinct TODO comments for test vs benchmark shapes Test shapes: TODO to replace with default config or any config that passes correctness. Benchmark shapes: TODO to replace with autotuned config. Also add instructions on getting default config via autotune_effort="none". Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * docs: remove references to single-config-for-all-shapes pattern Per-shape configs are the recommended approach. Remove mentions of using a single config across all shapes. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * docs: remove references to default config in rules section Configs are always participant-provided. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * docs: add tips for version control, tmux, and machine reboots Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * docs: move GPU machine tips to standalone section Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * docs: fix performance metric description to match actual eval method The previous description incorrectly stated geometric mean of 100 runs. The actual helion eval uses CUDA graphs with L2 cache clearing, 10 measurements, and arithmetic mean. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * Replace hard 30% LOC limit with judges' discretion for inline triton/asm The LOC-based rule was gameable (denominator inflation with padding code), so switch to a qualitative rule: inline triton/asm is allowed as escape hatches, but predominantly inline submissions may be disqualified. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * Add spawn mode tip for autotuning in GPU machine section Spawn mode isolates each autotuner trial in a subprocess with timeout protection, preventing hangs or crashes from killing the entire run. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * Clarify that spawn mode is slower than fork mode Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

* feat: create dedicated kernel folder on `popcorn setup` Instead of writing files directly into the current directory (which overwrites existing files), `popcorn setup` now creates a subfolder named after the problem directory (e.g. `softmax/`). If a folder with that name already exists, a `-N` suffix is appended (`softmax-1/`, `softmax-2/`, etc.) to avoid collisions. * docs: update setup docs to reflect new project folder behavior * style: fix rustfmt formatting in setup.rs

) The TileIR backend requires both env vars to be set. Update the table, step-by-step instructions, and "Which should I use?" section to consistently mention both ENABLE_TILE=1 and HELION_BACKEND=tileir.

Replace the rank-based correctness/performance formula with a simpler top-3 system: 5 pts (1st), 3 pts (2nd), 1 pt (3rd) per scored problem. Mark fp8_quant as an unscored warm-up. Ties decided by kernel quality.

Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Reorder so the unscored warm-up problem (fp8_quant) is listed first, matching the scoring note that says "Problem 1 is not scored".

…lem-1

* fix: writing multiple profile zip * test: cover profile trace helpers * ci: fix PR validation workflows --------- Co-authored-by: Mark Saroufim <marksaroufim@gmail.com>

* document submission inspection and deletion flow * reframe README note as reward hack section

@burtenshaw

* add to cli docstring * details in AGENTS.md * Apply suggestion from @burtenshaw Co-authored-by: burtenshaw <ben.burtenshaw@gmail.com> --------- Co-authored-by: Mark Saroufim <marksaroufim@gmail.com>

* Add aarch64 Linux support (DGX Spark / GB10) - Add aarch64-unknown-linux-gnu build target to the release workflow - Add .cargo/config.toml to configure the cross-linker for aarch64 - Update install.sh to detect arm64/aarch64 and download the correct binary (popcorn-cli-linux-aarch64.tar.gz) instead of the x86-64 build * validate arm64 installer in CI * move arm64 validation into test workflow * fold arm64 into test matrix * build arm64 release on native runner --------- Co-authored-by: brandonin <brandonin@users.noreply.github.com> Co-authored-by: Mark Saroufim <marksaroufim@gmail.com>

codecov · 2026-04-21T01:47:06Z

Codecov Report

❌ Patch coverage is 73.33333% with 4 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
src/cmd/mod.rs	73.33%	4 Missing ⚠️

📢 Thoughts on this report? Let us know!

dirs::home_dir() on Windows uses the shell API (SHGetKnownFolderPath), not the HOME env var, so we cannot redirect config lookup in tests. Gate config-fallback tests with #[cfg(not(windows))]. The env var override path is still tested cross-platform. Also recover from poisoned mutex so one test failure doesn't cascade.

Jack-Khuu · 2026-04-29T21:37:20Z

@msaroufim @S1ro1 lmk what you think? We'd use it on the HLH server side to submit on behalf of users

msaroufim and others added 30 commits February 26, 2025 17:25

Super important ascii art

3aa74a4

Merge pull request gpu-mode#2 from gpu-mode/msaroufim-patch-1

e2ae16d

Super important ascii art

Merge branch 'fix/launch'

90281ef

Feat: works without popcorn directives

7c235a7

Feat: works

37759d6

Merge pull request gpu-mode#3 from gpu-mode/feat/pre-launch-update

b86af3e

feat/pre launch update

WIP rust port

224a1de

Update README.md

98fbbe8

Feat: screen fix

a04ac61

Refactor: huge refactor

c8ffe91

Feat: login + extra refactor

9ad1ff0

Feat: reregister

ea3f4ab

Feat: github

eb0cdcd

feat: refactor + auth

d6f654b

Feat: build

3cc5a99

Fix: build

85b639a

Fix: build

3fb6315

Fix: build2

e00e952

Fix: build.

f1e983f

Fix: build.

8698a7d

Fix: build.

97c5052

Fix: build.

5333f75

Fix: timeouts

1aa5582

Feat: readme

01b3896

Fix: readme

3205bea

Merge pull request gpu-mode#5 from gpu-mode/rust

2df243d

Rust port

Fix: build.

be54afd

Fix; build

45ba09c

Fix; build

1acb7be

Fix; build

cba3153

msaroufim and others added 23 commits March 9, 2026 11:14

docs: simplify auth flow and remove get-api-url step (gpu-mode#40)

8735788

Create helion-hackathon.md

3d5fed8

Enable closed workflow

6ab2e4a

Update helion-hackathon.md

d3f8160

docs: add local testing section to helion-hackathon.md (gpu-mode#42)

f141be6

docs: mention HELION_BACKEND=tileir alongside ENABLE_TILE=1 (gpu-mode#47

5b9f29d

) The TileIR backend requires both env vars to be set. Update the table, step-by-step instructions, and "Which should I use?" section to consistently mention both ENABLE_TILE=1 and HELION_BACKEND=tileir.

docs: simplify hackathon scoring to top-3 points system (gpu-mode#49)

68fd44a

Replace the rank-based correctness/performance formula with a simpler top-3 system: 5 pts (1st), 3 pts (2nd), 1 pt (3rd) per scored problem. Mark fp8_quant as an unscored warm-up. Ties decided by kernel quality.

docs: add Discord auth hint after register step (gpu-mode#50)

66daac5

Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

More ui friendly register flow

3fd2da9

More ui friendly register flow2

fa1291f

docs: move fp8_quant to problem 1 in hackathon problems table

81720ae

Reorder so the unscored warm-up problem (fp8_quant) is listed first, matching the scoring note that says "Problem 1 is not scored".

Merge pull request gpu-mode#52 from yf225/docs/move-fp8-quant-to-prob…

1d36991

…lem-1

add princeton 2026 quickstart doc (gpu-mode#54)

4ee4737

fix: writing multiple profile zip (gpu-mode#53)

45e4ae5

* fix: writing multiple profile zip * test: cover profile trace helpers * ci: fix PR validation workflows --------- Co-authored-by: Mark Saroufim <marksaroufim@gmail.com>

[codex] document submission inspection and deletion flow (gpu-mode#55)

79e8cd2

* document submission inspection and deletion flow * reframe README note as reward hack section

Mention and add details in AGENTS.md (gpu-mode#39)

b94cc1a

* add to cli docstring * details in AGENTS.md * Apply suggestion from @burtenshaw Co-authored-by: burtenshaw <ben.burtenshaw@gmail.com> --------- Co-authored-by: Mark Saroufim <marksaroufim@gmail.com>

Override popcorn id with env arg

d2a55b4

Jack-Khuu added 2 commits April 29, 2026 10:11

More tests

40f9492

Jack-Khuu force-pushed the id-env branch from 4f4d3a1 to 88c50f5 Compare April 29, 2026 19:46

Scope imports

a1b51d5

msaroufim force-pushed the main branch from e871f85 to e33831e Compare June 15, 2026 04:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow overriding popcorn id with POPCORN_SUBMITTER_ID env arg#57

Allow overriding popcorn id with POPCORN_SUBMITTER_ID env arg#57
Jack-Khuu wants to merge 122 commits into
gpu-mode:mainfrom
Jack-Khuu:id-env

Jack-Khuu commented Apr 21, 2026

Uh oh!

codecov Bot commented Apr 21, 2026 •

edited

Loading

Uh oh!

Jack-Khuu commented Apr 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

12 participants

Conversation

Jack-Khuu commented Apr 21, 2026

Uh oh!

codecov Bot commented Apr 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Jack-Khuu commented Apr 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

12 participants

codecov Bot commented Apr 21, 2026 •

edited

Loading