fix(onboard): recreate sandbox on model/provider drift for same-name … by ChunkyMonkey11 · Pull Request #1521 · NVIDIA/NemoClaw

ChunkyMonkey11 · 2026-04-06T00:39:14Z

Summary

Fixes same-name re-onboard behavior where OpenClaw UI could keep showing a stale model/provider.
On onboarding with an existing ready sandbox, we now detect provider/model drift and avoid silent reuse by triggering recreate flow (with an interactive safety confirmation).

Related Issue

Fixes #1457

Changes

Added provider/model drift detection in onboarding sandbox-reuse path.
Added sandbox selection config read from sandbox via openshell sandbox download (/sandbox/.nemoclaw/config.json).
Added interactive confirmation prompt before destructive recreate when drift is detected.
Added explicit non-interactive recreate log path for drift.
Added tests in test/onboard.test.js covering:
- drift detection/no silent reuse logic presence
- confirmation guard behavior presence

Type of Change

Code change for a new feature, bug fix, or refactor.
Code change with doc updates.
Doc only. Prose changes without code sample modifications.
Doc only. Includes code sample changes.

Testing

npx prek run --all-files passes (or equivalently make check).
npm test passes.
make docs builds without warnings. (for doc-only changes)

Checklist

General

I have read and followed the contributing guide.
I have read and followed the style guide. (for doc-only changes)

Code Changes

Formatters applied — npx prek run --all-files auto-fixes formatting (or make format for targeted runs).
Tests added or updated for new or changed behavior.
No secrets, API keys, or credentials committed.
Doc pages updated for any user-facing behavior changes (new commands, changed defaults, new features, bug fixes that contradict existing docs).

Doc Changes

Follows the style guide. Try running the update-docs agent skill to draft changes while complying with the style guide. For example, prompt your agent with "/update-docs catch up the docs for the new changes I made in this PR."
New pages include SPDX license header and frontmatter, if creating a new page.
Cross-references and links verified.

Signed-off-by: Revant Patel revant.h.patel@gmail.com

Summary by CodeRabbit

New Features
- Detects when an existing sandbox's provider or model selection differs (or selection is unreadable) and triggers recreation; non-interactive runs auto-recreate, interactive runs prompt showing current vs requested values and abort on refusal
- Reuse now avoids skipping recreation when messaging attachments are missing; recreation notes clarify drift vs messaging-provider recreation
Tests
- Added end-to-end tests for drift detection, auto-recreate, and interactive-decline abort flows
Documentation
- Updated onboarding docs and added env var to force sandbox recreation

coderabbitai · 2026-04-06T00:39:29Z

No actionable comments were generated in the recent review. 🎉

ℹ️ Recent review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Run ID: d5c04cd8-6c6a-4e5c-9d6a-f238cb6a2b3f

📥 Commits

Reviewing files that changed from the base of the PR and between 4f5eecf and 82be5fb.

📒 Files selected for processing (3)

bin/lib/onboard.js
docs/reference/commands.md
test/onboard.test.js

✅ Files skipped from review due to trivial changes (2)

docs/reference/commands.md
bin/lib/onboard.js

📝 Walkthrough

Walkthrough

Downloads and robustly parses a sandbox's persisted /.nemoclaw/config.json, detects provider/model selection drift versus the requested onboarding, and gates sandbox recreation on interactive confirmation or automatic recreation in non-interactive mode. Integrates drift checks into sandbox reuse logic and adjusts messaging/provider upserts.

Changes

Cohort / File(s)	Summary
Drift Detection & Recreation `bin/lib/onboard.js`	Add logic to download and recursively locate `/.nemoclaw/config.json`, robustly parse possibly noisy files, extract persisted `provider`/`model`, detect selection drift vs requested values, prompt (interactive) or auto-confirm (non-interactive) sandbox recreation, and integrate drift checks into `createSandbox()` reuse and messaging/provider upsert flow; update recreation note text.
Tests: Drift Handling `test/onboard.test.js`	Enhance harness to deterministically inject a `.nemoclaw/config.json` via intercepted `sandbox download` and patched `fs.mkdtempSync`; make stdout JSON extraction resilient by reverse-scanning for the last complete JSON object; add tests for non-interactive auto-recreate when config unreadable and interactive decline aborting onboarding; minor embedded script ordering and whitespace fixes.
Docs: Behavior & Env Override `docs/reference/commands.md`	Document detection of provider/model drift on re-onboarding, interactive confirmation vs non-interactive automatic recreation, and new `NEMOCLAW_RECREATE_SANDBOX=1` environment variable to force recreation.

Sequence Diagram

sequenceDiagram
    participant User
    participant Onboard as Onboarding Process
    participant OpenShell as OpenShell CLI
    participant FS as Temp FS
    participant Drift as Drift Detector
    participant Confirm as Confirmation Dialog
    participant Sandbox as Sandbox Manager

    User->>Onboard: start onboarding (sandboxName, provider, model)
    Onboard->>OpenShell: run "sandbox download" for /.nemoclaw/config.json
    OpenShell->>FS: extract archive into temp dir
    Onboard->>FS: locate config.json (recursive) & robust-parse JSON
    Onboard->>Drift: compare stored provider/model -> requested provider/model
    alt drift detected
        Drift->>Confirm: require recreate confirmation
        alt interactive
            Confirm->>User: prompt "Recreate sandbox to apply provider/model change?"
            User-->>Confirm: accept / decline
            alt accept
                Confirm->>Sandbox: delete sandbox
                Sandbox->>Sandbox: create sandbox with requested selection
            else decline
                Confirm->>Onboard: abort onboarding (exit 1)
            end
        else non-interactive
            Drift->>Sandbox: auto-recreate sandbox (log reason)
        end
    else no drift
        Onboard->>Sandbox: reuse existing sandbox (upsert messaging providers if needed)
    end
    Sandbox->>FS: cleanup temp dir

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~25 minutes

Poem

🐇 I hopped through temp dirs, sniffed a config full of dust,
Found an old provider, a model that I must adjust.
"Rebuild the burrow?" you asked — I thumped and gave a cheer,
Recreated, tidied, logged — the new selection now is here. 🥕

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 12.50% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (4 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title accurately summarizes the main change: detecting and recreating sandboxes on model/provider drift when re-onboarding with the same name.
Linked Issues check	✅ Passed	The PR implements all core requirements from `#1457`: detects provider/model drift when re-onboarding with same name, recreates sandbox with interactive confirmation, and prevents silent reuse.
Out of Scope Changes check	✅ Passed	All changes directly support the drift detection and sandbox recreation objectives; no unrelated modifications detected.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

ChunkyMonkey11 · 2026-04-06T00:44:18Z

@coderabbitai please re-review

The branch head changed after I merged main into this PR branch.
No functional changes to the fix intent; requesting a fresh pass on the latest head commit.

coderabbitai · 2026-04-06T00:44:28Z

@ChunkyMonkey11 Sure! I'll kick off a fresh review on the latest head commit now.

✅ Actions performed

Full review triggered.

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (1)

test/onboard.test.js (1)
1977-1999: These drift tests grep source text instead of exercising the behavior.

They can miss runtime regressions and also fail on harmless helper/log renames. Please reuse the existing spawn-based harness patterns in this file to assert actual createSandbox() outcomes for provider-only drift, model drift, and unreadable config.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@test/onboard.test.js` around lines 1977 - 1999, Replace the static
source-text grep assertions with real integration tests that spawn the
onboarding binary and assert createSandbox() behavior: use the existing
spawn-based harness pattern already present in this test suite to run scenarios
that trigger provider-only drift, model drift, and unreadable config; invoke the
code paths that call getSelectionDrift and confirmRecreateForSelectionDrift,
then assert the process output/exit codes and that createSandbox was actually
recreated or left unchanged as expected (including interactive prompt behavior
for confirmRecreateForSelectionDrift), rather than matching log helper text.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@bin/lib/onboard.js`:
- Around line 924-967: readSandboxSelectionConfig can return null on
download/parse failures, but getSelectionDrift currently treats null as "no
drift"; change getSelectionDrift so that when
readSandboxSelectionConfig(sandboxName) returns null (unknown state) it does not
silently report changed: false — instead set changed: true to force
reconfiguration (or, if you prefer to surface unknown explicitly, set changed:
true and set providerChanged/modelChanged to null to indicate unknown rather
than false). Update the getSelectionDrift function to detect existing === null,
return changed: true, keep existingProvider/existingModel as null, and set
providerChanged/modelChanged to null (or true if you choose force-recreate
semantics) so callers can distinguish an actual mismatch from an
unreadable/unknown config.

---

Nitpick comments:
In `@test/onboard.test.js`:
- Around line 1977-1999: Replace the static source-text grep assertions with
real integration tests that spawn the onboarding binary and assert
createSandbox() behavior: use the existing spawn-based harness pattern already
present in this test suite to run scenarios that trigger provider-only drift,
model drift, and unreadable config; invoke the code paths that call
getSelectionDrift and confirmRecreateForSelectionDrift, then assert the process
output/exit codes and that createSandbox was actually recreated or left
unchanged as expected (including interactive prompt behavior for
confirmRecreateForSelectionDrift), rather than matching log helper text.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Run ID: 57e767ad-4022-4702-a0e2-9a7d62fb935c

📥 Commits

Reviewing files that changed from the base of the PR and between 9a5dc38 and 613c75d.

📒 Files selected for processing (2)

bin/lib/onboard.js
test/onboard.test.js

bin/lib/onboard.js

coderabbitai

♻️ Duplicate comments (1)

bin/lib/onboard.js (1)

950-963: ⚠️ Potential issue | 🟠 Major

Unreadable selection config is still treated as “no drift.”

At Line 951, readSandboxSelectionConfig() can return null; at Line 962 this currently resolves to changed: false, which can silently reuse a sandbox when provider/model state is actually unknown.

🔧 Proposed fix

 function getSelectionDrift(sandboxName, requestedProvider, requestedModel) {
   const existing = readSandboxSelectionConfig(sandboxName);
+  if (existing === null) {
+    return {
+      changed: true,
+      providerChanged: false,
+      modelChanged: false,
+      existingProvider: null,
+      existingModel: null,
+      unknown: true,
+    };
+  }
   const existingProvider =
     existing && typeof existing.provider === "string" ? existing.provider : null;
   const existingModel = existing && typeof existing.model === "string" ? existing.model : null;

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@bin/lib/onboard.js` around lines 950 - 963, getSelectionDrift currently
treats a missing or unreadable selection config as "no drift"; change it so that
when readSandboxSelectionConfig(sandboxName) returns null/invalid, the function
treats the state as drifted. Specifically, in getSelectionDrift (and using the
existing variables existing, existingProvider, existingModel, providerChanged,
modelChanged), detect if existing is falsy or has non-string provider/model and
set the returned changed flag to true (and/or set providerChanged/modelChanged
appropriately) so we don't silently reuse a sandbox when the selection config is
unknown.

🧹 Nitpick comments (1)

test/onboard.test.js (1)

1977-1999: Prefer behavior-driven assertions over source-text regex for drift logic.

These checks can pass even if runtime drift handling regresses but strings remain. Consider adding one execution-path test that stubs sandbox download + prompt result and asserts reuse vs recreate/abort outcomes directly.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@test/onboard.test.js` around lines 1977 - 1999, Replace the brittle
source-text regex tests with an execution-path test that stubs the runtime
behavior: import the onboard flow (the module that calls getSelectionDrift),
stub sandbox.download (or the sandbox object used by onboard.js) to simulate an
existing sandbox, stub getSelectionDrift to return changed=true and
changed=false cases, and stub the interactive prompt used by
confirmRecreateForSelectionDrift to return yes/no; then assert the actual
outcome (that the code calls the recreate path vs the reuse/abort path) by
spying on the functions that perform recreate (e.g., sandbox.recreate or the
recreate handler) and reuse (e.g., sandbox.connect) — exercise both changed=true
with prompt=confirm (expect recreate) and changed=true with prompt=abort (expect
abort/no recreate) and changed=false (expect reuse), so tests validate runtime
behavior rather than matching strings.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Duplicate comments:
In `@bin/lib/onboard.js`:
- Around line 950-963: getSelectionDrift currently treats a missing or
unreadable selection config as "no drift"; change it so that when
readSandboxSelectionConfig(sandboxName) returns null/invalid, the function
treats the state as drifted. Specifically, in getSelectionDrift (and using the
existing variables existing, existingProvider, existingModel, providerChanged,
modelChanged), detect if existing is falsy or has non-string provider/model and
set the returned changed flag to true (and/or set providerChanged/modelChanged
appropriately) so we don't silently reuse a sandbox when the selection config is
unknown.

---

Nitpick comments:
In `@test/onboard.test.js`:
- Around line 1977-1999: Replace the brittle source-text regex tests with an
execution-path test that stubs the runtime behavior: import the onboard flow
(the module that calls getSelectionDrift), stub sandbox.download (or the sandbox
object used by onboard.js) to simulate an existing sandbox, stub
getSelectionDrift to return changed=true and changed=false cases, and stub the
interactive prompt used by confirmRecreateForSelectionDrift to return yes/no;
then assert the actual outcome (that the code calls the recreate path vs the
reuse/abort path) by spying on the functions that perform recreate (e.g.,
sandbox.recreate or the recreate handler) and reuse (e.g., sandbox.connect) —
exercise both changed=true with prompt=confirm (expect recreate) and
changed=true with prompt=abort (expect abort/no recreate) and changed=false
(expect reuse), so tests validate runtime behavior rather than matching strings.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Run ID: b1330725-b633-4790-bd7d-d83a1ef03843

📥 Commits

Reviewing files that changed from the base of the PR and between 9a5dc38 and 613c75d.

📒 Files selected for processing (2)

bin/lib/onboard.js
test/onboard.test.js

ChunkyMonkey11 · 2026-04-06T06:59:52Z

All good to go pasts the test cases.

wscurran · 2026-04-06T15:09:22Z

✨ Thanks for submitting this fix, which proposes a way to detect provider/model drift during re-onboarding and prevent silent reuse of stale sandboxes.

Possibly related open issues:

#1457 NemoClaw onboarded with the same name shows the old model name in OpenClaw TUI

coderabbitai

Actionable comments posted: 2

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@test/onboard.test.js`:
- Around line 2039-2134: The test currently only asserts a recorded "'sandbox'
'delete' 'my-assistant'" command but doesn't verify the recreate step because
childProcess.spawn isn't recorded; update the in-test fixture (the script string
that requires createSandbox) to record spawn invocations (override
childProcess.spawn to push the command into the same commands array as
runner.run does) and then assert payload.commands contains a spawn/create entry
(e.g., command string includes "'sandbox' 'create' 'my-assistant'") in addition
to the existing delete assertion so the test ensures both delete and recreate
occur; reference the createSandbox invocation, the childProcess.spawn override,
and payload.commands when making the changes.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Run ID: 756d9712-383a-46fd-91cd-ccabdecdcea7

📥 Commits

Reviewing files that changed from the base of the PR and between a1c9d0f and 90ec84e.

📒 Files selected for processing (3)

bin/lib/onboard.js
docs/reference/commands.md
test/onboard.test.js

✅ Files skipped from review due to trivial changes (1)

docs/reference/commands.md

🚧 Files skipped from review as they are similar to previous changes (1)

bin/lib/onboard.js

test/onboard.test.js

ChunkyMonkey11 · 2026-04-07T04:30:27Z

All required checks pass; docstring coverage warning noted, can be handled in a follow-up cleanup PR if needed.

ChunkyMonkey11 · 2026-04-07T04:30:55Z

@coderabbitai resume

coderabbitai · 2026-04-07T04:31:02Z

✅ Actions performed

Reviews resumed.

coderabbitai

♻️ Duplicate comments (1)

test/onboard.test.js (1)
2209-2218: ⚠️ Potential issue | 🟡 Minor

Test may pass spuriously when NEMOCLAW_NON_INTERACTIVE is set in CI.

The spawn env spreads process.env without explicitly clearing NEMOCLAW_NON_INTERACTIVE. If the test runner sets this variable (common in CI), the spawned script inherits it, skips the prompt entirely, and the test passes without exercising the interactive confirmation path.
🛡️ Proposed fix: explicitly clear the env variable
     const result = spawnSync(process.execPath, [scriptPath], {
       cwd: repoRoot,
       encoding: "utf-8",
       env: {
         ...process.env,
         HOME: tmpDir,
         PATH: `${fakeBin}:${process.env.PATH || ""}`,
+        NEMOCLAW_NON_INTERACTIVE: "",
       },
       timeout: 12000,
     });
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@test/onboard.test.js` around lines 2209 - 2218, The spawned child inherits
process.env which can contain NEMOCLAW_NON_INTERACTIVE from CI, causing the
interactive branch to be skipped; modify the test to explicitly clear that
variable when building the env for spawnSync (change the env passed to spawnSync
for the invocation that uses
spawnSync/process.execPath/scriptPath/tmpDir/fakeBin to set
NEMOCLAW_NON_INTERACTIVE to undefined or remove it from the copied env before
adding HOME and PATH) so the child always runs in interactive mode for this
test.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Duplicate comments:
In `@test/onboard.test.js`:
- Around line 2209-2218: The spawned child inherits process.env which can
contain NEMOCLAW_NON_INTERACTIVE from CI, causing the interactive branch to be
skipped; modify the test to explicitly clear that variable when building the env
for spawnSync (change the env passed to spawnSync for the invocation that uses
spawnSync/process.execPath/scriptPath/tmpDir/fakeBin to set
NEMOCLAW_NON_INTERACTIVE to undefined or remove it from the copied env before
adding HOME and PATH) so the child always runs in interactive mode for this
test.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Run ID: 0a3868aa-7938-4659-9a92-f9ebb157dd6c

📥 Commits

Reviewing files that changed from the base of the PR and between a1c9d0f and 98df39b.

📒 Files selected for processing (3)

bin/lib/onboard.js
docs/reference/commands.md
test/onboard.test.js

✅ Files skipped from review due to trivial changes (1)

docs/reference/commands.md

🚧 Files skipped from review as they are similar to previous changes (1)

bin/lib/onboard.js

…re-onboard

coderabbitai

Actionable comments posted: 1

♻️ Duplicate comments (1)

test/onboard.test.js (1)

2784-2793: ⚠️ Potential issue | 🟡 Minor

Clear the recreate/non-interactive env flags for this interactive subprocess.

This case inherits process.env verbatim, unlike the nearby interactive tests. If the runner already has NEMOCLAW_NON_INTERACTIVE or NEMOCLAW_RECREATE_SANDBOX set, the confirmation path is skipped and this test no longer covers the decline flow.

🔧 Suggested fix

-    const result = spawnSync(process.execPath, [scriptPath], {
-      cwd: repoRoot,
-      encoding: "utf-8",
-      env: {
-        ...process.env,
-        HOME: tmpDir,
-        PATH: `${fakeBin}:${process.env.PATH || ""}`,
-      },
-      timeout: 12000,
-    });
+    const env = {
+      ...process.env,
+      HOME: tmpDir,
+      PATH: `${fakeBin}:${process.env.PATH || ""}`,
+    };
+    delete env.NEMOCLAW_NON_INTERACTIVE;
+    delete env.NEMOCLAW_RECREATE_SANDBOX;
+    const result = spawnSync(process.execPath, [scriptPath], {
+      cwd: repoRoot,
+      encoding: "utf-8",
+      env,
+      timeout: 12000,
+    });

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@test/onboard.test.js` around lines 2784 - 2793, The spawned subprocess in the
test (the spawnSync call that runs scriptPath with cwd repoRoot and env built
from ...process.env, HOME: tmpDir and PATH: `${fakeBin}:${process.env.PATH ||
""}`) unintentionally inherits NEMOCLAW_NON_INTERACTIVE and
NEMOCLAW_RECREATE_SANDBOX from the parent; explicitly clear those flags in the
env you pass to spawnSync (e.g., set NEMOCLAW_NON_INTERACTIVE and
NEMOCLAW_RECREATE_SANDBOX to undefined or an empty string in the env object) so
the interactive confirmation path is exercised and the decline flow is tested.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@test/onboard.test.js`:
- Around line 2700-2717: The test "detects provider/model drift and avoids
silent reuse" references payload.commands but never defines payload, causing a
ReferenceError; fix by creating or reusing a payload object with a commands
array before the assertion (e.g., initialize payload = { commands: [] } or
obtain the same payload used in the prior end-to-end case) so the assertion on
payload.commands.some(...) can run; update the test setup near the assertions
that check for "'sandbox' 'create'" so it uses the defined payload variable used
elsewhere in the test suite (or mock the command payload) before the final
assert.ok call.

---

Duplicate comments:
In `@test/onboard.test.js`:
- Around line 2784-2793: The spawned subprocess in the test (the spawnSync call
that runs scriptPath with cwd repoRoot and env built from ...process.env, HOME:
tmpDir and PATH: `${fakeBin}:${process.env.PATH || ""}`) unintentionally
inherits NEMOCLAW_NON_INTERACTIVE and NEMOCLAW_RECREATE_SANDBOX from the parent;
explicitly clear those flags in the env you pass to spawnSync (e.g., set
NEMOCLAW_NON_INTERACTIVE and NEMOCLAW_RECREATE_SANDBOX to undefined or an empty
string in the env object) so the interactive confirmation path is exercised and
the decline flow is tested.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Run ID: ae5e18ab-9e02-4c3d-ac29-f026a004e2b8

📥 Commits

Reviewing files that changed from the base of the PR and between 30b365f and 4f5eecf.

📒 Files selected for processing (3)

bin/lib/onboard.js
docs/reference/commands.md
test/onboard.test.js

✅ Files skipped from review due to trivial changes (1)

docs/reference/commands.md

🚧 Files skipped from review as they are similar to previous changes (1)

bin/lib/onboard.js

coderabbitai · 2026-04-09T00:54:33Z

test/onboard.test.js

+  it("detects provider/model drift and avoids silent reuse", () => {
+    const source = fs.readFileSync(
+      path.join(import.meta.dirname, "..", "bin", "lib", "onboard.js"),
+      "utf-8",
+    );
+    assert.match(source, /const selectionDrift = getSelectionDrift\(sandboxName, provider, model\);/);
+    assert.match(source, /unknown:\s*true/);
+    assert.match(source, /else if \(selectionDrift\.changed\)/);
+    assert.match(source, /Recreating sandbox due to provider\/model drift/);
+    assert.match(
+      source,
+      /Sandbox '\$\{sandboxName\}' exists — recreating to apply model\/provider change\./,
+    );
+    assert.ok(
+      payload.commands.some((entry) => entry.command.includes("'sandbox' 'create'")),
+      "expected sandbox recreate when selection config cannot be read",
+    );
+  });


⚠️ Potential issue | 🔴 Critical

payload is undefined here, so this test will fail before it verifies drift handling.

Lines 2713-2716 dereference payload.commands, but this test never creates a payload. That turns the new coverage into a hard ReferenceError and leaves the recreate assertion disconnected from the end-to-end case above.

💡 Suggested fix

it("detects provider/model drift and avoids silent reuse", () => { const source = fs.readFileSync( path.join(import.meta.dirname, "..", "bin", "lib", "onboard.js"), "utf-8", ); assert.match(source, /const selectionDrift = getSelectionDrift$sandboxName, provider, model$;/); assert.match(source, /unknown:\s*true/); assert.match(source, /else if $selectionDrift\.changed$/); assert.match(source, /Recreating sandbox due to provider\/model drift/); assert.match( source, /Sandbox '\$\{sandboxName\}' exists — recreating to apply model\/provider change\./, ); - assert.ok( - payload.commands.some((entry) => entry.command.includes("'sandbox' 'create'")), - "expected sandbox recreate when selection config cannot be read", - ); });

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@test/onboard.test.js` around lines 2700 - 2717, The test "detects provider/model drift and avoids silent reuse" references payload.commands but never defines payload, causing a ReferenceError; fix by creating or reusing a payload object with a commands array before the assertion (e.g., initialize payload = { commands: [] } or obtain the same payload used in the prior end-to-end case) so the assertion on payload.commands.some(...) can run; update the test setup near the assertions that check for "'sandbox' 'create'" so it uses the defined payload variable used elsewhere in the test suite (or mock the command payload) before the final assert.ok call.

cv

Security Review — WARNING (3 issues)

The drift-detection approach is sound and the interactive confirmation is good. But three issues need fixing.

Required changes

1. Unknown config should not auto-destroy (HIGH):
When the config file can't be read (drift.unknown === true), non-interactive mode unconditionally returns true (recreate). A transient download failure or unreachable sandbox would destroy a perfectly good sandbox. Default to reuse when config is unreadable in non-interactive mode, or require NEMOCLAW_RECREATE_SANDBOX=1 explicitly.

2. Not-ready sandbox guard dropped (HIGH):
The old code had explicit handling for existingSandboxState !== "ready" — it prompted in interactive mode or exited in non-interactive mode. The new code falls straight through to delete-and-recreate with no prompt. This is a behavioral regression. Restore the not-ready guard.

3. Broken test — payload undefined (MEDIUM):
The "detects provider/model drift" test references payload.commands.some(...) but payload is never defined in that scope. Likely copy-paste from the "unreadable config" test. Will throw ReferenceError.

What's good

Interactive confirmation guard is correct
Config file only stores provider/model (no secrets)
Temp directory handling with finally cleanup is clean
stdio: ["ignore", ...] suppresses download output

ChunkyMonkey11 mentioned this pull request Apr 6, 2026

[NemoClaw][All Platform] NemoClaw onboarded with the same name shows the old model name in OpenClaw TUI #1457

Open

coderabbitai bot reviewed Apr 6, 2026

View reviewed changes

bin/lib/onboard.js Show resolved Hide resolved

coderabbitai bot reviewed Apr 6, 2026

View reviewed changes

coderabbitai bot reviewed Apr 7, 2026

View reviewed changes

test/onboard.test.js Show resolved Hide resolved

test/onboard.test.js Show resolved Hide resolved

coderabbitai bot reviewed Apr 7, 2026

View reviewed changes

ChunkyMonkey11 added 4 commits April 8, 2026 17:37

fix(onboard): recreate sandbox on model/provider drift for same-name …

167882d

…re-onboard

fix(onboard): fail-safe selection drift and harden recreate/reuse tests

5bab672

docs(commands): document recreate behavior on same-name re-onboard drift

c9d2a03

test(onboard): assert recreate command in unreadable-config drift path

4f5eecf

ChunkyMonkey11 force-pushed the fix/1457-update-model-on-reonboard branch from 30b365f to 4f5eecf Compare April 9, 2026 00:49

coderabbitai bot reviewed Apr 9, 2026

View reviewed changes

Merge branch 'NVIDIA:main' into fix/1457-update-model-on-reonboard

82be5fb

cv requested changes Apr 9, 2026

View reviewed changes

Conversation

ChunkyMonkey11 commented Apr 6, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Related Issue

Changes

Type of Change

Testing

Checklist

General

Code Changes

Doc Changes

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Apr 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram

Estimated code review effort

Poem

❌ Failed checks (1 warning)

Uh oh!

ChunkyMonkey11 commented Apr 6, 2026

Uh oh!

coderabbitai bot commented Apr 6, 2026

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

ChunkyMonkey11 commented Apr 6, 2026

Uh oh!

wscurran commented Apr 6, 2026

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

ChunkyMonkey11 commented Apr 7, 2026

Uh oh!

ChunkyMonkey11 commented Apr 7, 2026

Uh oh!

coderabbitai bot commented Apr 7, 2026

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Apr 9, 2026

Choose a reason for hiding this comment

Uh oh!

cv left a comment

Choose a reason for hiding this comment

Security Review — WARNING (3 issues)

Required changes

What's good

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ChunkyMonkey11 commented Apr 6, 2026 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Apr 6, 2026 •

edited

Loading