extension source rm leaves stale @local/. rows in catalog, blocking later pulls of registry types

The swamp-extension-model and swamp-troubleshooting skills currently tell agents to invoke deno check, deno test, deno fmt, and deno lint directly when developing an extension. This works on a developer's box that already has a system Deno, but it has two underlying problems:

Implementation lock-in. Inviting agents (and humans) to invoke Deno directly implicitly contracts that the runtime is Deno, that its CLI surface is part of swamp's public API, and that flag/output changes across Deno major versions are swamp's problem to absorb. Any future change to the bundled runtime — version bump with breaking CLI changes, or a swap to a different runtime entirely — becomes a user-visible break.
Version skew on extension code. Extensions are bundled and executed by swamp using its bundled deno. Tests run on a developer's system Deno are tested against a different runtime than the one swamp will actually use in production. Most of the time the difference is invisible; sometimes it surfaces as npm: resolution drift, JSR caching differences, or stdlib semantics mismatches.

This is the architectural follow-up to #195, which is being closed won't-fix. #195 proposed exposing ~/.swamp/deno/deno (or a swamp deno test passthrough) so agents could find swamp's bundled runtime when no system Deno was on PATH. That fix would have made both problems above worse, not better — it would have publicly committed swamp to "the bundled runtime is Deno, the binary lives at this path."

The right shape is a swamp-owned operation that runs the extension developer loop without exposing any Deno-shaped surface.

Proposed solution

Add swamp extension verify <manifest-or-extension-path> as an opaque extension-lifecycle verification primitive.

What it does (internally, today)

Type check (deno check against the extension file using the bundled deno + swamp's import-map context)
Lint (deno lint)
Format check (deno fmt --check)
Run colocated unit tests (*_test.ts next to the extension file, using the bundled deno with the same permission policy swamp uses for execution)
Quality rubric scoring (the same checks swamp extension push --dry-run already runs)

All of these already exist somewhere in swamp's internals (src/libswamp/extensions/fmt.ts, quality.ts, push.ts, EmbeddedDenoRuntime). verify is composition over existing infra, not new infrastructure.

What it deliberately does NOT do

Expose any Deno flags, paths, or runner semantics.
Accept arbitrary test-runner options (--filter, --reporter, --shuffle, --parallel, --no-check). The escape hatch for that is "bring your own Deno."
Become a generic Deno wrapper (swamp deno <args>).
Run smoke tests against live APIs — that's the existing swamp model method run smoke-testing protocol, separate concern.

The principle: the primitive must be extension-shaped, not deno-shaped. If it ever drifts toward swamp extension test, swamp extension check, swamp extension lint, etc. — separate per-tool subcommands — we've just rebranded the Deno CLI under a swamp prefix and inherited every flag the original surface had. That's worse than direct exposure: same lock-in, more code to maintain.

CLI shape

swamp extension verify <manifest-or-extension-path>
  --json              # structured output
  --skip-tests        # quick lint-only checks during iteration
  --skip-quality      # skip rubric scoring during iteration

Default exit codes: 0 = all stages pass, non-zero = at least one stage failed. JSON mode returns a { stages: [{ name, passed, output }], passed: bool } envelope so CI and agents can pivot on individual failures.

Skill update (in scope for the same PR)

Once verify exists, every place in .claude/skills/swamp-extension-model/ and .claude/skills/swamp-troubleshooting/ that currently says deno check / deno test / deno fmt / deno lint against extension code flips to swamp extension verify. That's a sweep, not a one-line edit, and it's the actual user-visible value of the new primitive — without it, agents keep reaching for the raw Deno commands and the abstraction does nothing.

The bundled-deno path (~/.swamp/deno/deno) stays undocumented and unsupported as a public path.

Alternatives considered

Expose ~/.swamp/deno/deno directly in skills. The original ask in #195. Rejected: locks swamp into "the runtime is Deno, the binary is at this path" forever.
swamp deno <args> passthrough command. Same lock-in as the path, just dressed up as a swamp subcommand. Every Deno flag becomes part of swamp's contract.
Per-tool subcommands (swamp extension test, swamp extension check, swamp extension fmt, swamp extension lint). Rebrands the Deno CLI under a swamp prefix without removing the lock-in.
Tell agents to install a system Deno. The "do nothing" option for the bundled binary, but it leaves the version-skew problem for extension tests unsolved and adds friction for users who'd rather not.
Fold test execution into swamp extension push --dry-run. Tempting because --dry-run already runs quality checks. Rejected because verify should be runnable mid-development, not only as a pre-publish step. (Worth checking during triage that --dry-run and verify end up sharing implementation rather than duplicating it.)

Open questions to settle during triage

Test discovery rule. Colocated *_test.ts next to the extension file is the obvious answer, but extensions sometimes have shared _lib/ test fixtures. Need to define what counts as "the extension's tests."
Permissions. What --allow-* set should verify pass to the bundled deno when running tests? Probably --allow-all (matching how swamp executes models), but worth being explicit so we don't surprise users whose tests touch the network or filesystem.
Bundle cache interaction. Should verify invalidate the model's bundle cache, run tests against the cached bundle, or run against source? Source is the obvious answer for tests-of-the-source; bundle is what runs in production. Possibly both in different stages.
Relationship to swamp extension push --dry-run. --dry-run already runs quality checks. Either verify is --dry-run minus the upload, or --dry-run becomes verify + upload simulation. Worth deduping rather than ending up with two near-identical commands.
Skill rewrite scope. The sweep across swamp-extension-model and swamp-troubleshooting should land in the same PR as the new primitive. The exact list of files touched should fall out during planning.

Impact

Closes the architectural gap that #195 surfaced. After this lands:

The original incident from #195 (an agent on a no-system-deno machine couldn't run deno test) becomes a non-issue: the agent runs swamp extension verify, swamp owns the runtime invocation, no Deno path lookup needed.
swamp keeps the freedom to upgrade the bundled Deno (or replace it) without coordinating with users.
Extension tests run on the runtime they'll actually execute on in production, eliminating the version-skew failure mode.

02Bog Flow

Open

5/1/2026, 10:46:38 PM

No activity in this phase yet.

03Sludge Pulse

bixu commented 5/4/2026, 12:21:02 PM

I like!

bixu commented 5/10/2026, 4:44:02 PM

Ran into another situation today where this would have been great (not having to think about where Deno is).

swamp issue get should rate-limit unauthenticated users instead of blocking

swamp issue get should not require authentication

telemetry: emit child entries for follow-up action method invocations

Publish release-candidate / unstable extension versions

extension source rm leaves stale @local/. rows in catalog, blocking later pulls of registry types

extension push drops binaries: field from re-emitted archive manifest.yaml

macOS launchd autoupdate (club.swamp.autoupdate) silently fails — binary stays stale

Clicking @hivemq/honeycomb extension card on /extensions shows 'Something went wrong'

Docs: Update model-definitions.md and workflows.md for direct type execution

Docs: document binaries manifest field in extension-manifest.md

Add an official @swamp/ssh extension for general-purpose SSH (brownfield-friendly)

Accept and display binaries field from extension push metadata

Direct type execution: collapse model create + method run into one command

Per-method telemetry events for workflow runs

Workflow run liveness: orphaned 'running' records when originating CLI process dies mid-run

Provide a CLI-shape primer for AI agents to reduce rediscovery overhead

quality rubric: don't penalize extensions whose upstream constrains them to a single platform

Extension update rejects multiple .ts files extending the same target type within one local extension (regression)

swamp extension push deadlocks when invoked from inside a swamp workflow step

Collective-scoped auth keys + OIDC federation for CI publishing

forEach self.* in modelIdOrName not resolved in runtime execution path

Award leaderboard points for referrals and collective invites

Add agent harness detection and AiTool to telemetry

Workflow-level runtime expressions (env.*, vault.*) not resolved in driverConfig — docker driver receives literal ${{ ... }} strings

Implement W5: Per-fingerprint import URLs + subprocess test harness (extension catalog rearchitecture)

swamp config set crashes with YAML serialization error

datastore compact: VACUUM fails in compiled binary (SQLITE_LIMIT_ATTACHED=0)

Repo-level version gating: minSwampVersion high-water mark for team consistency

Docs: document self.* expressions in modelIdOrName during forEach

Docs: How-to guide for background autoupdating

Manifest version bumps silently ignored for existing local extension aggregates

materialiseExtensions misclassifies pulled rows when manifest name collides with a pulled extension

Local extension edits don't reliably trigger rebundle

Missing unique indexes on user.email and user.username allow duplicate users

Resolve self.* expressions in modelIdOrName during forEach expansion

discord-bot double-sends sign_up notifications

Discord bot sends duplicate signup notifications

Add 'swamp workflow list' as alias for 'swamp workflow search'

Add 'swamp auth status' as alias for 'swamp auth whoami'

Extension bundle cache does not invalidate on source edits

extension pull fails on @local/[email protected] phantom-claim collision when local repo has its own extension

Lab search by numeric issue ID returns no results

W3 sourceToRow writes empty source_mtime — should carry filesystem mtime through Source entity

Warm-start rebundleAndUpdateCatalog should respect terminal RowStates set by reconcile

Implement W4: KindAdapter + unified loader (extension catalog rearchitecture)

Docs: add vault read-secret command to reference manual

Extension layer garbage collection: prune catalog rows + evict orphaned bundles

Docs: document workflow concurrency limits in reference manual

Locally-sourced extension: source_mtime updates without regenerating stale bundle

Extension push: allow shipping executable host helpers (bin/mudroom blocker)

Vault expressions silently deliver __SWAMP_VSEC__ sentinels under the docker driver

First-class shell-shim support for extensions, with registry-level visibility

Local extension model bundles don't rebuild when source changes (no rebuild CLI; manual cache delete breaks the runner)

Configurable concurrency limits for workflow fan-out (forEach, parallel jobs/steps)

feat(security): redact sensitive method arg values from audit log

docs: document swamp datastore compact and GC WAL behaviour

UAT: swamp datastore compact reclaims WAL and catalog space

Plan v4 step 9 literal test untestable under current catalog PK semantics

Cross-process concurrency stress for W2 lifecycle services

UAT additions for W2 lifecycle services (Install/Remove/Upgrade)

Implement W3: ReconcileFromDisk + freshness-as-aggregate-query (extension catalog rearchitecture)

doctor extensions repair: clean catalog-only orphans

Extensions should be able to ship Claude Code skills

Pre-existing TOCTOU windows in YAML repo walkers (findAll directory level + findById)

`swamp datastore setup` migration is not resumable / leaves the repo in a partial state on failure

Built-in models must honor AbortSignal so --timeout works in practice

Reader-lock or lock-free read path for data list/get/search/query

data search: surface jobTag alongside workflowTag and stepTag (follow-up to #237)

global-arg silently strips unknown keys; should reject via strict Zod schema

redmine extension: adopt state machine patterns from @magistr for agent-driven workflows

performance degrades significantly with large SQLite catalog

summarise: timeout/hang on repos with many workflow runs

extension quality/fmt fail on pulled extensions (path mismatch)

vault: add read-secret CLI command for agent-driven secret retrieval

data query: stepName and jobName fields always empty in CEL results

Extension ecosystem: shared utility library, version sync tooling, and HTTP resilience patterns

Agentic CLI improvements: --json stdout isolation, array inputs, and --repo-dir consistency

data delete fails with "Directory not empty (os error 39)" when concurrent writes are active

Extract LockfileRepository (W2 prequel for swamp-club#231)

Extend DatastoreSyncService.markDirty() with optional relPath argument

Workflow-level runtime expressions (env., vault.) not resolved in driverConfig — docker driver receives literal ${{ ... }} strings