Lab #525: feat: giga-swamp phase 6 — Namespace-scoped sync

Problem

Phases 1-4 established namespace infrastructure, catalog schema, filesystem layout, and CEL cross-namespace queries. A single repo with a namespace works correctly against a filesystem datastore. But the multi-repo sharing scenario — the core value of giga-swamp — requires namespace-scoped sync. Today, push/pull syncs the entire datastore, and each repo's catalog only sees its own writes. Two repos sharing an S3 datastore can't see each other's data.

Context: Previous Scoped Sync Failures

Two previous scoped sync attempts were reverted:

PR #1386: Pulled files landed in cache but catalog wasn't re-indexed. data list returned 0 rows after cross-repo pull. Post-pull catalog rebuild is load-bearing.
PR #134: Treated directory dirty paths as file paths. Scoped push tried to Deno.readFile on a directory and failed silently.

The failure mode is always the same: push works, pull works in isolation, but the writer-pushes/reader-pulls/reader-queries flow breaks silently. Unit tests don't catch it. Only swamp-uat catches the cross-repo scenario.

Solution: Four Sub-Phases (Each Independently Shippable)

Each sub-phase ships as a separate PR, gated independently on swamp-uat. If one breaks, revert it without touching the others. The system works in a degraded-but-correct state after each sub-phase.

Sub-phase 6a: Plumbing — namespace on DatastoreSyncOptions

Add optional namespace field to DatastoreSyncOptions. Pass it through the sync coordinator to pullChanged/pushChanged. Extensions that ignore it keep syncing everything — identical to today. Zero behavior change.

Ship gate: all tests pass, swamp-uat passes, behavior is byte-identical to before.

Sub-phase 6b: Per-namespace index partitioning

Replace the monolithic .datastore-index.json with per-namespace indexes at {namespace}/.datastore-index.json. Solo mode keeps one index. The sync coordinator passes namespace to push/pull so only the repo's own namespace subtree is synced.

This is the highest-risk sub-phase. The markDirty contract has 8 load-bearing rules — namespace support is additive (adds a field to DatastoreSyncOptions), it does NOT change how dirty tracking works. The dirty path is a directory, not a file.

Ship gate: swamp-uat datastore suite passes. Critical test path: writer repo pushes data, reader repo pulls, reader runs data list, data is visible with correct provenance. Post-pull catalog rebuild must fire.

Sub-phase 6c: Foreign catalog export/pull

Each namespace publishes a lightweight .catalog-export.json containing its catalog rows as a flat JSON array. Foreign catalog pull fetches these exports and upserts into the local catalog. This is net-new sync infrastructure — additive, nothing existing depends on it.

The local catalog backfill (full-replace) needs to become namespace-scoped: delete own-namespace rows, insert own-namespace rows, preserve foreign rows. This was deferred from Phase 3 and is now required.

Also adds: swamp datastore catalog pull command, lastSynced timestamp on foreign catalog data (Design Decision 7).

Ship gate: writer pushes data, exports catalog. Reader pulls foreign catalog. Reader queries cross-namespace data via CEL (ns:model syntax from Phase 4). Metadata is visible, content is not yet (that's 6d).

Sub-phase 6d: On-demand cross-namespace content fetch

When a cross-namespace CEL expression accesses another namespace's data content (e.g. attributes), the content is fetched on demand from the remote datastore. Single-file GET, not a full sync. Cached locally for the command duration but not persisted — foreign content is ephemeral.

Adds optional fetchForeignContent method to DatastoreSyncService. Extensions that don't implement it return null (content unavailable).

Ship gate: data.latest('ns:model', 'name').attributes returns the actual content from the foreign namespace, fetched on demand.

Critical Constraints

Do NOT change the markDirty contract — 8 load-bearing rules extension authors depend on
Do NOT change DEFAULT_DATASTORE_SUBDIRS — destructive migration side effects
Do NOT restructure acquireModelLocks without verifying catalog rebuild is preserved
Do NOT assume markDirty relPath is a file path — it's a directory path from the data repo
Do NOT trust unit tests alone for sync changes — swamp-uat is the only reliable validation
Do NOT benchmark as validation — Phase 2 of the previous scoped sync showed 5x improvement by accidentally dropping most of the work

Verification Requirements (Non-Negotiable)

Before EVERY sub-phase PR:

deno check, deno lint, deno fmt, deno run test
deno run compile — build candidate binary
Run swamp-uat datastore suite against candidate binary
Critical test path: writer repo pushes → reader repo pulls → reader runs data list → data is visible with correct provenance
Verify catalog rebuild fires after pull
Solo mode regression gate: byte-identical behavior

Design Context

Full design doc: resources/giga-swamp.md (see Sync section, Namespace-Scoped Sync, and Relationship to Scoped Sync Work sections)

This is Phase 6 of 7. Depends on Phases 1-4 (all shipped). Phase 5 (CLI output) and Phase 7 (migration commands) will follow after Phase 6.

Should be able to see all the issues I created by a filter "submitted by me"

Ability to change the email address associated with my Swamp Club Account

feat: giga-swamp phase 5 — CLI output + namespace management commands

CI review jobs use two-dot diff that includes files the PR never touched

paths.base: manifest is not honored for workflows: — bundled workflows only resolve from repo root, blocking self-contained subdir layouts (sibling to #459)

Lab profanity filter rejects legitimate CLI flag tokens via substring match

Sign and notarize the swamp macOS binary

Add platform type to issue-lifecycle extension model Zod schema

fix: datastoreSetupExtension() ignores namespace config on initial migration push/pull

Remote execution: orchestrator/worker fan-out (replaces execution drivers)

swamp datastore sync --push creates global .datastore-index.json ignoring namespace config

feat: S3/GCS extension namespace-scoped sync support

Copy explicitGlobalArgs before mutation in resolveOrCreateDefinition

vault.get() expressions in extension model globalArguments are not resolved at runtime

swamp-issue skill should scrub secrets and org-specific data before submission

workflow validate: trim stale 'skipped' label from model_not_found warning

Add pi coding agent support

hashicorp-vault should read token from env

swamp-extension adversarial review skill needs mandatory mechanical verification checklist

feat: giga-swamp phase 6 — Namespace-scoped sync

swamp workflow validate emits misleading "Extension failed to load" warning when type resolves locally

Add issue search/list command to discover existing issues

Support vault-resolved private key content in transport auth (not just file paths)

Workflow engine resolves extension methods against base type, ignoring extension-registered methods

Per-model LockTimeoutError at 60s causes cascading failures under concurrent access

Persistent, queryable workflow runs (status / cancel from any shell)

swamp repo upgrade: ERR_SQLITE_ERROR 'attempt to write a readonly database' during extension catalog schema migration

workflow validate: fail on references to unknown model instances (typo'd modelIdOrName)

feat: giga-swamp phase 4 — CEL cross-namespace queries

Docs: document the extension push adversarial-review gate

vault://local_encryption token does not round-trip correctly for GCP OAuth2 access tokens

swamp issue: add ability to edit issue title and body after submission

@swamp/gcp/iam: add WIF pool, provider, service account, and binding support

Support vault-sourced identity keys

copy method reports success when scp exits non-zero (e.g. 255)

Docs: TLS behind inspecting proxies / private CAs (system trust store, DENO_CERT, SSL_CERT_FILE)

Extension quality/adversarial-review: add a 'published-surface hygiene' check for real infra identifiers

Feed-post scoring is a direct domain write, not a consumer of feed_post_approved telemetry

workflow validate silently PASSES steps whose model type is a pulled extension (step-inputs skipped = false pass)

extension quality fails to resolve bare specifiers — contradicts fmt no-import-prefix rule

Allow global arguments in direct type execution (workflow fan-out)

Bundled Deno does not honor the OS/system CA trust store

Gator-approved feed post did not trigger Discord activity or profile points

username_metrics projection backfill does not trigger re-scoring (stale UserScore for dormant users)

Enforce adversarial review gate before extension push

support git forge / web namespaces for collectives

Report type filtering in report search

extension search: empty results from CLI despite known extensions

workflow approve/resume cannot find suspended runs

vault annotate --url fails with query params on @swamp/aws-sm

datastore compact VACUUM fails with ERR_SQLITE_ERROR

workflow approve/resume cannot find suspended run when using S3 datastore

reindexByUsername re-strands pre-association history and wipes sign_in_dates

Telemetry never retroactively credits a device's pre-association history

Docs: document swamp doctor secrets in manual reference doctor.md

Docs: document 'swamp workflow resume --input' in manual reference

Cloudflare codegen: manifest version bumps on every regeneration (README not deno-fmt-clean)

Support dynamic host discovery from external sources

feat: giga-swamp phase 3 — Path resolver + per-namespace locking

@swamp/ssh exec: string host selector only matches 'all', ignores host names and tags

Add integration test for sensitive-arg guard on lazily-loaded extension types (follow-up to #480)

Remediate existing definitions holding cleartext sensitive global arguments (follow-up to #480)

Docs: document refusal of literal sensitive global arguments (follow-up to #480)

Docs: update extension-trust reference for swamp-only default + lockfile version pinning (swamp-club#465)

feat: giga-swamp phase 2 — Catalog schema v4 + repository interface

Support for Custom CA's

Cloudflare: support vault expressions for API credentials instead of env-var-only auth

GCP: support vault expressions for credentials instead of env-var-only auth

AWS: support vault expressions for credentials instead of env-var/SDK-chain-only auth

DigitalOcean: support vault expressions for the API token instead of DO_API_TOKEN env var

swamp model get does not redact `sensitive: true` fields (logs/reports/storage do)

Support vault expressions for API token instead of env var

UAT tests for manual_approval workflow commands

Document manual_approval workflow step type and suspend/approve/resume flow

Stale extension bundles break after swamp upgrade

Support --input flags on workflow resume for elevated permissions and runtime overrides

Add HTTP approval endpoints to swamp serve for manual_approval steps

feat: giga-swamp phase 1 — Namespace value object + config

swamp serve scheduled workflows do not load repo extension registries

ci: aws-check and gcp-check jobs take ~30min — rethink whether full model type-checking is needed per PR