smart_ide/docs/features/anythingllm-pull-sync-after-pull.md
Nicolas Cantu 58cc2493e5 chore: consolidate ia_dev module, sync tooling, and harden gateways (0.0.5)
Initial state:
- ia_dev was historically referenced as ./ia_dev in docs and integrations, while the vendored module lives under services/ia_dev.
- AnythingLLM sync and hook installation had error masking / weak exit signaling.
- Proxy layers did not validate proxy path segments, allowing path normalization tricks.

Motivation:
- Make the IDE-oriented workflow usable (sync -> act -> deploy/preview) with explicit errors.
- Reduce security footguns in proxying and script automation.

Resolution:
- Standardize IA_DEV_ROOT usage and documentation to services/ia_dev.
- Add SSH remote data mirroring + optional AnythingLLM ingestion.
- Extend AnythingLLM pull sync to support upload-all/prefix and fail on upload errors.
- Harden smart-ide-sso-gateway and smart-ide-global-api proxying with safe-path checks and non-leaking error responses.
- Improve ia-dev-gateway runner validation and reduce sensitive path leakage.
- Add site scaffold tool (Vite/React) with OIDC + chat via sso-gateway -> orchestrator.

Root cause:
- Historical layout changes (submodule -> vendored tree) and missing central contracts for path resolution.
- Missing validation for proxy path traversal patterns.
- Overuse of silent fallbacks (|| true, exit 0 on partial failures) in automation scripts.

Impacted features:
- Project sync: git pull + AnythingLLM sync + remote data mirror ingestion.
- Site frontends: SSO gateway proxy and orchestrator intents (rag.query, chat.local).
- Agent execution: ia-dev-gateway script runner and SSE output.

Code modified:
- scripts/remote-data-ssh-sync.sh
- scripts/anythingllm-pull-sync/sync.mjs
- scripts/install-anythingllm-post-merge-hook.sh
- cron/git-pull-project-clones.sh
- services/smart-ide-sso-gateway/src/server.ts
- services/smart-ide-global-api/src/server.ts
- services/smart-ide-orchestrator/src/server.ts
- services/ia-dev-gateway/src/server.ts
- services/ia_dev/tools/site-generate.sh

Documentation modified:
- docs/** (architecture, API docs, ia_dev module + integration, scripts)

Configurations modified:
- config/services.local.env.example
- services/*/.env.example

Files in deploy modified:
- services/ia_dev/deploy/*

Files in logs impacted:
- logs/ia_dev.log (runtime only)
- .logs/* (runtime only)

Databases and other sources modified:
- None

Off-project modifications:
- None

Files in .smartIde modified:
- .smartIde/agents/*.md
- services/ia_dev/.smartIde/**

Files in .secrets modified:
- None

New patch version in VERSION:
- 0.0.5

CHANGELOG.md updated:
- yes
2026-04-04 18:36:43 +02:00

40 lines
1.6 KiB
Markdown

# AnythingLLM — sync after `git pull`
## Goal
Upload files **added or modified** by a `git pull` (fast-forward or merge) to AnythingLLM without manual IDE actions.
## What it changes
- Each target repo can install a Git **`post-merge`** hook that calls `scripts/anythingllm-pull-sync/sync.mjs`.
- The same exclusions as **`.4nkaiignore`** (plus a few system patterns) apply.
- Deletions/renames are not mirrored as deletions in AnythingLLM in this version (upload only).
## Implementation (smart_ide repo)
- `scripts/anythingllm-pull-sync/`: Node (ESM) script + dependency `ignore`.
- `scripts/install-anythingllm-post-merge-hook.sh`: installs `.git/hooks/post-merge` with an absolute path to `sync.mjs`.
## Repo configuration (workspace slug)
Slug resolution order (first match wins):
1. `ANYTHINGLLM_WORKSPACE_SLUG` (env)
2. `.anythingllm.json` at repo root: `{ "workspaceSlug": "<slug>" }`
3. smart_ide `projects/<id>/conf.json` (matches `project_path` to the repo root; reads `smart_ide.anythingllm_workspace_slug[SMART_IDE_ENV]`, default env `test`)
## Deployment on the host
1. Run `npm install` in `scripts/anythingllm-pull-sync`.
2. Create `~/.config/4nk/anythingllm-sync.env` with:
- `ANYTHINGLLM_BASE_URL`
- `ANYTHINGLLM_API_KEY`
3. Install the hook:
- per repo: `./scripts/install-anythingllm-post-merge-hook.sh /path/to/repo`
- or for configured clones: `./scripts/install-anythingllm-post-merge-hook.sh --all`
## Observability
- stderr summary: `uploaded=`, `skipped=`, `errors=`, plus up to 20 error lines.
- If `ORIG_HEAD` is missing, or if URL/key/slug is missing: explicit message and exit code `0` (does not block the pull).