BrowserOS

mirror of https://github.com/browseros-ai/BrowserOS.git synced 2026-05-21 04:45:12 +00:00

Author	SHA1	Message	Date
Nikhil	c07d3d95d4	feat: add sqlite drizzle persistence (#919 ) * feat: add drizzle agent schema * feat: run sqlite drizzle migrations * refactor: remove old sql identity dependency * feat: store harness agents in sqlite * build: package db migrations * refactor: remove sqlite oauth token store * feat: restore oauth token storage * fix: handle empty install id * chore: ignore server runtime state * fix: address review feedback for PR 919	2026-05-02 15:19:57 -07:00
Nikhil	d38b01a8c7	feat(dev): add guided cleanup and reset commands (#890 ) * feat(dev): add guided cleanup and reset commands * fix: address cleanup reset review feedback	2026-04-30 12:27:15 -07:00
Nikhil	fd5aba249b	fix: stabilize OpenClaw gateway startup (#888 ) * feat(server): add shared process lock helper * feat(container): add container name reconciliation helpers * feat(openclaw): serialize lifecycle across processes * fix(openclaw): reconcile fixed gateway container startup * test(openclaw): cover lifecycle race recovery * fix(server): satisfy process lock error override * fix(openclaw): address review feedback * test(openclaw): align serialization mock with image check	2026-04-30 11:31:40 -07:00
Dani Akash	0c84547e8f	feat(agents): migrate OpenClaw chat onto the unified harness/ACP path (#859 ) * chore(acp): smoke-test ACP capabilities against running gateway Adds apps/server/scripts/acp-smoke.ts which spawns `openclaw acp` inside the gateway container and exercises every method we plan to depend on: initialize, newSession, prompt (text + image), cancel, listSessions, loadSession. SDK pinned to 0.19.1 (Bun's minimum-release-age policy blocks 0.20+ which were released < 7 days ago). Findings (full notes in plan outcomes): - promptCapabilities advertises image:true but the model does NOT see image bytes — silently dropped at the bridge. - sessionCapabilities advertises {list:{}} but session/list throws "Method not found": stale capability advertising. - loadSession works; replays user/assistant/thought text and session_info/usage/commands updates. No tool_call replay, as documented. - cancel works end-to-end: stopReason=cancelled. - closeSession/resumeSession are not on ClientSideConnection in 0.19.1; kill child to close, use loadSession for rebind. Plan revisions triggered by spike are recorded in plans/browseros-ai/BrowserOS/features/2026-04-28-2310-claude-code-acp-implementation-roadmap.md. * chore(acp): re-run smoke on SDK 0.21.0 and add mode/config/auth scenarios After bypassing Bun's minimum-release-age and upgrading the SDK to 0.21.0, restore the previously-skipped resume/close paths and add three new scenarios: mode (setSessionMode), config (setSessionConfigOption, correct configId field), and auth (authenticate noop). Findings, all bridge-side (independent of SDK): - session/list, session/resume, session/close all throw -32601 on OpenClaw 2026.4.12 — capability advertising is stale. - Image content blocks silently dropped; model never sees the bytes. - setSessionMode and setSessionConfigOption work; latter requires `configId` (not `optionId`) per the schema. - loadSession replays user/assistant/thought text + session_info + usage + available_commands; no tool_call replay (documented). - authenticate is a noop on OpenClaw (no authMethods advertised). Plan outcomes updated with full method-support matrix. * chore(deps): promote @agentclientprotocol/sdk to a runtime dependency The smoke script in apps/server/scripts/acp-smoke.ts used the SDK as devDependency. The upcoming ACP bridge (apps/server/src/api/services/acp/) needs it at runtime, not just for tooling. Move the entry from devDependencies to dependencies, alphabetically first under @a. Pinned to 0.21.0 — same version the smoke script validated against. README gains a small Dependencies note pointing at the future bridge location. No code changes yet. The bridge wiring lands in subsequent commits. fix(openclaw): wire LlmProvider.supportsImages through to OpenClaw model config When BrowserOS sets up a custom OpenAI-compat provider on the gateway, the agent UI's "Supports Image" flag (LlmProviderConfig.supportsImages) was being dropped on the floor. As a result the persisted model entry had no `input` field, OpenClaw defaulted it to ['text'], and image_url content parts were silently stripped before the model saw them. Fix: - Extend OpenClawSetupInput / OpenClawAgentMutationInput on the agent side (useOpenClaw.ts) and the route body schema + SetupInput + createAgent input on the server side with `supportsImages?: boolean`. - AgentsPage forwards `llmOption?.supportsImages` from the selected LlmProviderConfig in both handleSetup and handleCreate. - provider-map.resolveSupportedOpenClawProvider emits `input: ['text', 'image']` on the model entry when the flag is truthy; otherwise emits the explicit `['text']` so the value is always pinned (avoids relying on OpenClaw's implicit default). - applyBrowserosConfig adds `tools.media.image.enabled = true` to the bootstrap batch so the gateway's image-understanding pipeline is always wired up — per-model `input` still gates which models see images, this just enables the global path. ACP image content blocks are still dropped by the OpenClaw bridge — that's a separate bridge bug, not addressed here. This commit restores image support for the OpenAI-compat /v1/chat/completions path that the upcoming ACP chat panel will use as a carve-out for image-bearing prompts. Existing custom-provider configs are NOT auto-migrated; users will re-acquire image support either by re-running setup or by editing their model entries' `input` field manually. A migration pass for legacy installs is not in scope for this commit because the "supportsImages" intent isn't recoverable from the persisted config alone — the source of truth is the LlmProvider record on the agent side. * feat(agents): add OpenClaw to AgentAdapter union and catalog Extends AgentAdapter to 'claude' \| 'codex' \| 'openclaw' and adds the OpenClaw entry to AGENT_ADAPTER_CATALOG. The new entry has: - defaultModelId: 'default' — OpenClaw's ACP bridge does not surface per-session model selection (verified during the ACP spike), so models live in the OpenClawService config, not in the adapter catalog. AgentDefinition.modelId carries the gateway-side model name for display only. - models: [] — empty list signals "no per-session model picker" in the UI; isSupportedAgentModel('openclaw', undefined\|'default') returns true via the existing fallback path. - reasoningEfforts mirror OpenClaw's session-level `thought_level` config option (off / minimal / low / medium / high / adaptive). Also extends: - isAgentAdapter type guard recognizes 'openclaw' - HarnessAgentAdapter union on the extension side - agents.test.ts createAgent fake type - agent-catalog.test.ts asserts on the new entry, empty model list passthrough behavior, and OpenClaw's reasoning effort set Lockfile delta is the workspace SDK pin reconciling 0.20.0 (taken from dev's lock) up to our package.json's 0.21.0 (added in `c1d987ea`). acpx still uses 0.20.0 transitively — both are present. No runtime wiring yet — the registry override and AcpxRuntime plumbing land in subsequent commits. * feat(agents): plumb OpenClaw gateway accessors into AcpxRuntime Adds an optional `openclawGateway` accessor to AcpxRuntime so the upcoming registry override (Step 4) can spawn `openclaw acp` inside the gateway container with the right port, token, and container/VM identity. All accessors are getter-shaped so values stay live across gateway restarts (port can change, token can rotate). The accessor is threaded: server.ts → createAgentRoutes → AgentHarnessService → AcpxRuntime ↘ sidepanel lazy AcpxRuntime Also adds OpenClawService.getGatewayToken() returning the in-memory token string. We pass it via OPENCLAW_GATEWAY_TOKEN env var on the spawn (per OpenClaw's documented env-var precedence) instead of via `--token` flag (which leaks to ps aux) or `--token-file` path (no discrete token file lives inside the container — the token is nested inside openclaw.json). Wiring is dormant — the registry override that consumes these accessors lands in Step 4. Typecheck + existing acpx/harness/routes tests pass unchanged. * refactor(agents): scrub local plan-step references from code comments Replaces forward-looking comments that referenced internal plan steps (e.g. "Step 4 wires this into…") with comments that justify the code on its own merits. Plan files live locally on the contributor's machine, so cross-references are noise to the rest of the project. No behavior change. * feat(agents): spawn openclaw ACP adapter inside the gateway container When the harness resolves the `openclaw` adapter, it now returns a command that runs `openclaw acp` inside the bundled gateway container via `limactl shell <vm> -- nerdctl exec -i ... openclaw acp --url ws://127.0.0.1:<port>`. This reuses the openclaw binary already installed alongside the gateway — no host-side openclaw install is required. Auth: the gateway token is injected via OPENCLAW_GATEWAY_TOKEN on the container exec rather than `--token` on the openclaw CLI, so the secret never appears in `ps aux`. Banner output: OPENCLAW_HIDE_BANNER=1 and OPENCLAW_SUPPRESS_NOTES=1 keep stdout JSON-RPC-clean. LIMA_HOME: prefixed via `env LIMA_HOME=<path>` on the resolved command so the spawned limactl finds the BrowserOS-owned VM (the server doesn't set LIMA_HOME on its own process env). When the gateway accessor is absent, falls through to acpx's built-in openclaw adapter which assumes a host-side install — that branch will fail at spawn time with a descriptive error. Verified end-to-end via the existing acp-smoke script during the Step 0 spike. * feat(agents): dual-create OpenClaw harness agents on the gateway When the harness creates an `openclaw` adapter agent, it now also provisions a matching agent on the OpenClaw gateway via the existing CLI path (OpenClawService.createAgent). Symmetric on delete: gateway removeAgent runs alongside the harness-store delete. - Adds an OpenClawProvisioner interface (decoupled from OpenClawService for testability) and injects it through AgentHarnessService. - createAgent rolls back the harness record if gateway provisioning fails; deleteAgent tolerates gateway-side failures so harness identity stays consistent with the user-facing UI. - New OpenClawProvisionerUnavailableError surfaces as a 503 when an openclaw create request lands on a harness with no provisioner wired in (instead of a generic 500). - FileAgentStore mints openclaw agent ids with an 'oc-' prefix so the id satisfies the gateway's `^[a-z][a-z0-9-]$` agent name pattern. Other adapters keep raw UUIDs to preserve compatibility. - POST /agents body schema accepts providerType / providerName / baseUrl / apiKey / supportsImages, forwarded to the provisioner when adapter='openclaw'. The agents-page UI still routes openclaw create through the legacy /claw/agents flow; switching that path to the harness is a separate UI cutover. Tests cover: gateway dual-create on success, rollback on gateway failure, 503 when provisioner is missing, and tolerant delete on gateway-side failure. fix(agents): skip catalog model validation for OpenClaw adapter OpenClaw agents resolve their model from the gateway-side provider config (set at agent-create time via OpenClawService) rather than from the harness catalog, which has an empty `models: []` entry by design. Without this carve-out, every OpenClaw create body fails parsing with "Invalid modelId" because no concrete model id can satisfy isSupportedAgentModel('openclaw', ...). The reasoning-effort check still runs against the catalog (those values map directly to OpenClaw's session `thought_level` config option). * fix(agents): pass --session to openclaw bridge so newSession routes correctly acpx's AcpClient.createSession calls connection.newSession with cwd and mcpServers but never forwards the sessionKey. Without it, the openclaw bridge falls back to a synthetic acp:<uuid> session that doesn't resolve to any provisioned gateway agent — every harness chat returns a generic "Internal error" from -32603. Fix: bake `--session <key>` into the resolved spawn command. The bridge then uses that as the default session key for any newSession the bridge receives, routing the turn to the matching gateway agent. Per-session keying means each openclaw agent gets its own AcpxCoreRuntime instance (cached by sessionKey on top of the existing cwd/permissionMode key). This adds one extra runtime per active openclaw session — claude/codex are unaffected. Test asserts the resolved command includes the right --session arg. * fix(agents): suppress BrowserOS MCP for openclaw bridge The openclaw ACP bridge rejects newSession when mcpServers is non-empty because its provider tooling comes from the gateway, not from ACP-side MCP servers. Forwarding the BrowserOS HTTP MCP made every harness chat fail with a JSON-RPC -32603 "Internal error" before the session was even opened. Claude/codex still need the BrowserOS MCP for browser tooling, so the carve-out is keyed off whether the runtime is for an openclaw session. * feat(agents): route OpenClaw chat through the harness behind a flag Adds the `feature.useAcpxForOpenClaw` extension storage flag. When on, OpenClaw agents in the agent-command chat panel use the harness /agents/<id>/chat SSE and harness history hook instead of the legacy /claw/agents/<id>/chat. When off, behavior is unchanged. Also dedupes the agent rail when the same id appears in both stores (dual-created agents from /claw/agents and /agents) by preferring the harness entry — without this, every dual-created OpenClaw agent shows up twice after Step 5. Image attachments are temporarily disabled when the harness path is active; the carve-out lands in the next commit. * fix(agents): keep legacy OpenClaw agents on ClawChat The previous commit's flag-gated branch routed every `source='openclaw'` agent through `/agents/<id>/chat` when the flag was on, but the layout dedup means the only agents that ever reach that branch are legacy gateway-only entries (`main`, orphan agents from rolled-back creates) — which by definition have no harness record, so the harness path 404s and chat is unusable. Source is the only routing signal again: harness agents go through the harness, legacy agents stay on ClawChat. The storage flag stays for Step 9/10's migration story. * feat(agents): expose OpenClaw in sidepanel and route through gateway main `buildSidepanelChatTargets` now emits a single default ACP target for adapters with no per-session model picker (OpenClaw, whose model is configured on the gateway-side agent). Without this, OpenClaw never appeared in the sidepanel target picker because the catalog entry has `models: []`. Sidepanel sessions don't have a dedicated provisioned gateway agent. The openclaw bridge `--session` flag previously got the raw sidepanel key (`sidepanel:<convId>:openclaw:...`), which doesn't match any gateway agent — newSession was accepted but every prompt hung forever. The bridge command now rewrites non-harness session keys onto the always-present `main` gateway agent, encoding the original key as a channel suffix to keep state segregated per conversation. Verified end-to-end via curl: sidepanel openclaw chat streams `text-delta` + `finish: stop`. * feat(agents): backfill harness records for legacy gateway agents Reframes Step 9 of the OpenClaw-on-acpx migration. The plan's literal Step 9 (route OpenClaw history through the harness when the flag is on) was already a no-op after the Step 6 walkback — history is routed by source today. The actual blocker for Steps 10–13 was that legacy gateway-only agents (e.g. `main`, orphans from rolled-back creates) had no harness record, so they could never migrate to the harness path without breaking chat. `AgentHarnessService.reconcileWithGateway()` now lists every gateway agent and upserts a matching harness record for any that are missing. The pass runs lazily on first `listAgents()` call (memoized on success, retried on failure so a gateway-down boot doesn't permanently disable backfill). Verified end-to-end: the legacy `agent` agent now streams `text_delta` + `done(end_turn)` through `/agents/agent/chat`, with the bridge resolving to the gateway's `agent` record via the existing `agent:<name>:main` session-key format. After this, every OpenClaw agent surfaces as `source='agent-harness'` post-dedup, the legacy `useClawChatHistory` hook becomes unreachable for OpenClaw, and Steps 11–13 (delete legacy chat/history paths) are unblocked. * fix(agents): drop duplicate OpenClaw entry from NewAgentDialog adapter list The adapter Select hardcoded an `<SelectItem value="openclaw">OpenClaw</SelectItem>` on top of iterating `adapters`, which now includes OpenClaw post the catalog change. The dropdown rendered "OpenClaw" twice — once at the top, once at the bottom of the list. The literal was a pre-catalog artifact; removing it leaves a single OpenClaw entry sourced from the catalog. Routing into `handleOpenClawCreate` is unchanged because the value (`'openclaw'`) is identical either way. * fix(agents): always reconcile harness with gateway on list, just dedupe concurrent calls Memoizing the first successful reconcile meant new gateway agents (created via the legacy /claw/agents path or out-of-band CLI) never appeared in the harness until server restart. The Promise now serves as a concurrent-call dedupe only — cleared on settle — so every listAgents call picks up the current gateway state. Reconcile is one cheap idempotent CLI call. * chore(agents): remove dormant useAcpxForOpenClaw flag The flag was scaffolded in Step 6 but its routing effect was walked back the same day after it broke chat for legacy gateway-only agents. After the Step 9 backfill, every OpenClaw agent has a harness record and routes through the harness path purely from `source='agent-harness'` — no flag is consulted anywhere. Remove the dead storage item, hook, and stale comment. * refactor(agents): drop legacy /claw/agents/:id/history endpoint The harness /agents/:id/sessions/main/history endpoint replaced this once every OpenClaw agent got a harness record (Step 9 backfill). Routing is fully source-driven now, so the UI's useClawChatHistory hook is never enabled today — verified live: legacy URL returns 404, harness history hydrates correctly for the same agent. Removes the GET /claw/agents/:id/history route, OpenClawService's getAgentHistoryPage method plus its cursor/limit helpers and the history-only types it owned (BrowserOSOpenClawHistoryPageResponse, HistoryPageInput, normalizeHistoryLimit, encodeHistoryCursor, decodeHistoryCursor, jsonlEventsToHistoryItems), and the route + service tests that covered the dropped endpoint. OpenClawJsonlReader stays alive — still feeds /claw/dashboard, /claw/agents/:id/sessions, and the boot-time clawSession seed. Removing those is its own follow-up since the dashboard would need a harness-side replacement first. * feat(agents): wire image attachments through the harness ACP path Composer attachments now flow into the ACP `prompt` request as spec-compliant `image` content blocks alongside the user's text. End to end: composer → chatWithHarnessAgent({attachments}) → POST /agents/:id/chat {message, attachments} → parseChatBody decodes data: URLs to {mediaType, base64} → AgentHarnessService.send forwards → AcpxRuntime.send forwards → acpx startTurn({attachments}) → ACP image blocks UI no longer disables the attach button on harness agents — the gating was just a placeholder before this commit landed. Verified end to end with a 1×1 red PNG against a Claude harness agent: model replies "Red." correctly. OpenClaw's `acp` bridge still drops image content blocks upstream (verified by the same probe — Kimi-k2p5 reports "I don't see an image"). That's an upstream openclaw limitation, not a harness-side gap; Claude/Codex agents work as advertised today. * chore(openclaw): delete OpenClawJsonlReader and JSONL-backed routes * chore(openclaw): remove legacy /claw/agents/:id/chat and /queue routes * chore(agents): collapse chat panel to harness-only path * feat(agents): route OpenClaw image turns through the gateway HTTP client The OpenClaw `acp` bridge silently drops ACP `image` content blocks (verified during dogfood — model says "I don't see an image"). When the user attaches images to an OpenClaw agent, the harness now diverts that turn to the gateway's HTTP `/v1/chat/completions` endpoint, which accepts OpenAI-style `image_url` parts and forwards them natively to the provider. - New `OpenClawGatewayChatClient` translates an OpenAI streaming response into the same `AgentStreamEvent` shape the rest of the harness already consumes, so the chat panel renders identically whether a turn went through ACP or the gateway carve-out. - `AcpxRuntime.send` forks at the top: openclaw + any image attachment + a wired gateway client → `sendOpenclawViaGateway`. Other turns (text-only openclaw, claude, codex) take the existing ACP path unchanged. - The diverted path reads the prior turn history from the acpx session record so context is preserved, builds the OpenAI multimodal user message with text + image_url parts, and pumps the gateway SSE back to the caller through a tee that accumulates the assistant text. On natural completion, persists a synthetic user+assistant message pair to the acpx session record so reload shows the image turn in history. - Wired `OpenClawGatewayChatClient` into `AgentHarnessService` via `server.ts` (gateway port + token accessor, just like the existing `openclawGateway`). Persistence note: the acpx record requires User messages to carry an `id` and Agent messages to carry `tool_results` — without them the record fails to round-trip through `parseSessionRecord`. The persist helper now sets both. Limitation by design: image recognition only works if the OpenClaw agent's provider supports vision (e.g. Claude-via-OpenClaw, GPT-4o). The pipeline routes images correctly to the provider regardless; text-only providers like Kimi-k2p5 will reply "I don't see an image" because the model itself has no vision capability — that's a provider config issue, not a routing bug. The unit test asserts the image_url part is present in the OpenAI request the gateway client sends. The wider plan (background-resilient chat, queue, replay) remains in `plans/.../2026-04-29-1527-...-background-resilient-chat-and-image-uploads.md` as Phases 3–12; this commit ships only Phases 1–2. * feat(agents): validate inbound image attachments on /agents/:id/chat The harness chat body parser was accepting any mediaType and any dataUrl length. The composer enforces these caps client-side but the endpoint also serves direct curl/script callers, so the server has to defend itself. Restores the same caps the legacy /claw/agents/:id/chat parser had before it was deleted in the migration: - 10 attachments per message - 5 MB raw image bytes (≈ 6.7 MB once base64-encoded plus prefix) - PNG / JPEG / WebP / GIF only - Must start with `data:` Each violation returns 400 with a specific error message instead of silently dropping or forwarding the payload.	2026-04-29 16:37:03 +05:30
Nikhil	1946ca0cf8	chore: clean up unused agent sdk (#855 )	2026-04-28 17:21:46 -07:00
Nikhil	91d3285aa0	feat: add ACP agent harness (#849 ) * feat: add acp agent runtime spike * feat: add agent harness catalog * feat: persist harness agents in json * feat: persist agent transcripts * feat: route harness service through agent records * feat: expose generic agent harness routes * feat: add harness agent frontend api * feat: create harness agents from agents page * feat: chat with persisted harness agents * chore: remove obsolete agent profile spike * chore: self-review fixes * fix: combine openclaw and harness agents UI * refactor: split agents page components * fix: hide persisted harness turns	2026-04-28 15:29:38 -07:00
Nikhil	0b91c735ab	chore: bump server version, offset and patch for release (#814 )	2026-04-24 12:05:47 -07:00
Nikhil	4d660874ad	feat: consolidate build tools package (#785 ) * feat(build-tools): scaffold package + cache dir helpers * feat(build-tools): manifest types + R2 helper * feat(build-tools): build-disk script with virt-customize + zstd * feat(build-tools): build-tarball script * feat(build-tools): emit-manifest + cache:sync * ci(build-tools): independent build-vm + build-agent workflows * chore: remove legacy container packages + workflows * fix: address review feedback for PR #785 * fix: stabilize VM build DNS in CI * fix: prioritize arm64 build workflows * fix: keep arm64 VM recipe simple * fix: set VM build DNS in apt command * fix: avoid guest DNS for VM package install * fix: limit VM PR checks to build-tools validation	2026-04-22 16:23:11 -07:00
Nikhil	819887a2c5	feat(vm-container): WS1 VM disk image pipeline (#783 ) * feat(vm-container): ship the WS1 VM disk image pipeline New Bun/TS workspace package @browseros/vm-container that produces a reproducible, versioned Debian 12 + Podman qcow2 disk image for arm64 and x64, and publishes it to Cloudflare R2 under vm/<version>/ with a per- version manifest.json and a latest.json pointer. - virt-customize-driven build with a git-tracked recipe DSL. - zstd-compressed artifacts; sha256 sidecars for compressed + uncompressed. - Public surface at @browseros/vm-container/schema exposes zod-validated VmManifest + R2 key helpers for WS4 to import; /download is a stub landing pad for WS4 to fill in. - Rollback on partial upload failure: any exception after the first successful put deletes all previously uploaded keys for that version. - GHA workflow build-vm-container.yml runs a matrix build per arch on native runners, an x64 Lima boot smoke test, and a gated publish job. - Full unit coverage for arch, r2-keys, manifest, recipe parser, and publish (rollback + happy path via aws-sdk-client-mock). * fix(vm-container): address review comments - Split buildDisk into prepareCustomizedDisk + finalizeArtifacts for testability. - Replace resolvePinnedSha's sentinel-prefix check with a positive sha256-hex regex test, switch base-image.ts placeholder to empty string. - Drop unused R2_VM_PREFIX from .env.example; document CDN_BASE_URL override precedence in README. - Replace SSH host-key explicit list in recipe with `ssh_host_` glob so .pub keys and future key types are also removed. - lima-boot: introduce BunRequestInit type for the unix fetch option and reject empty limactlPath loudly. - Extend publish test suite: mid-manifest-upload failure path verifies both arches' qcow+sha are rolled back and latest.json is never written. - Add missing tests: parseArch('ARM64') case-sensitivity rejection, composeVirtCustomizeArgv unresolved-substitution pass-through. fix(vm-container): pin a real Debian snapshot, switch verify to SHA-512, streaming download - Pin Debian base to bookworm/20260413-2447 with real SHA-512 values from upstream SHA512SUMS (the sentinel placeholder never corresponded to a real build). Debian cloud images only publish SHA512SUMS today, so switch base-image verification to SHA-512 throughout: rename BaseImage.sha256 → sha512, manifest field base_image_sha256 → base_image_sha512, base_image.sha256_url → sha512_url, debianSha256SumsUrl → debianSha512SumsUrl. Our own artifact hashes (compressed_sha256, uncompressed_sha256, recipe_sha256) stay SHA-256. - Fix downloadTo: previous Bun.write(dest, response) buffered the entire 300 MB response before writing (100% CPU, empty dir). Replace with a getReader() loop that streams chunks through Bun.file().writer(). - build CLI now auto-derives --version from today's date when omitted (defaults to YYYY.MM.DD-dev1); explicit --version still overrides. Broaden CALVER_REGEX to accept alphanumeric suffixes so -dev1/-rc1 tags are valid. New todayCalver() helper. - Update GHA workflow fallback to github.run_number (shorter) instead of run_id. * fix(vm-container): resolve copy-in paths against recipeDir after substitution The copy-in path resolver checked op.src.startsWith('/') before running the {placeholder} substitution, so an absolute-after-substitution path like {manifest_tmp} → /tmp/vm-dist/manifest-stub-arm64.json was treated as relative and joined against recipeDir, producing a nonexistent path. Check the substituted value for absoluteness via path.isAbsolute. * fix: address review comments for 0422-ws1_vm_disk_pipeline * fix(ci): repair vm-container workflow * fix(ci): expose vm build logs on failure * fix(vm-container): expose base_image_sha256 in manifest per PRD The published manifest contract (consumed by WS4) now uses base_image_sha256 as the PRD specified. Internally the build still verifies the downloaded Debian base against the pinned sha512 (that's what Debian actually signs in SHA512SUMS) — then hashes the same bytes as sha256 and records that in the manifest. One extra digest pass of a ~300 MB file; negligible. - manifest.json: base_image_sha256 replaces base_image_sha512; sha512_url removed (not needed — sha256 is the consumer-facing hash). - CLI: --base-image-sha256 override validates against the locally-computed sha256 after download. - BuildResult.baseImage gains sha256 alongside sha512. - Tests updated to the new field. The auth.json bug (reviewer #2) is resolved: the source file is recipe/auth.json and the recipe emits `copy-in auth.json:/etc/containers/` so libguestfs writes /etc/containers/auth.json. * ci(vm-container): fix supermin kernel-read + rename sha512 inputs to sha256 - Ubuntu 24.04 GHA runners ship /boot/vmlinuz-* as mode 0600, which blocks libguestfs's supermin appliance builder when virt-customize runs as a non-root user. Chmod 0644 before the build — canonical CI workaround. - Rename workflow_dispatch input base_image_sha512 → base_image_sha256 and CLI flag --base-image-sha512 → --base-image-sha256 to match the orchestrator's renamed override. * ci(vm-container): give runner KVM access + install passt for libguestfs The supermin fix got us past appliance-build, but virt-customize then hit "passt exited with status 1". The passt networking helper misbehaves when libguestfs falls back to TCG emulation, which happens because the runner user isn't in the kvm group even though /dev/kvm exists on the GHA host. - chmod 0666 /dev/kvm → libguestfs uses hardware acceleration, avoids TCG. - install passt explicitly so the networking helper is present and current. * ci(vm-container): disable passt to force libguestfs slirp fallback libguestfs 1.54+ prefers passt for guest networking, but the passt binary on GHA ubuntu-24.04 exits with status 1 when invoked from the appliance — an AppArmor/capability issue that doesn't surface a useful diagnostic. The reliable workaround is to remove passt so libguestfs picks QEMU's built-in user-mode SLIRP as the network backend. SLIRP is slower but functional and doesn't require escalated privileges.	2026-04-22 14:04:00 -07:00
Nikhil	114d5e3a9f	feat: add agent container tarball pipeline (#782 ) * feat: add agent container tarball pipeline * docs: add agent-container env sample * refactor: simplify agent container pipeline * fix: address review feedback for PR #782 * fix: emit clean matrix JSON in CI * fix: align agent container artifact paths	2026-04-22 13:14:27 -07:00
Nikhil	7baee8d57e	chore: release server alpha - 0.0.88 (#747 )	2026-04-17 12:44:41 -07:00
Nikhil	c1b1e53a86	feat(ota): bundle full server resources tree in Sparkle payload (#726 ) * feat(ota): bundle full server resources tree (server + third_party bins) The OTA Sparkle payload now ships the complete resources/ tree the agent build produced, not just browseros_server. Every third-party binary (bun, ripgrep, podman, gvproxy, vfkit, krunkit, podman-mac-helper, win-sshproxy) flows to OTA-updated installs so podman integration works for users on the OTA channel, matching fresh Chromium-build installs. Extract the per-binary sign table into build/common/server_binaries.py so the Chromium-build sign path (modules/sign/) and OTA sign path (modules/ota/) share a single source of truth. Adding a new third-party dep is now a one-file edit that both paths pick up automatically; unknown executables under resources/bin/ are a hard error at release time. * fix(ota): address review comments on bundle signing flow - Avoid double-zipping during notarization: add notarize_macos_zip for pre-built Sparkle bundles so notarytool submits the zip directly instead of re-wrapping it through ditto --keepParent (Apple's service does not descend into nested archives). Keep notarize_macos_binary for single-binary callers. Share credential setup + submit logic via internal helpers. - Fail fast on unknown executables in sign_server_bundle_macos: collect the unknown-files list before any codesign call so a missing shared- table entry aborts in seconds, not after a full signing round. - Drop dead get_entitlements_path helper (no callers remain after the bundle refactor). * fix(ota): address PR review comments (greptile + claude) - sign_server_bundle_macos filters to executables only (p.is_file() + not p.is_symlink() + os.access X_OK) before applying the unknown-file guard. Non-Mach-O files (configs, dylibs, etc.) under resources/bin/ no longer cause misleading 'unknown executable' hard failures. - sign_server_bundle_windows now hard-errors on a missing expected binary instead of silently skipping it. Symmetric with the macOS guard — an incomplete bundle must not publish. - ServerOTAModule.execute() uses tempfile.TemporaryDirectory context managers for both the download and staging roots so they are cleaned up on every path, including failures. - Per-platform sign/notarize/Sparkle-sign failures now raise RuntimeError instead of silently skipping the platform — a release pipeline can no longer omit a target while reporting success. - Move import os and import shutil to the top of ota/sign_binary.py. - Drop unused log_error import from ota/server.py. * chore: bump server	2026-04-16 12:59:49 -07:00
Nikhil	f521ebc8dc	chore: bump server version, offset and patch for release (#721 )	2026-04-15 18:17:09 -07:00
Nikhil	dc26ff2554	chore: bump server, offset & patch for release (#715 )	2026-04-15 14:43:22 -07:00
Nikhil	0397d3e393	chore: release alpha: 0.0.83 (#695 )	2026-04-13 18:00:52 -07:00
Nikhil	ce7c209ba6	feat: add OpenClaw agent command center and terminal (#692 ) * feat: agent command center new tab with OpenClaw conversation history * feat: add web terminal for Podman container shell access * feat: align agent command center with new tab * fix: simplify agent command center styling * style: polish agent terminal layout and theming * style: simplify agent terminal styling * fix: address PR review comments for OpenClaw routes * fix: handle OpenClaw client start and error states * fix: resolve remaining OpenClaw review comments	2026-04-13 17:06:48 -07:00
Neel Gupta	14eeba7c20	Feat: Improved ACL robustness with semantic and fuzzy matching (#665 ) * feat: Add enhanced python-based ACL * fix: Port enhanced ACL to TypeScript * fix: greptile suggested bugs	2026-04-13 09:43:33 -07:00
Nikhil Sonti	3c629c5929	feat: tool approvals, governance dashboard, and execution history - Add tool approval system with per-category approval configuration - Build unified Governance dashboard (renamed from Admin) with pending approvals view and execution audit log - Move execution history tracking into the app shell - Extract buildChatRequestBody helper and add newtab system prompt - Add approval config change detection for mid-conversation rebuilds	2026-04-13 09:43:30 -07:00
Nikhil	6712e1d321	chore: bump server and extension version (#659 )	2026-04-08 10:18:24 -07:00
Nikhil	e5a852dd3d	chore: update server version (#644 )	2026-04-03 14:29:07 -07:00
shivammittal274	81350c0d7f	feat: replace model picker with shadcn Combobox + fuse.js fuzzy search (#617 ) The model picker in NewProviderDialog rendered inline, causing dialog resizing and lacked keyboard navigation. Replace it with a Popover + Command (shadcn Combobox) pattern and add fuse.js for fuzzy search. - Replace custom ModelPickerList with Popover + Command dropdown - Add fuse.js for fuzzy model search (replaces string.includes) - Add MODEL_SELECTED_EVENT and AI_PROVIDER_UPDATED_EVENT analytics - Enrich PROVIDER_SELECTED_EVENT with model_id in chat sessions	2026-03-30 16:38:21 +05:30
Nikhil	9bdb2413ec	feat: clean-up - remove obsolete controller extension (#610 ) * refactor(server): remove obsolete controller extension backend * fix: address review feedback for PR #610	2026-03-27 17:01:04 -07:00
Nikhil	d02b3f74e6	chore: update agent version (#608 )	2026-03-27 13:58:42 -07:00
Nikhil	86c62f14a5	chore: fix version number for extension (#606 )	2026-03-27 13:18:10 -07:00
Dani Akash	cee318a40b	fix: improve chat history freshness and reduce query payload (#598 ) * fix: add refresh indicator to chat history when fetching latest conversations Show a non-blocking "Fetching latest conversations" indicator at the top of the history list while the cached data is being refreshed. Users can still interact with the cached conversation list during the refresh. * perf: reduce chat history query payload — fetch last 2 messages instead of 5 The conversation list only displays the last user message as a preview. Fetching 5 messages per conversation was wasteful — each message contains the full UIMessage object (tool calls, reasoning, etc.) multiplied by 50 conversations per page. Reduced to last 2 which is sufficient to find the last user message in a user→assistant exchange. * perf: use first+DESC instead of last+ASC to push LIMIT down to SQL PostGraphile's `last: N` doesn't map to SQL LIMIT — it uses a padded LIMIT 10 and slices in application code. Changing to `first: 2` with ORDER_INDEX_DESC generates a true SQL LIMIT 2, reducing rows scanned from 500 to 100 per page (50 conversations × 2 vs 10 messages each). No UX impact — extractLastUserMessage() filters by role regardless of message order. * chore: update react query packages * feat: replace localforage with idb-keyval	2026-03-27 19:49:47 +05:30
Nikhil	aba7a10430	chore: server release (#592 )	2026-03-26 19:13:56 -07:00
Nikhil	220577b41c	feat: add CDN-hosted CLI installer flow (#588 ) * feat: add CDN upload flow for cli installers * fix: move cli install docs to top-level readme * fix: bun.lock update	2026-03-26 17:41:03 -07:00
shivammittal274	4e90b4561a	feat(eval): weekly eval pipeline with R2 uploads and trend dashboard (#516 ) * feat(eval): weekly eval pipeline with R2 uploads and trend dashboard Add infrastructure for running weekly evaluations and tracking score trends over time: - Auto-generated output dirs: results/{config-name}/{timestamp}/ Each eval run gets its own timestamped folder, nothing is overwritten. - upload-run.ts: uploads eval results to Cloudflare R2. Supports uploading a specific run or all un-uploaded runs for a config. - weekly-report.ts: generates an interactive HTML dashboard from R2 data. Config dropdown, trend chart with hover tooltips, searchable runs table. Groups runs by config name. - viewer.html: client-facing 3-column run viewer (task list, screenshots with autoplay, agent stream with messages.jsonl). Shows performance grader axis breakdown with per-axis scores. - browseros-agent-weekly.json: weekly benchmark config (kimi-k2p5, webbench-2of4-50, 10 workers, performance grader, headless). - eval-weekly.yml: GitHub Actions workflow with cron (Saturday 6am) and manual trigger. Runs on self-hosted Mac Studio runner. Concurrency group ensures only one eval runs at a time. - Dashboard updates: load previous runs, messages.jsonl viewer, grade badges show percentages, async stream loading. - Grader updates: timeout 30min, max turns 100, DOM content verification guidance for performance grader. * fix(eval): address Greptile review — injection, nested dirs, escaping - Fix script injection in eval-weekly.yml: pass github.event.inputs through env var instead of interpolating into shell - Fix /api/runs to enumerate nested results/{config}/{timestamp}/ dirs - Fix /api/load-run to allow single-slash run names (config/timestamp) - Add HTML escaping for R2-sourced values in weekly-report.ts - Escape axis names in viewer.html renderAxesBreakdown * fix(eval): fix biome lint — non-null assertion, template literals * fix(eval): fix biome errors — replace var with let, fix inner function declaration * fix(eval): address Greptile P2 issues - isRunDir: check all subdirs for metadata.json, not just first 3 - eval-runner: guard configPath for dashboard-driven runs (fallback to 'eval') - load-run: default unknown termination_reason to 'failed' not 'completed' * feat(eval): make BROWSEROS_BINARY configurable via env var	2026-03-21 22:12:52 +05:30
Nikhil	149cde118d	chore: bump server version, offset and patch for release (#512 )	2026-03-20 11:45:12 -07:00
Nikhil	6f398f0b36	fix: replace sharp with jimp to fix compiled binary crash (#510 ) sharp is a native C module (libvips) whose .node binaries can't be embedded in Bun compiled executables. It was imported at the top level in copilot-fetch.ts, crashing the entire server at startup. Replace with jimp (pure JavaScript, zero native deps) which bundles cleanly into compiled binaries. Same resize algorithm preserved. Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-20 11:06:05 -07:00
Nikhil	1c737b0f02	chore: bump server version (#501 )	2026-03-19 16:17:50 -07:00
shivammittal274	720baaed3e	feat: add GitHub Copilot as OAuth LLM provider (#500 ) * feat: add GitHub Copilot as OAuth-based LLM provider Add GitHub Copilot as a second OAuth provider using the Device Code flow (RFC 8628). Users authenticate via github.com/login/device, and the server polls for token completion. Supports 25+ models through a single Copilot subscription. Key changes: - Device Code OAuth flow in token manager (poll with safety margin) - Custom fetch wrapper injecting Copilot headers + vision detection - Provider factory using createOpenAICompatible for Chat Completions API - Extension UI with template card, auto-create on auth, and disconnect * fix: address PR review comments for GitHub Copilot OAuth - Validate device code response for error fields (GitHub can return 200 with error payload) - Store empty refreshToken instead of access token for GitHub tokens - Add closeButton to Toaster for dismissing device code toast * fix: add github-copilot to agent provider factory The chat route uses a separate provider-factory.ts (agent layer) from the test-provider route (llm/provider.ts). Added createGitHubCopilotFactory to the agent factory so chat works with GitHub Copilot. * fix: add github-copilot to provider icons, models, and dialog - Add Github icon from lucide-react to providerIcons map - Add 8 Copilot models (GPT-4o, Claude, Gemini, Grok) to models.ts - Add github-copilot to NewProviderDialog zod enum, validation skip, canTest check, and OAuth credential message * fix: reorder copilot models with free-tier models first Put models available on Copilot Free at the top (gpt-4o, gpt-4.1, gpt-5-mini, claude-haiku-4.5, grok-code-fast-1), followed by premium models that require paid Copilot subscription. * fix: set correct 64K context window for Copilot models Copilot API enforces a 64K input token limit regardless of the underlying model's native context window. Updated all model entries and the default template to 64000 so compaction triggers correctly. * fix: use actual per-model prompt limits from Copilot /models API Queried api.githubcopilot.com/models for real max_prompt_tokens values. GPT-4o/4.1 have 64K, Claude/gpt-5-mini have 128K, GPT-5.x have 272K. Also updated model list to match what's actually available on the API (e.g. claude-sonnet-4.6 instead of 4.5, added gpt-5.4/5.2-codex). * feat: resize images for Copilot using VS Code's algorithm Large screenshots cause 413 errors on Copilot's API. Resize images following VS Code's approach: max 2048px longest side, 768px shortest side, re-encode as JPEG at 75% quality. Uses sharp for server-side image processing. * fix: address all Greptile P1 review comments - Add .catch() on fire-and-forget pollDeviceCode to prevent unhandled rejection crashes (Node 15+) - Add deduplication guard (activeDeviceFlows Set) to prevent concurrent device code flows for the same provider - Add runtime validation of server response in frontend before calling window.open() and showing toast - Remove dead GITHUB_DEVICE_VERIFICATION constant from urls.ts * fix: upgrade biome to 2.4.8, fix all lint errors, and address review bugs - Upgrade biome from 2.4.5 to 2.4.8 (matches CI) and migrate configs - Fix image resize: only re-encode when dimensions actually change - Fix device code polling: retry on transient network errors instead of aborting - Allow restarting device code flow (clear old flow instead of throwing 500) - Fix pre-existing noNonNullAssertion and noExplicitAny lint errors globally * fix: address Greptile P2 review — image resize and config guard - Fix early-return guard: check max/min sides against their respective limits (MAX_LONG_SIDE/MAX_SHORT_SIDE) instead of both against SHORT - Preserve PNG alpha: detect hasAlpha and keep PNG format instead of unconditionally converting to lossy JPEG - Keep browserosId guard in resolveGitHubCopilotConfig consistent with ChatGPT Pro pattern (safety check that caller context is valid) * feat: update Copilot models to full list from pricing page, default to gpt-5-mini Added all 23 models from GitHub Copilot pricing page. Ordered with free-tier models first (gpt-5-mini, claude-haiku-4.5), then premium. Changed default from gpt-4o to gpt-5-mini since it's unlimited on Pro plan and has 128K context (vs gpt-4o's 64K limit).	2026-03-20 02:33:09 +05:30
shivammittal274	cee9c764b1	fix(skills): read-only view mode for built-in skills (#494 ) * fix(skills): read-only view mode for built-in skills - SkillCard shows Eye icon + "View" for built-in, Pencil + "Edit" for user - SkillDialog in read-only mode: disabled fields, no toolbar on markdown editor, "View Skill" title, "Close" button, no "Update Skill" - Hide tip section in read-only mode * fix(skills): use react-markdown for read-only skill view Replace MDXEditor with react-markdown for viewing built-in skills. MDXEditor chokes on code fences, angle brackets, and image syntax causing content truncation. react-markdown handles standard markdown correctly with no rendering issues.	2026-03-19 23:48:51 +05:30
Dani Akash	d965698905	fix: biome & tsc setup across repo (#493 ) * fix: biome lint issues * fix: code quality workflow * fix: all lint issues * chore: test lefthook pre-commit hook * chore: test lefthook with agent file * chore: revert test comment from lefthook verification * feat: setup tsgo for typechecking agent * fix: typecheck cli command * fix: early return to prevent errors	2026-03-19 18:18:24 +05:30
Nikhil	22c5e85707	chore: bump server version (#478 )	2026-03-17 17:12:23 -07:00
Dani Akash	2a6848bc1d	feat: improved system prompt (#466 ) * feat: added ai-sdk dev tools * feat: new system prompt section * feat: tests to maintain prompt integrity * feat: update mcp sync to use react query * fix: refetch logic for sync * chore: remove limits on fetching integrations * fix: refetch integrations on delete * fix: review comment * chore: update tests * fix: improved memory classification * fix: lint issues * fix: core memory prompts * fix: handle scenario where soul file is empty	2026-03-17 19:01:10 +05:30
Nikhil	e2069bc999	chore: bump server version (#459 )	2026-03-16 16:42:54 -07:00
shivammittal274	29056226bb	feat: add eval framework and coordinate-based input tools (#453 ) - Add hover_at, type_at, drag_at coordinate tools to server - Add hoverAt, typeAt, dragAt methods to Browser class - Export server internals (browser, tool-loop, registry) for eval imports - Copy eval app from enterprise repo with agents, graders, runner, dashboard - Nest eval-targets inside apps/eval - Adapt sessionExecutionDir → workingDir for current server API - Add biome ignore for dashboard HTML to prevent lint breaking onclick handlers	2026-03-16 23:12:23 +05:30
Dani Akash	290ee91a8b	Add 'packages/browseros-agent/' from commit '90bd4be3008285bf3825aad3702aff98f872671a' git-subtree-dir: packages/browseros-agent git-subtree-mainline: `8f148d0918` git-subtree-split: `90bd4be300`	2026-03-13 21:22:09 +05:30

39 Commits