BrowserOS

mirror of https://github.com/browseros-ai/BrowserOS.git synced 2026-05-20 20:39:10 +00:00

Author	SHA1	Message	Date
Nikhil	f2ac87d7c3	feat: show created agents in sidepanel (#865 ) * feat(agent): list created agents in sidepanel target catalog * feat(agent): show created agents in sidepanel selector * feat(server): add sidepanel chat route for created agents * feat(agent): route sidepanel agent sends by agent id * chore(agent): retire virtual sidepanel acp targets * fix: address review feedback for PR #865	2026-04-29 10:15:58 -07:00
Dani Akash	0c84547e8f	feat(agents): migrate OpenClaw chat onto the unified harness/ACP path (#859 ) * chore(acp): smoke-test ACP capabilities against running gateway Adds apps/server/scripts/acp-smoke.ts which spawns `openclaw acp` inside the gateway container and exercises every method we plan to depend on: initialize, newSession, prompt (text + image), cancel, listSessions, loadSession. SDK pinned to 0.19.1 (Bun's minimum-release-age policy blocks 0.20+ which were released < 7 days ago). Findings (full notes in plan outcomes): - promptCapabilities advertises image:true but the model does NOT see image bytes — silently dropped at the bridge. - sessionCapabilities advertises {list:{}} but session/list throws "Method not found": stale capability advertising. - loadSession works; replays user/assistant/thought text and session_info/usage/commands updates. No tool_call replay, as documented. - cancel works end-to-end: stopReason=cancelled. - closeSession/resumeSession are not on ClientSideConnection in 0.19.1; kill child to close, use loadSession for rebind. Plan revisions triggered by spike are recorded in plans/browseros-ai/BrowserOS/features/2026-04-28-2310-claude-code-acp-implementation-roadmap.md. * chore(acp): re-run smoke on SDK 0.21.0 and add mode/config/auth scenarios After bypassing Bun's minimum-release-age and upgrading the SDK to 0.21.0, restore the previously-skipped resume/close paths and add three new scenarios: mode (setSessionMode), config (setSessionConfigOption, correct configId field), and auth (authenticate noop). Findings, all bridge-side (independent of SDK): - session/list, session/resume, session/close all throw -32601 on OpenClaw 2026.4.12 — capability advertising is stale. - Image content blocks silently dropped; model never sees the bytes. - setSessionMode and setSessionConfigOption work; latter requires `configId` (not `optionId`) per the schema. - loadSession replays user/assistant/thought text + session_info + usage + available_commands; no tool_call replay (documented). - authenticate is a noop on OpenClaw (no authMethods advertised). Plan outcomes updated with full method-support matrix. * chore(deps): promote @agentclientprotocol/sdk to a runtime dependency The smoke script in apps/server/scripts/acp-smoke.ts used the SDK as devDependency. The upcoming ACP bridge (apps/server/src/api/services/acp/) needs it at runtime, not just for tooling. Move the entry from devDependencies to dependencies, alphabetically first under @a. Pinned to 0.21.0 — same version the smoke script validated against. README gains a small Dependencies note pointing at the future bridge location. No code changes yet. The bridge wiring lands in subsequent commits. fix(openclaw): wire LlmProvider.supportsImages through to OpenClaw model config When BrowserOS sets up a custom OpenAI-compat provider on the gateway, the agent UI's "Supports Image" flag (LlmProviderConfig.supportsImages) was being dropped on the floor. As a result the persisted model entry had no `input` field, OpenClaw defaulted it to ['text'], and image_url content parts were silently stripped before the model saw them. Fix: - Extend OpenClawSetupInput / OpenClawAgentMutationInput on the agent side (useOpenClaw.ts) and the route body schema + SetupInput + createAgent input on the server side with `supportsImages?: boolean`. - AgentsPage forwards `llmOption?.supportsImages` from the selected LlmProviderConfig in both handleSetup and handleCreate. - provider-map.resolveSupportedOpenClawProvider emits `input: ['text', 'image']` on the model entry when the flag is truthy; otherwise emits the explicit `['text']` so the value is always pinned (avoids relying on OpenClaw's implicit default). - applyBrowserosConfig adds `tools.media.image.enabled = true` to the bootstrap batch so the gateway's image-understanding pipeline is always wired up — per-model `input` still gates which models see images, this just enables the global path. ACP image content blocks are still dropped by the OpenClaw bridge — that's a separate bridge bug, not addressed here. This commit restores image support for the OpenAI-compat /v1/chat/completions path that the upcoming ACP chat panel will use as a carve-out for image-bearing prompts. Existing custom-provider configs are NOT auto-migrated; users will re-acquire image support either by re-running setup or by editing their model entries' `input` field manually. A migration pass for legacy installs is not in scope for this commit because the "supportsImages" intent isn't recoverable from the persisted config alone — the source of truth is the LlmProvider record on the agent side. * feat(agents): add OpenClaw to AgentAdapter union and catalog Extends AgentAdapter to 'claude' \| 'codex' \| 'openclaw' and adds the OpenClaw entry to AGENT_ADAPTER_CATALOG. The new entry has: - defaultModelId: 'default' — OpenClaw's ACP bridge does not surface per-session model selection (verified during the ACP spike), so models live in the OpenClawService config, not in the adapter catalog. AgentDefinition.modelId carries the gateway-side model name for display only. - models: [] — empty list signals "no per-session model picker" in the UI; isSupportedAgentModel('openclaw', undefined\|'default') returns true via the existing fallback path. - reasoningEfforts mirror OpenClaw's session-level `thought_level` config option (off / minimal / low / medium / high / adaptive). Also extends: - isAgentAdapter type guard recognizes 'openclaw' - HarnessAgentAdapter union on the extension side - agents.test.ts createAgent fake type - agent-catalog.test.ts asserts on the new entry, empty model list passthrough behavior, and OpenClaw's reasoning effort set Lockfile delta is the workspace SDK pin reconciling 0.20.0 (taken from dev's lock) up to our package.json's 0.21.0 (added in `c1d987ea`). acpx still uses 0.20.0 transitively — both are present. No runtime wiring yet — the registry override and AcpxRuntime plumbing land in subsequent commits. * feat(agents): plumb OpenClaw gateway accessors into AcpxRuntime Adds an optional `openclawGateway` accessor to AcpxRuntime so the upcoming registry override (Step 4) can spawn `openclaw acp` inside the gateway container with the right port, token, and container/VM identity. All accessors are getter-shaped so values stay live across gateway restarts (port can change, token can rotate). The accessor is threaded: server.ts → createAgentRoutes → AgentHarnessService → AcpxRuntime ↘ sidepanel lazy AcpxRuntime Also adds OpenClawService.getGatewayToken() returning the in-memory token string. We pass it via OPENCLAW_GATEWAY_TOKEN env var on the spawn (per OpenClaw's documented env-var precedence) instead of via `--token` flag (which leaks to ps aux) or `--token-file` path (no discrete token file lives inside the container — the token is nested inside openclaw.json). Wiring is dormant — the registry override that consumes these accessors lands in Step 4. Typecheck + existing acpx/harness/routes tests pass unchanged. * refactor(agents): scrub local plan-step references from code comments Replaces forward-looking comments that referenced internal plan steps (e.g. "Step 4 wires this into…") with comments that justify the code on its own merits. Plan files live locally on the contributor's machine, so cross-references are noise to the rest of the project. No behavior change. * feat(agents): spawn openclaw ACP adapter inside the gateway container When the harness resolves the `openclaw` adapter, it now returns a command that runs `openclaw acp` inside the bundled gateway container via `limactl shell <vm> -- nerdctl exec -i ... openclaw acp --url ws://127.0.0.1:<port>`. This reuses the openclaw binary already installed alongside the gateway — no host-side openclaw install is required. Auth: the gateway token is injected via OPENCLAW_GATEWAY_TOKEN on the container exec rather than `--token` on the openclaw CLI, so the secret never appears in `ps aux`. Banner output: OPENCLAW_HIDE_BANNER=1 and OPENCLAW_SUPPRESS_NOTES=1 keep stdout JSON-RPC-clean. LIMA_HOME: prefixed via `env LIMA_HOME=<path>` on the resolved command so the spawned limactl finds the BrowserOS-owned VM (the server doesn't set LIMA_HOME on its own process env). When the gateway accessor is absent, falls through to acpx's built-in openclaw adapter which assumes a host-side install — that branch will fail at spawn time with a descriptive error. Verified end-to-end via the existing acp-smoke script during the Step 0 spike. * feat(agents): dual-create OpenClaw harness agents on the gateway When the harness creates an `openclaw` adapter agent, it now also provisions a matching agent on the OpenClaw gateway via the existing CLI path (OpenClawService.createAgent). Symmetric on delete: gateway removeAgent runs alongside the harness-store delete. - Adds an OpenClawProvisioner interface (decoupled from OpenClawService for testability) and injects it through AgentHarnessService. - createAgent rolls back the harness record if gateway provisioning fails; deleteAgent tolerates gateway-side failures so harness identity stays consistent with the user-facing UI. - New OpenClawProvisionerUnavailableError surfaces as a 503 when an openclaw create request lands on a harness with no provisioner wired in (instead of a generic 500). - FileAgentStore mints openclaw agent ids with an 'oc-' prefix so the id satisfies the gateway's `^[a-z][a-z0-9-]$` agent name pattern. Other adapters keep raw UUIDs to preserve compatibility. - POST /agents body schema accepts providerType / providerName / baseUrl / apiKey / supportsImages, forwarded to the provisioner when adapter='openclaw'. The agents-page UI still routes openclaw create through the legacy /claw/agents flow; switching that path to the harness is a separate UI cutover. Tests cover: gateway dual-create on success, rollback on gateway failure, 503 when provisioner is missing, and tolerant delete on gateway-side failure. fix(agents): skip catalog model validation for OpenClaw adapter OpenClaw agents resolve their model from the gateway-side provider config (set at agent-create time via OpenClawService) rather than from the harness catalog, which has an empty `models: []` entry by design. Without this carve-out, every OpenClaw create body fails parsing with "Invalid modelId" because no concrete model id can satisfy isSupportedAgentModel('openclaw', ...). The reasoning-effort check still runs against the catalog (those values map directly to OpenClaw's session `thought_level` config option). * fix(agents): pass --session to openclaw bridge so newSession routes correctly acpx's AcpClient.createSession calls connection.newSession with cwd and mcpServers but never forwards the sessionKey. Without it, the openclaw bridge falls back to a synthetic acp:<uuid> session that doesn't resolve to any provisioned gateway agent — every harness chat returns a generic "Internal error" from -32603. Fix: bake `--session <key>` into the resolved spawn command. The bridge then uses that as the default session key for any newSession the bridge receives, routing the turn to the matching gateway agent. Per-session keying means each openclaw agent gets its own AcpxCoreRuntime instance (cached by sessionKey on top of the existing cwd/permissionMode key). This adds one extra runtime per active openclaw session — claude/codex are unaffected. Test asserts the resolved command includes the right --session arg. * fix(agents): suppress BrowserOS MCP for openclaw bridge The openclaw ACP bridge rejects newSession when mcpServers is non-empty because its provider tooling comes from the gateway, not from ACP-side MCP servers. Forwarding the BrowserOS HTTP MCP made every harness chat fail with a JSON-RPC -32603 "Internal error" before the session was even opened. Claude/codex still need the BrowserOS MCP for browser tooling, so the carve-out is keyed off whether the runtime is for an openclaw session. * feat(agents): route OpenClaw chat through the harness behind a flag Adds the `feature.useAcpxForOpenClaw` extension storage flag. When on, OpenClaw agents in the agent-command chat panel use the harness /agents/<id>/chat SSE and harness history hook instead of the legacy /claw/agents/<id>/chat. When off, behavior is unchanged. Also dedupes the agent rail when the same id appears in both stores (dual-created agents from /claw/agents and /agents) by preferring the harness entry — without this, every dual-created OpenClaw agent shows up twice after Step 5. Image attachments are temporarily disabled when the harness path is active; the carve-out lands in the next commit. * fix(agents): keep legacy OpenClaw agents on ClawChat The previous commit's flag-gated branch routed every `source='openclaw'` agent through `/agents/<id>/chat` when the flag was on, but the layout dedup means the only agents that ever reach that branch are legacy gateway-only entries (`main`, orphan agents from rolled-back creates) — which by definition have no harness record, so the harness path 404s and chat is unusable. Source is the only routing signal again: harness agents go through the harness, legacy agents stay on ClawChat. The storage flag stays for Step 9/10's migration story. * feat(agents): expose OpenClaw in sidepanel and route through gateway main `buildSidepanelChatTargets` now emits a single default ACP target for adapters with no per-session model picker (OpenClaw, whose model is configured on the gateway-side agent). Without this, OpenClaw never appeared in the sidepanel target picker because the catalog entry has `models: []`. Sidepanel sessions don't have a dedicated provisioned gateway agent. The openclaw bridge `--session` flag previously got the raw sidepanel key (`sidepanel:<convId>:openclaw:...`), which doesn't match any gateway agent — newSession was accepted but every prompt hung forever. The bridge command now rewrites non-harness session keys onto the always-present `main` gateway agent, encoding the original key as a channel suffix to keep state segregated per conversation. Verified end-to-end via curl: sidepanel openclaw chat streams `text-delta` + `finish: stop`. * feat(agents): backfill harness records for legacy gateway agents Reframes Step 9 of the OpenClaw-on-acpx migration. The plan's literal Step 9 (route OpenClaw history through the harness when the flag is on) was already a no-op after the Step 6 walkback — history is routed by source today. The actual blocker for Steps 10–13 was that legacy gateway-only agents (e.g. `main`, orphans from rolled-back creates) had no harness record, so they could never migrate to the harness path without breaking chat. `AgentHarnessService.reconcileWithGateway()` now lists every gateway agent and upserts a matching harness record for any that are missing. The pass runs lazily on first `listAgents()` call (memoized on success, retried on failure so a gateway-down boot doesn't permanently disable backfill). Verified end-to-end: the legacy `agent` agent now streams `text_delta` + `done(end_turn)` through `/agents/agent/chat`, with the bridge resolving to the gateway's `agent` record via the existing `agent:<name>:main` session-key format. After this, every OpenClaw agent surfaces as `source='agent-harness'` post-dedup, the legacy `useClawChatHistory` hook becomes unreachable for OpenClaw, and Steps 11–13 (delete legacy chat/history paths) are unblocked. * fix(agents): drop duplicate OpenClaw entry from NewAgentDialog adapter list The adapter Select hardcoded an `<SelectItem value="openclaw">OpenClaw</SelectItem>` on top of iterating `adapters`, which now includes OpenClaw post the catalog change. The dropdown rendered "OpenClaw" twice — once at the top, once at the bottom of the list. The literal was a pre-catalog artifact; removing it leaves a single OpenClaw entry sourced from the catalog. Routing into `handleOpenClawCreate` is unchanged because the value (`'openclaw'`) is identical either way. * fix(agents): always reconcile harness with gateway on list, just dedupe concurrent calls Memoizing the first successful reconcile meant new gateway agents (created via the legacy /claw/agents path or out-of-band CLI) never appeared in the harness until server restart. The Promise now serves as a concurrent-call dedupe only — cleared on settle — so every listAgents call picks up the current gateway state. Reconcile is one cheap idempotent CLI call. * chore(agents): remove dormant useAcpxForOpenClaw flag The flag was scaffolded in Step 6 but its routing effect was walked back the same day after it broke chat for legacy gateway-only agents. After the Step 9 backfill, every OpenClaw agent has a harness record and routes through the harness path purely from `source='agent-harness'` — no flag is consulted anywhere. Remove the dead storage item, hook, and stale comment. * refactor(agents): drop legacy /claw/agents/:id/history endpoint The harness /agents/:id/sessions/main/history endpoint replaced this once every OpenClaw agent got a harness record (Step 9 backfill). Routing is fully source-driven now, so the UI's useClawChatHistory hook is never enabled today — verified live: legacy URL returns 404, harness history hydrates correctly for the same agent. Removes the GET /claw/agents/:id/history route, OpenClawService's getAgentHistoryPage method plus its cursor/limit helpers and the history-only types it owned (BrowserOSOpenClawHistoryPageResponse, HistoryPageInput, normalizeHistoryLimit, encodeHistoryCursor, decodeHistoryCursor, jsonlEventsToHistoryItems), and the route + service tests that covered the dropped endpoint. OpenClawJsonlReader stays alive — still feeds /claw/dashboard, /claw/agents/:id/sessions, and the boot-time clawSession seed. Removing those is its own follow-up since the dashboard would need a harness-side replacement first. * feat(agents): wire image attachments through the harness ACP path Composer attachments now flow into the ACP `prompt` request as spec-compliant `image` content blocks alongside the user's text. End to end: composer → chatWithHarnessAgent({attachments}) → POST /agents/:id/chat {message, attachments} → parseChatBody decodes data: URLs to {mediaType, base64} → AgentHarnessService.send forwards → AcpxRuntime.send forwards → acpx startTurn({attachments}) → ACP image blocks UI no longer disables the attach button on harness agents — the gating was just a placeholder before this commit landed. Verified end to end with a 1×1 red PNG against a Claude harness agent: model replies "Red." correctly. OpenClaw's `acp` bridge still drops image content blocks upstream (verified by the same probe — Kimi-k2p5 reports "I don't see an image"). That's an upstream openclaw limitation, not a harness-side gap; Claude/Codex agents work as advertised today. * chore(openclaw): delete OpenClawJsonlReader and JSONL-backed routes * chore(openclaw): remove legacy /claw/agents/:id/chat and /queue routes * chore(agents): collapse chat panel to harness-only path * feat(agents): route OpenClaw image turns through the gateway HTTP client The OpenClaw `acp` bridge silently drops ACP `image` content blocks (verified during dogfood — model says "I don't see an image"). When the user attaches images to an OpenClaw agent, the harness now diverts that turn to the gateway's HTTP `/v1/chat/completions` endpoint, which accepts OpenAI-style `image_url` parts and forwards them natively to the provider. - New `OpenClawGatewayChatClient` translates an OpenAI streaming response into the same `AgentStreamEvent` shape the rest of the harness already consumes, so the chat panel renders identically whether a turn went through ACP or the gateway carve-out. - `AcpxRuntime.send` forks at the top: openclaw + any image attachment + a wired gateway client → `sendOpenclawViaGateway`. Other turns (text-only openclaw, claude, codex) take the existing ACP path unchanged. - The diverted path reads the prior turn history from the acpx session record so context is preserved, builds the OpenAI multimodal user message with text + image_url parts, and pumps the gateway SSE back to the caller through a tee that accumulates the assistant text. On natural completion, persists a synthetic user+assistant message pair to the acpx session record so reload shows the image turn in history. - Wired `OpenClawGatewayChatClient` into `AgentHarnessService` via `server.ts` (gateway port + token accessor, just like the existing `openclawGateway`). Persistence note: the acpx record requires User messages to carry an `id` and Agent messages to carry `tool_results` — without them the record fails to round-trip through `parseSessionRecord`. The persist helper now sets both. Limitation by design: image recognition only works if the OpenClaw agent's provider supports vision (e.g. Claude-via-OpenClaw, GPT-4o). The pipeline routes images correctly to the provider regardless; text-only providers like Kimi-k2p5 will reply "I don't see an image" because the model itself has no vision capability — that's a provider config issue, not a routing bug. The unit test asserts the image_url part is present in the OpenAI request the gateway client sends. The wider plan (background-resilient chat, queue, replay) remains in `plans/.../2026-04-29-1527-...-background-resilient-chat-and-image-uploads.md` as Phases 3–12; this commit ships only Phases 1–2. * feat(agents): validate inbound image attachments on /agents/:id/chat The harness chat body parser was accepting any mediaType and any dataUrl length. The composer enforces these caps client-side but the endpoint also serves direct curl/script callers, so the server has to defend itself. Restores the same caps the legacy /claw/agents/:id/chat parser had before it was deleted in the migration: - 10 attachments per message - 5 MB raw image bytes (≈ 6.7 MB once base64-encoded plus prefix) - PNG / JPEG / WebP / GIF only - Must start with `data:` Each violation returns 400 with a specific error message instead of silently dropping or forwarding the payload.	2026-04-29 16:37:03 +05:30
Nikhil	2ff5c12840	feat: add sidepanel ACP chat targets (#857 ) * feat(agent): add sidepanel chat target catalog * feat(agent): show acp models in sidepanel selector * feat(server): adapt acp events to ui message streams * feat(server): add sidepanel acp chat route * feat(agent): route sidepanel chat through acp targets * chore: self-review fixes * fix: address review feedback for PR #857	2026-04-28 18:23:38 -07:00
Nikhil Sonti	3c629c5929	feat: tool approvals, governance dashboard, and execution history - Add tool approval system with per-category approval configuration - Build unified Governance dashboard (renamed from Admin) with pending approvals view and execution audit log - Move execution history tracking into the app shell - Extract buildChatRequestBody helper and add newtab system prompt - Add approval config change detection for mid-conversation rebuilds	2026-04-13 09:43:30 -07:00
Nikhil	77dcd37000	feat: ACLs and support enforcing (#583 ) * feat: add ACL rules for per-site element-level agent restrictions Implement Access Control List (ACL) rules that let users block the agent from interacting with specific elements on specific websites. Rules are defined in a new Settings > ACL Rules page and enforced server-side in executeTool() before any input tool handler runs. - Shared ACL types and site pattern matching (packages/shared) - Extension storage, settings UI with rule cards and add dialog - Server-side guard in executeTool() checking tool+page+element - Browser class extensions for element property resolution via CDP - Visual overlay injection (red "BLOCKED" mask) via Runtime.evaluate - Rules transported in chat request body alongside declinedApps * fix: address review comments for ACL rules - Add selector-to-property matching in matchesElement (tag, id, class) - Remove scroll from guarded tools set (read-like action) * fix: ACL site pattern matching fails on multi-segment URL paths The glob-to-regex conversion used [^/]* for wildcard () which only matches a single path segment. ".amazon.com/" failed to match "www.amazon.com/cart/smart-wagon" because the trailing couldn't cross the slash between "cart" and "smart-wagon". Fix: Split URL matching into hostname vs path parts. Path wildcards now use .* to match across slashes. Also add simple domain matching so users can just type "amazon.com" instead of ".amazon.com/". * fix: wire up ACL overlay injection after take_snapshot applyAclOverlays was defined but never called. Now triggers after take_snapshot completes on pages matching ACL rules, so the agent sees red "BLOCKED" overlays on restricted elements. * refactor: rework 0326-acl_rules based on feedback	2026-04-13 09:42:45 -07:00
Felarof	df7873562d	Revert Kimi partnership UI, restore daily limit survey (#663 ) * docs: add uBlock Origin install info to getting started and ad-blocking pages Chrome dropped support for the full uBlock Origin extension — highlight that BrowserOS brings it back and make it easy to install from both the getting started guide and the dedicated ad-blocking page. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * feat: revert Kimi partnership UI, restore daily limit survey Remove Kimi/Moonshot AI partnership branding from the rate limit banner, provider card, provider templates, and LLM hub. Restore the original survey CTA on daily limit errors. Moonshot AI remains as a regular provider template without the "Recommended" badge. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: address Greptile review comments - Guard survey CTA with !isCreditsExhausted to avoid showing it for credits-exhausted users who already see "View Usage & Billing" - Remove dead kimi-launch feature flag files (kimi-launch.ts, useKimiLaunch.ts) - Remove unused KIMI_RATE_LIMIT analytics events - Remove VITE_PUBLIC_KIMI_LAUNCH from env schema and .env.example Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-08 16:39:00 -07:00
shivammittal274	81350c0d7f	feat: replace model picker with shadcn Combobox + fuse.js fuzzy search (#617 ) The model picker in NewProviderDialog rendered inline, causing dialog resizing and lacked keyboard navigation. Replace it with a Popover + Command (shadcn Combobox) pattern and add fuse.js for fuzzy search. - Replace custom ModelPickerList with Popover + Command dropdown - Add fuse.js for fuzzy model search (replaces string.includes) - Add MODEL_SELECTED_EVENT and AI_PROVIDER_UPDATED_EVENT analytics - Enrich PROVIDER_SELECTED_EVENT with model_id in chat sessions	2026-03-30 16:38:21 +05:30
Dani Akash	cee318a40b	fix: improve chat history freshness and reduce query payload (#598 ) * fix: add refresh indicator to chat history when fetching latest conversations Show a non-blocking "Fetching latest conversations" indicator at the top of the history list while the cached data is being refreshed. Users can still interact with the cached conversation list during the refresh. * perf: reduce chat history query payload — fetch last 2 messages instead of 5 The conversation list only displays the last user message as a preview. Fetching 5 messages per conversation was wasteful — each message contains the full UIMessage object (tool calls, reasoning, etc.) multiplied by 50 conversations per page. Reduced to last 2 which is sufficient to find the last user message in a user→assistant exchange. * perf: use first+DESC instead of last+ASC to push LIMIT down to SQL PostGraphile's `last: N` doesn't map to SQL LIMIT — it uses a padded LIMIT 10 and slices in application code. Changing to `first: 2` with ORDER_INDEX_DESC generates a true SQL LIMIT 2, reducing rows scanned from 500 to 100 per page (50 conversations × 2 vs 10 messages each). No UX impact — extractLastUserMessage() filters by role regardless of message order. * chore: update react query packages * feat: replace localforage with idb-keyval	2026-03-27 19:49:47 +05:30
Dani Akash	aacb47f7ee	feat: isolate new-tab agent navigation from origin tab (#593 ) * feat: isolate new-tab agent navigation from origin tab Add origin-aware navigation isolation so the agent never navigates away from the new-tab chat UI. This is a two-layer defense: 1. Prompt adaptation: When origin is 'newtab', the system prompt's execution and tool-selection sections are rewritten to prohibit navigating the active tab and default all lookups to new_page. 2. Tool-level guards: navigate_page and close_page reject attempts to act on the origin tab when in newtab mode, returning an error that teaches the agent to self-correct. The client now sends an `origin` field ('sidepanel' \| 'newtab') instead of injecting a soft NEWTAB_SYSTEM_PROMPT that LLMs could ignore. Backwards compatible — defaults to 'sidepanel'. Closes TKT-592, addresses TKT-564 * test: add newtab origin navigation guard tests - 14 new prompt tests verifying the system prompt adapts correctly for newtab vs sidepanel origin (execution rules, tool selection table, absence of conflicting single-tab guidance) - 6 new integration tests for navigate_page and close_page guards: rejects origin tab in newtab mode, allows non-origin tabs, allows all tabs in sidepanel mode, backwards compatible with no session	2026-03-27 12:06:32 +05:30
Dani Akash	0f193055c7	fix: broaden connection error detection for main page and sidepanel (#563 ) * fix: broaden connection error detection for main page and sidepanel The connection error check required both "Failed to fetch" AND "127.0.0.1" in the error message. On the main page, the browser only produces "Failed to fetch" without the IP, so users saw a generic "Something went wrong" instead of the troubleshooting link. Broaden detection to also match "localhost" and bare "Failed to fetch" errors that don't contain an external URL. Also pass providerType in NewTabChat so provider-specific errors render correctly. Closes #526 * fix: simplify connection error detection All chat requests go through the local BrowserOS agent server, so any "Failed to fetch" error is always a local connection issue. Remove the unnecessary 127.0.0.1/localhost/URL checks. * fix: pass providerType to agentUrlError ChatError instances	2026-03-26 20:55:40 +05:30
shivammittal274	c8204efab6	feat: improve rate limit UX, usage page, and provider selector (#544 ) * feat: improve rate limit UX, usage page, and provider selector - Show "Add your own provider for unlimited usage" CTA when BrowserOS credits are exhausted or daily limit is reached - Fix credit exhaustion detection to match actual error message - Improve Usage page: remove disabled Add Credits button, add "Coming soon" badge, add "Want unlimited usage?" section linking to providers - Add "+ Add Provider" button at bottom of chat provider selector dropdown * fix: use asChild pattern for Button+anchor in usage page Replace nested <a><Button> (invalid HTML) with Button asChild pattern per shadcn/ui convention.	2026-03-24 18:01:42 +05:30
Dani Akash	fe257cd8d1	feat: only parse browseros provider errors (#542 )	2026-03-24 14:43:05 +05:30
shivammittal274	8548bcf50a	feat: credit-based tracking for BrowserOS provider (#489 ) * feat: add credit-based tracking for BrowserOS provider Send X-BrowserOS-ID header on all LLM requests through the BrowserOS gateway for per-installation credit tracking. Handle 429 CREDITS_EXHAUSTED as non-retryable. Add GET/PUT /credits endpoints to check and manage credit balance. * docs: add credits tracking UI design Design for showing credit balance in side panel chat header (color-coded badge) and a dedicated Usage & Billing settings page. Credits refresh after each completed message turn or on exhaustion error. * docs: add credits tracking UI implementation plan 8-task plan covering useCredits hook, CreditBadge component, ChatHeader integration, message completion refresh, ChatError CREDITS_EXHAUSTED handling, Usage & Billing settings page, and route/sidebar registration. * feat: add useCredits React Query hook * feat: add CreditBadge component with color thresholds * feat: show credit badge in chat header for BrowserOS provider * feat: refresh credits after chat message completion and on error * feat: handle CREDITS_EXHAUSTED error in chat * feat: add Usage & Billing settings page * feat: register usage page route and sidebar entry * fix: lint and formatting fixes for credit tracking UI * fix: separate credits exhausted from Kimi rate limit in ChatError, redesign Usage page * chore: remove PUT /credits endpoint and setCredits function * fix: extract shared credit colors, add error state to UsagePage, use dailyLimit from gateway * fix: make dailyLimit required in CreditsInfo (gateway always returns it) * feat: gate credits UI behind CREDITS_SUPPORT feature flag (server >= 0.0.78)	2026-03-20 22:49:00 +05:30
Dani Akash	2b4fdf1aad	feat: improved multi tab agent workflow (#507 ) * feat: updated multitab workflow * fix: updated prompt with fix for test cases * fix: active agent glow * fix: review comments	2026-03-20 18:31:36 +05:30
shivammittal274	720baaed3e	feat: add GitHub Copilot as OAuth LLM provider (#500 ) * feat: add GitHub Copilot as OAuth-based LLM provider Add GitHub Copilot as a second OAuth provider using the Device Code flow (RFC 8628). Users authenticate via github.com/login/device, and the server polls for token completion. Supports 25+ models through a single Copilot subscription. Key changes: - Device Code OAuth flow in token manager (poll with safety margin) - Custom fetch wrapper injecting Copilot headers + vision detection - Provider factory using createOpenAICompatible for Chat Completions API - Extension UI with template card, auto-create on auth, and disconnect * fix: address PR review comments for GitHub Copilot OAuth - Validate device code response for error fields (GitHub can return 200 with error payload) - Store empty refreshToken instead of access token for GitHub tokens - Add closeButton to Toaster for dismissing device code toast * fix: add github-copilot to agent provider factory The chat route uses a separate provider-factory.ts (agent layer) from the test-provider route (llm/provider.ts). Added createGitHubCopilotFactory to the agent factory so chat works with GitHub Copilot. * fix: add github-copilot to provider icons, models, and dialog - Add Github icon from lucide-react to providerIcons map - Add 8 Copilot models (GPT-4o, Claude, Gemini, Grok) to models.ts - Add github-copilot to NewProviderDialog zod enum, validation skip, canTest check, and OAuth credential message * fix: reorder copilot models with free-tier models first Put models available on Copilot Free at the top (gpt-4o, gpt-4.1, gpt-5-mini, claude-haiku-4.5, grok-code-fast-1), followed by premium models that require paid Copilot subscription. * fix: set correct 64K context window for Copilot models Copilot API enforces a 64K input token limit regardless of the underlying model's native context window. Updated all model entries and the default template to 64000 so compaction triggers correctly. * fix: use actual per-model prompt limits from Copilot /models API Queried api.githubcopilot.com/models for real max_prompt_tokens values. GPT-4o/4.1 have 64K, Claude/gpt-5-mini have 128K, GPT-5.x have 272K. Also updated model list to match what's actually available on the API (e.g. claude-sonnet-4.6 instead of 4.5, added gpt-5.4/5.2-codex). * feat: resize images for Copilot using VS Code's algorithm Large screenshots cause 413 errors on Copilot's API. Resize images following VS Code's approach: max 2048px longest side, 768px shortest side, re-encode as JPEG at 75% quality. Uses sharp for server-side image processing. * fix: address all Greptile P1 review comments - Add .catch() on fire-and-forget pollDeviceCode to prevent unhandled rejection crashes (Node 15+) - Add deduplication guard (activeDeviceFlows Set) to prevent concurrent device code flows for the same provider - Add runtime validation of server response in frontend before calling window.open() and showing toast - Remove dead GITHUB_DEVICE_VERIFICATION constant from urls.ts * fix: upgrade biome to 2.4.8, fix all lint errors, and address review bugs - Upgrade biome from 2.4.5 to 2.4.8 (matches CI) and migrate configs - Fix image resize: only re-encode when dimensions actually change - Fix device code polling: retry on transient network errors instead of aborting - Allow restarting device code flow (clear old flow instead of throwing 500) - Fix pre-existing noNonNullAssertion and noExplicitAny lint errors globally * fix: address Greptile P2 review — image resize and config guard - Fix early-return guard: check max/min sides against their respective limits (MAX_LONG_SIDE/MAX_SHORT_SIDE) instead of both against SHORT - Preserve PNG alpha: detect hasAlpha and keep PNG format instead of unconditionally converting to lossy JPEG - Keep browserosId guard in resolveGitHubCopilotConfig consistent with ChatGPT Pro pattern (safety check that caller context is valid) * feat: update Copilot models to full list from pricing page, default to gpt-5-mini Added all 23 models from GitHub Copilot pricing page. Ordered with free-tier models first (gpt-5-mini, claude-haiku-4.5), then premium. Changed default from gpt-4o to gpt-5-mini since it's unlimited on Pro plan and has 128K context (vs gpt-4o's 64K limit).	2026-03-20 02:33:09 +05:30
Dani Akash	5bb6143373	feat: display selected text from page in sidepanel (#496 ) * feat: select text and pass to sidepanel * fix: lint issues * fix: persist selection across tabs * fix: review comments * fix: change when the selection is cleared * feat: sanitize url	2026-03-19 20:21:31 +05:30
Dani Akash	d965698905	fix: biome & tsc setup across repo (#493 ) * fix: biome lint issues * fix: code quality workflow * fix: all lint issues * chore: test lefthook pre-commit hook * chore: test lefthook with agent file * chore: revert test comment from lefthook verification * feat: setup tsgo for typechecking agent * fix: typecheck cli command * fix: early return to prevent errors	2026-03-19 18:18:24 +05:30
Dani Akash	1b88ade021	feat: updated homepage chat (#481 ) * feat: updated chat ui from homepage * fix: vertical scroll * fix: horizontal scroll issue * fix: lint issues * fix: header width * fix: message input from home to chat * feat: created sidebar header support in new tab chat * fix: remove history from new tab chat * fix: remove the shared element transition * fix: lint issues * fix: review comments * fix: defer the sendMessage callback * fix: all code concerns * fix: preserve state of chat on homepage * fix: review comments	2026-03-19 15:24:05 +05:30
shivammittal274	46a8326140	feat: add ChatGPT Pro OAuth as LLM provider (#476 ) * feat: add ChatGPT Pro OAuth as LLM provider Adds OAuth 2.0 (Authorization Code + PKCE) flow so users can authenticate with their ChatGPT Pro subscription to power BrowserOS's agent, matching the pattern used by Codex CLI, OpenCode, and Pi. Server: - OAuth token lifecycle (PKCE, exchange, refresh, SQLite storage) - Dedicated callback server on port 1455 (Codex client ID registration) - Codex fetch wrapper routing API calls to chatgpt.com/backend-api - Config resolution + provider factories for all code paths (chat, test, refine) Extension: - ChatGPT Pro template card with OAuth flow trigger - Status polling hook + auto-create provider on auth success - Model list with Codex-supported models (gpt-5.x-codex family) * fix: address Greptile PR review comments - Wire OAuth callback server stop handle into onShutdown (P1: port 1455 leak) - Guard against missing refresh token + clear stale tokens on failed refresh (P1) - Add logger.warn to silent catch in codex-fetch body mutation - Document JWT trust assumption in parseAccessTokenClaims - Source model ID from provider template instead of hard-coding * simplify: remove unnecessary OAuth shutdown wiring and useCallback - Revert OAuthHandle interface — callback server port releases on process exit - Remove stopCallbackServer from shutdown flow (dead code) - Remove all useCallback from useOAuthStatus per CLAUDE.md guidance * style: add readonly modifiers and braces per TS style guide * docs: add E2E test screenshots for ChatGPT Pro OAuth * fix: strip item IDs from Codex requests to fix multi-turn conversations * fix: preserve function_call_output IDs in Codex requests * fix: resolve Codex store=false + tool-use incompatibility - Pass providerOptions { openai: { store: false } } to ToolLoopAgent so the AI SDK inlines content instead of using item_reference - Strip item IDs and previous_response_id in codex-fetch (safety net) - Use .responses() model (Codex only speaks Responses API format) * fix: remove non-Codex model gpt-5.2 from chatgpt-pro model list * fix: strip unsupported Codex params and update model list - Strip temperature, max_tokens, top_p from Codex requests (unsupported) - Add all available Codex models including gpt-5.4, gpt-5.2, gpt-5.1 * chore: remove screenshots containing email * feat: enable reasoning events for ChatGPT Pro Codex models * chore: set reasoning effort to high for ChatGPT Pro * feat: add configurable reasoning effort and summary for ChatGPT Pro - Add reasoningEffort (none/low/medium/high) and reasoningSummary (auto/concise/detailed) dropdowns in the Edit Provider dialog - Pass through extension → chat request → agent config → providerOptions - Defaults: effort=high, summary=auto * fix: strip max_output_tokens from Codex requests (fixes compaction) * fix: address Greptile P1 issues - Fix default model fallback: gpt-4o → gpt-5.3-codex (Codex endpoint) - Clear stale tokens on refresh failure (prevents infinite retry loop) - Only auto-create provider after explicit OAuth flow, not on page load - Add catch block to auto-create effect with error toast	2026-03-18 22:07:43 +05:30
Dani Akash	2a6848bc1d	feat: improved system prompt (#466 ) * feat: added ai-sdk dev tools * feat: new system prompt section * feat: tests to maintain prompt integrity * feat: update mcp sync to use react query * fix: refetch logic for sync * chore: remove limits on fetching integrations * fix: refetch integrations on delete * fix: review comment * chore: update tests * fix: improved memory classification * fix: lint issues * fix: core memory prompts * fix: handle scenario where soul file is empty	2026-03-17 19:01:10 +05:30
shivammittal274	e67c17a0f8	feat: add voice input to agent chat sidebar (#467 ) * feat: add voice input to agent chat sidebar Allow users to record voice and transcribe to text in the chat input. Mic button shows when input is empty, waveform visualizer during recording, transcription via OpenAI (llm.browseros.com/api/transcribe). - Extract shared useVoiceInput hook to lib/voice/ - Time-domain waveform bars that bounce per-frequency-band - Bar height capped to fit input container - Analytics events for recording lifecycle * fix: address review — add fetch timeout, await stopRecording, deduplicate VoiceInputState - Add AbortSignal.timeout(30s) to transcription fetch - Await stopRecording() and track analytics after completion - Export VoiceInputState from useVoiceInput, import in consumers * fix: await startRecording before tracking, narrow SurveyChat effect deps - Await startRecording() so analytics only fires after mic permission granted - Narrow SurveyChat useEffect dependency from [voice] to [voice.transcript, voice.isTranscribing] * fix: analytics only tracks on success, clean up stream on failure, type API response - startRecording returns boolean; track(RECORDING_STARTED) only fires on success - Catch block cleans up MediaStream tracks and AudioContext on partial failure - Type transcription API response with TranscribeResponse interface * fix: keep mic button always visible alongside send button Mic and send are now separate buttons, both always visible. Mic is disabled while AI is streaming. Send is disabled during recording/transcribing. Buttons are no longer absolutely positioned inside the textarea — they sit beside it in the flex row. * fix: keep mic button always visible inside input alongside send Both mic and send buttons are always visible inside the input field, positioned on the right side (ChatGPT-style). Mic is disabled while AI is streaming. Send is disabled during recording/transcribing. * fix: remove unreachable CSS branch in recording waveform div	2026-03-17 18:28:19 +05:30
shivammittal274	41c9b1547c	feat: add per-task LLM provider selection for scheduled tasks (#450 ) * feat: add per-task LLM provider selection for scheduled tasks Allow users to choose which AI provider a scheduled task runs with, using the same ChatProviderSelector component from the new-tab page. Falls back to the global default provider when none is selected or if the selected provider has been deleted. * fix: lint issues * chore: updated to latest schema.graphql file --------- Co-authored-by: Dani Akash <DaniAkash@users.noreply.github.com>	2026-03-16 18:03:21 +05:30
Felarof	4bee76253d	fix: prevent undefined provider in chat requests on fresh install (#442 ) * fix: fallback to default BrowserOS provider when provider is null When the extension first loads, provider config is loaded async from storage. If a chat request fires before loading completes (race condition), provider is null and the server receives provider: undefined, causing a Zod validation error. This adds a fallback to createDefaultBrowserOSProvider() in both chat paths (sidepanel and scheduled tasks) so provider.type is always defined. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * feat: fallback to first provider when default provider ID is stale When defaultProviderId in storage doesn't match any loaded provider (e.g. after Kimi/Moonshot rollout), selectedProvider was null causing provider: undefined in chat requests. Now falls back to providers[0]. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: repair stale defaultProviderId in storage on load When the stored default provider ID doesn't match any loaded provider, write back the corrected ID (providers[0].id) to storage so it doesn't silently persist across sessions. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-14 09:05:27 -07:00
Felarof	95c855a091	feat: replace rate limit CTAs with Kimi/Moonshot partnership links (#437 ) * feat: replace rate limit CTAs with Kimi/Moonshot partnership links Comment out old "Learn more" and "take a quick survey" links on the daily limit error banner. Replace with Kimi API key docs link and direct Moonshot AI platform link for conversion tracking. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: remove partnership tagline from rate limit banner Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-13 12:45:41 -07:00
Dani Akash	290ee91a8b	Add 'packages/browseros-agent/' from commit '90bd4be3008285bf3825aad3702aff98f872671a' git-subtree-dir: packages/browseros-agent git-subtree-mainline: `8f148d0918` git-subtree-split: `90bd4be300`	2026-03-13 21:22:09 +05:30

25 Commits