BrowserOS

mirror of https://github.com/browseros-ai/BrowserOS.git synced 2026-05-18 19:16:22 +00:00

Author	SHA1	Message	Date
shivammittal274	f13b375a95	feat: update Copilot models to full list from pricing page, default to gpt-5-mini Added all 23 models from GitHub Copilot pricing page. Ordered with free-tier models first (gpt-5-mini, claude-haiku-4.5), then premium. Changed default from gpt-4o to gpt-5-mini since it's unlimited on Pro plan and has 128K context (vs gpt-4o's 64K limit).	2026-03-20 02:30:46 +05:30
shivammittal274	ac7cb1ebaf	fix: address Greptile P2 review — image resize and config guard - Fix early-return guard: check max/min sides against their respective limits (MAX_LONG_SIDE/MAX_SHORT_SIDE) instead of both against SHORT - Preserve PNG alpha: detect hasAlpha and keep PNG format instead of unconditionally converting to lossy JPEG - Keep browserosId guard in resolveGitHubCopilotConfig consistent with ChatGPT Pro pattern (safety check that caller context is valid)	2026-03-20 02:16:29 +05:30
shivammittal274	9b25c0eb53	fix: upgrade biome to 2.4.8, fix all lint errors, and address review bugs - Upgrade biome from 2.4.5 to 2.4.8 (matches CI) and migrate configs - Fix image resize: only re-encode when dimensions actually change - Fix device code polling: retry on transient network errors instead of aborting - Allow restarting device code flow (clear old flow instead of throwing 500) - Fix pre-existing noNonNullAssertion and noExplicitAny lint errors globally	2026-03-20 02:12:53 +05:30
shivammittal274	1fbcf091f6	fix: address all Greptile P1 review comments - Add .catch() on fire-and-forget pollDeviceCode to prevent unhandled rejection crashes (Node 15+) - Add deduplication guard (activeDeviceFlows Set) to prevent concurrent device code flows for the same provider - Add runtime validation of server response in frontend before calling window.open() and showing toast - Remove dead GITHUB_DEVICE_VERIFICATION constant from urls.ts	2026-03-20 01:51:07 +05:30
shivammittal274	3292903c88	feat: resize images for Copilot using VS Code's algorithm Large screenshots cause 413 errors on Copilot's API. Resize images following VS Code's approach: max 2048px longest side, 768px shortest side, re-encode as JPEG at 75% quality. Uses sharp for server-side image processing.	2026-03-20 01:47:57 +05:30
shivammittal274	dcd5d617fe	fix: use actual per-model prompt limits from Copilot /models API Queried api.githubcopilot.com/models for real max_prompt_tokens values. GPT-4o/4.1 have 64K, Claude/gpt-5-mini have 128K, GPT-5.x have 272K. Also updated model list to match what's actually available on the API (e.g. claude-sonnet-4.6 instead of 4.5, added gpt-5.4/5.2-codex).	2026-03-20 01:29:25 +05:30
shivammittal274	844b710b77	fix: set correct 64K context window for Copilot models Copilot API enforces a 64K input token limit regardless of the underlying model's native context window. Updated all model entries and the default template to 64000 so compaction triggers correctly.	2026-03-20 01:20:11 +05:30
shivammittal274	bf98e5bbe8	fix: reorder copilot models with free-tier models first Put models available on Copilot Free at the top (gpt-4o, gpt-4.1, gpt-5-mini, claude-haiku-4.5, grok-code-fast-1), followed by premium models that require paid Copilot subscription.	2026-03-20 01:01:53 +05:30
shivammittal274	a180946f70	fix: add github-copilot to provider icons, models, and dialog - Add Github icon from lucide-react to providerIcons map - Add 8 Copilot models (GPT-4o, Claude, Gemini, Grok) to models.ts - Add github-copilot to NewProviderDialog zod enum, validation skip, canTest check, and OAuth credential message	2026-03-20 00:58:41 +05:30
shivammittal274	8a649bdd07	fix: add github-copilot to agent provider factory The chat route uses a separate provider-factory.ts (agent layer) from the test-provider route (llm/provider.ts). Added createGitHubCopilotFactory to the agent factory so chat works with GitHub Copilot.	2026-03-20 00:52:31 +05:30
shivammittal274	e9f1b0ce89	fix: address PR review comments for GitHub Copilot OAuth - Validate device code response for error fields (GitHub can return 200 with error payload) - Store empty refreshToken instead of access token for GitHub tokens - Add closeButton to Toaster for dismissing device code toast	2026-03-20 00:49:07 +05:30
shivammittal274	699b5a7cd6	feat: add GitHub Copilot as OAuth-based LLM provider Add GitHub Copilot as a second OAuth provider using the Device Code flow (RFC 8628). Users authenticate via github.com/login/device, and the server polls for token completion. Supports 25+ models through a single Copilot subscription. Key changes: - Device Code OAuth flow in token manager (poll with safety margin) - Custom fetch wrapper injecting Copilot headers + vision detection - Provider factory using createOpenAICompatible for Chat Completions API - Extension UI with template card, auto-create on auth, and disconnect	2026-03-20 00:36:51 +05:30
shivammittal274	cee9c764b1	fix(skills): read-only view mode for built-in skills (#494 ) * fix(skills): read-only view mode for built-in skills - SkillCard shows Eye icon + "View" for built-in, Pencil + "Edit" for user - SkillDialog in read-only mode: disabled fields, no toolbar on markdown editor, "View Skill" title, "Close" button, no "Update Skill" - Hide tip section in read-only mode * fix(skills): use react-markdown for read-only skill view Replace MDXEditor with react-markdown for viewing built-in skills. MDXEditor chokes on code fences, angle brackets, and image syntax causing content truncation. react-markdown handles standard markdown correctly with no rendering issues.	2026-03-19 23:48:51 +05:30
Nikhil	7bdeeb85d5	fix: revert: convert settings to popup dialog (#477 ) (#498 ) * Revert "feat: convert settings to popup dialog (#477)" This reverts commit `42aa0ff1ef`. * fix: address review feedback for PR #498 - Remove erroneous SETTINGS_PAGE_VIEWED_EVENT tracking from SidebarLayout (was firing on every non-settings page navigation) - Fix mobile settings sidebar not closing on route change by merging setMobileOpen(false) into the pathname-dependent analytics useEffect Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-19 11:13:14 -07:00
Dani Akash	19069cb9c4	fix: newtab layout (#497 )	2026-03-19 20:40:38 +05:30
Dani Akash	5bb6143373	feat: display selected text from page in sidepanel (#496 ) * feat: select text and pass to sidepanel * fix: lint issues * fix: persist selection across tabs * fix: review comments * fix: change when the selection is cleared * feat: sanitize url	2026-03-19 20:21:31 +05:30
Dani Akash	f4d4b73a24	fix: improved memory tools (#495 ) * fix: new prompt update tool * fix: memory search tool * fix: all review comments * chore: remove dead code	2026-03-19 19:01:25 +05:30
Dani Akash	d965698905	fix: biome & tsc setup across repo (#493 ) * fix: biome lint issues * fix: code quality workflow * fix: all lint issues * chore: test lefthook pre-commit hook * chore: test lefthook with agent file * chore: revert test comment from lefthook verification * feat: setup tsgo for typechecking agent * fix: typecheck cli command * fix: early return to prevent errors	2026-03-19 18:18:24 +05:30
shivammittal274	50b2f45590	fix(skills): UI section separation and fix find-alternatives rendering (#492 ) * fix(skills): UI section separation and fix find-alternatives rendering - Split skills page into "My Skills" (user) and "BrowserOS Skills" (built-in) sections - Fix find-alternatives SKILL.md — replace angle bracket placeholders with curly braces to prevent MDXEditor from parsing them as JSX and rendering empty content * fix(skills): bump find-alternatives to v1.1 for CDN sync	2026-03-19 17:38:28 +05:30
Dani Akash	1b88ade021	feat: updated homepage chat (#481 ) * feat: updated chat ui from homepage * fix: vertical scroll * fix: horizontal scroll issue * fix: lint issues * fix: header width * fix: message input from home to chat * feat: created sidebar header support in new tab chat * fix: remove history from new tab chat * fix: remove the shared element transition * fix: lint issues * fix: review comments * fix: defer the sendMessage callback * fix: all code concerns * fix: preserve state of chat on homepage * fix: review comments	2026-03-19 15:24:05 +05:30
shivammittal274	079a254fa4	fix(skills): separate built-in and user skills into distinct directories (#487 ) * fix(skills): separate built-in and user skills into distinct directories - Move built-in skills to ~/.browseros/skills/builtin/, user skills stay in root - Unify seed + sync into single syncBuiltinSkills() function, delete seed.ts - Preserve user's enabled/disabled state during remote sync version updates - Add catalog reconciliation — remove built-in skills dropped from remote catalog - Fallback to bundled defaults per-skill when remote sync fails - One-time migration moves existing default skills from root to builtin/ - Add builtIn field to SkillMeta, determined by directory (not metadata) - UI shows "Built-in" badge, hides delete button for built-in skills - Reject deletion of built-in skills in service layer - Check both dirs for ID collision on skill creation * fix(skills): address review — dedup by id, guard applyEnabled regex - loader.ts: deduplication now keys on skill.id (directory slug) not skill.name (display name), preventing silent drops on name collision - remote-sync.ts: applyEnabled checks if regex matched before writing, logs warning if remote content lacks an enabled field * fix(skills): reconciliation preserves bundled defaults, delete returns 403 - reconcileRemovedSkills now keeps DEFAULT_SKILLS IDs in the safe set, preventing delete-then-reinstall cycle that lost enabled:false state - DELETE /skills/:id returns 403 for built-in skills instead of 500 * refactor(skills): simplify syncBuiltinSkills to single clean pass Build content map (bundled + remote), iterate once, preserve enabled, reconcile deletions. Removes 7 helper functions, 70 lines of code. * refactor(skills): extract syncOneSkill, patch content before writing - syncBuiltinSkills is now 15 lines: build map, iterate, clean up - syncOneSkill: flat, patches enabled state before writing (single write) - setEnabled: pure function for content patching - removeObsoleteSkills: extracted from inline block	2026-03-19 13:35:47 +05:30
Felarof	42aa0ff1ef	feat: convert settings to popup dialog (#477 ) * feat: convert settings page to popup dialog, move workflows to main nav Replace the dedicated settings page layout (SettingsSidebarLayout) with a modal dialog (SettingsDialog) that opens on top of the current page. Settings are now accessible via a dialog triggered from the main sidebar, eliminating the confusing dual-sidebar navigation pattern. - Create SettingsDialog with tabbed left panel and content area - Move Workflows into main sidebar navigation (feature-gated) - Remove /settings/* routes (except /settings/survey) - Delete SettingsSidebarLayout and SettingsSidebar components - Update backward compatibility redirects Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * feat: setup new urls for the dialog box * fix: dialog close button * fix: settings analytics * fix: address review comments --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com> Co-authored-by: Dani Akash <DaniAkash@users.noreply.github.com>	2026-03-18 23:26:13 +05:30
shivammittal274	4000f094f6	Feat/chatgpt pro polish (#484 ) * fix: ChatGPT Pro UI polish — fix undefined display and add icon - Fix "gpt-5.3-codex · undefined" — hide baseUrl when not set - Add OpenAI icon for chatgpt-pro provider in icon map * chore: rename ChatGPT Pro to ChatGPT Plus/Pro (supports both plans) * chore: remove accidentally committed files	2026-03-18 22:51:22 +05:30
shivammittal274	151be81cee	fix: ChatGPT Pro UI polish — fix undefined display and add icon (#483 ) - Fix "gpt-5.3-codex · undefined" — hide baseUrl when not set - Add OpenAI icon for chatgpt-pro provider in icon map	2026-03-18 22:23:28 +05:30
shivammittal274	46a8326140	feat: add ChatGPT Pro OAuth as LLM provider (#476 ) * feat: add ChatGPT Pro OAuth as LLM provider Adds OAuth 2.0 (Authorization Code + PKCE) flow so users can authenticate with their ChatGPT Pro subscription to power BrowserOS's agent, matching the pattern used by Codex CLI, OpenCode, and Pi. Server: - OAuth token lifecycle (PKCE, exchange, refresh, SQLite storage) - Dedicated callback server on port 1455 (Codex client ID registration) - Codex fetch wrapper routing API calls to chatgpt.com/backend-api - Config resolution + provider factories for all code paths (chat, test, refine) Extension: - ChatGPT Pro template card with OAuth flow trigger - Status polling hook + auto-create provider on auth success - Model list with Codex-supported models (gpt-5.x-codex family) * fix: address Greptile PR review comments - Wire OAuth callback server stop handle into onShutdown (P1: port 1455 leak) - Guard against missing refresh token + clear stale tokens on failed refresh (P1) - Add logger.warn to silent catch in codex-fetch body mutation - Document JWT trust assumption in parseAccessTokenClaims - Source model ID from provider template instead of hard-coding * simplify: remove unnecessary OAuth shutdown wiring and useCallback - Revert OAuthHandle interface — callback server port releases on process exit - Remove stopCallbackServer from shutdown flow (dead code) - Remove all useCallback from useOAuthStatus per CLAUDE.md guidance * style: add readonly modifiers and braces per TS style guide * docs: add E2E test screenshots for ChatGPT Pro OAuth * fix: strip item IDs from Codex requests to fix multi-turn conversations * fix: preserve function_call_output IDs in Codex requests * fix: resolve Codex store=false + tool-use incompatibility - Pass providerOptions { openai: { store: false } } to ToolLoopAgent so the AI SDK inlines content instead of using item_reference - Strip item IDs and previous_response_id in codex-fetch (safety net) - Use .responses() model (Codex only speaks Responses API format) * fix: remove non-Codex model gpt-5.2 from chatgpt-pro model list * fix: strip unsupported Codex params and update model list - Strip temperature, max_tokens, top_p from Codex requests (unsupported) - Add all available Codex models including gpt-5.4, gpt-5.2, gpt-5.1 * chore: remove screenshots containing email * feat: enable reasoning events for ChatGPT Pro Codex models * chore: set reasoning effort to high for ChatGPT Pro * feat: add configurable reasoning effort and summary for ChatGPT Pro - Add reasoningEffort (none/low/medium/high) and reasoningSummary (auto/concise/detailed) dropdowns in the Edit Provider dialog - Pass through extension → chat request → agent config → providerOptions - Defaults: effort=high, summary=auto * fix: strip max_output_tokens from Codex requests (fixes compaction) * fix: address Greptile P1 issues - Fix default model fallback: gpt-4o → gpt-5.3-codex (Codex endpoint) - Clear stale tokens on refresh failure (prevents infinite retry loop) - Only auto-create provider after explicit OAuth flow, not on page load - Add catch block to auto-create effect with error toast	2026-03-18 22:07:43 +05:30
Dani Akash	4b18723a21	fix: undo shortcut in rewrite button (#472 ) * fix: undo shortcut in rewrite button * fix: address reviews	2026-03-18 07:04:48 +05:30
Nikhil	4909927c03	chore: bump PATCH and OFFSET (#479 )	2026-03-17 17:41:45 -07:00
Nikhil	22c5e85707	chore: bump server version (#478 )	2026-03-17 17:12:23 -07:00
shivammittal274	59b00a6837	feat: remote skill download and auto-sync (#468 ) * feat: add remote skill download and auto-sync Download default skills from remote catalog on first setup with bundled fallback when offline. Background sync every 45 minutes checks for new/updated skills without overwriting user-customized ones. Tracks installed defaults via content hashes in a local manifest file. * feat: make skills catalog URL configurable and add generation script Add SKILLS_CATALOG_URL env var (following CODEGEN_SERVICE_URL pattern) with fallback to the default constant. Add script to generate catalog.json from bundled defaults for static hosting. * feat: add R2 upload script and use cdn.browseros.com for catalog URL Add upload-skills-catalog.ts that generates and uploads catalog.json to Cloudflare R2 (same infra as existing build artifacts). Update default catalog URL to cdn.browseros.com/skills/v1/catalog.json. * test: add E2E tests for remote skill sync against live CDN * fix: address code review findings — security, validation, DRY - Add path traversal protection via safeSkillDir in writeSkillFile and readSkillContent (reuses existing validation from service.ts) - Add runtime type guards for catalog JSON and manifest JSON parsing - Fix seedFromRemote to return false on partial failure so bundled fallback kicks in - Add per-skill error handling in syncRemoteSkills so one bad skill doesn't crash the entire sync - Wire stopSkillSync into Application.stop() shutdown path - Extract version from frontmatter in seedFromBundled instead of hardcoding '1.0' - Consolidate duplicated logic: reuse installSkill/writeSkillFile/ contentHash/saveManifest from remote-sync.ts in seed.ts - Extract shared catalog generation into scripts/catalog-utils.ts * test: add flow tests for all four sync scenarios against live CDN * refactor: remove redundant scripts and inline catalog generation Drop generate-skills-catalog.ts, catalog-utils.ts, and e2e-remote-sync.test.ts (covered by flows.test.ts). Inline catalog generation into upload-skills-catalog.ts. * test: add full E2E server flow test against live CDN Tests all 7 steps of the real server lifecycle: fresh seed from CDN, no-op sync, user edit preservation, skill reinstall, custom skill protection, background timer firing, and second startup skip. * chore: remove e2e-server-flow test * fix: address Greptile review — entry validation, size limit, DRY, no-op saves - Validate individual skill entries in catalog (id, version, content must all be strings) not just the top-level shape - Add 1MB response size limit on catalog fetch to prevent resource exhaustion from compromised/misconfigured CDN - Skip manifest save when sync cycle had no changes (avoids unnecessary disk I/O every 45 minutes) - Share extractVersion via remote-sync.ts export, remove duplicate from seed.ts * fix: prevent bundled fallback from overwriting partial remote seeds When seedFromRemote partially fails, the bundled fallback now skips skills already in the manifest (installed by the partial remote seed). Also adds Content-Length early check before downloading the full catalog response body. * fix: run sync immediately on startup, not just on interval Previously the first sync fired 45 minutes after boot. Now startSkillSync runs one sync immediately so returning users get skill updates right away. * refactor: simplify sync — remote always wins, remove manifest Remote catalog is the source of truth. If a skill exists in the catalog, its version is compared against local frontmatter and overwritten when newer. No manifest file, no content hashes. User-created skills (IDs not in catalog) are never touched. * fix: skip bundled skills already installed by partial remote seed * chore: remove unreliable Content-Length check * chore: remove size limit checks, fetch timeout is sufficient	2026-03-17 21:40:45 +05:30
Nikhil	44af9aea6d	fix: clean-up old scripts (#474 ) * fix: remove old scripts * fix: remove vscode	2026-03-17 08:56:55 -07:00
Nikhil	1779e1e7bd	fix: create user-data dir if missing (#473 )	2026-03-17 08:30:39 -07:00
shivammittal274	2597cdbc70	feat: add Rewrite with AI for scheduled task prompts (#465 ) * feat: add "Rewrite with AI" prompt refinement for scheduled tasks Add a lightweight /refine-prompt endpoint that uses generateText to rewrite rough scheduled task prompts into clear, actionable instructions. The UI adds a sparkle-icon button next to the Prompt label in the NewScheduledTaskDialog with loading state, undo support, and disabled state when the textarea is empty. * fix: clear stale undo ref on dialog re-open and pass providerId to refinePrompt - Reset originalPromptRef when dialog opens and on form submit to prevent stale "Undo rewrite" button on re-open - Accept optional providerId in refinePrompt() so the form's selected provider is used for refinement instead of always the system default * fix: hide undo rewrite link while refinement is in flight * fix: reset isRefining state on dialog re-open * fix: ignore stale refine-prompt responses after dialog re-open Use a request generation counter so that if the dialog is closed and re-opened while a rewrite is in flight, the stale response is silently discarded instead of overwriting the fresh form state. * fix: invalidate stale refine requests on dialog reopen and rename to kebab-case - Increment refineRequestIdRef on dialog open so in-flight requests from a previous session are discarded when they complete - Rename refinePrompt.ts to refine-prompt.ts per CLAUDE.md file naming	2026-03-17 19:40:56 +05:30
shivammittal274	515ad44826	fix: resolve biome v2 config and lint errors (#471 ) Migrate `files.ignore` to `files.includes` for Biome v2 compatibility, fix forEach callback return value, unused variable, import ordering, and formatting violations.	2026-03-17 19:14:01 +05:30
Dani Akash	2a6848bc1d	feat: improved system prompt (#466 ) * feat: added ai-sdk dev tools * feat: new system prompt section * feat: tests to maintain prompt integrity * feat: update mcp sync to use react query * fix: refetch logic for sync * chore: remove limits on fetching integrations * fix: refetch integrations on delete * fix: review comment * chore: update tests * fix: improved memory classification * fix: lint issues * fix: core memory prompts * fix: handle scenario where soul file is empty	2026-03-17 19:01:10 +05:30
Dani Akash	74f6a2dff1	fix: issue with fill tool (#469 )	2026-03-17 18:58:17 +05:30
Dani Akash	58adac17db	feat: new workflows (#470 )	2026-03-17 18:56:55 +05:30
shivammittal274	e67c17a0f8	feat: add voice input to agent chat sidebar (#467 ) * feat: add voice input to agent chat sidebar Allow users to record voice and transcribe to text in the chat input. Mic button shows when input is empty, waveform visualizer during recording, transcription via OpenAI (llm.browseros.com/api/transcribe). - Extract shared useVoiceInput hook to lib/voice/ - Time-domain waveform bars that bounce per-frequency-band - Bar height capped to fit input container - Analytics events for recording lifecycle * fix: address review — add fetch timeout, await stopRecording, deduplicate VoiceInputState - Add AbortSignal.timeout(30s) to transcription fetch - Await stopRecording() and track analytics after completion - Export VoiceInputState from useVoiceInput, import in consumers * fix: await startRecording before tracking, narrow SurveyChat effect deps - Await startRecording() so analytics only fires after mic permission granted - Narrow SurveyChat useEffect dependency from [voice] to [voice.transcript, voice.isTranscribing] * fix: analytics only tracks on success, clean up stream on failure, type API response - startRecording returns boolean; track(RECORDING_STARTED) only fires on success - Catch block cleans up MediaStream tracks and AudioContext on partial failure - Type transcription API response with TranscribeResponse interface * fix: keep mic button always visible alongside send button Mic and send are now separate buttons, both always visible. Mic is disabled while AI is streaming. Send is disabled during recording/transcribing. Buttons are no longer absolutely positioned inside the textarea — they sit beside it in the flex row. * fix: keep mic button always visible inside input alongside send Both mic and send buttons are always visible inside the input field, positioned on the right side (ChatGPT-style). Mic is disabled while AI is streaming. Send is disabled during recording/transcribing. * fix: remove unreachable CSS branch in recording waveform div	2026-03-17 18:28:19 +05:30
shivammittal274	94e3f99adb	feat: add test-ui skill for visual testing of agent extension via CDP (#464 ) * feat: add CDP UI inspector script for dev self-testing * fix: address code review feedback for inspect-ui script - Use Delete key (not Backspace) to match server's keyboard.ts clearField - Add windowId resolution to open-sidepanel (chrome.sidePanel.open requires it) - Make target matching case-insensitive - Replace process.exit(1) in eval with thrown error for proper cleanup - Add comment referencing DEV_PORTS source of truth * docs: add self-testing workflow for UI changes via CDP inspector * fix: runtime fixes for inspect-ui discovered during live testing - Remove Input.enable (domain has no enable method) - Add DOM.getDocument before DOM operations (required by protocol) - Use BrowserOS-specific sidePanel.browserosToggle API instead of standard chrome.sidePanel.open (side panel starts disabled) - Enable side panel with setOptions before toggling * feat: add test-ui skill for visual testing of agent extension UI Adds a Claude Code skill that lets the agent visually test both surfaces of the BrowserOS extension: - New tab page (app.html) — left sidebar with Home, Scheduled Tasks, Settings, Skills, Memory, Soul, Connect Apps - Right side panel (sidepanel.html) — chat interface Includes all gotchas discovered through real testing: randomized ports, fresh profile onboarding redirect, stale element IDs after navigation, BrowserOS-specific sidePanel APIs, DOM.getDocument requirement. * feat: add press_key, scroll, hover, select_option, wait_for to inspect-ui Brings inspect-ui.ts to parity with server's MCP input tools: - press_key: key combos like Enter, Control+A, Meta+Shift+P (ported from keyboard.ts pressCombo) - scroll: up/down/left/right with configurable amount - hover: hover over element by ID for tooltip/hover state testing - select_option: select dropdown option by value or visible text (ported from browser.ts selectOption) - wait_for: poll for text or CSS selector with 10s timeout Updated skill documentation with new commands and examples. * docs: prefer snapshot over screenshot, add holistic debugging guidance - Add snapshot vs screenshot guidance table — prefer snapshot for structural checks, screenshot only for visual/layout verification - Add server log checking instructions ([agent], [server], [build] tags) - Add JS error checking via eval - Add API connectivity verification - Add common issues troubleshooting table - Update all examples to use snapshot as default verification * fix: address Greptile review feedback - Replace process.exit(1) with process.exitCode + return in cmdWaitFor to allow async CDP cleanup in finally blocks - Fix cmdScroll enabling Runtime instead of Page domain - Add BROWSEROS_EXTENSION_ID env var override for extension ID - Align CLAUDE.md dev server command with SKILL.md canonical command	2026-03-17 15:18:00 +05:30
Nikhil	e2069bc999	chore: bump server version (#459 )	2026-03-16 16:42:54 -07:00
shivammittal274	2d51c82722	fix: detect custom clickable elements in take_snapshot (#452 ) take_snapshot only used the AX tree, which misses custom components (cursor:pointer divs, onclick handlers, etc.) that lack ARIA roles. These elements appeared as role="generic" and were invisible to the agent. Changes: - Merge findCursorInteractiveElements into snapshot() so take_snapshot catches cursor:pointer, onclick, and tabindex elements - Add DisclosureTriangle to INTERACTIVE_ROLES for <summary> elements - Use aria-label as text fallback in cursor detection for icon-only buttons - Fix dedup bug in enhancedSnapshot that was silently dropping all cursor-detected elements by checking against all AX node IDs instead of only already-included output IDs	2026-03-17 02:01:15 +05:30
shivammittal274	29056226bb	feat: add eval framework and coordinate-based input tools (#453 ) - Add hover_at, type_at, drag_at coordinate tools to server - Add hoverAt, typeAt, dragAt methods to Browser class - Export server internals (browser, tool-loop, registry) for eval imports - Copy eval app from enterprise repo with agents, graders, runner, dashboard - Nest eval-targets inside apps/eval - Adapt sessionExecutionDir → workingDir for current server API - Add biome ignore for dashboard HTML to prevent lint breaking onclick handlers	2026-03-16 23:12:23 +05:30
shivammittal274	d1d2074abc	feat: add get_console_logs tool for browser console output (#454 ) * feat: add get_console_logs tool to surface browser console output Captures Runtime.consoleAPICalled, Runtime.exceptionThrown, and Log.entryAdded CDP events per page with a FIFO ring buffer (500 entries). - ConsoleCollector: per-page buffers with O(1) session routing via Map lookup - Session-aware CDP event dispatching (onSessionEvent) in CdpBackend - Log.enable() added alongside Runtime.enable() in attachToPage - Single tool with level hierarchy, text search, limit, and clear params - Buffer clears on main-frame navigation, cleaned up on page close * fix: address review — handle session re-attach, remove dead code - ConsoleCollector.attach() now updates session mapping on re-attach instead of early-returning, preventing silent event drops after target detach/re-attach (e.g. tab crash, cross-process navigation) - Remove unused clearConsoleLogs() and ConsoleCollector.clear()	2026-03-16 22:20:40 +05:30
shivammittal274	41c9b1547c	feat: add per-task LLM provider selection for scheduled tasks (#450 ) * feat: add per-task LLM provider selection for scheduled tasks Allow users to choose which AI provider a scheduled task runs with, using the same ChatProviderSelector component from the new-tab page. Falls back to the global default provider when none is selected or if the selected provider has been deleted. * fix: lint issues * chore: updated to latest schema.graphql file --------- Co-authored-by: Dani Akash <DaniAkash@users.noreply.github.com>	2026-03-16 18:03:21 +05:30
shivammittal274	8b0e6dbfd3	Merge pull request #448 from browseros-ai/fix/filter-empty-conversation-messages fix: filter empty messages from conversation history	2026-03-16 13:30:42 +05:30
github-actions[bot]	07a2d13f16	docs: shivammittal274 signed the CLA in browseros-ai/BrowserOS#$pullRequestNo	2026-03-15 12:27:03 +00:00
shivammittal274	46031ed573	fix: filter empty messages from conversation history to prevent validation errors The AI SDK can produce assistant messages with empty parts (parts:[]) when a stream is aborted, and providers reject assistant messages with empty text content. This adds a validation utility that filters both cases before sending messages to createAgentUIStreamResponse and when persisting them.	2026-03-15 17:42:34 +05:30
Nikhil	ecd31efcb0	fix: remove Git LFS tracking for docs images so Mintlify can serve them (#446 ) Mintlify deploys docs by cloning the repo but does not run `git lfs pull`. The `.gitattributes` rule `docs/images/** filter=lfs` caused all doc images to be stored as ~130-byte LFS pointer files, which Mintlify served as-is — breaking every image on the site. Removing the LFS rule and re-adding the files as regular git blobs fixes all images without changing any paths or MDX files. Also fixes broken Slack link placeholder in troubleshooting page. Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-14 21:29:26 -07:00
Nikhil	c79c775fb8	fix: co-locate troubleshooting images to fix broken CDN rendering (#444 ) Images in docs/images/ are served as broken 130-byte placeholders by Mintlify CDN. Co-locating images with the MDX file (matching the working pattern in features/workflow/ and features/cowork/) bypasses this issue. Also fixes the Slack link placeholder. Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-14 21:12:26 -07:00
Felarof	4bee76253d	fix: prevent undefined provider in chat requests on fresh install (#442 ) * fix: fallback to default BrowserOS provider when provider is null When the extension first loads, provider config is loaded async from storage. If a chat request fires before loading completes (race condition), provider is null and the server receives provider: undefined, causing a Zod validation error. This adds a fallback to createDefaultBrowserOSProvider() in both chat paths (sidepanel and scheduled tasks) so provider.type is always defined. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * feat: fallback to first provider when default provider ID is stale When defaultProviderId in storage doesn't match any loaded provider (e.g. after Kimi/Moonshot rollout), selectedProvider was null causing provider: undefined in chat requests. Now falls back to providers[0]. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: repair stale defaultProviderId in storage on load When the stored default provider ID doesn't match any loaded provider, write back the corrected ID (providers[0].id) to storage so it doesn't silently persist across sessions. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-14 09:05:27 -07:00
Felarof	5b1b4e22cb	chore: disable Canva and Exa from Klavis MCP server list Comment out non-working Canva and Exa integrations from the OAuth MCP servers list and remove their imports/icon mappings from the UI. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-13 15:30:50 -07:00

1 2 3 4 5 ...

2198 Commits