BrowserOS

mirror of https://github.com/browseros-ai/BrowserOS.git synced 2026-05-21 12:55:09 +00:00

Author	SHA1	Message	Date
Nikhil	91d3285aa0	feat: add ACP agent harness (#849 ) * feat: add acp agent runtime spike * feat: add agent harness catalog * feat: persist harness agents in json * feat: persist agent transcripts * feat: route harness service through agent records * feat: expose generic agent harness routes * feat: add harness agent frontend api * feat: create harness agents from agents page * feat: chat with persisted harness agents * chore: remove obsolete agent profile spike * chore: self-review fixes * fix: combine openclaw and harness agents UI * refactor: split agents page components * fix: hide persisted harness turns	2026-04-28 15:29:38 -07:00
Felarof	df7873562d	Revert Kimi partnership UI, restore daily limit survey (#663 ) * docs: add uBlock Origin install info to getting started and ad-blocking pages Chrome dropped support for the full uBlock Origin extension — highlight that BrowserOS brings it back and make it easy to install from both the getting started guide and the dedicated ad-blocking page. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * feat: revert Kimi partnership UI, restore daily limit survey Remove Kimi/Moonshot AI partnership branding from the rate limit banner, provider card, provider templates, and LLM hub. Restore the original survey CTA on daily limit errors. Moonshot AI remains as a regular provider template without the "Recommended" badge. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: address Greptile review comments - Guard survey CTA with !isCreditsExhausted to avoid showing it for credits-exhausted users who already see "View Usage & Billing" - Remove dead kimi-launch feature flag files (kimi-launch.ts, useKimiLaunch.ts) - Remove unused KIMI_RATE_LIMIT analytics events - Remove VITE_PUBLIC_KIMI_LAUNCH from env schema and .env.example Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-08 16:39:00 -07:00
Dani Akash	94540d9e87	chore(agent): remove workflows feature (#656 )	2026-04-08 08:42:22 +05:30
shivammittal274	81350c0d7f	feat: replace model picker with shadcn Combobox + fuse.js fuzzy search (#617 ) The model picker in NewProviderDialog rendered inline, causing dialog resizing and lacked keyboard navigation. Replace it with a Popover + Command (shadcn Combobox) pattern and add fuse.js for fuzzy search. - Replace custom ModelPickerList with Popover + Command dropdown - Add fuse.js for fuzzy model search (replaces string.includes) - Add MODEL_SELECTED_EVENT and AI_PROVIDER_UPDATED_EVENT analytics - Enrich PROVIDER_SELECTED_EVENT with model_id in chat sessions	2026-03-30 16:38:21 +05:30
shivammittal274	890d3406dd	feat: promote BrowserOS as MCP with UI improvements (#541 ) - Add MCP promo banner on AI providers page with "New" badge and "66+ tools" highlight, linking to /settings/mcp - Add Quick Setup section on MCP settings page with copy-paste commands for Claude Code, Gemini CLI, Codex, Claude Desktop, OpenClaw - Consolidate MCP settings: move restart button inline with server URL, remove separate MCP Server Settings card - Add analytics event for promo banner clicks	2026-03-24 03:08:08 +05:30
shivammittal274	11d15d079f	feat: alibaba qwen oauth (#506 ) * feat: add Qwen Code as OAuth LLM provider with refactored OAuth hooks Add Alibaba Qwen Code as a third OAuth provider using Device Code flow with PKCE. Free tier: 2,000 requests/day, up to 1M token context. Refactoring: - Extract useOAuthProviderFlow hook (eliminates ~180 lines of duplicated OAuth logic from AISettingsPage for ChatGPT Pro + Copilot + Qwen) - Extract resolveOAuthConfig in config.ts (shared resolver for all OAuth providers, parameterized by provider name, default model, refresh flag) - Generalize token-manager device code flow to support PKCE (code_challenge/code_verifier) and form-urlencoded content type New code: - Qwen Code provider config with PKCE + form encoding flags - Provider factories (both provider.ts and provider-factory.ts) - Extension UI (template card, models, analytics, dialog) * fix: use portal.qwen.ai as API base URL for OAuth tokens DashScope (dashscope.aliyuncs.com) expects Alibaba Cloud API keys, not OAuth tokens from chat.qwen.ai. The correct endpoint for OAuth Bearer tokens is portal.qwen.ai/v1. * fix: correct Qwen Code model IDs and context windows - coder-model (1M context): virtual alias that routes to best model - qwen3-coder-plus (1M): was incorrectly 131K - qwen3-coder-flash (1M): new, speed-optimized variant - qwen3.5-plus (1M): was incorrectly 1048576 (power-of-two vs decimal) - Removed qwen3-coder-next (local/self-hosted, not available via OAuth) - Default model changed to coder-model (auto-routes server-side) * fix: move Qwen device code request to extension (bypasses WAF) Alibaba WAF blocks server-side requests to chat.qwen.ai. Move the initial device code request to the extension (browser context with cookies), then hand off the deviceCode + codeVerifier to the server for background polling via new POST /oauth/:provider/poll endpoint. * fix: persist OAuth flow-started flag in sessionStorage The flowStartedRef was lost when the component remounted (e.g. user navigated to onboarding then back to settings). Use sessionStorage to persist the flag so auto-create works after navigation. * revert: remove sessionStorage for OAuth flow flag Revert to simple useRef pattern matching the original ChatGPT Pro implementation. The auto-create works when the user stays on the AI settings page during auth. * revert: move Qwen back to server-side device code flow WAF block was temporary (rate-limiting), not permanent. Server-side fetch to chat.qwen.ai now works. Reverted client-side device code approach — Qwen now uses the same clean server-side flow as Copilot. Removed: clientSideDeviceCode config, startClientSideDeviceCode(), POST /oauth/:provider/poll endpoint, startDeviceCodePolling(). * feat: add WAF detection, rate-limit protection, and token storage endpoint - Detect WAF captcha responses (HTML instead of JSON) in device code request and token polling, with user-friendly error messages - Add 30s cooldown on "USE" button to prevent rapid clicks triggering WAF - WAF-blocked poll requests silently retry instead of aborting - Add POST /oauth/:provider/token endpoint for storing externally-provided tokens (useful for future fallback flows) - Add storeTokens() method to OAuthTokenManager - Pass server error messages through to extension toast notifications * refactor: remove 30s cooldown, simplify OAuth hook The hook is now identical for all providers — server handles retries via activeDeviceFlows.delete(). Removed flowStartedAtRef cooldown that was blocking legitimate retries. * feat: client-side OAuth for Copilot and Qwen Code Move device code OAuth flow to the extension for GitHub Copilot and Qwen Code. The extension makes requests using Chrome's network stack, which bypasses Alibaba WAF TLS fingerprint detection that blocks server-side Bun/Node.js fetch. New files: - client-oauth.ts: Client-side device code + PKCE + token polling Changes: - useOAuthProviderFlow: handleClientAuth() for providers with clientAuth config, handleServerAuth() for others (ChatGPT Pro) - AISettingsPage: clientAuth config for Copilot and Qwen Code - WAF detection: opens provider site for captcha solving on block Server-side device code flow preserved as fallback (token-manager.ts, providers.ts). Token storage via POST /oauth/:provider/token endpoint. * fix: export OAuthProviderFlowConfig type, fix typecheck errors - Export OAuthProviderFlowConfig interface so AISettingsPage can use it instead of duplicating the type inline - Fix string \| null → string \| undefined for agentServerUrl parameter	2026-03-20 17:46:48 +05:30
shivammittal274	720baaed3e	feat: add GitHub Copilot as OAuth LLM provider (#500 ) * feat: add GitHub Copilot as OAuth-based LLM provider Add GitHub Copilot as a second OAuth provider using the Device Code flow (RFC 8628). Users authenticate via github.com/login/device, and the server polls for token completion. Supports 25+ models through a single Copilot subscription. Key changes: - Device Code OAuth flow in token manager (poll with safety margin) - Custom fetch wrapper injecting Copilot headers + vision detection - Provider factory using createOpenAICompatible for Chat Completions API - Extension UI with template card, auto-create on auth, and disconnect * fix: address PR review comments for GitHub Copilot OAuth - Validate device code response for error fields (GitHub can return 200 with error payload) - Store empty refreshToken instead of access token for GitHub tokens - Add closeButton to Toaster for dismissing device code toast * fix: add github-copilot to agent provider factory The chat route uses a separate provider-factory.ts (agent layer) from the test-provider route (llm/provider.ts). Added createGitHubCopilotFactory to the agent factory so chat works with GitHub Copilot. * fix: add github-copilot to provider icons, models, and dialog - Add Github icon from lucide-react to providerIcons map - Add 8 Copilot models (GPT-4o, Claude, Gemini, Grok) to models.ts - Add github-copilot to NewProviderDialog zod enum, validation skip, canTest check, and OAuth credential message * fix: reorder copilot models with free-tier models first Put models available on Copilot Free at the top (gpt-4o, gpt-4.1, gpt-5-mini, claude-haiku-4.5, grok-code-fast-1), followed by premium models that require paid Copilot subscription. * fix: set correct 64K context window for Copilot models Copilot API enforces a 64K input token limit regardless of the underlying model's native context window. Updated all model entries and the default template to 64000 so compaction triggers correctly. * fix: use actual per-model prompt limits from Copilot /models API Queried api.githubcopilot.com/models for real max_prompt_tokens values. GPT-4o/4.1 have 64K, Claude/gpt-5-mini have 128K, GPT-5.x have 272K. Also updated model list to match what's actually available on the API (e.g. claude-sonnet-4.6 instead of 4.5, added gpt-5.4/5.2-codex). * feat: resize images for Copilot using VS Code's algorithm Large screenshots cause 413 errors on Copilot's API. Resize images following VS Code's approach: max 2048px longest side, 768px shortest side, re-encode as JPEG at 75% quality. Uses sharp for server-side image processing. * fix: address all Greptile P1 review comments - Add .catch() on fire-and-forget pollDeviceCode to prevent unhandled rejection crashes (Node 15+) - Add deduplication guard (activeDeviceFlows Set) to prevent concurrent device code flows for the same provider - Add runtime validation of server response in frontend before calling window.open() and showing toast - Remove dead GITHUB_DEVICE_VERIFICATION constant from urls.ts * fix: upgrade biome to 2.4.8, fix all lint errors, and address review bugs - Upgrade biome from 2.4.5 to 2.4.8 (matches CI) and migrate configs - Fix image resize: only re-encode when dimensions actually change - Fix device code polling: retry on transient network errors instead of aborting - Allow restarting device code flow (clear old flow instead of throwing 500) - Fix pre-existing noNonNullAssertion and noExplicitAny lint errors globally * fix: address Greptile P2 review — image resize and config guard - Fix early-return guard: check max/min sides against their respective limits (MAX_LONG_SIDE/MAX_SHORT_SIDE) instead of both against SHORT - Preserve PNG alpha: detect hasAlpha and keep PNG format instead of unconditionally converting to lossy JPEG - Keep browserosId guard in resolveGitHubCopilotConfig consistent with ChatGPT Pro pattern (safety check that caller context is valid) * feat: update Copilot models to full list from pricing page, default to gpt-5-mini Added all 23 models from GitHub Copilot pricing page. Ordered with free-tier models first (gpt-5-mini, claude-haiku-4.5), then premium. Changed default from gpt-4o to gpt-5-mini since it's unlimited on Pro plan and has 128K context (vs gpt-4o's 64K limit).	2026-03-20 02:33:09 +05:30
Dani Akash	1b88ade021	feat: updated homepage chat (#481 ) * feat: updated chat ui from homepage * fix: vertical scroll * fix: horizontal scroll issue * fix: lint issues * fix: header width * fix: message input from home to chat * feat: created sidebar header support in new tab chat * fix: remove history from new tab chat * fix: remove the shared element transition * fix: lint issues * fix: review comments * fix: defer the sendMessage callback * fix: all code concerns * fix: preserve state of chat on homepage * fix: review comments	2026-03-19 15:24:05 +05:30
shivammittal274	46a8326140	feat: add ChatGPT Pro OAuth as LLM provider (#476 ) * feat: add ChatGPT Pro OAuth as LLM provider Adds OAuth 2.0 (Authorization Code + PKCE) flow so users can authenticate with their ChatGPT Pro subscription to power BrowserOS's agent, matching the pattern used by Codex CLI, OpenCode, and Pi. Server: - OAuth token lifecycle (PKCE, exchange, refresh, SQLite storage) - Dedicated callback server on port 1455 (Codex client ID registration) - Codex fetch wrapper routing API calls to chatgpt.com/backend-api - Config resolution + provider factories for all code paths (chat, test, refine) Extension: - ChatGPT Pro template card with OAuth flow trigger - Status polling hook + auto-create provider on auth success - Model list with Codex-supported models (gpt-5.x-codex family) * fix: address Greptile PR review comments - Wire OAuth callback server stop handle into onShutdown (P1: port 1455 leak) - Guard against missing refresh token + clear stale tokens on failed refresh (P1) - Add logger.warn to silent catch in codex-fetch body mutation - Document JWT trust assumption in parseAccessTokenClaims - Source model ID from provider template instead of hard-coding * simplify: remove unnecessary OAuth shutdown wiring and useCallback - Revert OAuthHandle interface — callback server port releases on process exit - Remove stopCallbackServer from shutdown flow (dead code) - Remove all useCallback from useOAuthStatus per CLAUDE.md guidance * style: add readonly modifiers and braces per TS style guide * docs: add E2E test screenshots for ChatGPT Pro OAuth * fix: strip item IDs from Codex requests to fix multi-turn conversations * fix: preserve function_call_output IDs in Codex requests * fix: resolve Codex store=false + tool-use incompatibility - Pass providerOptions { openai: { store: false } } to ToolLoopAgent so the AI SDK inlines content instead of using item_reference - Strip item IDs and previous_response_id in codex-fetch (safety net) - Use .responses() model (Codex only speaks Responses API format) * fix: remove non-Codex model gpt-5.2 from chatgpt-pro model list * fix: strip unsupported Codex params and update model list - Strip temperature, max_tokens, top_p from Codex requests (unsupported) - Add all available Codex models including gpt-5.4, gpt-5.2, gpt-5.1 * chore: remove screenshots containing email * feat: enable reasoning events for ChatGPT Pro Codex models * chore: set reasoning effort to high for ChatGPT Pro * feat: add configurable reasoning effort and summary for ChatGPT Pro - Add reasoningEffort (none/low/medium/high) and reasoningSummary (auto/concise/detailed) dropdowns in the Edit Provider dialog - Pass through extension → chat request → agent config → providerOptions - Defaults: effort=high, summary=auto * fix: strip max_output_tokens from Codex requests (fixes compaction) * fix: address Greptile P1 issues - Fix default model fallback: gpt-4o → gpt-5.3-codex (Codex endpoint) - Clear stale tokens on refresh failure (prevents infinite retry loop) - Only auto-create provider after explicit OAuth flow, not on page load - Add catch block to auto-create effect with error toast	2026-03-18 22:07:43 +05:30
shivammittal274	2597cdbc70	feat: add Rewrite with AI for scheduled task prompts (#465 ) * feat: add "Rewrite with AI" prompt refinement for scheduled tasks Add a lightweight /refine-prompt endpoint that uses generateText to rewrite rough scheduled task prompts into clear, actionable instructions. The UI adds a sparkle-icon button next to the Prompt label in the NewScheduledTaskDialog with loading state, undo support, and disabled state when the textarea is empty. * fix: clear stale undo ref on dialog re-open and pass providerId to refinePrompt - Reset originalPromptRef when dialog opens and on form submit to prevent stale "Undo rewrite" button on re-open - Accept optional providerId in refinePrompt() so the form's selected provider is used for refinement instead of always the system default * fix: hide undo rewrite link while refinement is in flight * fix: reset isRefining state on dialog re-open * fix: ignore stale refine-prompt responses after dialog re-open Use a request generation counter so that if the dialog is closed and re-opened while a rewrite is in flight, the stale response is silently discarded instead of overwriting the fresh form state. * fix: invalidate stale refine requests on dialog reopen and rename to kebab-case - Increment refineRequestIdRef on dialog open so in-flight requests from a previous session are discarded when they complete - Rename refinePrompt.ts to refine-prompt.ts per CLAUDE.md file naming	2026-03-17 19:40:56 +05:30
shivammittal274	e67c17a0f8	feat: add voice input to agent chat sidebar (#467 ) * feat: add voice input to agent chat sidebar Allow users to record voice and transcribe to text in the chat input. Mic button shows when input is empty, waveform visualizer during recording, transcription via OpenAI (llm.browseros.com/api/transcribe). - Extract shared useVoiceInput hook to lib/voice/ - Time-domain waveform bars that bounce per-frequency-band - Bar height capped to fit input container - Analytics events for recording lifecycle * fix: address review — add fetch timeout, await stopRecording, deduplicate VoiceInputState - Add AbortSignal.timeout(30s) to transcription fetch - Await stopRecording() and track analytics after completion - Export VoiceInputState from useVoiceInput, import in consumers * fix: await startRecording before tracking, narrow SurveyChat effect deps - Await startRecording() so analytics only fires after mic permission granted - Narrow SurveyChat useEffect dependency from [voice] to [voice.transcript, voice.isTranscribing] * fix: analytics only tracks on success, clean up stream on failure, type API response - startRecording returns boolean; track(RECORDING_STARTED) only fires on success - Catch block cleans up MediaStream tracks and AudioContext on partial failure - Type transcription API response with TranscribeResponse interface * fix: keep mic button always visible alongside send button Mic and send are now separate buttons, both always visible. Mic is disabled while AI is streaming. Send is disabled during recording/transcribing. Buttons are no longer absolutely positioned inside the textarea — they sit beside it in the flex row. * fix: keep mic button always visible inside input alongside send Both mic and send buttons are always visible inside the input field, positioned on the right side (ChatGPT-style). Mic is disabled while AI is streaming. Send is disabled during recording/transcribing. * fix: remove unreachable CSS branch in recording waveform div	2026-03-17 18:28:19 +05:30
Felarof	95c855a091	feat: replace rate limit CTAs with Kimi/Moonshot partnership links (#437 ) * feat: replace rate limit CTAs with Kimi/Moonshot partnership links Comment out old "Learn more" and "take a quick survey" links on the daily limit error banner. Replace with Kimi API key docs link and direct Moonshot AI platform link for conversion tracking. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: remove partnership tagline from rate limit banner Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-13 12:45:41 -07:00
Dani Akash	290ee91a8b	Add 'packages/browseros-agent/' from commit '90bd4be3008285bf3825aad3702aff98f872671a' git-subtree-dir: packages/browseros-agent git-subtree-mainline: `8f148d0918` git-subtree-split: `90bd4be300`	2026-03-13 21:22:09 +05:30

13 Commits