mirror of https://github.com/pocketpaw/pocketpaw.git synced 2026-05-21 17:24:57 +00:00

Files

Ragini Pandey c905e22c49 feat: add AGENTS.md support (#456 ) (#551 )

Implement the AGENTS.md standard (https://github.com/anthropics/agents-md)
so PocketPaw reads project-specific agent constraints from target repos and
publishes its own capability declaration.

Changes:
- src/pocketpaw/agents_md/loader.py — AgentsMdLoader: walks up from the
  working directory to the repo root (.git boundary), reads and parses
  AGENTS.md, mtime-based caching, 32 KiB size cap.
- src/pocketpaw/agents_md/__init__.py — package exports.
- src/pocketpaw/bootstrap/context_builder.py — new `agents_md_dir` param on
  build_system_prompt(); injects AGENTS.md constraints as step 8 of the
  system prompt (after health state).
- src/pocketpaw/agents/loop.py — passes `agents_md_dir` (from file_context
  current_dir or file_jail_path fallback) to build_system_prompt(); emits
  SystemEvent(event_type='agents_md_loaded') for the dashboard Activity panel
  when a file is found.
- AGENTS.md — PocketPaw's own capability declaration (tools, safety
  boundaries, forbidden operations, coding conventions).
- tests/test_agents_md.py — 23 tests covering loader, caching, walk-up,
  .git boundary, section parsing, context injection, and event emission.

Closes #456"

Co-authored-by: Rohit Kushwaha <rohitk290106@gmail.com>

2026-03-10 21:17:29 +05:30

4.2 KiB

Raw Permalink Blame History

AGENTS.md

This file declares PocketPaw's capabilities, supported tools, and safety boundaries for other AI agents and tools that interact with this repository. Format: AGENTS.md specification

Agent Identity

Name: PocketPaw
Type: Self-hosted AI assistant agent
Description: PocketPaw is a locally-run AI agent controlled via Telegram, Discord, Slack, WhatsApp, or a web dashboard. It operates on the user's machine with configurable tool access and safety guardrails.

Supported Backends

PocketPaw can delegate to any of the following AI backends:

claude_agent_sdk — Anthropic Claude via the official Agent SDK (default)
openai_agents — OpenAI GPT models or Ollama (local)
google_adk — Google Gemini via the Agent Development Kit
codex_cli — OpenAI Codex CLI subprocess
opencode — External OpenCode server via REST API
copilot_sdk — GitHub Copilot SDK

Capabilities

File System

✅ Read files within the configured file_jail_path (default: $HOME)
✅ Write and edit files within file_jail_path
✅ List directories and traverse directory trees
❌ Access files outside file_jail_path

Shell Execution

✅ Execute shell commands (trust level: critical — requires explicit user permission)
❌ Commands matching dangerous patterns in security/rails.py are always blocked (e.g., rm -rf /, fork bombs, privilege escalation)
❌ Shell commands require trust_level = "critical" policy approval

Web & Network

✅ Web search (via configured search provider)
✅ URL content extraction
✅ HTTP requests via the browser tool (Playwright-based, accessibility-tree snapshots)
❌ No direct socket access or raw network operations

Memory

✅ Session memory (per-conversation history)
✅ Long-term memory (remember, recall, forget tools)
✅ Memory stored locally at ~/.pocketpaw/memory/

Integrations (optional — require explicit configuration)

Gmail (read, send, label)
Google Calendar (list, create)
Google Drive (list, download, upload)
Google Docs (read, create, search)
Spotify (search, playback control)
Discord, Slack, Telegram, WhatsApp channels

Security & Guardrails

All shell-executing tools pass through security/rails.py before execution.
A secondary Guardian AI performs safety checks on high-trust and critical operations.
All trust_level = "high" and trust_level = "critical" actions are audit-logged to ~/.pocketpaw/audit.jsonl.
Prompt injection scanning is enabled by default on all inbound messages.
Channel identifiers are validated against configured allowlists before processing.

Forbidden Operations

❌ Hardcoded credentials or API keys — all secrets use encrypted CredentialStore
❌ asyncio.run() inside library code
❌ Module-level get_settings() calls
❌ Logging API keys, tokens, or user PII
❌ Unauthenticated REST endpoints — all routes require existing auth middleware
❌ Creating new event types on the message bus without updating bus/events.py

Project-Specific Instructions for AI Agents

When working on this repository:

Follow the event-driven bus pattern: adapters publish InboundMessage, agents publish OutboundMessage and SystemEvent. Never call agent code directly from an adapter.
Use Protocol (not ABC) for all public interfaces.
Lazy-import optional SDK dependencies inside __init__ or first-use methods.
All I/O must be async def. Use asyncio.to_thread() for unavoidable blocking calls.
Run uv run ruff check . and uv run pytest --ignore=tests/e2e before committing.
New secret fields must be added to SECRET_FIELDS in credentials.py.
Every new AgentBackend must yield AgentEvent(type="done", ...) as its final event.

Tool Policy Groups

Group	Tools
`group:fs`	read_file, write_file, edit_file, list_dir, directory_tree
`group:shell`	shell
`group:browser`	browser
`group:memory`	remember, recall, forget
`group:search`	web_search, url_extract
`group:skills`	create_skill, skill

Default profile: full (no restrictions). Configure via tool_profile in settings.

4.2 KiB Raw Permalink Blame History