Changelog¶
All notable project changes are tracked here (code + docs).
[1.7.0] — 2026-04-14¶
Added¶
- Cloudflare integration platform (cf-001 through cf-012):
- Workers RuntimeBridge (
bridges/cloudflare.py) — agent execution on Workers + Durable Objects - Workflow Bridge (
bridges/cloudflare_workflow.py) — durable multi-step workflows with auto-retry and approval gates - Sandbox Bridge (
bridges/cloudflare_sandbox.py) — V8 isolate and container sandboxes for isolated code execution - Browser Rendering Bridge (
bridges/browser_rendering.py) — headless web browsing, screenshots, scraping, PDF generation - R2 Workspace Sync (
bridges/r2_sync.py) — content-addressed delta file sync via Cloudflare R2 - Workers AI Provider (
core/routing/cloudflare_ai.py) — free-tier LLM models (Llama 3.1, Mistral, Gemma, Qwen) for planning - D1 Analytics Client (
core/cost/d1_analytics.py) — usage metering, billing tiers (free/pro/team/enterprise), quota enforcement - Vectorize Semantic Cache (
core/memory/vectorize_cache.py) — embedding-based LLM response caching - MCP Remote Transport (
mcp/remote_transport.py) — streamable HTTP transport for remote MCP server access - Cloud CLI (
cli/commands/cloud_cmd.py) —bernstein cloudsubcommands: login, logout, run, status, runs, cost, deploy - Cloudflare Agents Adapter (
adapters/cloudflare_agents.py) — spawn agents vianpx wrangler dev - Codex-on-Cloudflare Adapter (
adapters/codex_cloudflare.py) — run Codex in Cloudflare sandboxes - Full Cloudflare documentation: overview, setup, bridges, adapters, Workers AI, analytics, CLI, MCP remote (8 new doc pages)
[1.4.11] — 2026-04-03¶
Added¶
- Bernstein doctor — comprehensive pre-flight health check: adapters, API keys, ports,
.sdd/integrity, MCP servers. Auto-repair mode with--fix. - Per-agent token progress — real-time token usage tracking per spawned agent, surfaced in
bernstein status. - Context injection token budget — explicit budgets for injected context (files, lessons, RAG chunks) with graceful truncation and priority ordering.
- Output style customization — configurable agent output format via markdown templates.
- Installation mismatch detection — detects gaps between expected and installed adapter capabilities.
- API preconnect warmup — connection warmup before heavy runs to reduce first-request latency.
- Worker badge identity — process identification visible in
bernstein psand Activity Monitor. - TUI keybinding system — configurable keyboard shortcuts in the Textual dashboard.
- Progressive permission prompts — per-agent permission levels for fine-grained control.
- Activity tracking metrics — session-level activity statistics and agent usage patterns.
- Away summary generation — summarize what happened while you were away.
- Commit attribution stats — per-agent commit statistics.
- Session analytics — cumulative insights across runs.
- Settings snapshot in traces — agent settings preserved in execution traces.
- Side question support — agents can ask clarifying questions mid-task.
- Diff folding display — folded diff rendering in agent output.
- Word-level diff rendering — character-level change highlighting.
- Contextual tips system — in-context hints for agents.
- Session tag system — tag and filter runs.
- Rename session — session renaming command.
- Security review command —
bernstein security-reviewfor vulnerability assessment. - Cumulative progress tracking — progress tracking across runs.
- Plugin trust warning — warns on unverified plugins.
- Plugin error reporting — improved error diagnostics for plugin failures.
- Extra usage provisioning — additional usage quota management.
- Truecolor mode detection — automatic terminal color capability detection.
- Dirty flag layout caching — caching optimizations for dirty project detection.
- Release notes display — show release notes on startup.
Fixed¶
- Context warnings in
bernstein doctoroutput for better diagnostics. - Circuit breaker for repeated compact failures — prevents agent thrashing.
Changed¶
- Documentation overhaul: README, GETTING_STARTED, ARCHITECTURE, FEATURE_MATRIX, BENCHMARKS, CHANGELOG, CONTRIBUTING all rewritten against v1.4.11 codebase.
[1.4.9] — 2026-04-01¶
Added¶
- Process-aware shutdown/drain improvements across CLI and core lifecycle paths.
- Cost analytics enhancements (additional endpoints/aggregation work and routing transparency updates).
- Security enhancements including sensitivity-classification and IP-allowlist related hardening.
- TUI keyboard help (
?) shortcut support.
Changed¶
- Issue triage and documentation alignment pass to reduce roadmap vs shipped-feature drift.
- Retry, lifecycle, and observability narratives updated to better reflect current implementation boundaries.
[1.4.0] — 2026-03-31¶
Added¶
- Plan Files: loadable YAML project plans with stages and steps (
bernstein run plan.yaml) - Server Supervisor: auto-restart on crash with exponential backoff (max 5 restarts / 10 min)
- CrashGuard Middleware: catches unhandled exceptions → 500 instead of process death
- Orchestrator drain mode: loop continues while agents are active, even after stop signal
- Quality gates: PII scan, mutation testing, benchmark regression detection
- Gate Runner: parallel execution of all quality gates (asyncio)
- Benchmark regression gate: block merge when performance degrades beyond threshold
- PII log redaction: auto-installed filter scrubs emails, phones, SSNs, credit cards from all log output
- Agent loop detection: kills agents caught in edit-loop cycles (same file edited N+ times in window)
- Deadlock detection: wait-for graph cycle detection with automatic victim selection
- Cost anomaly detection: Z-score based cost anomaly signaling with configurable thresholds
- Per-agent file/command permissions: role-based matrix restricting which files and commands each role may use
- Premium visual theme: CRT power-off effects, gradient splash, block-art logo
- Live boot log: orchestrator boot progress shown in Agents panel while no agents spawned
- Persistent memory: SQLite-backed cross-session agent memory
- Context handoff: structured context briefs for subtask delegation
- Zero-config mode: auto-detect project type, no bernstein.yaml required
- Worktree environment hooks: auto-symlink node_modules, copy .env
- FIFO merge queue: sequential merge with git merge-tree conflict pre-check
- Ticket Format v1: YAML frontmatter with model routing, janitor signals, tags
- 10 adapters: Claude, Codex, Cursor, Gemini, Kiro, OpenCode, Aider, Amp, Roo Code, Generic
- Futuristic splash screen: full-screen animated boot sequence
- Plan display: mission-briefing style execution plan approval
- test_cli_run_params.py: catches cli() → run() parameter sync bugs
Fixed¶
- Manager always uses opus/max (was falling back to haiku via fast_path)
- Orchestrator no longer exits while agents still running
- Server failure backoff: 5s per failure instead of constant polling
- Startup crash: missing pii_scan fields in QualityGatesConfig
- .yaml/.md backward compatibility in all backlog parsers
Changed¶
- Ticket format migrated from .md to .yaml (YAML frontmatter)
- Version bump 1.3.x → 1.4.0
[1.0.3] — 2026-03-30¶
Added¶
- State-of-the-art CI/CD pipeline: 11 new GitHub Actions workflows
- Three-tier AI PR review (GitHub Models + Gemini CLI + Bernstein deep review)
- Semgrep SAST, license compliance, spelling, dead code analysis, workflow linting
- PR auto-labeling, size warnings, stale cleanup, Dependabot auto-merge
- Release Drafter for automated changelog generation
- Telegram bot notifications on CI completion
- Codecov coverage gating (85% project / 70% patch)
- Concurrency groups on all workflows with cancel-in-progress
- CI and Codecov badges in README
Changed¶
- FEATURE_MATRIX updated with CI/CD section (15 new entries)
- GETTING_STARTED expanded with CI pipeline documentation
- Manual backlog index updated with all setup tickets and status tracking
[1.0.2] — 2026-03-28¶
Changed¶
- Documentation audit: updated outdated model names, CLI references, API endpoints, and GitHub Action version tags
- Default branch references updated from
mastertomainacross all docs
[1.0.0] — 2026-03-28¶
Added¶
- ACP (Agent Communication Protocol) endpoints for agent interoperability
- A2A (Agent-to-Agent) protocol support
- Cluster mode with multi-node coordination (node registration, heartbeat, status)
- Auth routes: OIDC, SAML, CLI device flow, group mappings, user management
- Graduation system for agent promotion based on performance
- Plans routes for plan listing, approval, and rejection
- Slack integration (slash commands and events)
- Quality dashboard with per-model quality metrics
- Cost history, live cost tracking, and cost alerts endpoints
- File lock tracking via dashboard routes
- Task prioritization, force-claim, and progress reporting endpoints
- Chaos testing CLI group
- Audit CLI group
- Verify CLI command
Changed¶
- Version bumped to 1.0.0 (stable release)
- Route modules expanded: acp.py, auth.py, graduation.py, plans.py, slack.py added to core/routes/
[0.3.0] — 2026-03-28¶
Added¶
- Checkpoint and wrap-up CLI commands for session management
- Task snapshots endpoint for viewing task state history
- Webhook alerts endpoint
- SSE event stream at
/eventsfor real-time dashboard updates - Prometheus
/metricsendpoint for observability - Bandit-based model routing stats at
/routing/bandit - Cache stats endpoint at
/cache-stats
Changed¶
- CLI decomposed further: audit_cmd.py, chaos_cmd.py, checkpoint_cmd.py, verify_cmd.py, wrap_up_cmd.py
- Task server routes expanded with block, progress, and prioritize actions
[0.2.0] — 2026-03-28¶
Added¶
- Agent discovery system with multi-provider routing (
cli: auto) - Quality gates for task verification
- Rule enforcement engine
- Token monitor for real-time usage tracking
- Approval gates for high-risk operations
- MCP server integration
- Hot reload for configuration changes
- Aider, Amp, and Roo Code adapters
- Adapter manager and caching adapter layer
- Environment isolation for adapter processes
- Web dashboard with real-time SSE updates
- Workspace management for multi-repo orchestration
- GitHub App integration for webhook-driven tasks
- Auth middleware and checkpoint commands
- Delegate, trigger, and wrap-up CLI commands
Changed¶
- Default CLI adapter is now
auto(detects installed agents) instead ofclaude - Test count badge updated: 2500+ to 4250+ (142 test files, 4257 test functions)
- Server decomposed into
core/routes/(tasks.py, status.py, webhooks.py, costs.py, agents.py, auth.py, dashboard.py, plans.py, quality.py, graduation.py, slack.py) - Orchestrator decomposed into tick_pipeline.py, task_lifecycle.py, agent_lifecycle.py
- CLI decomposed into helpers.py, run_cmd.py, stop_cmd.py, status_cmd.py, agents_cmd.py, evolve_cmd.py, advanced_cmd.py, and more
- TaskStore extracted to task_store.py with PostgreSQL and Redis backends
bernstein catalogcommands renamed tobernstein agents(sync, list, validate)- Adapter listing in DESIGN.md updated to include all current adapters (removed stale kiro.py)
- Example YAML files updated:
cli: claudechanged tocli: auto - All documentation references to
bernstein catalogupdated tobernstein agents - Removed stale "(default)" label from Claude adapter docs (default is now
auto)
[0.1.0] — 2026-03-28¶
Added¶
- License: Apache 2.0
- Per-run cost budgeting (
--budget 5.00) with threshold warnings - CI auto-fix pipeline with GitHub Actions log parser
- GitHub Action (
action.yml) for CI-triggered orchestration - MCP tool access — agents use MCP servers (stdio/SSE)
- TUI session manager (
bernstein live) with Textual - "The Bernstein Way" architecture tenets document
- Quickstart demo (
examples/quickstart/) - Comparison pages (
docs/compare/) - GitHub Action documentation (
docs/github-action.md) - Feature cards for cost budgeting, GitHub Action, MCP on index page
docs/competitive-matrix.md— feature comparison vs CrewAI, AutoGen, LangGraph, etc.docs/zero-lock-in.md— model-agnostic architecture deep divedocs/CHANGELOG.md— this filedocs/VERSION— documentation version tracking
Changed¶
- All license references updated to Apache 2.0 across all HTML and markdown docs
- README: quickstart section with full install → init → run flow
- README: test count badge, license badge, benchmark badge
- Getting Started: fixed test command to use isolated runner
- Comparison table: added cost budgeting and GitHub Action rows