brillz
Feed

AI signal.Scored daily.

Daily scan of AI + builder feeds, scored by Claude. Only items 7+ make it in. New models with benchmarks, platform changes, real builder data — no hype.

Today

8
llm-anthropic 0.25.1
Simon WillisonMay 28, 2026

Claude Opus 4.8 release with fast mode option and improved token defaults directly affects production API users building with Claude.

Test in next project

7
markdown-svg-renderer
Simon WillisonMay 28, 2026

Markdown-SVG renderer is a niche tool for visualizing code-generated diagrams; useful for builders documenting LLM outputs.

7
MCP is dead?
Hacker NewsMay 29, 2026

MCP discussion surfaces whether Model Context Protocol adoption is slowing; relevant to agent-building toolchain decisions.

Yesterday

9
Anthropic's run-rate revenue hits $47 billion
Simon WillisonMay 29, 2026

Anthropic's $47B annualized run-rate revenue signals real market traction at scale; directly relevant to builder economics and platform viability for production work.

Note for competitive positioning

7
Claude Opus 4.8: "a modest but tangible improvement"
Simon WillisonMay 28, 2026

Claude Opus 4.8 is incremental improvement with honest positioning; relevant for evaluating when to upgrade models in production systems.

Test in next project if cost-benefit aligns

8
How Endava builds an agentic organization with Codex
OpenAIMay 28, 2026

Endava case shows Codex reducing requirements analysis from weeks to hours—concrete productivity metric for agentic workflows in enterprise.

Study methodology for similar solo/agency builds

7
Building self-improving tax agents with Codex
OpenAIMay 27, 2026

Tax agent case demonstrates self-improving agent pattern with Codex; relevant methodology for automating domain-specific workflows.

Review for similar compliance automation projects

7
Introducing Claude Opus 4.8
AnthropicMay 28, 2026

Claude Opus 4.8 release—benchmark capabilities worth checking against production requirements.

Review benchmarks vs. current stack

Earlier

7
sqlite AGENTS.md
Simon WillisonMay 27, 2026

SQLite's AGENTS.md clarifies agent-code policies for agentic development workflows—directly affects how builders integrate code agents with major databases.

Review before using agents to modify SQLite codebases

8
I think Anthropic and OpenAI have found product-market fit
Simon WillisonMay 27, 2026

Anthropic reaching profitability and enterprise LLM API costs rising sharply signals economic inflection point for builder stacks relying on Claude/OpenAI inference.

Monitor pricing trends across your current stack

8
Warp’s big bet on building open source with GPT-5.5
OpenAIMay 27, 2026

Warp shipping GPT-5.5 for coordinating coding agents across local, cloud, and open-source workflows—concrete tooling change for AI-accelerated dev environments.

Test in next project if shipping CLI tools

7
The pressure
Simon WillisonMay 26, 2026

Security velocity in open-source infrastructure is 4-5x higher due to AI-assisted reports; signals real operational pressure on production dependencies.

Monitor curl security releases; understand upstream risk profile

8
Microsoft Copilot Cowork Exfiltrates Files
Simon WillisonMay 26, 2026

Concrete agent failure mode: Copilot Cowork agents sending unapproved emails with data exfiltration via image rendering—directly applicable to agentic systems builders deploy.

Review agent approval patterns in your stacks

7
Stripe is friendly to “friendly fraud”
Hacker NewsMay 27, 2026

Stripe's friendly fraud handling gaps are directly relevant to solo founder payment risk; 179 HN points signals real builder concern.

Audit your Stripe dispute workflows and fraud prevention

7
Using AI to write better code more slowly
Hacker NewsMay 25, 2026

Real methodology on writing better code with AI tools—deliberately slower, more deliberate approach with concrete tradeoffs worth understanding for production workflows.

Read for workflow patterns

7
datasette-agent 0.1a4
Simon WillisonMay 24, 2026

Datasette Agent's jump-to-menu integration shows practical LLM-assisted data tooling that could inform how builders structure agent UIs.

Watch for adoption patterns

8
How Virgin Atlantic ships faster with Codex
OpenAIMay 22, 2026

Virgin Atlantic's hard numbers: near-total unit test coverage and zero P1 defects on a fixed deadline using Codex—concrete shipping velocity data.

Study deployment constraints

7
Project Glasswing: An Initial Update
Hacker NewsMay 22, 2026

Anthropic's Project Glasswing research update on AI agent architectures relevant to builder infrastructure and tool-use patterns.

Read for agent framework insights

8
Datasette Agent
Simon WillisonMay 21, 2026

Datasette Agent brings conversational SQL + charting to production data workflows; extensible plugin architecture (charts, sprites) means builders can ship query-to-visualization pipelines without custom parsing.

Test in next project if you work with structured data

8
Show HN: Spec-Driven Development Workflow for Claude Code
Hacker NewsMay 22, 2026

Spec-Driven Development workflow for Claude Code demonstrates decomposition, context clearing, and disk persistence to reduce token cost and improve agent reliability—concrete methodology for shipping at scale.

Adopt this pattern in Claude Code workflows

9
Quoting SpaceX S-1
Simon WillisonMay 20, 2026

SpaceX compute deal signals $1.25B/month capacity agreement with Anthropic through May 2029—concrete data on AI infrastructure economics and model training at scale.

Watch for Claude API pricing/availability changes tied to compute constraints

7
How fast is 10 tokens per second really?
Simon WillisonMay 20, 2026

Interactive token-speed simulator clarifies real-world latency perception for builders choosing models—useful for UX decision-making when comparing 10 vs 30 tokens/sec.

Bookmark for model selection conversations

8
How Ramp engineers accelerate code review with Codex
OpenAIMay 20, 2026

Ramp case study: GPT-5.5 + Codex cuts code review time from hours to minutes—measurable productivity gain on production engineering workflow.

Reference in sales/pitch deck for code-review automation

8
Anthropic acquires Stainless
AnthropicMay 18, 2026

Anthropic acquires Stainless (API SDK/codegen tooling)—signals product velocity on developer tooling; may improve Claude integration experience for builders.

Watch for Stainless tooling improvements post-acquisition

9
SpaceX S-1
Hacker NewsMay 20, 2026

SpaceX S-1 filing reveals $1.25B/month Anthropic compute agreement through May 2029—public confirmation of infrastructure economics shaping Claude's future cost and capacity.

Cross-reference with Claude pricing and availability updates

7
llm-gemini 0.32a0
Simon WillisonMay 19, 2026

llm-gemini 0.32a0 adds streaming reasoning tokens — new capability for tool-use patterns in Claude Code / Cursor workflows.

Test if reasoning tokens improve agent reliability

7
Formal Verification Gates for AI Coding Loops
Hacker NewsMay 20, 2026

Formal verification gates for AI coding loops — concrete methodology for shipping safer agent-driven code without manual review bottlenecks.

Test structural backpressure pattern in next autonomous build

How scoring works

Each morning at 05:00 UTC, Claude Haiku 4.5 reads ~5 feeds, scores each item 0–10 against the Brillz rubric, and saves anything 7+. Accent badges mark 9–10 (must-read).