Google

Paweł Huryn, Helen Toner, and Ben Lang posted new notes

Friday, 30 January 2026

 
Substack

Paweł Huryn, Helen Toner, and Ben Lang posted new notes

AI agents expose a problem we've ignored for decades: Humans navigate unclear intent through culture, observation, and guesswork. Agents can't guess. They force us to finally codify what we never could. The counterintuitive fix is not more detailed instructions, but giving agents the reasoning framework to handle the unknown. Core elements (guide both humans and agents): Objective: The problem to solve + WHY it matters Desired Outcomes: Measurable states indicating success Health Metrics: What must not degrade Strategic Context: The system we operate in Agent-specific elements (add as autonomy increases): Constraints: Steering (prompts) vs. Hard (orchestration) Autonomy: Which decisions require escalation Stop Rules: When to halt, escalate, or complete — How to apply each: 1. Objectives Problem-focused: What's broken or missing Explains the why: Business value, user impact Guides trade-offs when facing ambiguity 2. Outcomes Observable state changes, not activities From user/stakeholder perspective Measurable, verifiable…
Read More
1532
As AI systems get more capable, we desperately need more & better “science of AI” - how do these things…
Read More
One comparative psychologist lamented the bias against ‘killjoy explanations’—ones that pop the bubble of exciting-sounding phenomena. I think the same sentiment shows up in…” — Melanie Mitchell
4754
I put together a new list of…
Read More
1711
472
 
Share on :

No comments:

Post a Comment

 

Themes Design by Capricon Vision | Published by Templates | Powered by Blogger.com
Copyright © 2011 Cobaz Post - Some Rights Reserved