cybernetics-design-philosophy - SKILL.md Agent Skill

name: cybernetics-design-philosophy description: > Cybernetics design philosophy guide based on Norbert Wiener's "The Human Use of Human Beings." Use this skill during: agent system design, feedback loop architecture, multi-agent orchestration, error handling strategy, self-correcting system design, resilience planning. Trigger when the user mentions "feedback loop", "self-correction", "negative feedback", "positive feedback", "oscillation", "homeostasis", "stability", "requisite variety", "entropy", "information theory", "agent orchestration", "cybernetics", or similar topics.

The Human Use of Human Beings — Distilled Guide

Source: Norbert Wiener, The Human Use of Human Beings: Cybernetics and Society (1950/1954) Central thesis: Society can only be understood through the messages and communication facilities that belong to it. Feedback is the foundation of all purposive, self-correcting behavior.

I. Core Thesis: Communication, Control, and Feedback

Cybernetics is the science of control and communication in the animal and the machine. Its central insight: the mechanisms that allow a living organism to maintain itself against entropy are identical in structure to those that allow a machine to correct its own behavior.

Three interlocking claims:

Information is the content of what is exchanged with the outer world as we adjust to it and make our adjustment felt upon it.
Feedback — returning actual performance data to the control mechanism — is what distinguishes purposive behavior from mere mechanical execution.
Any system that cannot receive and act on feedback is an open-loop system; open-loop systems are inherently fragile in uncertain environments.

"It is the thesis of this book that society can only be understood through a study of the messages and the communication facilities which belong to it."

II. Entropy and Information: Why Systems Decay

The second law of thermodynamics describes a universe trending toward maximum disorder (heat death). Information is entropy's opponent: it is the local, temporary accumulation of order against the universal current of disorganization.

Key principle: Every act of verification is a local reduction of entropy. Skipping verification allows entropy to accumulate undetected.

Practical implication: Information degrades through every relay. A message passed through four translation steps retains a fraction of its original fidelity. This is not a failure of implementation — it is a thermodynamic law.

Red flag: Any system that operates without verification will degrade. Not might degrade — will degrade. The question is only when and how fast.

"The machine is a device which locally and temporarily seems to resist the general tendency for the increase of entropy."

III. Negative Feedback: The Foundation of Self-Correction

Negative feedback is the mechanism by which a system detects deviation from a goal and applies a corrective force proportional to that deviation. "Negative" is not a value judgment — it means the feedback opposes the direction of error.

Structure:

Reference value → [Comparator] → Error signal → [Actuator] → Output
                        ↑__________________________________|
                                Negative feedback

Principle: The feedback signal must come from an independent sensor, never from the actuator's self-report. Letting the executor evaluate its own output is equivalent to removing the comparator from the loop — the feedback circuit is physically short-circuited.

Agent mapping: In an AI agent system:

Reference value = system prompt / goal definition
Sensor = independent verification tool (compiler, grep, diff)
Comparator = evaluation logic
Actuator = code writer, file modifier, API caller
Negative feedback = verify → correct → re-verify loop

An agent without a verification step is an open-loop system. It may produce correct output by chance, but it has no mechanism to detect or recover from deviation.

Red flag: Accepting an executor's self-report as verification. The subagent saying "task completed successfully" is not evidence of completion — it is the actuator speaking, not the sensor.

IV. Positive Feedback and Oscillation: When Correction Goes Wrong

Positive feedback amplifies deviation rather than opposing it. A small error becomes a larger error, which triggers a larger correction, which overshoots, which triggers a correction in the opposite direction — the system oscillates.

Wiener describes intention tremor: a patient with cerebellar damage attempting to pick up a cup finds their hand oscillating violently around the cup. Each corrective movement overshoots, triggering the next overcorrection. The feedback loop is intact, but its gain is too high.

Principle: Every correction loop requires a convergence threshold and a maximum iteration count. An unbounded feedback loop will oscillate regardless of how carefully the individual correction steps are designed.

Agent mapping: The LLM hallucination spiral is a positive feedback structure:

Initial false assumption → generates content validating assumption →
content reinforces assumption → more confident false generation…

No external negative feedback (independent fact-checking) exists to oppose the amplification. The system drifts further from truth with each step.

Red flag: A reflection loop or retry mechanism with no termination condition. If quality scores fluctuate with each iteration rather than converging monotonically, the system is exhibiting intention tremor — investigate immediately rather than continuing to iterate.

Intention tremor is a signal, not noise. When an agent's output quality oscillates, the cause is almost always feedback gain set too high, not insufficient effort.

V. Homeostasis: How Open Systems Resist Decay

Homeostasis is the maintained stability of an open system — not a static equilibrium, but a dynamic, continuously active resistance to perturbation. The body's maintenance of temperature, blood pH, and blood sugar are all homeostatic: continuous active processes, not passive states.

Key insight: Stability is not the absence of disturbance — it is the continuous expenditure of energy to counter disturbance. A system that "was working yesterday" is not guaranteed to work today without the active maintenance of its feedback mechanisms.

Principle: Maintenance is not optional. A system that was stable at time T will degrade to instability at time T+N if its feedback loops are not actively running. One-time checks do not substitute for continuous monitoring.

Agent mapping:

Continuous monitoring (health checks, log analysis, regression tests running on every change) is homeostatic behavior.
A one-time audit before deployment is not homeostasis — it is a single measurement with no feedback loop.
A robust agent system is one with continuous verification, not one that was verified once at launch.

"We are not stuff that abides, but patterns that perpetuate themselves."

VI. Purposive Behavior: Goal-Driven Systems

In 1943, Rosenblueth, Wiener, and Bigelow proposed a decisive thesis: purposive behavior can be completely described in terms of physical feedback mechanisms, without invoking intent, soul, or consciousness.

A guided missile tracking an aircraft and a human reaching for a cup are structurally identical: both continuously measure the deviation between current state and goal state, and both continuously generate corrective action proportional to that deviation. Purpose is not a metaphysical property — it is the external expression of a feedback loop's reference value.

Principle: A purposive agent measures deviation from goal rather than executing a fixed script. An agent following a script is a music box; an agent measuring deviation is a missile.

Agent mapping:

Mode	Behavior	Failure Mode
Script-driven	Executes predefined steps in order	Fails silently when any step's preconditions differ from expected
Goal-driven	Continuously measures distance from goal; adapts path	Can oscillate if convergence criterion is undefined

ReAct (Reason + Act) architecture is a direct implementation of the Rosenblueth-Wiener-Bigelow framework: observe current state → compare to goal → reason about corrective action → act → observe again.

VII. Requisite Variety: Controllers Must Match Environment Complexity

Ashby's Law of Requisite Variety (extended from Wiener's framework): a controller can only neutralize disturbances it has the capacity to respond to. A controller with N response modes can handle at most N classes of disturbance.

Formal implication: If the environment can produce M distinct failure modes and the agent has tools covering only K < M failure modes, then (M - K) failure modes will be systematically unhandled — regardless of how sophisticated the agent's reasoning is. This is not an implementation gap; it is a structural impossibility.

Principle: Tool diversity must match problem diversity. Giving an agent a single verification tool (e.g., only a compiler) creates guaranteed blind spots for all failure modes not detectable by that tool.

Agent mapping:

Single tool for all task types → violates requisite variety
Multi-role agent team (implementer + reviewer + security-checker) → higher variety, covers more failure modes
Verification suite covering multiple dimensions (compilation + semantic grep + visual inspection) → requisite variety applied to the feedback loop itself

Red flag: Assigning one agent to handle all task types regardless of diversity. The agent's tool set defines the ceiling of problems it can solve, not its intelligence.

VIII. First-Order vs. Higher-Order Feedback

Wiener distinguishes two levels of feedback:

First-order feedback corrects individual output deviations. A thermostat is first-order: it does not learn — it applies the same corrective response every time.

Higher-order feedback modifies the strategy itself. Learning is higher-order feedback: the result of an interaction does not just correct a single output, it changes the behavioral policy that generated the output. Wiener describes this as "rewriting the tape" rather than "replaying the tape."

Principle: The same failure mode appearing twice is a signal to change strategy, not to retry. Repeating the same approach after it has already failed once is first-order behavior applied to a higher-order problem.

Agent mapping:

Level	Trigger	Agent Action
First-order	Single task failure	Check output, fix error, retry
Higher-order	Same failure mode ≥ 2 times	Reflect on strategy, change method, update plan

Most current agent frameworks implement only first-order feedback (retry with fix). This is the cybernetic equivalent of an insect: capable of responding to known failure patterns, incapable of adapting to novel ones.

Red flag: A system that retries the same approach after three consecutive failures of the same type. This is not persistence — it is the absence of higher-order feedback.

IX. Communication Bottlenecks: Information Degradation

"That information may be dissipated but not gained is the cybernetic form of the second law of thermodynamics."

Every relay in a communication chain is a translation. Every translation involves selection, compression, and reencoding — operations that can only lose information, never gain it. This is not a quality-of-implementation issue; it is structural.

Practical consequence: In a chain of four agents where each passes its output as the next's input, the final agent may be operating on significantly degraded information — even if each individual handoff appears clean.

Principle: Verification must be performed at the end of the information chain, not trusted to intermediate relays. Independent validation at the endpoint is the only way to detect cumulative degradation.

Design rules:

Parallel fan-out (all subagents receive complete instructions from the orchestrator) is always preferable to serial chain (each subagent receives only the previous one's output).
Explicit intent transmission: pass not just the task description but also why the task is needed and what success looks like — intent is the first casualty of relay degradation.
Result aggregation at the orchestrator level: subagent results must return to the orchestrator for synthesis, not be forwarded directly to the next subagent.

X. Human-Machine Collaboration

Wiener's clearest practical principle: humans and machines have complementary strengths that should not be confused.

Machines excel at: repetitive precision execution, high-speed pattern matching across large datasets, maintaining consistent behavior under fatigue, numerical computation.

Humans excel at: value judgment, goal selection, handling genuinely novel situations, ethical reasoning, recognizing when a system's outputs are nonsensical.

Principle: Machines should handle repetitive and precise tasks; humans should handle judgment and decision. The danger is not that machines will replace human judgment — it is that humans will defer to machine outputs in domains where machine judgment is not trustworthy.

Agent mapping (Human-in-the-loop design):

Decision Type	Handler
Repetitive execution (format conversion, build, test)	Agent autonomous
Novel situation not covered by existing patterns	Human checkpoint
High-stakes irreversible action	Human approval required
Goal definition and success criteria	Human defines, agent executes
Ethical or value-laden tradeoffs	Human decides

Red flag: Removing human checkpoints from high-stakes decision points to improve throughput. The short-term efficiency gain creates long-term fragility at exactly the points where human judgment is most valuable.

XI. Agent System Mapping: Cybernetics to Modern AI

Complete correspondence table:

Cybernetics Concept	Agent Component	Design Implication
Reference value (setpoint)	System prompt / goal definition	Must be measurable; vague goals cannot be compared
Sensor	Tool call result, compiler output, grep result	Must be independent of executor
Comparator	Evaluation logic, verification step	Cannot be performed by executor
Actuator / Effector	Code writer, file editor, API caller	The only component allowed to act on the world
Negative feedback loop	Verify → correct → re-verify cycle	Every agent must have one
Positive feedback	Hallucination spiral, infinite retry	Must be blocked by convergence criterion
Homeostasis	System robustness, self-healing	Requires continuous monitoring, not one-time audit
Intention tremor	Reflection loop without termination	Set max iterations + convergence threshold
Purposive behavior	Goal-driven ReAct architecture	Measure deviation from goal, not script execution
Higher-order feedback	Strategy switch on repeated failure	Trigger at ≥ 2 consecutive same-type failures
Information degradation	Context loss in multi-hop chains	Use fan-out, not serial relay
Requisite variety	Tool diversity matching problem diversity	Tool set defines the ceiling of solvable problems
Orchestrator (control layer)	Main session / coordinator agent	Receives feedback, compares to goal, issues corrections — does not execute
Ant mode (rigid)	Fixed-pipeline agent	Appropriate for high-certainty, repetitive tasks only
Human mode (adaptive)	Open-goal agent with tool selection	Appropriate for exploratory, uncertain tasks

XII. Red Flags Quick Reference

Signal	Meaning
No Feedback Agent	System runs open-loop; output quality unverified
Self-Report Trust	Accepting executor's own claim without independent verification
Infinite Correction Loop	No convergence criterion; agent oscillates indefinitely
Single-Tool Agent	One tool for all tasks; violates requisite variety
Feedback Without Memory	Same error repeated; no higher-order learning
Homogeneous Multi-Agent	All agents have identical roles; no diversity benefit
Unvalidated Delegation	Delegating without verifying delegate's output
Overcorrection	Each fix introduces new problems; intention tremor pattern
Static Goal in Dynamic Environment	Fixed objectives when context changes; no adaptation
Pass-Through Orchestrator	Orchestrator just forwards instructions; adds no coordination value
Information Loss in Chain	Multi-hop delegation with no end-to-end verification
Centralized Single Point	One controller bottleneck; no resilience under failure
Premature Optimization	Optimizing before feedback loop is stable
Ignoring Oscillation Signals	Score fluctuations dismissed instead of investigated

XIII. Design Principles Summary

Feedback is non-negotiable — every system needs a correction loop.
Independent verification over self-reporting.
Negative feedback maintains stability; positive feedback amplifies — know which you are using.
Set convergence thresholds — unbounded correction loops will oscillate.
Same failure twice means change strategy, not retry.
Controller variety must match environment variety (Ashby's Law).
Every delegation requires verification at the end of the chain.
Design for graceful degradation, not perfection.
Information decays through every relay — verify at endpoints.
Purposive behavior requires measuring deviation from goal, not following scripts.
Stable systems require continuous energy and information input — maintenance is not optional.
Human judgment for decisions, machine precision for execution.
Higher-order feedback (changing methods) is more valuable than first-order (fixing outputs).
Oscillation is a signal, not noise — investigate it.
The measure of intelligence is adaptation to change, not speed of execution.
Open-loop systems fail silently; closed-loop systems fail loudly and recover.
Entropy is the default trajectory — order requires active work.
The danger is not the machine; it is the concentration of control in the hands of the few who operate it.

XIV. Usage Guide

Reference this guide in the following scenarios:

Agent architecture review: Run through the Red Flags table (Section XII) as a design checklist. If any signal is present, address it before launch.
Feedback loop design: Apply Sections III–IV–VIII. Define reference value, sensor, comparator, and actuator explicitly. Set convergence threshold before the first iteration.
Multi-agent orchestration: Apply Sections VII–IX–XI. Prefer fan-out over serial chains. Ensure orchestrator aggregates results rather than forwarding them.
Error handling strategy: Apply Sections III–VIII. Implement first-order feedback for known failure modes; add higher-order feedback triggers for repeated failures.
Resilience planning: Apply Sections V–XII. Audit for single points of failure; convert one-time checks to continuous monitoring loops.
Human-in-the-loop placement: Apply Section X. Map each decision type to the appropriate handler; never remove human checkpoints from irreversible high-stakes actions.