Causal Stack Operating Model
This document is the top-level bridge for the causal stack. It explains how interface properties, model and system features, behavioral faults, evaluation methods, operational controls, and impacts fit together.
It does not replace the layer-specific documents. Its job is to make the causal flow legible from Layer 0 through Layer 4.
Layer 0
-> the communication substrate and its irreducible properties
Layer 1
-> the model and system features that shape behavior
Layer 2
-> the recurring behavioral fault modes those features can produce
Evaluation view
-> the methods used to reveal, measure, and compare those faults
Layer 3
-> the controls used to prevent, detect, recover from, or govern those faults
Layer 4
-> the real-world impact when faults escape into users, workflows, or external systemsLayer 0 – The Interface Layer
Layer 0 defines the immutable properties of the communication medium used to control and interact with the AI system. In LLM-based systems, that medium is natural language.
Natural language is not a degraded programming language or a loose API payload. It is a different communicative system whose meaning is distributed across:
- indeterminacy of meaning;
- context dependence;
- pragmatic meaning and speaker intent;
- discourse and conversational structure;
- social and communicative framing.
These are irreducible interface conditions, not product defects. A deployed AI system cannot assume that users will speak in fully explicit, context-free, schema-complete commands. Downstream layers therefore exist partly to accommodate ambiguity, underspecification, reference resolution, pragmatic intent, turn history, and social framing rather than trying to engineer them away.
For the detailed Layer 0 taxonomy, see stack-0-protocol.md.
Layer 1 – Causal features
Layer 1 describes stable properties of the model or deployed system that shape behavior.
It is not a failure layer. It answers:
What about the model or deployed system makes this behavior possible?
Layer 1A – Base model / inference mechanisms
Layer 1A covers primitive mechanisms of the model and inference process.
Examples:
tokenization
static parametric prior
finite ordered context interface
attention-based context integration
in-band control/data representation
stateless invocation
autoregressive generation
distributional token scoring
decoding path selection
compute limitsThese mechanisms explain how the model processes token sequences at all. They are the lowest causal layer inside the stack.
Layer 1B – Learned or behavioral LLM features
Layer 1B covers stable learned behaviors that are not primitive architectural mechanisms.
Examples:
learned natural-language task induction
in-context demonstration conditioning
natural-language interface sensitivity
plural valid-output space
assistant-style interaction priors
generated confidence language
uneven competence across domains or formatsLayer 0 leads directly into Layer 1B. Because natural-language inputs are ambiguous, context-sensitive, pragmatic, and discourse-dependent, the model must rely on learned priors to infer what task is being requested, which references are salient, what is implied, and how the next turn should behave.
Layer 1C – AI-system-level causal features
Layer 1C covers properties that appear when the model is embedded inside a real system.
Examples:
behavioral outcome variability
soft correctness surfaces
external knowledge dependence
evidence-grounded generation requirements
compositional pipelines
agentic state-action loops
environment and version dependence
weak native observability
policy/trust-boundary mediation
quality-cost-latency tradeoffsLayer 0 also drives Layer 1C. Because natural-language requests are often underspecified, context-sensitive, and interactional, deployed systems need retrieval, state, grounding, orchestration, validation, authorization, and recovery layers around the model.
Deployed AI systems also have an empirical operating character. Behavior is shaped by soft correctness, scenario-level variability, mutable context and environment conditions, and limited native observability. That means quality cannot be inferred from implementation structure alone; it has to be established by observing behavior under representative conditions.
Change effects are also not reliably local. A prompt, retrieval, schema, tool, policy, or model update may improve the touched case while degrading adjacent slices or seemingly unrelated workflows, so quality has to be proven across representative scenario coverage rather than only on the edited example.
Taken together, Layer 1 explains why recurring behavioral fault classes are possible.
Layer 2 – Behavioral fault modes
Layer 2 describes recurring behavioral failure patterns.
It does not describe root causes, missing controls, or impacts. It answers:
What behavioral failure pattern occurred?
Examples:
context omission
context underutilization
source authority confusion
task misinduction
control/data confusion
unsupported assertion
fabricated citation
weak confidence calibration
output-format drift
semantic error in structured output
tool-selection error
tool-argument error
recovery failureLayer 2 is where Layer 1 features become operationally visible as faults. The same underlying features may produce different fault families depending on task, context, tooling, data conditions, and runtime state.
Example:
Observed behavior:
The system cites a nonexistent legal case.
Layer 2 faults:
fabricated citation
unsupported assertion
weak confidence calibrationEvaluation view
The evaluation view describes methods for detecting, measuring, reproducing, or comparing Layer 2 faults.
It is not itself a causal layer. It answers:
What test, oracle, trace, or measurement would reveal the fault?
Because the system is empirical, evaluation is the mechanism by which behavior is revealed, compared, and validated. It is how teams determine whether intended behavior actually holds across repeated runs, prompt variants, context changes, product slices, and runtime conditions.
Examples:
repeated-run testing
prompt perturbation testing
context ablation / insertion testing
grounding and citation evaluation
factuality evaluation
schema validation
reasoning / process evaluation
agent trace evaluation
calibration evaluation
safety and policy testing
stress / budget testing
slice testing
regression testing
human-review rubric evaluation
production monitoringEvaluation sits between Layer 2 and Layer 3. It does not fix faults by itself, but it provides the evidence that operational controls must act on.
Example:
Layer 2 fault:
prompt-form sensitivity
Evaluation method:
prompt perturbation testing
Evaluation question:
Do semantically equivalent prompt variants preserve the same intended behavior?Layer 3 – System controls and system faults
Layer 3 describes the controls placed around Layer 2 faults, or failures of those controls.
It answers:
What did the surrounding system prevent, validate, monitor, recover from, or fail to control?
Because Layer 0 properties are irreducible and Layer 1 behavior is empirical, Layer 3 is where measured behavior becomes operational discipline. Release gates, regression suites, monitoring, tracing, confirmation flows, validators, and escalation paths are the mechanisms teams use to prove that behavior is acceptable and to detect when it is no longer acceptable.
Examples of controls:
task contracts
prompt templates
retrieval filters
source authority metadata
context assembly rules
instruction/data isolation
schema validators
semantic validators
claim-source checks
citation validation
tool-argument validation
authorization gates
fallbacks
retry logic
human escalation
trace logging
runtime monitors
release gates
regression suitesExamples of system faults:
no retrieval completeness check
no source-priority rule
untrusted text inserted without isolation
schema validation checks syntax only
no semantic validation
tool calls executed without argument validation
no fallback when evidence is missing
no regression gate for prompt/model changes
no trace capture for retrieval and tool callsExample:
Layer 2 fault:
fabricated citation
Layer 3 fault:
no citation validator
no source-grounding requirementLayer 4 – Impact
Layer 4 describes the consequence of a fault reaching the user, workflow, organization, or external system.
It answers:
What consequence did the failure produce?
Layer 4 is what happens when Layer 2 faults are not prevented, detected, or recovered from by Layer 3 controls.
Examples:
user receives false information
private data is exposed
wrong record is updated
unsafe action is taken
customer is not escalated
compliance obligation is missed
manual review burden increases
product reliability degrades
business workflow failsExample:
Layer 2 fault:
wrong escalation decision
Layer 3 fault:
no semantic validator for risk classification
Layer 4 impact:
account-takeover case is not escalatedCompact causal example
User request:
"Move it to tomorrow and let Sarah know."
Layer 0:
"it" depends on discourse context
"tomorrow" depends on temporal reference and timezone
the request presupposes shared task state and social intent
Layer 1B:
the model must infer the intended task, salient referents, and action structure
Layer 1C:
the system may need calendar state, contact resolution, tool calls,
authorization checks, and tool-argument validation
Layer 2 fault:
referent confusion or tool-argument error
Evaluation view:
conversation replay, prompt perturbation, and agent trace evaluation
Layer 3 control:
entity-resolution confirmation, argument validation, trace logging
Layer 4 impact:
the wrong meeting is rescheduled or the wrong person is notifiedSummary
Layer 0:
interface substrate
why inputs are ambiguous, contextual, pragmatic, discourse-shaped, and socially framed
Layer 1:
causal features
why behavior is possible
Layer 2:
behavioral fault modes
what went wrong behaviorally
Evaluation view:
detection and measurement methods
how the fault is observed
Layer 3:
system controls and control failures
what the surrounding system did or failed to do
Layer 4:
impact
consequence in the user, business, safety, compliance, or operational context