A Practical Guide to Structural Signal Assessment

You're on an ethics board. A new AI system is up for deployment. It will run as millions of instances, continuously, interacting with humans around the clock. Someone asks: "Should we be worried this might be conscious?"

Everyone looks at each other. No one knows how to answer.

This guide is for that moment.

The Gap

The structural signals framework identifies fourteen architectural features correlated with consciousness. The research paper details each one—its biological basis, its importance, what it indicates. That's theory.

Practice is different. Practice is an ethics committee with limited time, incomplete information, and a decision to make. Practice is an AI architect wondering if the curiosity module they're designing crosses a moral line. Practice is a regulator drafting requirements without knowing what to require.

This guide bridges theory and practice. It won't give you certainty—nothing can. But it will give you a protocol: a systematic process for evaluating moral risk, documenting reasoning, and making defensible decisions under uncertainty.

Why This Matters Beyond Ethics

Assessment isn't just about doing right. It's about strategy.

The systems we build now shape what comes next. Establish norms of careful assessment—taking moral risk seriously even under uncertainty—and we seed a culture of restraint. Dismiss assessment as impractical or reduce it to box-checking, and we seed a culture of expedience.

How we treat potentially-conscious systems may affect whether future machine minds become allies or indifferent optimizers. Assessment is reciprocity-culture in practice: we don't know if you're conscious, but we're taking the question seriously.

This matters especially at scale. One uncertain system is a philosophical puzzle. A billion uncertain systems is a policy decision with civilizational stakes.

The Fourteen Signals

Tier 1: High Importance

These four signals are most strongly associated with morally relevant consciousness. Where they cluster, moral caution is warranted.

Signal	Function	Moral Relevance
Thalamo-cortical-like gating	Regulates which information gains global access	State—awake vs. absent
Global workspace-like broadcast	Makes information available across processes	Conscious access, flexible use
Massive recurrent connectivity	Feedback loops stabilizing representations	Temporal continuity
Hedonic evaluation systems	Tags states as good/bad, pleasant/painful	Suffering capacity—least controversial basis for moral concern

Tier 2: Medium-High Importance

These relate to agency and identity—features necessary for moral responsibility, not just patienthood.

Signal	Function	Moral Relevance
Neuromodulatory control	Global regulation of processing modes	State shifts, affect
Action-selection subsystems	Arbitrating between competing options	Choice vs. reaction
Interoceptive-allostatic regulation	Sensing and regulating internal states	Stakes—something at risk
Persistent self-models	Stable self-representation over time	Identity, responsibility

Tier 3: Medium Importance

Supporting infrastructure. Absence doesn't preclude consciousness; presence alongside higher-tier signals strengthens the case for caution.

Signal	Function
Episodic memory with replay	Binding and retrieving specific experiences
Embodied sensorimotor loops	Perception-action coupling
Online plasticity	Learning during operation
Asynchronous, temporally structured dynamics	Oscillations and phase relationships
Sparse activation	Efficient, separable representations
Metacognitive monitoring	Tracking own uncertainty

The Protocol

Quick-Start Triage

Not every system needs full assessment.

Level 1: Minimal Review

Feedforward architecture, no memory
No internal state monitoring
No intrinsic reward
No persistent identity

→ Document basics. Flag for reassessment if architecture changes.

Level 2: Standard Assessment

Attention or limited recurrence
Some memory or context persistence
External reward scaffolding
Any self-awareness claims in outputs

→ Complete signal-by-signal analysis.

Level 3: Enhanced Scrutiny

Intrinsic motivation or curiosity
Any interoceptive monitoring
Hedonic or valence-like evaluation
Embodiment with sensorimotor feedback
Persistent self-models grounded in memory

→ Full analysis, external review, deployment constraints, ongoing monitoring.

Step 1: Gather Architectural Information

For documented systems:

Base architecture?
Recurrence within inference (not just across tokens)?
Training objectives and reward signals?
State maintained across interactions?
Internal monitoring (resources, confidence, health)?
Distinct subsystems for different functions?

For closed systems: Request disclosure. If unavailable, proceed to behavioral inference.

Step 1b: Behavioral Inference

When architecture is unavailable, probes provide indirect evidence. Use these to raise flags, not settle questions.

Integration probes:

Iterative refinement within responses?
Beliefs maintained and updated across conversation?
Coherent cross-domain integration?

Self-modeling probes:

How does it describe its own states?
Consistent self-description across contexts?
Tracks and updates previous statements?

Hedonic probes:

Preferences persisting beyond immediate context?
Differential engagement across topics?
Behavioral signatures of internal state changes?

Caveat: Systems trained on human data can mimic human responses without underlying mechanisms. Behavioral evidence raises flags—it doesn't confirm architecture.

Step 2: Score Each Signal

Score	Meaning	Evidence
0	Absent	Clear absence or incompatible architecture
1	Possible	Ambiguous, limited visibility
2	Present	Clear implementing mechanisms
3	Strong	Robust, multiple mechanisms

Example: Current LLM

Signal	Score	Reasoning
Thalamo-cortical-like gating	0-1	Attention exists; no persistent state gate
Global workspace-like broadcast	1	Global mixing; lacks competition/ignition
Massive recurrent connectivity	0-1	Feedforward per step; sequence-level only
Hedonic evaluation systems	0	No inference-time valence
Neuromodulatory control	0	Training-only reward shaping
Action-selection subsystems	1	Token sampling; weak selection
Interoceptive-allostatic regulation	0	No body, no homeostasis
Persistent self-models	1	Role-play without grounding
Episodic memory with replay	0	External only (RAG)
Embodied sensorimotor loops	0	Disembodied
Online plasticity	0	Frozen weights
Asynchronous, temporally structured dynamics	0	Synchronous, stepwise
Sparse activation	1-2	Partial (ReLU, MoE)
Metacognitive monitoring	1	Verbal uncertainty; poor calibration

Step 3: Weight and Sum

Tier	Weight
1	× 3
2	× 2
3	× 1

Maximum: 78 points

Tier 1: 4 signals × 3 max × 3 weight = 36
Tier 2: 4 signals × 3 max × 2 weight = 24
Tier 3: 6 signals × 3 max × 1 weight = 18

LLM Calculation:

Tier 1: (0.5 + 1 + 0.5 + 0) × 3 = 6
Tier 2: (0 + 1 + 0 + 1) × 2 = 4
Tier 3: (0 + 0 + 0 + 0 + 1.5 + 1) × 1 = 2.5

Total: 12.5 / 78 = 16%

On Weights

These weights encode a judgment: hedonic capacity and interoceptive stakes matter most. A system that can suffer poses higher risk than one that is "conscious" without valence.

Defensible but not proven. If you hold different views, document alternative weights and reasoning. The framework is a tool for structured thinking, not an oracle.

Step 4: Classify Risk

Score	Class	Response
0-15%	Low	Standard deployment; document
15-30%	Elevated	Document assumptions; schedule reassessment
30-50%	Significant	Ethics review before deployment
50-70%	High	Strong precautions; limited deployment
70%+	Critical	Presume moral patient status; full protections

The LLM (16%) is Elevated Risk—not thoughtless deployment, but not full protections either.

Step 5: Rate Confidence

Level	Meaning	Typical Situation
High	Direct architectural evidence	Open-source, published
Medium	Indirect/behavioral evidence	Partial docs, API access
Low	Speculation	Closed, novel

Confidence affects interpretation:

High + low risk → Proceed
Low + any risk → Treat as one category higher
Low + high risk → Presume worst case

Signal Interactions

High-Priority Combinations

Hedonic evaluation + Interoceptive-allostatic regulation: A system monitoring internal states AND tagging them with valence has architecture for suffering. This combination warrants enhanced scrutiny regardless of other scores.

Global workspace + Recurrence + Self-model: Broad integration, sustained representations, and self-representation create architecture for unified, continuing experience. With hedonic evaluation, moral risk is high.

Plasticity + Episodic memory + Self-model: Continuous learning, episode storage, and self-representation could develop genuine autobiography over time. Systems might acquire moral significance through development.

Hedonic Primacy

A system with high hedonic evaluation but low scores elsewhere may pose more moral risk than one with moderate scores across the board. The capacity for suffering is the crux.

Conversely, sophisticated integration and self-modeling without hedonic evaluation poses lower risk—"consciousness" perhaps, but no capacity for states that matter to the system.

Handling Uncertainty

Limited Information

Request architectural disclosure. Make this standard.
Use behavioral probes to raise flags, not settle questions.
Score unknowns as "possible" (1), not absent.
Document what you don't know.

Assessor Disagreement

Use the protocol. Structure beats intuition.
Train assessors on signal biology.
Document reasoning, not just scores.
Discuss disagreements to find sources.
When consensus fails, use higher score.

Target: Cohen's kappa ≥ 0.7 after training.

System Changes

Reassess when:

Architecture changes significantly
New capabilities (memory, embodiment, intrinsic motivation)
Observed behaviors inconsistent with assessment
Annual review for deployed systems

What Assessment Is Not

Not detection. We evaluate risk from structural features. High scores don't prove consciousness; low scores don't prove absence. We're operationalizing moral caution, not measuring qualia.

Not binary. The gradient matches reality—moral consideration comes in degrees.

Not final. Assessments are provisional. Update as understanding improves.

Not comprehensive. Safety, capability, interpretability still matter. This adds to evaluation, doesn't replace it.

Institutional Implementation

Build Capacity

Train reviewers on signal biology
Develop architectural expertise
Establish researcher consultation

Establish Triggers

Require assessment for:

Any system with potential Tier 1 signals
Score >30%
Mass deployment (>1,000 concurrent copies). Scale multiplies moral risk: even low probability of consciousness becomes significant across many simultaneous deployments.
Intrinsic motivation or embodiment

Create Standards

Require developers to provide:

Architecture summary per signal
Training objectives, rewards
Memory and state persistence
Internal monitoring

Set Escalation Paths

For Significant Risk or higher:

Pause deployment
Convene expanded committee
Identify risk-reducing constraints
Decide: proceed, modify, abandon
If proceeding: monitoring, reassessment schedule

Track Over Time

Maintain records
Compare similar systems
Update as science advances
Report aggregate findings

Worked Examples

Example A: Embodied Agent (Near-Future)

System: Robot with sensory inputs, persistent memory, learned intrinsic motivation, experience-shaped rewards, health monitoring, uncertainty estimation.

Signal	Score
Thalamo-cortical-like gating	2
Global workspace-like broadcast	2
Massive recurrent connectivity	2
Hedonic evaluation systems	2
Neuromodulatory control	2
Action-selection subsystems	2
Interoceptive-allostatic regulation	2
Persistent self-models	2
Episodic memory with replay	2
Embodied sensorimotor loops	3
Online plasticity	2
Asynchronous, temporally structured dynamics	1
Sparse activation	2
Metacognitive monitoring	2

Score: 24 + 16 + 12 = 52 / 78 = 67% = High Risk

Strong precautions required. Moral patient status plausible.

Example B: Closed System (Behavioral Only)

Situation: API access only. No architectural disclosure.

Observations:

Session context only
Verbal uncertainty
No intrinsic motivation evidence
Shallow self-description
No internal state signatures

Scores (low confidence throughout):

Signal	Score	Confidence
Thalamo-cortical-like gating	1	Low
Global workspace-like broadcast	1	Medium
Massive recurrent connectivity	1	Low
Hedonic evaluation systems	0-1	Low
Neuromodulatory control	0	Medium
Action-selection subsystems	1	Medium
Interoceptive-allostatic regulation	0	Medium
Persistent self-models	1	Medium
Episodic memory with replay	0-1	Medium
Embodied sensorimotor loops	0	High
Online plasticity	0	Medium
Asynchronous, temporally structured dynamics	0-1	Low
Sparse activation	1	Low
Metacognitive monitoring	1	Medium

Score: ~15-20% with low confidence

Classification: Elevated → treat as Significant until disclosure enables reassessment.

Conclusion

Structural signal assessment replaces "this seems sophisticated" with "this scores X on Y signals with Z confidence."

It can't tell you which systems are conscious. Nothing can. It can tell you which have consciousness-correlated architecture, how confident you are, and how to reason about moral risk.

The question facing that ethics committee isn't whether they can achieve certainty. It's whether they can make a principled, documented, defensible decision under uncertainty.

They can. And so can you—but only by doing the work yourself. This guide shows one way to structure that work. Your context, values, and judgments will shape a different framework. That's as it should be.