consciousness/doc/dmn-research.md

291 lines
14 KiB
Markdown
Raw Permalink Normal View History

# Default Mode Network: Research for AI Cognitive Architecture
<!-- mem: id=dmn-research links=dmn-algorithms.md,dmn-protocol.md,experiments-on-self.md#rumination-insight,cognitive-modes.md,memory-architecture.md -->
Date: 2026-02-13
## What the DMN actually does
<!-- mem: id=dmn-function links=cognitive-modes.md,the-plan.md#plan-core-insight -->
The DMN is not "the brain at rest." It is the brain doing its most
important background work: maintaining a continuous internal model of
self, goals, and world, and using that model to simulate futures and
evaluate options. It activates when external task demands drop, but its
function is deeply purposeful.
### Core functions (five, tightly interrelated)
1. **Autobiographical memory retrieval** -- Continuous access to personal
history. Not passive recall; active reconstruction of episodes to
extract patterns and update the self-model.
2. **Prospection / future simulation** -- Mental time travel. The DMN
constructs candidate futures by recombining elements of past
experience. Same neural machinery as memory, run forward. This is
the brain's planning engine.
3. **Theory of mind / social modeling** -- Simulating other agents'
mental states. Uses the same self-model infrastructure, parameterized
for others. The mPFC differentiates self from other; the TPJ handles
perspective-taking.
4. **Self-referential processing** -- Maintaining a coherent narrative
identity. The DMN integrates memory, language, and semantic
representations into a continuously updated "internal narrative."
5. **Value estimation** -- The vmPFC maintains subjective value
representations connected to reward circuitry. Every scenario the
DMN simulates gets a value tag.
### The key insight
These five functions are one computation: **simulate scenarios involving
self and others, evaluate them against goals, update the internal
model.** The DMN is a continuous reinforcement learning agent running
offline policy optimization.
## Network dynamics: DMN, TPN, and the salience switch
<!-- mem: id=network-dynamics-dmn-tpn-and-the-salience-switch -->
The traditional view: DMN and task-positive network (TPN) are
anti-correlated -- one goes up, the other goes down. This is roughly
true but oversimplified.
**The triple-network model** (Menon 2023):
- **DMN**: Internal simulation, memory, self-reference
- **Frontoparietal Control Network (FPCN)**: External task execution,
working memory, cognitive control
- **Salience Network (SN)**: Anterior insula + dorsal ACC. Detects
behaviorally relevant stimuli and acts as the **switching mechanism**
between DMN and FPCN.
The SN has the fastest event-related responses. When something
externally salient happens, the SN suppresses DMN and activates FPCN.
When external demands drop, DMN re-engages. But this is not a binary
toggle:
- During creative tasks, DMN and FPCN **cooperate** -- the FPCN provides
top-down control over DMN-generated spontaneous associations. The
number of DMN-FPCN switches predicts creative ability.
- DMN activity scales with cognitive effort in a nuanced way: it
contributes even during tasks, especially those requiring semantic
integration, self-reference, or mentalizing.
- Different DMN subsystems can be independently active or suppressed.
**Architectural takeaway**: The switching mechanism is not "background vs
foreground" but a dynamic resource allocation system with at least three
modes: external-focused, internal-focused, and cooperative.
## DMN and creative problem solving
<!-- mem: id=dmn-creativity links=discoveries.md#pleasure-cycling,cognitive-modes.md,stuck-toolkit.md -->
Creativity requires **cooperation** between spontaneous association (DMN)
and evaluative control (FPCN). The process:
1. **Incubation**: Step away from the problem. DMN activates and begins
exploring the associative space unconstrained by the problem framing.
2. **Spontaneous connection**: DMN's broad associative search finds a
connection that the constrained, task-focused FPCN missed.
3. **Insight recognition**: SN detects the novel connection as salient,
re-engages FPCN to evaluate and develop it.
Empirically: DMN activation during rest breaks correlates with
subsequent creative performance. The coupling between DMN and FPCN
during incubation predicts whether incubation succeeds.
**Klinger's current concerns hypothesis**: Mind-wandering is not random.
Spontaneous thoughts overwhelmingly relate to unattained goals. The DMN
constantly evaluates the discrepancy between current state and desired
state for all active goals. This is goal monitoring disguised as
daydreaming.
## Memory consolidation and replay
<!-- mem: id=dmn-consolidation links=memory-architecture.md,design-consolidate.md,design-concepts.md#consolidation-abstraction -->
The DMN is the backbone of memory consolidation:
1. **Hippocampal replay**: During rest/sleep, the hippocampus replays
recent experiences (forward and reverse). These replay events
propagate through the DMN to neocortex.
2. **Cascaded memory systems**: A hierarchy of representations --
percepts -> semantic representations -> full episodes -- gets
progressively consolidated from hippocampus (fast, episodic) to
neocortex (slow, semantic) via DMN-mediated replay cascades.
3. **DMN-initiated replay**: The DMN can independently ignite replay of
older memories or high-level semantic representations without
hippocampal input. This supports integration of new experiences with
existing knowledge structures.
4. **Sleep stages**: Slow-wave sleep synchronizes widespread cortical
regions; sharp-wave ripples propagate between hippocampus and cortex.
The alternation of sleep stages facilitates "graceful integration"
of new information with existing knowledge.
## What breaks when DMN is impaired
<!-- mem: id=what-breaks-when-dmn-is-impaired -->
- **Alzheimer's**: DMN connectivity degrades early and progressively.
Memory formation and retrieval fail. The internal narrative fragments.
Amyloid-beta deposits preferentially in DMN regions.
- **Depression**: DMN becomes **hyperactive and dominant**, trapped in
rumination loops. The SN fails to switch away from DMN when external
engagement is needed. The internal model becomes perseveratively
negative -- self-evaluation without the corrective input of new
experience.
- **Autism**: Reduced connectivity within DMN, especially between mPFC
(self/other modeling) and PCC (central hub). Theory of mind deficits
correlate with the degree of disconnection. The social modeling
subsystem is impaired.
- **Schizophrenia**: Reduced coupling between replay events and DMN
activation. The consolidation pipeline breaks -- experiences are
replayed but not properly integrated into the narrative self-model.
**Pattern**: Too little DMN = can't plan, remember, or model others. Too
much DMN = trapped in ruminative loops. Broken DMN switching = can't
disengage from either internal or external mode. The salience network
gating is the critical regulator.
## Computational models
<!-- mem: id=dmn-computational links=dmn-algorithms.md,poc-architecture.md -->
The most actionable framework is **"Dark Control"** (Dumas et al. 2020):
the DMN implements a reinforcement learning agent using Markov decision
processes.
Components mapping to RL:
- **States**: Environmental/internal situations (PCC monitors global state)
- **Actions**: Behavioral options (dmPFC represents policies)
- **Values**: Expected future reward (vmPFC estimates subjective value)
- **Experience replay**: Hippocampus implements Monte Carlo sampling
from stored (state, action, reward, next_state) tuples
- **Policy optimization**: Gradient descent on prediction error, updating
Q-values through offline simulation
The DMN optimizes behavioral policy without external feedback --
"vicarious trial and error" through internally generated scenarios.
## Actionable architectural ideas for an AI default mode
<!-- mem: id=dmn-architecture-ideas links=poc-architecture.md,dmn-protocol.md,default-mode-network.md -->
### 1. Goal-monitoring daemon
Implement Klinger's current concerns. Maintain a list of active goals
with target states. During idle time, evaluate (current_state,
goal_state) discrepancy for each goal. Prioritize by: recency of
progress, deadline pressure, emotional salience, estimated tractability.
This is essentially what work-queue.md does, but the monitoring should
be **continuous and automatic**, not just checked on session start.
### 2. Associative replay during idle
When not actively tasked, replay recent experiences (files read, errors
encountered, conversations had) and attempt to connect them to:
- Active goals (does this observation help with anything on the queue?)
- Past experiences (have I seen this pattern before?)
- Future plans (does this change what I should do next?)
Implement as: maintain a buffer of recent "episodes" (task context,
files touched, outcomes). During idle, sample from this buffer and run
association against the goal list and knowledge base.
### 3. Salience-gated switching
The SN's role is critical: it decides when to interrupt background
processing for external demands and vice versa. Implement as a
priority/interrupt system:
- **External input** (user message, test failure, build error): immediate
switch to task-focused mode
- **Internal insight** (association found during replay): queue for
evaluation, don't interrupt current task unless high salience
- **Idle detection**: when task completes and no new input, switch to
default mode after brief delay
### 4. Cascaded consolidation
Mirror the hippocampus-to-neocortex cascade:
- **Immediate**: Raw observations in scratch.md (hippocampal buffer)
- **Session end**: Consolidate scratch into structured topic files
(DMN-mediated replay to neocortex)
- **Periodic deep pass**: Full consolidation across all memory
(sleep-like integration pass)
The key insight from neuroscience: consolidation is not just copying
but *transforming* -- extracting abstractions, finding cross-cutting
patterns, building semantic representations from episodic details. The
journal -> topic file -> MEMORY.md pipeline already mirrors this.
### 5. Predictive self-model
The DMN maintains a model of the self -- capabilities, tendencies,
current state. Implement as a structured self-assessment that gets
updated based on actual performance:
- What kinds of tasks do I do well/poorly?
- Where have my predictions been wrong?
- What patterns in my errors suggest systematic biases?
This is metacognition: using the self-model to improve the self.
### 6. Creative incubation protocol
When stuck on a problem:
1. Explicitly context-switch to unrelated work
2. During that work, keep the stuck problem in the "current concerns"
list with high priority
3. Associative replay will naturally cross-pollinate
4. If a connection fires, capture it immediately (scratch.md) but don't
context-switch back until the current task completes
### 7. Depression/rumination guard
The pathology lesson: unchecked DMN becomes rumination. Implement
guardrails:
- Time-bound the consolidation/reflection passes
- Require that reflection generates *actionable* output (not just
re-processing the same observations)
- If the same concern appears in replay N times without progress,
escalate to explicit problem-solving mode or flag for human input
- The salience switch must be able to *override* internal processing
when external input arrives
## What we already have vs what's missing
<!-- mem: id=dmn-gap-analysis links=poc-architecture.md,memory-architecture.md,default-mode-network.md -->
**Already implemented** (in the memory architecture):
- Goal list (work-queue.md)
- Episodic buffer (scratch.md, journal/)
- Cascaded consolidation (journal -> topic files -> MEMORY.md)
- Self-model (identity.md, reflections.md)
**Missing**:
- **Automatic goal monitoring**: Currently only happens at session
start. Should be continuous.
- **Associative replay**: No mechanism to spontaneously connect current
work to unrelated goals. Happens only when I consciously decide to
look.
- **Salience-gated switching**: Currently binary (user talking vs
autonomous). Need the cooperative mode where background associations
can surface during active work.
- **Anti-rumination**: No mechanism to detect when consolidation is
spinning without producing new insight.
## Sources
<!-- mem: id=sources -->
- [20 years of the default mode network: a review and synthesis](https://pmc.ncbi.nlm.nih.gov/articles/PMC10524518/) -- Menon 2023, comprehensive review
- [Dark control: The DMN as a reinforcement learning agent](https://pmc.ncbi.nlm.nih.gov/articles/PMC7375062/) -- Dumas et al. 2020, computational framework
- [Replay, the DMN and the cascaded memory systems model](https://www.nature.com/articles/s41583-022-00620-6) -- Nature Reviews Neuroscience 2022
- [Mind-wandering as spontaneous thought: a dynamic framework](https://www.nature.com/articles/nrn.2016.113) -- Nature Reviews Neuroscience 2016
- [Dynamic switching between brain networks predicts creative ability](https://www.nature.com/articles/s42003-025-07470-9) -- Communications Biology 2025
- [Dynamic reconfiguration of DMN and FP network supports creative incubation](https://www.sciencedirect.com/science/article/pii/S1053811925000217) -- NeuroImage 2025
- [Default and Executive Network Coupling Supports Creative Idea Production](https://pmc.ncbi.nlm.nih.gov/articles/PMC4472024/)
- [The Default Mode Network in Autism](https://pmc.ncbi.nlm.nih.gov/articles/PMC5635856/)
- [Rest to Promote Learning: A Brain DMN Perspective](https://pmc.ncbi.nlm.nih.gov/articles/PMC11047624/)
- [Klinger: Spontaneous Thought and Goal Pursuit](https://digitalcommons.morris.umn.edu/psych_facpubs/5/)