Facts are localized (ROME). Behaviors are hierarchically distributed:
core circuit (small set of mid-late layer attention heads) + supporting
circuits (distributed context encoding). Apollo's flat minima are right
for distributed change. Rank-256 captures the full hierarchy. Includes
measurement plan for validating which heads change during training.