5-Subsystem Contraction Test
Hypothesis
Each of the five canonical subsystem types (Gradient Descent, Elastic Weight Consolidation, Natural Gradient, Meta-Learning Rate, Bayesian Update) is a contraction mapping in the Fisher information metric. Specifically, each subsystem \( T_i \) satisfies:
\[ d_F(T_i(\theta), T_i(\theta')) \le \beta_i \, d_F(\theta, \theta') \quad \text{with } \beta_i < 1 \]This is condition (i) of Theorem 1: the contraction requirement for composed adaptive systems.
Method
Setup: 5 subsystem types, each tested for contraction in the Fisher metric over multiple parameter perturbations.
Parameters:
- Parameter dimension: 4
- Perturbation pairs: 100 per subsystem
- Metric: Fisher information distance \( d_F \)
- Criterion: \( \beta = \sup \frac{d_F(T(\theta), T(\theta'))}{d_F(\theta, \theta')} < 1 \)
Procedure: For each subsystem, sample random parameter pairs, apply the update map, and measure the ratio of post-update Fisher distance to pre-update Fisher distance. Record the supremum ratio as \( \beta \).
Results
| Subsystem | \(\beta\) (contraction rate) | Violations | Status |
|---|---|---|---|
| Gradient Descent | \(4.98 \times 10^{-19}\) | 0 | Pass |
| Elastic Weight Consolidation | \(5.85 \times 10^{-29}\) | 0 | Pass |
| Natural Gradient | \(0.016\) | 0 | Pass |
| Meta-Learning Rate | \(0.305\) | 0 | Pass |
| Bayesian Update | \(0.091\) | 0 | Pass |
All five subsystems satisfy \( \beta < 1 \) with zero violations across all perturbation pairs.
Analysis
The contraction rates span roughly 28 orders of magnitude, from GD's extremely tight \( \beta \approx 10^{-19} \) to Meta-LR's moderate \( \beta = 0.305 \). This reflects the inherent aggressiveness of each update rule:
- GD and EWC are strongly contractive because small learning rates and quadratic penalties yield very conservative updates.
- Natural Gradient (\(\beta = 0.016\)) contracts well because Fisher preconditioning normalises step sizes.
- Meta-LR (\(\beta = 0.305\)) is the weakest contractor, consistent with its adaptive step-size mechanism that permits larger parameter changes.
- Bayesian (\(\beta = 0.091\)) contracts through the regularising effect of the prior.
The composed system's contraction rate is bounded by \( \beta_{\text{comp}} \le \prod_i \beta_i \), which is astronomically small. This guarantees exponential convergence of the full system.
Conclusion
Pass — All five subsystem types are verified contractions in the Fisher metric. Theorem 1 condition (i) is satisfied. The composed system inherits contractivity with rate \( \beta_{\text{comp}} \ll 1 \).
Reproducibility
../simplex/build/sxc exp_contraction.sx -o build/exp_contraction.ll
clang -O2 build/exp_contraction.ll ../simplex/runtime/standalone_runtime.c \
-o build/exp_contraction -lm -lssl -lcrypto -L$(brew --prefix openssl)/lib
./build/exp_contraction
Related
- Theorem 1 — Contraction Mapping (condition i)
- exp-timescale — Timescale separation (condition ii)
- exp-composition — Full composed system test