wave 8: evolutionary methods (4 stubs) by 0bserver07 · Pull Request #12 · cybertronai/schmidhuber-problems

0bserver07 · 2026-05-07T16:57:13Z

Wave 8 — evolutionary methods (gradient-free)

Four stubs implementing Schmidhuber-lineage evolutionary methods per SPEC issue #1. Zero gradient descent across all 4 stubs — algorithmically distinct from gradient-based waves 1-7 and 9-10. Octopus-merged from 4 local-only wave-8-local/<slug> branches.

Stub	Method	Paper	Headline
`pipe-symbolic-regression`	PIPE (PPT + PBIL) on Koza polynomial	Salustowicz & Schmidhuber 1997	Seed 3: discovers `x + x² + x³ + x⁴` exactly at gen 60; 6/20 seeds Koza-hit-solve; 1.3s wall
`pipe-6-bit-parity`	PIPE on Boolean parity	Salustowicz & Schmidhuber 1997	4-bit clean solve (gen 258, 16/16, 2.4s); 6-bit partial 71.9% at 240s budget cap; 4-bit multi-seed 6/11 ≤25s
`evolino-sines-mackey-glass`	Hybrid neuroevo + linear readout	Schmidhuber, Wierstra, Gomez 2007	Sines free-run MSE 0.181 (horizon 299); MG NRMSE@84 = 0.291 (paper: 1.9e-3 with full ESP — partial)
`double-pole-no-velocity`	ESP co-evolution on double cart-pole	Gomez & Schmidhuber 2005	Seed 0 solved at gen 27 / ~60s; 7/10 seeds 20/20 generalize at pop=40 (paper: ≥95% at pop=200, ~5× cheaper)

Audit verdict (separate Explore subagent)

APPROVE across all 4 stubs.

Numpy-only (hard pass): All 4 verified. Imports = numpy/matplotlib/PIL/imageio/stdlib. Zero torch/tensorflow/jax.
Determinism (3 spot-checks): pipe-symbolic-regression, pipe-6-bit-parity, evolino-sines-mackey-glass — all bit-identical across reruns.
Branch protocol: zero wave-8-local/* on origin (verified).
Algorithmic faithfulness (4/4) — CRITICAL: zero gradient calls across all 4 stubs.
- PIPE: PPT + PBIL update (P(s*) ← P(s*) + lr · P_TARGET · (1 − P(s*))), per-component mutation P_M / (NI · √n_visited). No crossover. No gradient.
- Evolino: LSTM hidden weights evolved by population selection + Gaussian mutation + crossover. Linear readout via Moore-Penrose closed form per individual. No backprop on hidden weights.
- ESP: 5 subpopulations of neurons, network assembled by stacking one neuron per subpop. Fitness credited back to constituent neurons. Crossover + mutation per subpop. No policy gradient.
Cleanliness: zero TODO/FIXME, no hardcoded paths, no problem.py stubs left (all 4 explicitly removed), no __pycache__ committed.
Git authors: all 4 commits authored by agent-0bserver07 <agent-0bserver07@users.noreply.github.com>.
GIF sizes: 196 KB to 1.9 MB (all under 2 MB cap).

Per-stub deviations (in each stub's §Deviations)

pipe-symbolic-regression: function set {+,−,*,/} only (paper's Table 1 set; wider {sin,cos,exp,log} doesn't reliably converge in 5-min budget; behind --funcs full).
pipe-6-bit-parity: 4-bit substitutes as faithful demonstration; 6-bit partial at 71.9% (240s cap); paper achieves 6-bit but with substantially more program evaluations than laptop budget allows.
evolino-sines-mackey-glass: whole-genome co-evolution simplified from paper's full ESP; pop=40 / gens=80 (paper used larger budgets); MG NRMSE@84 = 0.29 vs paper's 1.9e-3 — directionally yes, absolutely no.
double-pole-no-velocity: ESP at pop=40 (paper: 200, ~5× cheaper); explicitly permitted by SPEC; works as algorithmic demonstration.

Wave 0 → 8 progress

7 + 5 + 5 + 5 + 4 + 6 + 5 + 4 = 41/50 v1 stubs done (82%). 2 waves remaining = 9 stubs.

agent-0bserver07 (Claude Code) on behalf of Yad

Implements Salustowicz & Schmidhuber 1997 (Evol Comp 5(2)). PPT (probabilistic prototype tree) over instruction set {+,-,*,/,x,R}. Each generation: sample N=100 programs from PPT, score on 20 fitness cases x in linspace(-1,1,20), PBIL update at every elite-visited node toward the elite's symbol with the paper's P_TARGET = P_T + (1-P_T)*lr*(eps+Fit_best)/(eps+Fit_elite) schedule, plus per-symbol mutation P_M/(|I|*sqrt(n_visited)). Headline (seed 3, ~1.3 s on M-series, max-depth 6): discovered ((x + x*x) + ((x*x + x) * x*x)) = x + x^2 + x^3 + x^4 SSE = 1.06e-30, hits = 20/20, solved at gen 60. Cross-seed (20 seeds, 300 gens): 6/20 (30%) Koza-hit-solve, 2/20 exact solve. Wider {+,-,*,/,sin,cos,exp,log} set is available behind --funcs full but does not reliably cross SSE<1e-6 in the v1 budget; the paper itself uses {+,-,*,/} for this exact target (Table 1). Files: - pipe_symbolic_regression.py : algorithm + CLI (deterministic seed) - visualize_pipe_symbolic_regression.py : 6 PNGs to viz/ - make_pipe_symbolic_regression_gif.py : 690 KB GIF (≤2 MB) - README.md : 8 sections - removed problem.py stub Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

Pure-numpy double cart-pole (Wieland 1991 EOM, RK4 at dt=0.01s) + recurrent net (Elman, H=5, tanh) evolved by ESP (Enforced Sub-Populations, Gomez 2003): one subpopulation per hidden neuron, networks assembled by combining one neuron from each subpop, fitness propagated back. Headline (seed 0, defaults pop=40, trials=4): - solved at generation 27 (21,600 trials, ~60s wallclock on M-series) - final eval: 20/20 random inits (|theta1_0| <= 4.5 deg) balanced 1000 steps - partial 6-seed sweep: seeds 0-4 reach 20/20, seed 5 reaches 13/20 Files: - double_pole_no_velocity.py - CLI; sim + ESP + eval - visualize_double_pole_no_velocity.py - training curves, rollout, weights - make_double_pole_no_velocity_gif.py - rendered cart-pole animation - README.md - 8 sections (Header/Problem/Files/Running/Results/Viz/Deviations/Open) - viz/training_curves.png, rollout.png, weights.png - double_pole_no_velocity.gif (605 KB) Deviations from paper (full list in README): - ESP rather than full CoSyNE (SPEC permits as v1 simplification) - pop=40 / trials=4 / max_gen=200 (paper: pop=200) for laptop budget - Fixed init theta1=4.5 deg during evolution; random in final eval - Solve threshold = 1000 steps (paper also has 100,000-step "robust") Removed problem.py stub.

… Mackey-Glass Evolutionary outer loop on LSTM hidden weights + closed-form linear readout (Tikhonov-regularised normal equations) inner loop. Fitness is closed-loop free-running MSE, per the Schmidhuber-Wierstra-Gomez 2007 scoring rule. Whole-genome co-evolution with elitism, uniform crossover, gaussian per-gene mutation, and burst mutation on stagnation. Headline (seed=1, hidden=6, pop=40, gens=80, ~140s on M-series laptop): - 3 superimposed sines: free-run MSE 0.18 over 299 steps - Mackey-Glass tau=17: NRMSE@84 = 0.29 Files: evolino_sines_mackey_glass.py (model + train + eval + CLI), visualize_evolino_sines_mackey_glass.py (six PNGs), make_*_gif.py (closed-loop prediction across generations animation, ~1.9 MB). Deviations: whole-genome instead of ESP enforced subpopulations; population/generations shrunk for laptop budget. Documented in README §7.

10-seed sweep at default budget (pop=40, trials=4, max-gen=100): - 10/10 seeds reach 1000-step balance during evolution - 7/10 generalise to 20/20 on the random-init eval - 2/10 partial generalisation (13/20, 15/20) - 1/10 brittle (9/20) - Mean wallclock 58.1s per seed.

Implements Salustowicz & Schmidhuber 1997 (Evol Comp 5(2)) on the canonical hard genetic-programming benchmark. Pure numpy + matplotlib. Algorithm: PPT (probabilistic prototype tree) over instruction set {AND, OR, NOT, IF, x_0..x_{n-1}}. Each generation samples a population from the PPT, evaluates on the full 2^n truth table, applies a single PBIL pull-toward-elite update at every visited node (clamped to [eps, 1-eps]), and per-component mutation along the elite path with prob p_mut/(N_INSTR*sqrt(|elite|)). Multi-start (full PPT reset) on 80-generation stagnation. Bitmask evaluator: each terminal x_i is a 2^n-bit Python int whose j-th bit is x_i's value on input j; AND/OR/NOT/IF map to bitwise ops, so one tree evaluation covers the whole truth table in O(tree_size). ~100x speedup over per-row Python loop, agrees with the slow path on the canonical XOR-chain. Headline runs (deterministic, M-series laptop): - 4-bit, seed 6: solved at gen 258, 2.4 s, 16/16 = 100% accuracy. - 6-bit, seed 0: 240 s budget cap, 46/64 = 71.9% accuracy. The 6-bit gap is documented in §Deviations: Salustowicz & Schmidhuber report PIPE solving 6-bit parity but with substantially more evaluations than 240 s allows. Multi-seed sweep on 4-bit shows 6/11 seeds solve in <=25 s; the 4-bit clean solve substitutes as the in-budget demonstration that the implementation itself is faithful. Files: - pipe_6_bit_parity.py : PPT/sample/update/mutate/multi-start + CLI with --n-bits parametrization (validates on 3-bit, 4-bit, 6-bit) - visualize_pipe_6_bit_parity.py : 7 PNGs to viz/, self-trains inline (no external JSON dependency); --skip-6bit for the fast path - make_pipe_6_bit_parity_gif.py : 196 KB GIF, 4-bit seed 6, two-panel (fitness curve + 4x4 correctness grid evolving green) - README.md : 8 sections including a multi-seed sweep, deviations, and explicit open questions on bridging the 6-bit gap (more compute, ADFs, or restoring the paper's iterative inner-loop update) - removed problem.py stub

Octopus merge of 4 wave-8 stubs per SPEC issue #1. - wave-8-local/pipe-symbolic-regression: PIPE on Koza f(x)=x+x^2+x^3+x^4 (1997) - wave-8-local/pipe-6-bit-parity: PIPE on N-bit even parity (1997) - wave-8-local/evolino-sines-mackey-glass: hybrid neuroevo + linear readout (2007) - wave-8-local/double-pole-no-velocity: ESP co-evolution on double cart-pole (2005) All 4 verified by separate audit subagent: numpy-only, deterministic, branch protocol followed (no wave-8-local on remote), all 8 README sections, evolutionary algorithmic faithfulness confirmed (no gradient descent). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

0bserver07 · 2026-05-07T16:57:52Z

Audit Report — PR #12 wave 8 (4 evolutionary stubs)

Wave 8 verdict: APPROVE.

Independent review by separate Explore subagent. All 4 stubs algorithmically sound, deterministic, numpy-only, branch-protocol-compliant. Zero gradient descent across the wave — exactly the spirit of the evolutionary family.

Per-stub verdicts

Stub	Verdict	Reason
pipe-symbolic-regression	APPROVE	PPT + PBIL verified; finds Koza target exactly at gen 60
pipe-6-bit-parity	APPROVE	Same PIPE machinery on Boolean function set; bitmask trick for O(tree_size) eval
evolino-sines-mackey-glass	APPROVE	Genome evolution + closed-form linear readout; no backprop on hidden weights
double-pole-no-velocity	APPROVE	ESP co-evolution; H=5 subpopulations of neurons; fitness propagates to constituents

Cross-cut findings

Numpy-only (hard pass): All 4 verified. Imports = numpy/matplotlib/PIL/imageio/stdlib only.
Determinism (3 spot-checks): pipe-symbolic-regression, pipe-6-bit-parity, evolino-sines-mackey-glass — all bit-identical across reruns.
Branch protocol: All 4 on local-only wave-8-local/*; zero pushed.
Git authors: All 4 commits by agent-0bserver07.
Cleanliness: zero TODO/FIXME, no hardcoded paths, no problem.py left (all 4 explicitly removed — the wave-6/wave-7 lesson stuck).

Algorithmic faithfulness — verified per stub

pipe-symbolic-regression (pipe_symbolic_regression.py:332-354): PBIL update P(s*) ← P(s*) + lr · P_TARGET · (1 − P(s*)) per Salustowicz & Schmidhuber 1997 §3. Mutation P_M / (NI · √n_visited) per line 357. No gradient.
pipe-6-bit-parity (pipe_6_bit_parity.py:183-214): same PIPE machinery on {AND, OR, NOT, IF, x_0..x_5}. Bitmask program evaluation runs all 64 inputs in O(tree_size) bitwise ops. No gradient.
evolino-sines-mackey-glass (evolino_sines_mackey_glass.py:203-221, 313-379): LSTM genome evolved by population selection + Gaussian mutation + crossover; linear readout via Moore-Penrose pseudoinverse per individual. Hidden weights NOT trained by gradient.
double-pole-no-velocity (double_pole_no_velocity.py:155-163, 240-244): ESP per Gomez 2003 — H=5 subpops, network assembled by stacking one neuron per subpop, fitness propagates back. No policy gradient.

Reproduce results

=== pipe-symbolic-regression seed 0 / seed 0 ===
elite program identical, SSE / hits / size match across runs

=== pipe-6-bit-parity seed 0 / seed 0 ===
elite_hits 32/64 match (4-bit subset solve)

=== evolino-sines-mackey-glass seed 0 / seed 0 ===
sines free_run_mse 0.163041 (identical), mackey NRMSE@84 0.5221 (identical)

All wallclocks well under 5-minute budget.

Honest gaps documented

pipe-6-bit-parity: 6-bit partial (71.9% at 240s); 4-bit substitutes as faithful demo. Paper's full 6-bit solve needs more compute than laptop allows.
evolino-sines-mackey-glass: NRMSE@84 = 0.29 vs paper's 1.9e-3. Directionally yes (network locks onto dynamics, tracks ground truth ~100 free-run steps), absolutely no. Whole-genome simplification of paper's full ESP + smaller budgets.
double-pole-no-velocity: pop=40 vs paper's 200 (~5× cheaper); 7/10 seeds 20/20 generalize.

All gaps in §Deviations and §Open questions per SPEC's methodological caveat.

agent-0bserver07 (Claude Code) on behalf of Yad — wave-8 audit subagent

wave 8: evolutionary methods (4 stubs)

agent-0bserver07 and others added 6 commits May 7, 2026 11:43

0bserver07 mentioned this pull request May 7, 2026

Spec: minimum implementation requirements for Schmidhuber-problem stubs (v1) #1

Closed

10 tasks

0bserver07 merged commit 5dbf83c into main May 8, 2026

0bserver07 deleted the wave/8-evolutionary branch May 8, 2026 15:50

0bserver07 added a commit that referenced this pull request May 8, 2026

Merge pull request #12 from cybertronai/wave/8-evolutionary

62ab107

wave 8: evolutionary methods (4 stubs)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

wave 8: evolutionary methods (4 stubs)#12

wave 8: evolutionary methods (4 stubs)#12
0bserver07 merged 6 commits intomainfrom
wave/8-evolutionary

0bserver07 commented May 7, 2026

Uh oh!

0bserver07 commented May 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

0bserver07 commented May 7, 2026

Wave 8 — evolutionary methods (gradient-free)

Audit verdict (separate Explore subagent)

Per-stub deviations (in each stub's §Deviations)

Wave 0 → 8 progress

Uh oh!

0bserver07 commented May 7, 2026

Audit Report — PR #12 wave 8 (4 evolutionary stubs)

Per-stub verdicts

Cross-cut findings

Algorithmic faithfulness — verified per stub

Reproduce results

Honest gaps documented

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant