Finished
WorldRecklessergodice_stuff_fast_unsounddiff8.0+0.08
LLR: -2.33 (-2.25, 2.89) [0.00, 4.00]
Games: 7680 W: 1840 L: 1923 D: 3917
Ptnml(0-2): 36, 974, 1890, 917, 23
PeregrRecklessless-tt-cutdiff8.0+0.08
LLR: -2.31 (-2.25, 2.89) [0.00, 4.00]
Games: 32864 W: 7907 L: 7913 D: 17044
Ptnml(0-2): 132, 3915, 8319, 3959, 107
less ~14%
WorldRecklesslmr-iirdiff8.0+0.08
LLR: -2.29 (-2.25, 2.89) [0.00, 4.00]
Games: 16084 W: 3891 L: 3947 D: 8246
Ptnml(0-2): 79, 1946, 4027, 1932, 58
WorldRecklesslmp-static-evaldiff8.0+0.08
LLR: 3.04 (-2.25, 2.89) [0.00, 4.00]
Games: 22280 W: 5555 L: 5350 D: 11375
Ptnml(0-2): 89, 2592, 5570, 2803, 86
WorldRecklessconthist-spsa-resultsdiff8.0+0.08
LLR: -2.34 (-2.25, 2.89) [0.00, 4.00]
Games: 20550 W: 4890 L: 4934 D: 10726
Ptnml(0-2): 75, 2474, 5219, 2434, 73
WorldRecklessconthist-spsadiff8.0+0.08
Tuning 2 Parameters
3915/4000 Iterations
7830/8000 Games Played
what are the chances spsa works for 2 params?
StyxdoRecklessv25-f71908d1diff40.0+0.40
LLR: 2.89 (-2.25, 2.89) [0.00, 4.00]
Games: 31488 W: 7463 L: 7250 D: 16775
Ptnml(0-2): 28, 3602, 8267, 3823, 24
S1 Higher weight to 640, S2 higher weight to 1536 (S1: 640=1, 1536 =0.5 S2: 640=0.5, 1536=1)
WorldRecklessreset-optimizer-with-warmupdiffN=25000
LLR: -2.44 (-2.25, 2.89) [0.00, 4.00]
Games: 3090 W: 952 L: 1091 D: 1047
Ptnml(0-2): 129, 375, 646, 296, 99
Replace loading the AdamW optimizer state into S2 with warmup batches
PeregrRecklessfp-histdiff8.0+0.08
LLR: -2.01 (-2.25, 2.89) [0.00, 4.00]
Games: 21914 W: 5248 L: 5273 D: 11393
Ptnml(0-2): 81, 2611, 5598, 2586, 81
WorldRecklessv25-f71908d1diffN=25000
LLR: -2.30 (-2.25, 2.89) [0.00, 4.00]
Games: 34034 W: 11320 L: 11323 D: 11391
Ptnml(0-2): 1274, 3725, 7021, 3724, 1273
S1 Higher weight to 640, S2 higher weight to 1536 (S1: 640=1, 1536 =0.5 S2: 640=0.5, 1536=1)
StyxdoRecklessv25-f71908d1diff8.0+0.08
LLR: 2.96 (-2.25, 2.89) [0.00, 4.00]
Games: 11194 W: 2845 L: 2675 D: 5674
Ptnml(0-2): 45, 1291, 2764, 1443, 54
S1 Higher weight to 640, S2 higher weight to 1536 (S1: 640=1, 1536 =0.5 S2: 640=0.5, 1536=1)
WorldRecklesss1_lr_warmupdiffN=25000
LLR: -2.38 (-2.25, 2.89) [0.00, 4.00]
Games: 38400 W: 12811 L: 12801 D: 12788
Ptnml(0-2): 1423, 4195, 7967, 4179, 1436
LR Warmup over 1024 batches
PeregrRecklesssimpl-fpdiff40.0+0.40
LLR: 2.91 (-2.25, 2.89) [-4.00, 0.00]
Games: 31120 W: 7186 L: 7150 D: 16784
Ptnml(0-2): 20, 3594, 8304, 3614, 28
StyxdoRecklesss1_lr_warmupdiff8.0+0.08
LLR: -2.25 (-2.25, 2.89) [0.00, 4.00]
Games: 27434 W: 6827 L: 6847 D: 13760
Ptnml(0-2): 121, 3372, 6770, 3314, 140
LR Warmup over 1024 batches
PeregrRecklesssimpl-fpdiff8.0+0.08
LLR: 2.97 (-2.25, 2.89) [-4.00, 0.00]
Games: 15518 W: 3830 L: 3742 D: 7946
Ptnml(0-2): 63, 1837, 3857, 1953, 49
use the new values, but need to run LTC for this as well
PeregrRecklesssimpl-fpdiff8.0+0.08
LLR: -0.43 (-2.25, 2.89) [-3.00, 0.00]
Games: 5598 W: 1357 L: 1395 D: 2846
Ptnml(0-2): 16, 696, 1421, 642, 24
PeregrRecklesscapt-fut-valuediff8.0+0.08
LLR: 2.90 (-2.25, 2.89) [0.00, 4.00]
Games: 19478 W: 4819 L: 4629 D: 10030
Ptnml(0-2): 68, 2269, 4888, 2433, 81
StyxdoRecklesss1_wdl_warmup_25_5diff8.0+0.08
LLR: -2.25 (-2.25, 2.89) [0.00, 4.00]
Games: 21814 W: 5409 L: 5446 D: 10959
Ptnml(0-2): 91, 2688, 5396, 2631, 101
S1 WDL Warmup, Includes v24 changes.
WorldRecklessconthist-pruning-2diff8.0+0.08
LLR: -2.27 (-2.25, 2.89) [0.00, 4.00]
Games: 13032 W: 3120 L: 3183 D: 6729
Ptnml(0-2): 50, 1573, 3325, 1526, 42
-512 * depth - 768
WorldRecklesss1_wdl_warmup_25_5diffN=25000
LLR: -2.39 (-2.25, 2.89) [0.00, 4.00]
Games: 23266 W: 7697 L: 7750 D: 7819
Ptnml(0-2): 886, 2544, 4778, 2587, 838
S1 WDL Warmup, Includes v24 changes.
WorldRecklessconthist-pruning-1diff8.0+0.08
LLR: -2.26 (-2.25, 2.89) [0.00, 4.00]
Games: 5174 W: 1202 L: 1290 D: 2682
Ptnml(0-2): 27, 667, 1276, 601, 16
-512 * depth - 256
WorldRecklessv24-75b969efdiff40.0+0.40
LLR: 2.92 (-2.25, 2.89) [0.00, 4.00]
Games: 5392 W: 1333 L: 1192 D: 2867
Ptnml(0-2): 7, 559, 1425, 696, 9
WorldRecklessv24-75b969efdiffN=25000
LLR: 3.38 (-2.25, 2.89) [0.00, 4.00]
Games: 60026 W: 20142 L: 19685 D: 20199
Ptnml(0-2): 2117, 6501, 12471, 6656, 2268
WorldRecklessv24-75b969efdiff8.0+0.08
LLR: 2.92 (-2.25, 2.89) [0.00, 4.00]
Games: 30578 W: 7565 L: 7340 D: 15673
Ptnml(0-2): 116, 3621, 7577, 3872, 103
WorldRecklessqs-see-thresholddiff8.0+0.08
LLR: -2.26 (-2.25, 2.89) [0.00, 4.00]
Games: 13438 W: 3279 L: 3341 D: 6818
Ptnml(0-2): 39, 1688, 3339, 1602, 51