Finished | ||||||
---|---|---|---|---|---|---|
Styxdo | Reckless | dataset_dist_test1 | diff | 8.0+0.08 | LLR: -2.26 (-2.25, 2.89) [0.00, 3.00] Games: 17952 W: 4539 L: 4635 D: 8778 Ptnml(0-2): 73, 2213, 4487, 2143, 60 | STC to follow |
Styxdo | Reckless | dataset_dist_test1 | diff | N=25000 | Elo: -3.40 +- 2.32 (95%) [N=40000] Games: 40348 W: 12716 L: 13111 D: 14521 Ptnml(0-2): 1282, 4666, 8551, 4515, 1160 | FN Sanity test |
Styxdo | Reckless | s2-lr-adj2 | diff | N=25000 | Elo: -1.55 +- 4.49 (95%) [N=10000] Games: 10292 W: 3169 L: 3215 D: 3908 Ptnml(0-2): 282, 1205, 2212, 1171, 276 | FN val first |
Styxdo | Reckless | s2-lr-adj1 | diff | 8.0+0.08 | LLR: -2.29 (-2.25, 2.89) [0.00, 3.00] Games: 6770 W: 1614 L: 1733 D: 3423 Ptnml(0-2): 12, 877, 1720, 770, 6 | stc |
Styxdo | Reckless | s2-lr-adj1 | diff | N=25000 | Elo: -2.40 +- 3.19 (95%) [N=20000] Games: 20382 W: 6331 L: 6472 D: 7579 Ptnml(0-2): 568, 2374, 4440, 2249, 560 | fn 1st |
Styxdo | Reckless | see_pruning_threshold | diff | 8.0+0.08 | LLR: -2.28 (-2.25, 2.89) [0.00, 3.00] Games: 80290 W: 19864 L: 19822 D: 40604 Ptnml(0-2): 133, 9691, 20472, 9699, 150 | let's gooooo |
Styxdo | Reckless | dfrc_s1_permute_maybe_fked | diff | 8.0+0.08 | LLR: -2.29 (-2.25, 2.89) [0.00, 3.00] Games: 6222 W: 1535 L: 1661 D: 3026 Ptnml(0-2): 33, 812, 1537, 706, 23 | How fked? |
Styxdo | Reckless | dfrc_s1_permute_maybe_fked | diff | N=25000 | Elo: -10.35 +- 3.24 (95%) [N=20000] Games: 21162 W: 6548 L: 7178 D: 7436 Ptnml(0-2): 764, 2509, 4478, 2253, 577 | Standard, fixed node sanity test first. |
Styxdo | Reckless | eps-test1 | diff | N=25000 | Elo: -6.74 +- 3.16 (95%) [N=20000] Games: 21256 W: 6630 L: 7042 D: 7584 Ptnml(0-2): 673, 2485, 4610, 2301, 559 | FN, permutation may be fked |
Styxdo | Reckless | weight_range_expt1 | diff | 8.0+0.08 | LLR: -1.65 (-2.25, 2.89) [0.00, 3.00] Games: 9926 W: 2440 L: 2516 D: 4970 Ptnml(0-2): 26, 1246, 2495, 1170, 26 | stc to follow up, unless fn is just trash |
Styxdo | Reckless | weight_range_expt1 | diff | N=25000 | Elo: -5.80 +- 4.28 (95%) [N=10000] Games: 11446 W: 3593 L: 3784 D: 4069 Ptnml(0-2): 360, 1308, 2516, 1241, 298 | FN Sanity test |
Styxdo | Reckless | factorizer-tweak-2 | diff | 40.0+0.40 | LLR: 3.01 (-2.25, 2.89) [0.00, 3.00] Games: 38262 W: 9428 L: 9176 D: 19658 Ptnml(0-2): 21, 4379, 10084, 4621, 26 | LTC against prev-main for stats |
Styxdo | Reckless | dfrc-03 | diff | N=25000 | Elo: -0.77 +- 2.47 (95%) [N=40000] Games: 34134 W: 10670 L: 10746 D: 12718 Ptnml(0-2): 897, 4050, 7309, 3854, 957 | dfrc-s2-re-do, check fn standard first as sanity check |
Styxdo | Reckless | eval-scale-500 | diff | 8.0+0.08 | LLR: -1.21 (-2.25, 2.89) [0.00, 3.00] Games: 4608 W: 1142 L: 1204 D: 2262 Ptnml(0-2): 16, 587, 1156, 533, 12 | stc |
Styxdo | Reckless | eval-scale-500 | diff | N=25000 | LLR: -3.68 (-2.94, 2.94) [0.00, 3.00] Games: 14422 W: 4609 L: 4863 D: 4950 Ptnml(0-2): 497, 1705, 3011, 1551, 447 | FN |
Styxdo | Reckless | new-datasets-both-stages | diff | 8.0+0.08 | LLR: -2.28 (-2.25, 2.89) [0.00, 3.00] Games: 12226 W: 3123 L: 3232 D: 5871 Ptnml(0-2): 47, 1492, 3140, 1391, 43 | stc |
Styxdo | Reckless | new-datasets-both-stages | diff | N=25000 | LLR: -3.36 (-2.94, 2.94) [0.00, 3.00] Games: 22068 W: 7112 L: 7316 D: 7640 Ptnml(0-2): 744, 2523, 4645, 2437, 685 | FN |
Styxdo | Reckless | relu | diff | 8.0+0.08 | LLR: -2.26 (-2.25, 2.89) [0.00, 3.00] Games: 5512 W: 1350 L: 1476 D: 2686 Ptnml(0-2): 27, 730, 1365, 610, 24 | STC |
Styxdo | Reckless | relu | diff | N=25000 | LLR: -2.06 (-2.94, 2.94) [0.00, 3.00] Games: 47086 W: 15393 L: 15417 D: 16276 Ptnml(0-2): 1411, 5409, 10059, 5121, 1543 | FN |
Styxdo | Reckless | s2-3-combine | diff | 8.0+0.08 | LLR: -2.26 (-2.25, 2.89) [0.00, 3.00] Games: 21162 W: 5313 L: 5399 D: 10450 Ptnml(0-2): 40, 2578, 5442, 2470, 51 | stc |
Styxdo | Reckless | s2-3-combine | diff | N=25000 | LLR: -2.99 (-2.94, 2.94) [0.00, 3.00] Games: 63518 W: 20249 L: 20298 D: 22971 Ptnml(0-2): 1755, 7400, 13609, 7129, 1866 | FN |
Styxdo | Reckless | ranger-retry | diff | 40.0+0.40 | LLR: -1.19 (-2.25, 2.89) [0.00, 3.00] Games: 3782 W: 878 L: 938 D: 1966 Ptnml(0-2): 5, 477, 983, 425, 1 | In case STC passes |
Styxdo | Reckless | ranger-retry | diff | 8.0+0.08 | LLR: -1.15 (-2.25, 2.89) [0.00, 3.00] Games: 19572 W: 4973 L: 4998 D: 9601 Ptnml(0-2): 58, 2408, 4887, 2367, 66 | Ranger Retry |
Styxdo | Reckless | ranger-retry | diff | N=25000 | LLR: 2.98 (-2.94, 2.94) [0.00, 3.00] Games: 69838 W: 22992 L: 22545 D: 24301 Ptnml(0-2): 2180, 7663, 14855, 7972, 2249 | Ranger Retry |
Styxdo | Reckless | styx-permuted | diff | N=25000 | LLR: -0.08 (-2.94, 2.94) [0.00, 3.00] Games: 1176 W: 383 L: 383 D: 410 Ptnml(0-2): 0, 0, 588, 0, 0 | Did I do it right? - Yes |