OpenBench

OpenBench Testing Framework

Finished
Cj5716	Reckless	experiment-patch-2	diff	8.0+0.08	LLR: 3.09 (-2.25, 2.89) [0.00, 3.00] Games: 69956 W: 17938 L: 17592 D: 34426 Ptnml(0-2): 280, 8246, 17645, 8462, 345	ablation test, please give me my sanity back
Cj5716	Reckless	experiment-patch-1b	diff	8.0+0.08	LLR: 2.92 (-2.25, 2.89) [0.00, 3.00] Games: 50348 W: 12899 L: 12606 D: 24843 Ptnml(0-2): 224, 5925, 12611, 6162, 252	Against 1A, NOT MAIN
Cj5716	Reckless	experiment-tuned-5h	diff	8.0+0.08	LLR: 2.91 (-2.25, 2.89) [0.00, 3.00] Games: 91418 W: 23316 L: 22931 D: 45171 Ptnml(0-2): 429, 10724, 23022, 11101, 433	Against 5A, NOT MAIN
Cj5716	Reckless	experiment-tuned-5f	diff	8.0+0.08	LLR: 2.90 (-2.25, 2.89) [0.00, 3.00] Games: 68628 W: 17720 L: 17384 D: 33524 Ptnml(0-2): 336, 8099, 17196, 8259, 424	Against 5A, NOT MAIN
Sapher	Reckless	separate-root-pv	diff	4.0+0.04	LLR: 2.95 (-2.25, 2.89) [0.00, 3.00] Games: 19212 W: 5094 L: 4873 D: 9245 Ptnml(0-2): 123, 2017, 5090, 2268, 108	Take 1
Cj5716	Reckless	shrink-win-stable	diff	40.0+0.40	LLR: 2.97 (-2.25, 2.89) [0.00, 3.00] Games: 42662 W: 10627 L: 10372 D: 21663 Ptnml(0-2): 23, 4740, 11556, 4983, 29
Cj5716	Reckless	shrink-win-stable	diff	8.0+0.08	LLR: 2.99 (-2.25, 2.89) [0.00, 3.00] Games: 16856 W: 4371 L: 4149 D: 8336 Ptnml(0-2): 70, 1947, 4206, 2101, 104
World	Reckless	[Stockfish] dev	diff	N=20000 *	Generated 25000990/25000000 Games Elo: 3.19 +- 0.11 (95%) [N=25000000] Games: 25000990 W: 8224344 L: 7994635 D: 8782011
Cj5716	Reckless	change-bad-cap-threshold	diff	40.0+0.40	LLR: 3.01 (-2.25, 2.89) [0.00, 3.00] Games: 47808 W: 11843 L: 11579 D: 24386 Ptnml(0-2): 18, 5211, 13183, 5473, 19
Cj5716	Reckless	change-bad-cap-threshold	diff	8.0+0.08	LLR: 2.89 (-2.25, 2.89) [0.00, 3.00] Games: 84552 W: 21432 L: 21077 D: 42043 Ptnml(0-2): 264, 9592, 22249, 9867, 304
87	Reckless	fix-beta-bound2	diff	40.0+0.40	LLR: 2.93 (-2.25, 2.89) [-2.75, 0.25] Games: 17986 W: 4471 L: 4343 D: 9172 Ptnml(0-2): 8, 1866, 5115, 1998, 6
87	Reckless	fix-beta-bound2	diff	8.0+0.08	LLR: 2.91 (-2.25, 2.89) [-2.75, 0.25] Games: 58926 W: 14777 L: 14716 D: 29433 Ptnml(0-2): 164, 6346, 16382, 6407, 164
Sapher	Reckless	simplify-quiet-see	diff	40.0+0.40	LLR: 2.98 (-2.25, 2.89) [-2.75, 0.25] Games: 41224 W: 10148 L: 10054 D: 21022 Ptnml(0-2): 18, 4605, 11268, 4707, 14	Take 1
Sapher	Reckless	simplify-quiet-see	diff	8.0+0.08	LLR: 2.90 (-2.25, 2.89) [-2.75, 0.25] Games: 8326 W: 2158 L: 1998 D: 4170 Ptnml(0-2): 31, 933, 2071, 1101, 27	Take 1
87	Reckless	hp-in_check	diff	8.0+0.08	LLR: 2.94 (-2.25, 2.89) [0.00, 3.00] Games: 69400 W: 17538 L: 17209 D: 34653 Ptnml(0-2): 227, 8055, 17834, 8330, 254
Sp00ph	Reckless	fix/null-move-castling	diff	40.0+0.40	LLR: 2.91 (-2.25, 2.89) [-2.75, 0.25] Games: 17832 W: 4442 L: 4310 D: 9080 Ptnml(0-2): 7, 2002, 4764, 2138, 5	on behalf of grepfuldead
Sp00ph	Reckless	fix/null-move-castling	diff	8.0+0.08	LLR: 2.92 (-2.25, 2.89) [-2.75, 0.25] Games: 14716 W: 3733 L: 3587 D: 7396 Ptnml(0-2): 43, 1679, 3779, 1803, 54	on behalf of grepfuldead
World	Reckless	[Stockfish] dev *	diff	N=20000 *	Generated 25000040/25000000 Games Elo: 3.36 +- 0.11 (95%) [N=25000000] Games: 25000040 W: 8230430 L: 7988610 D: 8781000
Sapher	Reckless	ttmove-history	diff	8.0+0.08	LLR: 2.91 (-2.25, 2.89) [0.00, 3.00] Games: 44932 W: 11319 L: 11046 D: 22567 Ptnml(0-2): 135, 5214, 11538, 5401, 178	Take 2
Sapher	Reckless	ttmove-history	diff	8.0+0.08	LLR: 2.90 (-2.25, 2.89) [0.00, 3.00] Games: 29144 W: 7601 L: 7358 D: 14185 Ptnml(0-2): 125, 3396, 7305, 3603, 143	Take 1
Swedis	Reckless	halfprobcut_sanity_check	diff	N=5000	Elo: -0.00 +- 0.00 (95%) [N=1000] Games: 1938 W: 684 L: 684 D: 570 Ptnml(0-2): 0, 0, 969, 0, 0	check if this code does actually affect the search
87	Reckless	iid2	diff	40.0+0.40	LLR: 2.91 (-2.25, 2.89) [0.00, 3.00] Games: 11626 W: 2960 L: 2774 D: 5892 Ptnml(0-2): 5, 1231, 3156, 1415, 6
87	Reckless	iid2	diff	8.0+0.08	LLR: 2.92 (-2.25, 2.89) [0.00, 3.00] Games: 16444 W: 4196 L: 3987 D: 8261 Ptnml(0-2): 50, 1822, 4295, 1979, 76
87	Reckless	negext-malus	diff	8.0+0.08	LLR: 3.03 (-2.25, 2.89) [0.00, 3.00] Games: 27186 W: 6932 L: 6690 D: 13564 Ptnml(0-2): 82, 3156, 6908, 3332, 115
Sapher	Reckless	pawn-hist	diff	40.0+0.40	LLR: 2.89 (-2.25, 2.89) [0.00, 3.00] Games: 40472 W: 10127 L: 9879 D: 20466 Ptnml(0-2): 36, 4531, 10844, 4799, 26	Take 1

1 2 3 86 87 88