CCRL 40/4
Downloads and Statistics
October 25, 2008
Testing summary:
Total: 352'891 games
played by 690 programs
5321 CPU days (X2 4600+)

White wins: 134'426 (38.1%)
Black wins: 106'995 (30.3%)
Draws: 111'470 (31.6%)
White score: 53.9%

Pure list

"Pure" list removes rating distortion

"Pure" list is computed to remove the distortion that may affect the main rating list. Distortion appears when several versions or settings of the same engine are included together in the testing study. Suppose you have engine A and several versions of engine B: B1, B2, B3. Suppose also that A is particularly strong versus any version of B, which often happens in real testing because of some characteristics of those engines. In such case A will have higher rating, comparing to the study where only one version of B is present. Same thing may happen when A is weak versus B, getting lower rating.

To remove that distortion, a separate game database is constructed from games played only by best version in each engine "family". To save some space and time, pure database has all moves stripped out, it contains PGN header and results only. Then the "Pure list" is computed based for that "pure" database using Bayeselo.

Pure lists for all classes of engines

All engines   (32-bit)

1-2-CPU engines   (32-bit)

Single-CPU engines   (32-bit)

Free engines   (32-bit)

Free 1-2-CPU engines   (32-bit)

Free single-CPU engines   (32-bit)

Open source engines   (32-bit)

Open source 1-2-CPU engines   (32-bit)

Open source single-CPU engines   (32-bit)

Pure lists for complete database

Pure database download

To save space, pure database has all moves stripped out, it contains PGN header and results only. This pure database is useful only for rating calculation or similar analysis, it does not have actual games, only the results.

Download pure database, 36'627 games:

CCRL 40/4 Rating List — Pure all engines

Ponder off, General books (up to 12 moves), 3-4-5 piece EGTB
Time control: Equivalent to 40 moves in 4 minutes on Athlon 64 X2 4600+ (2.4 GHz)
Computed on October 25, 2008 with Bayeselo based on 36'627 games
RankNameRatingScoreAverage
Opponent
DrawsGamesLOS
ELO+
1Rybka 3 Dynamic 64-bit 4CPU3251 (−3)+29−2781.8%−231.727.5%545 
100.0%
2Naum 3 64-bit 4CPU3068 (−6)+17−1761.8%−80.336.8%1138
97.8%
3Zappa Mexico 64-bit 4CPU3042 (+5)+20−2060.3%−71.438.8%770
67.6%
4Deep Shredder 11 64-bit 4CPU3036 (+7)+18−1753.6%−23.731.0%1138
83.9%
5Hiarcs 12 4CPU3024 (−3)+16−1651.1%−5.238.4%1244
67.1%
6Deep Sjeng 3.0 64-bit 4CPU3019 (−4)+19−1949.3%+6.037.4%918
52.5%
7Deep Fritz 10.1 4CPU3018 (+5)+19−1952.9%−22.231.9%946
95.0%
8Toga II 1.4.1SE 4CPU2997 (+4)+18−1845.1%+37.135.1%987
53.5%
9Glaurung 2.1 64-bit 4CPU2996 (+7)+19−1951.6%−10.235.9%898
84.9%
10Loop M1-T 64-bit 4CPU2982 (+9)+18−1848.2%+10.438.3%961
99.8%
11Deep Junior 10 4CPU2945 (+23)+20−2043.2%+44.533.5%844
59.5%
12Bright 0.3a 4CPU2942 (0)+19−1943.9%+39.335.1%918
94.7%
13Thinker 5.2I (Passive) 64-bit 2CPU2902 (+1)+37−3673.1%−166.426.0%288
92.5%
14Chessmaster 11 4CPU2861 (+6)+41−4160.8%−74.029.5%200
88.8%
15Scorpio 2.0 32-bit 4CPU2825 (+3)+32−3326.8%+170.127.1%362
51.1%
16Ktulu 8.02824 (+1)+24−2457.8%−58.227.5%619
54.3%
17Spike 1.2 Turin2823 (−3)+13−1352.7%−21.331.9%2103
85.5%
18Delfi 5.4 2CPU2806 (0)+25−2560.4%−68.634.6%511
85.6%
19Chess Tiger 2007.12789 (−26)+19−1956.1%−39.537.0%864
63.8%
20Pharaon 3.5.1 4CPU2781 (−3)+39−3945.6%+28.232.3%217
55.5%
21Deep Frenzee May07 64-bit 2CPU2777 (−12)+31−3057.1%−51.533.0%352
87.0%
22SmarThink 1.00 32-bit2755 (−3)+25−2561.6%−86.330.1%574
55.8%
23Alaric 7072753 (−7)+20−2055.1%−35.928.8%884
97.9%
24Alfil 8.1.1 Optimized2722 (−4)+23−2355.0%−33.832.5%640
54.5%
25Movei 00.8.438(10 10 10)2721 (−12)+19−1952.1%−15.532.1%911
63.5%
26E.T. Chess 13.01.082716 (−16)+19−1951.4%−9.834.3%888
63.0%
27Slow Chess Blitz WV2.12712 (+1)+17−1748.2%+11.033.1%1218
59.6%
28WildCat 82708 (−18)+23−2347.7%+14.032.0%637
55.5%
29SOS 5.12706 (+24)+20−2053.7%−28.030.0%820
54.2%
30Booot 4.14.02704 (−17)+24−2448.4%+8.334.2%546
57.3%
31Pro Deo 1.22700 (−19)+29−2950.6%−7.328.1%405
52.6%
32Colossus 2007d2699 (−12)+19−1949.1%+5.932.3%874
72.8%
33‑34BugChess2 1.6.2 64-bit2690 (−6)+24−2450.4%−3.727.9%603
49.1%
33‑34Gandalf 62690 (−26)+30−3048.7%+10.830.3%376
53.8%
35Sloppy 0.2.0 64-bit2688 (−21)+24−2448.0%+13.233.0%575
54.6%
36GarboChess 2.10 64-bit2686 (−21)+26−2645.8%+28.926.4%493
62.3%
37Pseudo 0.7c2681 (+6)+20−2049.6%+3.432.7%816
55.8%
38Ruffian 1.0.52679 (0)+19−1953.3%−25.034.0%884
52.3%
39Petir 4.392678 (+15)+22−2253.3%−23.033.9%672
52.5%
40Hamsters 0.7 2CPU2677 (−6)+24−2442.3%+51.237.5%544
64.3%
41Aristarch 4.502671 (+4)+20−2051.5%−11.430.5%830
94.4%
42Crafty 21.5 PS 64-bit2647 (+5)+21−2250.6%−5.031.9%721
59.7%
43Jonny 2.83 32-bit2643 (−6)+19−1948.9%+8.326.1%914
63.4%
44Anaconda 2.0.12639 (−1)+18−1853.1%−21.530.3%1043
78.7%
45AnMon 5.602629 (+14)+17−1756.1%−44.627.2%1196
54.8%
46The Baron 2.232626 (−30)+34−3537.8%+85.231.0%287
55.7%
47Ufim 8.022623 (−3)+16−1654.9%−35.826.9%1381
82.6%
48Trace 1.37a2612 (−1)+17−1754.1%−28.326.1%1231
58.1%
49Yace 0.99.872610 (−1)+19−1854.5%−31.930.7%992
62.5%
50Little Goliath Evolution 3.12 (TB On)2605 (−4)+21−2051.8%−12.030.5%799
91.2%
51Arasan 10.4 64-bit 2CPU2581 (−10)+27−2747.7%+13.929.8%457
57.2%
52Tao 5.62578 (+10)+20−2052.9%−22.025.1%924
66.3%
53Green Light Chess 3.002572 (−2)+19−1950.7%−4.930.2%894
71.5%
54Abrok 5.02565 (+6)+17−1851.2%−8.827.9%1124
52.5%
55Twisted Logic 20080404x2564 (+3)+20−2052.0%−15.523.2%928
71.2%
56Amyan 1.5972557 (−1)+16−1647.9%+16.727.8%1321
74.6%
57Matacz 1.3 HT742548 (+6)+20−2051.7%−11.830.2%861
65.6%
58Pepito 1.59 64-bit2543 (+1)+17−1752.3%−17.431.5%1109
54.2%
59SpiderChess 0706032542 (−6)+15−1549.8%+1.432.4%1469
62.6%
60Quark 2.352538 (+11)+19−1950.4%−3.126.3%995
54.1%
61‑62Cerebro 3.03b2535 (−18)+36−3826.6%+173.223.1%286
50.8%
61‑62King of Kings 2.562535 (−1)+20−2049.5%+2.226.0%832
52.4%
63LambChop 10.992534 (+6)+17−1850.6%−3.128.9%1112
82.0%
64Dragon 4.62523 (+10)+17−1748.4%+11.827.0%1305
55.2%
65Kiwi 0.6d2521 (+4)+19−1948.3%+12.126.5%917
54.0%
66‑67Delphil 1.9c2520 (+5)+27−2749.4%+5.233.3%448
48.2%
66‑67Snitch 1.6.2 32-bit2520 (+7)+20−2047.6%+16.426.8%867
82.1%
68‑69KnightDreamer 3.32507 (+1)+21−2147.1%+19.428.2%734
50.7%
68‑69Nejmet 3.072507 (−7)+17−1747.4%+19.126.1%1247
62.0%
70RomiChess P3k 64-bit2502 (−4)+26−2648.2%+12.325.1%510
57.6%
71Patzer 3.80 2CPU2498 (+3)+30−3052.4%−18.023.8%378
52.8%
72Hermann 2.02496 (+10)+24−2445.8%+32.622.9%632
61.3%
73Gosu 0.162491 (+8)+23−2352.7%−20.222.2%675
62.8%
74‑75Amy 0.8.7b DC2486 (+5)+22−2244.5%+39.024.0%736
48.8%
74‑75Gaia 3.5 32-bit2486 (−9)+23−2353.7%−28.819.6%668
57.3%
76Muse 0.899b2484 (+10)+17−1749.7%+0.324.7%1186
64.4%
77Comet B682479 (+3)+17−1743.4%+47.124.1%1234
66.3%
78Der Bringer 1.92474 (−3)+16−1646.6%+25.424.7%1319
71.6%
79Homer 2.012464 (+1)+30−2955.2%−40.618.7%423
67.2%
80Djinn 0.925x2456 (−6)+23−2346.6%+23.122.4%695
59.1%
81Crazy Bishop 00522452 (−1)+24−2440.9%+63.226.2%607
58.7%
82Amateur 2.822448 (−2)+20−2044.7%+38.722.6%946
75.5%
83Cheese 1.22437 (−1)+24−2447.8%+15.025.8%625
56.7%
84Phalanx XXII Reborn JA2434 (+8)+22−2337.8%+87.724.5%703
56.0%
85Arion 1.72431 (+8)+24−2450.1%−1.827.9%606
69.5%
86OliThink 5.1.4 64-bit2420 (+15)+40−4046.0%+27.923.2%224
96.3%
87Asterisk 0.62373 (+11)+31−3145.3%+36.421.9%375
68.7%
88LittleThought 1.03 64-bit 2CPU2363 (+7)+29−3041.2%+63.521.5%414
51.9%
89Anechka 0.082362 (+9)+26−2643.2%+49.520.6%553
58.4%
90Horizon 4.32357 (+3)+33−3439.6%+80.622.9%327
71.7%
91Sage 2.2a2345 (+7)+30−3141.0%+67.621.9%389
66.7%
92Popochin 2.72334 (+10)+37−3838.2%+84.829.7%249
58.9%
93Natwarlal 0.142329 (−19)+27−2836.9%+97.321.6%487
62.6%
94Atlas 2.202320 (+17)+41−4231.4%+132.028.6%210
67.5%
95Latista 1.502307 (+2)+40−4137.9%+91.522.0%223
94.5%
96BlackBishop 0.982263 (+2)+30−3125.7%+190.019.2%453
 

Explanation of the columns

"Rank" — 1 is best, 2 is second best, etc.. It's simple.
"Engine" — Name and version of an engine.
"ELO" — Engine rating computed with Bayeselo. This column has also a number in brackets, which shows the difference between "Pure" rating and rating computed for complete database. For example "2850 (+10)" in the ELO column means that engine's "pure" rating is 2850, which is 10 points higher than its rating in the complete list.
"+" and "−" — 95% confidence intervals. For example, if engine's rating is 2850, "+" is +20 and "−" is −15, it means that there is only 5% estimated probability that engine's "true" rating is outside of the [2850−15 .. 2850+20] range.
"Score" — Number of points scored by an engine, divided by the number of games. Win is 1 point, draw is 1/2 of a point, and loss is 0. Please note that this is computed for "pure" database, so the numbers are different from the main list.
"Average Opponent" — Difference between the rating of engine tested and average of the opponent ratings for all games played by that engine. (Only games from the "pure" database were counted). Positive number means that engine was playing with stronger opponents, averagely. Negative number - weaker opponents.
"Draws" — Percentage of games by an engine, that ended in a draw. (Only games in "pure" database are counted).
"Games" — Total number of games played by an engine. (Only games in the "pure" database are counted).

The detailed explanation how we construct the "pure" list:

1. We have to find the best versions in each engine family. We can't use the "Best versions" list for that, because the "Best versions" list may be affected by distortion which we are trying to remove. To find the true best version in a family of engines we create separate game database, containing only games by engines from that family. Then we compute the ratings for that small database and take the highest rated engine as best, to represent that family in the "pure" list. There is also a requirement that every engine in the "pure" list must have at least 150 games played with other "pure" engines, and it also must be a public release, not beta or private version.

2. After finding a set of "pure" best versions, we exctract all games where both side engines are from that set, and those games form a "pure" database. Pure list is simply a rating list computed for that database using Bayeselo.

Features of the pure list

First thing that you have to realize about the "pure" list is that it is not necessarily more relevant than the big list of all versions. "Pure" list removes one kind of distortion - distortion that may occur from multiple version of same engine. But the price for that is big - the "pure" database is several times smaller than complete database. This results in much larger statistical error, as you can see in the + / - columns. Also, the "pure" list can still have other types of distortion - distortion resulting from too small (including 0) or too large number of games in particular pairs.

So, don't take this list as certainly superior to the "Best versions" list. This list does not substitute the "Best versions" list, but simply provides a different view for those who may be afraid of distortions. It is possible though that in time this list will become clearly superior, when the "pure" database will be large enough.

Please also realize that some engine version being listed in the "Best versions" list does not guarantee that the same version will be listed in the "Pure" list. Most often it will be the case, but theoretically it is possible that different version will turn out to be the best in the "pure" context.


Crosstable for "pure" database

Results matrix

Pure all engines
#NameELO123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596
1Rybka 3 Dynamic 64-bit 4CPU3251   116 − 24
+99−7=34
+31
117 − 29
+95−7=44
−13
100 − 26
+80−6=40
−27
 113 − 20
+97−4=32
+12
                                                                                        
2Naum 3 64-bit 4CPU3068  58.5 − 41.5
+39−22=39
+29
46.5 − 36.5
+30−20=33
+8
57.5 − 42.5
+32−17=51
0
58.5 − 41.5
+37−20=43
+3
111 − 87
+73−49=76
−10
55.5 − 44.5
+37−26=37
−34
57.5 − 42.5
+44−29=27
−20
66 − 31
+50−15=32
+35
39.5 − 20.5
+27−8=25
−21
72 − 28
+55−11=34
+22
  81 − 19
+70−8=22
−7
                                                                                 
3Zappa Mexico 64-bit 4CPU3042 41.5 − 58.5
+22−39=39
−29
 54 − 43
+33−22=42
+28
51 − 49
+24−22=54
−12
 32.5 − 14.5
+27−9=11
+117
13.5 − 16.5
+8−11=11
−77
 57 − 42
+35−20=44
−13
66 − 34
+47−15=38
+7
33.5 − 16.5
+22−5=23
+4
  34.5 − 14.5
+28−8=13
−70
 81 − 17
+69−5=24
+27
                                                                               
4Deep Shredder 11 64-bit 4CPU303624 − 116
+7−99=34
−31
36.5 − 46.5
+20−30=33
−8
43 − 54
+22−33=42
−28
 63.5 − 34.5
+52−23=23
+96
61 − 38
+47−24=28
+61
24.5 − 24.5
+16−16=17
−18
47 − 48
+32−33=30
−42
65 − 34
+50−19=30
+67
42 − 42
+27−27=30
−58
57 − 43
+42−28=30
−42
31.5 − 18.5
+23−10=17
−8
  39 − 8
+34−3=10
+41
 75.5 − 21.5
+61−7=29
−17
                                                                               
5Hiarcs 12 4CPU302429 − 117
+7−95=44
+13
42.5 − 57.5
+17−32=51
0
49 − 51
+22−24=54
+12
34.5 − 63.5
+23−52=23
−96
 53.5 − 46.5
+32−25=43
+15
55.5 − 44.5
+35−24=41
+26
59 − 41
+43−25=32
+34
55 − 45
+36−26=38
+4
58 − 42
+41−25=34
+11
59 − 41
+37−19=44
−23
62 − 38
+42−18=40
−6
    79 − 21
+62−4=34
−1
                                                                               
6Deep Sjeng 3.0 64-bit 4CPU301926 − 100
+6−80=40
+27
41.5 − 58.5
+20−37=43
−3
 38 − 61
+24−47=28
−61
46.5 − 53.5
+25−32=43
−15
 27.5 − 25.5
+19−17=17
+9
61 − 39
+42−20=38
+48
57 − 43
+37−23=40
+20
54.5 − 45.5
+33−24=43
−8
27.5 − 12.5
+20−5=15
+49
     73 − 27
+55−9=36
−42
                                                                               
7Deep Fritz 10.1 4CPU3018 87 − 111
+49−73=76
+10
14.5 − 32.5
+9−27=11
−117
24.5 − 24.5
+16−16=17
+18
44.5 − 55.5
+24−35=41
−26
25.5 − 27.5
+17−19=17
−9
 55 − 44
+40−29=30
+18
23.5 − 26.5
+14−17=19
−41
31 − 19
+24−12=14
+47
29 − 21
+24−16=10
−15
125 − 75
+95−45=60
+10
   40.5 − 9.5
+37−6=7
+60
                                                                                
8Toga II 1.4.1SE 4CPU299720 − 113
+4−97=32
−12
44.5 − 55.5
+26−37=37
+34
16.5 − 13.5
+11−8=11
+77
48 − 47
+33−32=30
+42
41 − 59
+25−43=32
−34
39 − 61
+20−42=38
−48
44 − 55
+29−40=30
−18
 54.5 − 45.5
+37−28=35
+27
45 − 40
+21−16=48
0
31.5 − 22.5
+23−14=17
+3
16 − 15
+8−7=16
−43
  21.5 − 8.5
+17−4=9
−21
 23.5 − 6.5
+18−1=11
+16
                                                                               
9Glaurung 2.1 64-bit 4CPU2996 42.5 − 57.5
+29−44=27
+20
 34 − 65
+19−50=30
−67
45 − 55
+26−36=38
−4
43 − 57
+23−37=40
−20
26.5 − 23.5
+17−14=19
+41
45.5 − 54.5
+28−37=35
−27
 56.5 − 42.5
+36−22=41
+29
31.5 − 18.5
+24−11=15
+36
63.5 − 36.5
+45−18=37
+33
    75 − 25
+55−5=40
−9
                                                                               
10Loop M1-T 64-bit 4CPU2982 31 − 66
+15−50=32
−35
42 − 57
+20−35=44
+13
42 − 42
+27−27=30
+58
42 − 58
+25−41=34
−11
45.5 − 54.5
+24−33=43
+8
19 − 31
+12−24=14
−47
40 − 45
+16−21=48
0
42.5 − 56.5
+22−36=41
−29
 58.5 − 40.5
+45−27=27
+27
30.5 − 19.5
+19−8=23
+27
  35.5 − 12.5
+26−3=19
−1