CCRL 40/4
Downloads and Statistics
August 25, 2008
Testing summary:
Total: 333'259 games
played by 663 programs
5017 CPU days (X2 4600+)

White wins: 126'802 (38.0%)
Black wins: 100'944 (30.3%)
Draws: 105'513 (31.7%)
White score: 53.9%

Pure list

"Pure" list removes rating distortion

"Pure" list is computed to remove the distortion that may affect the main rating list. Distortion appears when several versions or settings of the same engine are included together in the testing study. Suppose you have engine A and several versions of engine B: B1, B2, B3. Suppose also that A is particularly strong versus any version of B, which often happens in real testing because of some characteristics of those engines. In such case A will have higher rating, comparing to the study where only one version of B is present. Same thing may happen when A is weak versus B, getting lower rating.

To remove that distortion, a separate game database is constructed from games played only by best version in each engine "family". To save some space and time, pure database has all moves stripped out, it contains PGN header and results only. Then the "Pure list" is computed based for that "pure" database using Bayeselo.

Pure lists for all classes of engines

All engines   (32-bit)

1-2-CPU engines   (32-bit)

Single-CPU engines   (32-bit)

Free engines   (32-bit)

Free 1-2-CPU engines   (32-bit)

Free single-CPU engines   (32-bit)

Open source engines   (32-bit)

Open source 1-2-CPU engines   (32-bit)

Open source single-CPU engines   (32-bit)

Pure lists for complete database

Pure database download

To save space, pure database has all moves stripped out, it contains PGN header and results only. This pure database is useful only for rating calculation or similar analysis, it does not have actual games, only the results.

Download pure database, 33'916 games:

CCRL 40/4 Rating List — Pure all engines

Ponder off, General books (up to 12 moves), 3-4-5 piece EGTB
Time control: Equivalent to 40 moves in 4 minutes on Athlon 64 X2 4600+ (2.4 GHz)
Computed on August 25, 2008 with Bayeselo based on 33'916 games
RankNameRatingScoreAverage
Opponent
DrawsGamesLOS
ELO+
1Rybka 3 64-bit 4CPU3251 (−4)+22−2183.1%−254.925.1%963 
100.0%
2Naum 3 64-bit 4CPU3072 (−3)+17−1659.0%−60.636.4%1238
98.5%
3Zappa Mexico 64-bit 4CPU3044 (+6)+21−2060.3%−70.738.8%770
73.6%
4Deep Shredder 11 64-bit 4CPU3036 (+5)+18−1854.2%−27.730.8%1108
77.7%
5Hiarcs 12 4CPU3027 (+1)+16−1652.4%−14.638.8%1199
71.1%
6‑7Deep Fritz 10.1 4CPU3021 (+7)+18−1849.5%+1.731.5%1046
47.8%
6‑7Deep Sjeng 3.0 64-bit 4CPU3021 (−2)+19−1950.2%−0.237.8%892
95.8%
8‑9Glaurung 2.1 64-bit 4CPU2999 (+10)+18−1848.1%+15.735.0%998
52.3%
8‑9Toga II 1.4.1SE 4CPU2999 (+5)+19−1946.0%+30.135.2%954
82.5%
10Loop M1-T 64-bit 4CPU2987 (+14)+17−1845.2%+32.237.4%1062
99.9%
11Deep Junior 10 4CPU2947 (+24)+19−1941.4%+59.432.4%894
55.8%
12Bright 0.3a 4CPU2945 (+2)+19−1943.9%+39.135.1%918
100.0%
13Chessmaster 11 4CPU2861 (+5)+38−3764.1%−98.628.5%249
57.6%
14Thinker 5.1e (Passive) 64-bit2856 (+1)+37−3665.8%−110.329.7%259
83.9%
15Scorpio 2.0 32-bit 4CPU2828 (+6)+32−3326.8%+169.927.1%362
55.1%
16Spike 1.2 Turin2825 (−2)+14−1449.2%+8.630.5%2036
54.2%
17Ktulu 8.02824 (0)+24−2457.8%−59.027.5%619
74.8%
18Delfi 5.4 2CPU2811 (+6)+27−2761.0%−72.834.7%447
89.4%
19‑20Chess Tiger 2007.12790 (−26)+19−1954.9%−32.037.1%855
48.7%
19‑20Deep Frenzee May07 64-bit 2CPU2790 (0)+31−3054.5%−30.535.0%343
60.5%
21Pharaon 3.5.1 4CPU2783 (−2)+39−3945.6%+28.432.3%217
76.1%
22SmarThink 1.00 32-bit2765 (+2)+26−2559.9%−75.230.3%532
74.9%
23Alaric 7072754 (−6)+21−2154.2%−29.429.6%780
93.7%
24Movei 00.8.438(10 10 10)2732 (−5)+20−2052.7%−19.932.3%807
58.3%
25Alfil 8.1.1 Optimized2729 (+2)+23−2355.4%−37.731.7%640
53.1%
26E.T. Chess 13.01.082727 (−5)+20−2053.4%−21.835.1%814
62.7%
27WildCat 82722 (−9)+23−2350.3%−4.033.9%605
67.0%
28Booot 4.14.02715 (−6)+27−2746.4%+20.634.8%442
61.0%
29Slow Chess Blitz WV2.12710 (−2)+17−1748.9%+6.232.8%1156
58.9%
30Gandalf 62706 (−8)+31−3145.7%+31.129.1%337
58.6%
31‑33Colossus 2007d2701 (−13)+20−2048.5%+10.632.3%834
52.1%
31‑33Pro Deo 1.22701 (−19)+29−2950.6%−8.428.1%405
49.0%
31‑33Sloppy 0.2.0 64-bit2701 (−7)+24−2446.7%+22.234.2%567
56.0%
34GarboChess 2.10 64-bit2698 (−14)+26−2646.8%+22.426.6%493
52.6%
35SOS 5.12697 (+14)+20−2052.7%−20.330.5%885
72.6%
36BugChess2 1.6.2 64-bit2687 (−7)+24−2448.8%+8.828.2%571
56.7%
37Pseudo 0.7c2685 (+9)+20−2050.5%−3.330.8%881
59.8%
38Ruffian 1.0.52681 (+1)+19−1953.8%−27.434.2%948
54.9%
39Petir 4.392680 (+16)+21−2153.7%−25.733.4%736
71.0%
40Aristarch 4.502672 (+5)+20−2051.6%−12.528.9%892
77.7%
41Hamsters 0.62661 (−1)+18−1851.2%−8.328.8%1073
85.8%
42Crafty 21.5 PS 64-bit2646 (+2)+21−2150.5%−4.031.0%785
71.1%
43Jonny 2.83 32-bit2639 (−12)+19−1948.2%+12.426.1%977
53.1%
44The Baron 1.8.12638 (−8)+20−2048.5%+7.825.5%898
51.8%
45Anaconda 2.0.12637 (−4)+18−1852.4%−16.829.6%1075
85.0%
46Ufim 8.022624 (−4)+17−1753.6%−27.226.5%1284
62.2%
47AnMon 5.602620 (+6)+17−1754.6%−32.727.9%1228
82.1%
48‑49Little Goliath Evolution 3.122609 (0)+17−1754.9%−35.224.7%1169
50.7%
48‑49Trace 1.37a2609 (−4)+17−1752.1%−14.226.3%1167
57.9%
50Yace 0.99.872607 (−4)+18−1853.0%−22.230.1%992
65.3%
51Arasan 10.4 64-bit 2CPU2598 (−8)+34−3544.6%+37.928.6%287
88.7%
52‑53Green Light Chess 3.002574 (−1)+20−2049.7%+0.629.3%894
49.4%
52‑53Tao 5.62574 (+7)+20−2051.0%−8.324.9%893
75.3%
54Abrok 5.02565 (+2)+18−1849.7%+1.827.5%1060
54.9%
55Twisted Logic 20080404x2563 (0)+21−2150.6%−5.024.0%832
71.3%
56Amyan 1.5972555 (−5)+17−1745.8%+32.927.1%1256
72.1%
57Pepito 1.59 64-bit2548 (+3)+20−2051.2%−10.031.9%831
53.8%
58SpiderChess 0706032546 (−6)+16−1649.5%+4.731.5%1268
70.7%
59LambChop 10.992539 (+2)+20−2049.0%+7.727.5%863
54.7%
60Matacz 1.3 HT742538 (+1)+21−2148.4%+10.330.7%734
87.5%
61Kiwi 0.6d2521 (+3)+19−1947.2%+19.426.9%1008
53.4%
62‑63Quark 2.352520 (0)+20−2046.4%+24.926.0%865
50.0%
62‑63Snitch 1.6.2 32-bit2520 (+6)+22−2246.3%+25.626.7%707
70.6%
64Dragon 4.62512 (+4)+18−1846.6%+24.927.1%1113
53.6%
65RomiChess P3k 64-bit2511 (−1)+29−2949.4%+3.423.3%429
62.7%
66Nejmet 3.072505 (−10)+18−1846.7%+24.326.3%1087
80.3%
67Hermann 2.02491 (+4)+23−2345.3%+36.522.8%696
69.1%
68Gosu 0.162483 (−3)+24−2451.6%−12.121.5%643
52.9%
69‑70Gaia 3.5 32-bit2482 (−9)+25−2554.9%−38.219.1%576
50.1%
69‑70Muse 0.899b2482 (+3)+27−2755.9%−45.224.2%491
51.9%
71‑72Amy 0.8.7b DC2481 (+2)+24−2542.6%+52.524.1%576
50.0%
71‑72Delphil 1.82481 (−7)+29−2855.1%−37.426.8%422
69.2%
73Der Bringer 1.92472 (−2)+18−1846.1%+29.325.0%1165
64.6%
74Comet B682467 (−4)+19−1940.5%+68.524.0%984
68.6%
75Homer 2.012458 (−5)+29−2854.5%−35.418.8%452
70.9%
76Amateur 2.822446 (−10)+29−2948.0%+16.721.5%432
58.8%
77Phalanx XXII Reborn JA2442 (+8)+25−2637.7%+87.426.1%544
60.8%
78Djinn 0.925x2436 (−14)+27−2748.7%+10.521.2%496
53.0%
79Cheese 1.22435 (−7)+26−2648.8%+7.224.7%535
59.5%
80Arion 1.72430 (−2)+24−2449.7%+0.229.0%586
99.9%
81Asterisk 0.62366 (+4)+30−3044.4%+43.421.8%400
66.1%
82LittleThought 1.03 64-bit 2CPU2358 (−1)+27−2840.1%+71.121.4%476
57.0%
83Anechka 0.082354 (+1)+27−2744.1%+44.819.3%493
52.6%
84Horizon 4.32353 (−3)+33−3439.6%+79.722.9%327
70.8%
85Sage 2.2a2341 (+2)+30−3141.0%+67.721.9%389
67.5%
86‑87Natwarlal 0.142331 (−18)+27−2737.4%+92.522.1%493
51.0%
86‑87Popochin 2.72331 (+5)+37−3838.2%+83.629.7%249
68.9%
88Atlas 2.202316 (+11)+41−4231.4%+131.628.6%210
66.3%
89Latista 1.502303 (−4)+40−4137.9%+92.422.0%223
94.0%
90BlackBishop 0.982260 (−1)+31−3225.3%+194.019.6%445
 

Explanation of the columns

"Rank" — 1 is best, 2 is second best, etc.. It's simple.
"Engine" — Name and version of an engine.
"ELO" — Engine rating computed with Bayeselo. This column has also a number in brackets, which shows the difference between "Pure" rating and rating computed for complete database. For example "2850 (+10)" in the ELO column means that engine's "pure" rating is 2850, which is 10 points higher than its rating in the complete list.
"+" and "−" — 95% confidence intervals. For example, if engine's rating is 2850, "+" is +20 and "−" is −15, it means that there is only 5% estimated probability that engine's "true" rating is outside of the [2850−15 .. 2850+20] range.
"Score" — Number of points scored by an engine, divided by the number of games. Win is 1 point, draw is 1/2 of a point, and loss is 0. Please note that this is computed for "pure" database, so the numbers are different from the main list.
"Average Opponent" — Difference between the rating of engine tested and average of the opponent ratings for all games played by that engine. (Only games from the "pure" database were counted). Positive number means that engine was playing with stronger opponents, averagely. Negative number - weaker opponents.
"Draws" — Percentage of games by an engine, that ended in a draw. (Only games in "pure" database are counted).
"Games" — Total number of games played by an engine. (Only games in the "pure" database are counted).

The detailed explanation how we construct the "pure" list:

1. We have to find the best versions in each engine family. We can't use the "Best versions" list for that, because the "Best versions" list may be affected by distortion which we are trying to remove. To find the true best version in a family of engines we create separate game database, containing only games by engines from that family. Then we compute the ratings for that small database and take the highest rated engine as best, to represent that family in the "pure" list. There is also a requirement that every engine in the "pure" list must have at least 150 games played with other "pure" engines, and it also must be a public release, not beta or private version.

2. After finding a set of "pure" best versions, we exctract all games where both side engines are from that set, and those games form a "pure" database. Pure list is simply a rating list computed for that database using Bayeselo.

Features of the pure list

First thing that you have to realize about the "pure" list is that it is not necessarily more relevant than the big list of all versions. "Pure" list removes one kind of distortion - distortion that may occur from multiple version of same engine. But the price for that is big - the "pure" database is several times smaller than complete database. This results in much larger statistical error, as you can see in the + / - columns. Also, the "pure" list can still have other types of distortion - distortion resulting from too small (including 0) or too large number of games in particular pairs.

So, don't take this list as certainly superior to the "Best versions" list. This list does not substitute the "Best versions" list, but simply provides a different view for those who may be afraid of distortions. It is possible though that in time this list will become clearly superior, when the "pure" database will be large enough.

Please also realize that some engine version being listed in the "Best versions" list does not guarantee that the same version will be listed in the "Pure" list. Most often it will be the case, but theoretically it is possible that different version will turn out to be the best in the "pure" context.


Crosstable for "pure" database

Results matrix

Pure all engines
#NameELO123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990
1Rybka 3 64-bit 4CPU3251 73 − 27
+57−11=32
−21
 95 − 15
+84−4=22
+74
79.5 − 21.5
+64−6=31
−23
82 − 18
+68−4=28
+3
79 − 21
+62−4=34
−30
82.5 − 17.5
+69−4=27
−13
86 − 14
+75−3=22
+29
83.5 − 17.5
+69−3=29
−25
44.5 − 5.5
+41−2=7
+34
    95 − 6
+90−1=10
+7
                                                                          
2Naum 3 64-bit 4CPU307227 − 73
+11−57=32
+21
 58.5 − 41.5
+39−22=39
+27
46.5 − 36.5
+30−20=33
+4
57.5 − 42.5
+32−17=51
−1
111 − 87
+73−49=76
−11
58.5 − 41.5
+37−20=43
+1
57.5 − 42.5
+44−29=27
−21
55.5 − 44.5
+37−26=37
−36
66 − 31
+50−15=32
+36
39.5 − 20.5
+27−8=25
−23
72 − 28
+55−11=34
+21
  81 − 19
+70−8=22
−8
                                                                           
3Zappa Mexico 64-bit 4CPU3044 41.5 − 58.5
+22−39=39
−27
 54 − 43
+33−22=42
+26
51 − 49
+24−22=54
−11
32.5 − 14.5
+27−9=11
+118
  13.5 − 16.5
+8−11=11
−77
57 − 42
+35−20=44
−10
66 − 34
+47−15=38
+7
33.5 − 16.5
+22−5=23
+5
  34.5 − 14.5
+28−8=13
−69
81 − 17
+69−5=24
+27
                                                                          
4Deep Shredder 11 64-bit 4CPU303615 − 95
+4−84=22
−74
36.5 − 46.5
+20−30=33
−4
43 − 54
+22−33=42
−26
 63.5 − 34.5
+52−23=23
+99
24.5 − 24.5
+16−16=17
−15
61 − 38
+47−24=28
+63
65 − 34
+50−19=30
+70
47 − 48
+32−33=30
−40
42 − 42
+27−27=30
−53
57 − 43
+42−28=30
−40
31.5 − 18.5
+23−10=17
−5
  39 − 8
+34−3=10
+44
75.5 − 21.5
+61−7=29
−15
                                                                          
5Hiarcs 12 4CPU302721.5 − 79.5
+6−64=31
+23
42.5 − 57.5
+17−32=51
+1
49 − 51
+22−24=54
+11
34.5 − 63.5
+23−52=23
−99
 55.5 − 44.5
+35−24=41
+26
53.5 − 46.5
+32−25=43
+14
55 − 45
+36−26=38
+4
59 − 41
+43−25=32
+33
58 − 42
+41−25=34
+13
59 − 41
+37−19=44
−24
62 − 38
+42−18=40
−6
   79 − 21
+62−4=34
−2
                                                                          
6Deep Fritz 10.1 4CPU302118 − 82
+4−68=28
−3
87 − 111
+49−73=76
+11
14.5 − 32.5
+9−27=11
−118
24.5 − 24.5
+16−16=17
+15
44.5 − 55.5
+24−35=41
−26
 25.5 − 27.5
+17−19=17
−10
23.5 − 26.5
+14−17=19
−41
55 − 44
+40−29=30
+17
31 − 19
+24−12=14
+50
29 − 21
+24−16=10
−16
125 − 75
+95−45=60
+10
    40.5 − 9.5
+37−6=7
+57
                                                                         
7Deep Sjeng 3.0 64-bit 4CPU302121 − 79
+4−62=34
+30
41.5 − 58.5
+20−37=43
−1
 38 − 61
+24−47=28
−63
46.5 − 53.5
+25−32=43
−14
27.5 − 25.5
+19−17=17
+10
 57 − 43
+37−23=40
+21
61 − 39
+42−20=38
+48
54.5 − 45.5
+33−24=43
−5
27.5 − 12.5
+20−5=15
+49
    73 − 27
+55−9=36
−42
                                                                          
8Glaurung 2.1 64-bit 4CPU299917.5 − 82.5
+4−69=27
+13
42.5 − 57.5
+29−44=27
+21
 34 − 65
+19−50=30
−70
45 − 55
+26−36=38
−4
26.5 − 23.5
+17−14=19
+41
43 − 57
+23−37=40
−21
 45.5 − 54.5
+28−37=35
−28
56.5 − 42.5
+36−22=41
+31
31.5 − 18.5
+24−11=15
+35
63.5 − 36.5
+45−18=37
+33
   75 − 25
+55−5=40
−10
                                                                          
9Toga II 1.4.1SE 4CPU299914 − 86
+3−75=22
−29
44.5 − 55.5
+26−37=37
+36
16.5 − 13.5
+11−8=11
+77
48 − 47
+33−32=30
+40
41 − 59
+25−43=32
−33
44 − 55
+29−40=30
−17
39 − 61
+20−42=38
−48
54.5 − 45.5
+37−28=35
+28
 45 − 40
+21−16=48
+2
31.5 − 22.5
+23−14=17
+3
16 − 15
+8−7=16
−42
  21.5 − 8.5
+17−4=9
−20
23.5 − 6.5
+18−1=11
+16
                                                                          
10Loop M1-T 64-bit 4CPU298717.5 − 83.5
+3−69=29
+25
31 − 66
+15−50=32
−36
42 − 57
+20−35=44
+10
42 − 42
+27−27=30
+53
42 − 58
+25−41=34
−13
19 − 31
+12−24=14
−50
45.5 − 54.5
+24−33=43
+5
42.5 − 56.5
+22−36=41
−31
40 − 45
+16−21=48
−2
 58.5 − 40.5
+45−27=27
+24
30.5 − 19.5
+19−8=23
+25
  35.5 − 12.5
+26−3=19
−3
34.5 − 15.5
+28−9=13
−26
                                                                          
11Deep Junior 10 4CPU29475.5 − 44.5
+2−41=7
−34
20.5 − 39.5
+8−27=25
+23
34 − 66
+15−47=38
−7
43 − 57
+28−42=30
+40
41 − 59
+19−37=44
+24
21 − 29
+16−24=10
+16
12.5 − 27.5
+5−20=15
−49
18.5 − 31.5
+11−24=15
−35
22.5 − 31.5
+14−23=17
−3
40.5 − 58.5
+27−45=27
−24
 45 − 48
+29−32=32
−10
   32.5 − 17.5
+24−9=17
−21
33.5 − 14.5
+27−8=13
+17
                                                                         
12Bright 0.3a 4CPU2945 28 − 72
+11−55=34
−21
16.5 − 33.5
+5−22=23
−5
1