(Edit: I wrote the following post in Microsoft Word, and pasted it here. The nice formatting of course didn’t transfer when I pasted the text below. I should have seen that coming, but I don’t have time to fix it. I have to go to a Chess tournament that starts at noon!)
(Edit again: Tried reformatting. Didn’t work. The forum software seems to think white space is a waste of bits. Also, I don’t know if anyone else has this problem, but when I try to edit after about 15 lines, it’s darned near impossible. It seems to scroll all over the place, but not where I’m typing. Is there some setting somewhere?)
I’ve been working with my program. I think I got the color allocation right now.
I’m very pleased with the results so far. I continued working with an 18 player, 5 round, tournament, using randomly selected players from the USCF database, and assuming no unrated players, no byes, no upsets, and no draws. That’s pretty unrealistic, but it’s just a baseline analysis. It took a long time to do stats on this tournament. As time goes on, I’ll be building in automatic statistical calculations and a “simulation” mode that will randomly do pairs and have a random game outcome generation based on ELO ratings, but that’s a lot of work. For now, I just ran through the pairings on my program, and on WinTD set to Standard Swiss, and wrote things down and exported to Excel and did some stats.
So, how did it do? Pretty darned good. In some ways better than the standard/WinTD method. Mostly, it conformed to my expectations. The rest of this message is a lengthy explanation of findings, so if you aren’t interested in that sort of thing, you can stop reading now. The exact pairings are shown at the end of this message. First I’ll give the analysis.
Comparing overall pairings, as expected, my method produced gradually closer games as the day went on, with the final games happening in the final round. WinTD shoved the games in more rapidly, so that the closest overall games were in round 4, with round 5 being less competitive. Overall, WinTD had less ratings variation. The round by round ratings variations are as follows. Surprisingly, the standard deviations of ratings variation was generally higher with my program than with WinTD. I’m going to have to see if that’s a bug, or a statistical fluke due to ratings distribution. I would expect my program to have less variation between games in a given round.
1 2 3 4 5 Overall
WinTD 1204 546 602 335 568 604
Lame 1204 746 369 498 347 679
Color allocation:
How often did someone not get due color? Define an “alternating color fault” as one in which a player had the right number of colors assignments, but did not alternate. (e.g. BWW) and an “equalizing color fault” as one in which the wrong number of colors were played. (e.g. WBWW). My program produced two alternating color faults and two equalizing color faults. WinTD produced 8 alternating color faults and one equalizing color fault. At the end of the tournament, all players had played 3 of one color and 2 of another color for both programs.
Intermediate standings:
In a tournament with no upsets, you would expect players to be ranked consistently throughout. With my method, there is never a time in which a player with a higher rating had a lower score. With WinTD, such situations begin in round 3 when the top player in the losers’ bracket from round 1 also lost in round 2. I haven’t done a full analysis including tiebreakers.
Overall conclusions: I like my method, at least for tournaments with enough rounds to do a complete “sort” of the players. I’ll have to do a lot more analysis, not to mention just making sure I didn’t have a data entry error, but it’s looking good. The one thing I didn’t like was that overall, the games weren’t as competitive, and I’ll have to take a look at why. I’ll definitely have to try all sorts of variations, with byes and draws and unrated players and what happens when players’ “true ability” is higher or lower than the rating. Nevertheless, I think it’s promising.
The players below are listed by their ratings.
Lame Method WinTD
673 2225 673 2225
2114 586 2114 586
457 2062 457 2062
1901 397 1901 397
285 1305 285 1305
1229 212 1229 212
114 1179 114 1179
900 112 900 112
105 863 105 863
2225 1229 2225 1229
1179 2114 1305 2114
2062 900 2062 900
863 1901 1179 1901
1305 114 863 673
212 673 586 285
586 285 112 457
112 457 397 114
397 105 212 105
1901 2225 2062 2225
2114 1305 2114 1901
673 2062 1305 863
1229 586 1229 586
457 1179 457 1179
900 397 900 397
114 863 673 212
285 112 285 112
105 212 114 105
2225 2062 2225 2114
863 2114 1179 2062
1179 1901 1901 1305
1305 900 863 1229
397 1229 673 900
285 673 586 114
586 105 212 457
212 457 397 285
112 114 105 112
2114 2225 1901 2225
2062 1305 2114 900
1901 1229 1229 2062
673 1179 1305 586
900 586 397 1179
457 863 457 863
212 397 114 673
114 285 285 105
105 112 112 212