Combined USCF and FIDE rating databases

NERD ALERT *** NERD ALERT *** NERD ALERT :slight_smile:

For anyone who wants to see how I generate the combined database, I have made a git repository available at http://www.sudburyriverchess.com/git/ratings. You can see the scripts behind the curtain by creating a local copy of the repository:

git clone http://www.sudburyriverchess.com/git/ratings

(Clicking on the URL directly will just give you a blank page.)

If this message makes about as much sense to you as it would if it were written in Aramaic, well, I put the ā€œNERD ALERTā€ header at the top for a reason. :smiling_imp:

I’m happy to announce a new feature coming to the combined database!

Currently, the combined database contains data for exactly those US Chess members whose ID appears in the corresponding golden database. However, members do join or renew after the cutoff for the golden database. The custom rating lists I use to get the members’ full names also includes expiration dates and ID numbers as of the request for the custom rating list.

Starting with the September file, the combined database will now include up-to-date member expiration dates and will also include (almost) all US Chess ID numbers. (I have to fudge that last statement. The custom rating lists I use include all fifty states plus PR, ON, AE, AP, QC, BC, VI, and GU (the most common ā€œstatesā€ in the golden database that are not actually US states). New members in other ā€œstatesā€ will not appear in the combined database until they appear in the golden database.)

US Chess has been working on a new format for rating supplements which includes some FIDE information as well as some other useful fields. The format being tested is a JSON data file.

However, at this time there are no plans for it to contain FIDE ratings. That’s because the release cycle for US Chess ratings lists and the FIDE rating lists are not compatible, so either the US Chess information will be out of date relative to the FIDE data or vice-versa. US Chess’s official position is that organizers running FIDE events should go to FIDE for up-to-date ratings data. US Chess tries to maintain accurate linkages between US Chess member IDs and FIDE IDS.

Testing on this new format is proceeding, though slowly, and there is as of yet no commitment that this format will ever become an officially supported format.

Should the format change, I will continue providing this service with a combined database in the new format. As I have previously stated, the source for the FIDE ratings in my database is the FIDE rating list (FRL) that FIDE officially announces the last day of each month (to take effect the following day). It is true that I have seen ratings change during the month (and the players_list.zip file updated), but I don’t know why that happens. In any case, I feel safe using the official published ratings effective the first of the month.

Something like this really needs to be a regular thing. It doesn’t need the FIDE ratings. That’s easy enough to merge together for those who need it. Have there been any changes or an update on this file? I sent you some notes on the file you gave me a while back. Did you see them by any chance?

To be honest, I don’t remember, Bill. I’ve been sort of busy with family matters lately.

Thanks a lot for doing this each month Ken. I use this database every month since it has players full name and the FIDE information is also nice for the occasional FIDE rated tournament I help out with. Unlike the version of the database provided by US Chess, your version of the database includes duplicate and deceased players ID’s. Is there any way those can be taken out of the database?

If we take deceased players out of the database completely, what would happen to the name displayed in crosstables for those players? (Answer: It shows up as an error, there are around 500 IDs deleted from our database back in the 90’s for whom this already happens, because we no longer have records for those IDs.)

It doesn’t happen often, but sometimes we have to reactivate a deceased player so that an event that person was competing in can be rated. After the event is rated, we change the status of that player back to deceased.

Unfortunately, there is no ā€˜status’ field in either the monthly or golden master rating supplement formats, probably because it did not occur to the people designing those formats many years ago to include one, and adding one would break at least one of the pairing programs which utilize those files.

I wasn’t suggesting we take deceased players out of the database completely.

What about duplicate IDs? Is it possible to take those out completely? Could the crosstables including the duplicate IDs be updated with the correct ID?

There are over 500 IDs flagged as duplicates that are still in the crosstable system, which probably means that the ratings for those IDs were never merged together.

They might need to be reviewed on a case-by-case basis, to prevent updating them without losing track of what the currently are in those events and without causing a large number of players to see a change in their rating after what would almost certainly be a massive rerate would be complicated.

Changing the player ID in an event before the rerate window (events initially rated after 1/1/2004) causes the pre and post event ratings for the changed players in those events to show as blanks.

When we looked at it a few years ago, we did not not see a reasonable solution to these issues.

In other words, it is complicated enough that doing anything might best be done as part of the re-engineered database and MSA system.

I don’t think we want to change what’s currently shown on MSA for duplicated IDs.

TDs who are using current ratings (as opposed to official published ratings) are looking those players up on MSA anyway, aren’t they? So they’ll see the duplicate ID.

One possibility might be to change the name field in the rating supplement files to show something like ā€œDUP,SEE 99999999ā€ instead of the player’s name. That fits in the 17 characters currently allocated for the name field in the supplement files, but it might cause other problems so it would need further study, including consultation with TDs and the pairing program authors. And there are people who use the supplement files for things other than looking up ratings, it might interfere with that.

The August url link is not active yet. The ā€œlatestā€ link still points to July.

I’m sorry for the failure. I have an automated procedure that produces the combined database, uploads it to the web site, and posts the announcement in this thread. (No, I’m not up at 3:30 AM EDT generating the database!) Apparently something went wrong and my script did not catch the error.

In any case, the August combined database is now available. I regret any inconvenience to the users of this database. As I am at the U.S. Open, I can’t investigate what went wrong until I get back home.

I’m doing some cleanup on my web hosting server. I have old combined databases going back to 1/2014. (Wow, I’ve been doing this almost six years!) Those old databases are taking up a fair bit of space, and I really doubt anyone needs them any longer. So, I’m purging all the combined databases before 2019.

If you were counting on these being available … oops, sorry.

Thanks, Ken, for doing these. It has been a real help. How you scrape the email download for the longer names is neat. But, what a necessary ugly hack. It’d be nice if there was a better way. I know there currently isn’t.

If there is a new supplement/golden master format defined at some point, it could include a longer name field. (17 characters is kind of small, we have hundreds of members whose last name is longer than that. But that spec was defined long before I was doing IT work for the Federation.)

Be aware, though, that even in a longer name field truncation is sometimes necessary, because we limit the combined length of the prefix/title, first name, last name and suffix fields to 30 characters.

If? We’ve been asking for a better format for a long time now. In the meantime we will continue to use Ken’s version of the database as it is superior to the one offered by US Chess (has players full name, has all three OTB US Chess ratings).

This is very cool; thank you! :slight_smile:

The July 2020 combined database is now available. The link to the latest version of the database will now return the July 2020 database. (There is also a direct link to this database: https://www.sudburyriverchess.com/ratings/uscf-fide_2020-07.zip.)

There is a SwissSys database configuration file for use with the combined database at https://www.sudburyriverchess.com/ratings/uscf-fide.dbp. If you haven’t already configured SwissSys to use the combined database, you can download this file, launch SwissSys, choose ā€œDatabase Setupā€ under the ā€œDatabaseā€ menu, and click the ā€œLoad Settingsā€¦ā€ button. Just change the ā€œDatabase fileā€ field in the ā€œDatabase Setupā€ dialog to point to the directory where you have placed the uscf-fide.dbf file and click the ā€œOK - Applyā€ button. If you’ve already configured SwissSys to work with the database, you don’t need the uscf-fide.dbp file.

The August 2020 combined database is now available. The link to the latest version of the database will now return the August 2020 database. (There is also a direct link to this database: https://www.sudburyriverchess.com/ratings/uscf-fide_2020-08.zip.)

There is a SwissSys database configuration file for use with the combined database at https://www.sudburyriverchess.com/ratings/uscf-fide.dbp. If you haven’t already configured SwissSys to use the combined database, you can download this file, launch SwissSys, choose ā€œDatabase Setupā€ under the ā€œDatabaseā€ menu, and click the ā€œLoad Settingsā€¦ā€ button. Just change the ā€œDatabase fileā€ field in the ā€œDatabase Setupā€ dialog to point to the directory where you have placed the uscf-fide.dbf file and click the ā€œOK - Applyā€ button. If you’ve already configured SwissSys to work with the database, you don’t need the uscf-fide.dbp file.