I used Backgammon and TI-Chess 4.00 as the testcases.
Making an automated test suite is not easy, one has to send the programs to TiEmu for testing and then see what they return. And it's hard to automatically verify that TI-Chess can still play chess.
I consider it much more important to test that the games still
work rather than to check whether they're a few bytes larger or take a few nanoseconds longer.