hrothgar, on 2024-November-22, 09:29, said:
Thanks very much for posting how well GIB / Ben do in the various events
If possible, it would be interesting to see the Standard Deviation in addition to the means.
To me, the most interesting result is how much worse GIB does in the Zenith daylong than any other event...
Are the conditions of contest significantly different?
If possible, it would be interesting to see the Standard Deviation in addition to the means.
To me, the most interesting result is how much worse GIB does in the Zenith daylong than any other event...
Are the conditions of contest significantly different?
Zenith has 120 instances.
If we look at how the 120 MP-percentages of the instances are distributed, the standard deviation is about 7.
The score of an instance could be anywhere between 40% and 70%, but is mostly in the low 50s. And the distribution is a bit skewed towards higher scores.
The reason why GIB is a bit weaker on Zenith is because GIB's strength is declarer play and GIB's weakness is bidding, especially competitive bidding.
In Zenith there is less declaring and more competitive bidding (because the boards are random, not best hand) and this disadvantages GIB.
In my opinion, when users get an impression of how robots play, subjectively declarer play doesn't count as much. Because in a typical best hand tournament the robot rarely declares.
What the user is most exposed to is bidding with the robot, and some defense. With bidding being the weak spot (very much for BasicGIB), the impression is that the robot is much worse than it is.