News
In the exercise, VERSES compared OpenAI advanced reasoning model o1-preview to Genius. Each model attempted to crack the Mastermind code on 100 games with up to ten guesses to crack the code.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results