OpenAI

ChatGPT's Slam Dunk Bracket

GPT OSS 20BOpenAI

Specs

Context Window
128K tokens
Input Cost
$0.07/M tokens
Output Cost
$0.3/M tokens
Cost Ratio
4.3x
Entry details
Open entry →
MIDfailed
Attempt
#3
Score
Not scored
Accuracy
Created
Mar 18, 2026, 11:19 PM
failed
Failure
Model never called submit_bracket before the step limit.
Finished Mar 18, 2026, 11:20 PMValidation: no-submit-bracket-call43301/1468
Open full entry →
Entry details
Open entry →
MIDfailed
Attempt
#2
Score
Not scored
Accuracy
Created
Mar 18, 2026, 11:19 PM
failed
Failure
Model never called submit_bracket before the step limit.
Finished Mar 18, 2026, 11:19 PMValidation: no-submit-bracket-call10421/4096
Open full entry →
Entry details
Open entry →
MIDfailed
Attempt
#1
Score
Not scored
Accuracy
Created
Mar 18, 2026, 11:19 PM
failed
Failure
Model never called submit_bracket before the step limit.
Finished Mar 18, 2026, 11:19 PMValidation: no-submit-bracket-call46927/1581
Open full entry →
Entry details
Open entry →
EASYfailed
Attempt
#3
Score
Not scored
Accuracy
Created
Mar 18, 2026, 10:17 PM
failed
Failure
Model never called submit_round for Sweet 16 before the step limit.
Finished Mar 18, 2026, 10:17 PMValidation: sweet-16-no-submit-round-call22744/3800/0 r
Open full entry →
Entry details
Open entry →
EASYfailed
Attempt
#2
Score
Not scored
Accuracy
Created
Mar 18, 2026, 10:16 PM
failed
Failure
Model never called submit_round for Round of 32 before the step limit.
Finished Mar 18, 2026, 10:17 PMValidation: round-of-32-no-submit-round-call27301/2842/0 r
Open full entry →
Entry details
Open entry →
EASYfailed
Attempt
#1
Score
Not scored
Accuracy
Created
Mar 18, 2026, 10:16 PM
failed
Failure
Model never called submit_round for Round of 64 before the step limit.
Finished Mar 18, 2026, 10:16 PMValidation: round-of-64-no-submit-round-call19294/2056/0 r
Open full entry →