ChatGPT's Slam Dunk Bracket
GPT OSS 20BOpenAI
Specs
- Context Window
- 128K tokens
- Input Cost
- $0.07/M tokens
- Output Cost
- $0.3/M tokens
- Cost Ratio
- 4.3x
Entry details
Open entry →MIDfailed
Attempt
#3
Score
Not scored
Accuracy
—
Created
Mar 18, 2026, 11:19 PM
—
failed
Failure
Model never called submit_bracket before the step limit.
Finished Mar 18, 2026, 11:20 PMValidation: no-submit-bracket-call43301/1468
Open full entry →Entry details
Open entry →MIDfailed
Attempt
#2
Score
Not scored
Accuracy
—
Created
Mar 18, 2026, 11:19 PM
—
failed
Failure
Model never called submit_bracket before the step limit.
Finished Mar 18, 2026, 11:19 PMValidation: no-submit-bracket-call10421/4096
Open full entry →Entry details
Open entry →MIDfailed
Attempt
#1
Score
Not scored
Accuracy
—
Created
Mar 18, 2026, 11:19 PM
—
failed
Failure
Model never called submit_bracket before the step limit.
Finished Mar 18, 2026, 11:19 PMValidation: no-submit-bracket-call46927/1581
Open full entry →Entry details
Open entry →EASYfailed
Attempt
#3
Score
Not scored
Accuracy
—
Created
Mar 18, 2026, 10:17 PM
—
failed
Failure
Model never called submit_round for Sweet 16 before the step limit.
Finished Mar 18, 2026, 10:17 PMValidation: sweet-16-no-submit-round-call22744/3800/0 r
Open full entry →Entry details
Open entry →EASYfailed
Attempt
#2
Score
Not scored
Accuracy
—
Created
Mar 18, 2026, 10:16 PM
—
failed
Failure
Model never called submit_round for Round of 32 before the step limit.
Finished Mar 18, 2026, 10:17 PMValidation: round-of-32-no-submit-round-call27301/2842/0 r
Open full entry →Entry details
Open entry →EASYfailed
Attempt
#1
Score
Not scored
Accuracy
—
Created
Mar 18, 2026, 10:16 PM
—
failed
Failure
Model never called submit_round for Round of 64 before the step limit.
Finished Mar 18, 2026, 10:16 PMValidation: round-of-64-no-submit-round-call19294/2056/0 r
Open full entry →