TechMarkets

OpenAI GPT score on FrontierMath Benchmark by June 30?

45%+>99%

50%+>99%

60%+99%

70%+99%

$42,032 Vol.

Jun 30, 2026

60%+$29K Vol.

99%

53%

70%+$14K Vol.

99%

79%

RulesMarket Context

This market will resolve to "Yes" if any OpenAI GPT model achieves the listed score or greater on the FrontierMath Exam by February 28, 2026, 11:59 PM ET. Otherwise, the market will resolve to "No". This market will resolve according to the Epoch AI’s Frontier Math benchmarking leaderboard (https://epoch.ai/frontiermath) for Tier 1-3. Studies which are not included in the leaderboard (e.g. https://x.com/EpochAIResearch/status/1945905796904005720) will not be considered. The primary resolution source will be information from EpochAI; however, a consensus of credible reporting may also be used.

Holders

TrashGirlPapermarket6h ago

Well... Typical polymarket here, when rules don't cover the situation at all. Basically all models got +42% for free due to errors in tasks. It doesn't reflect the nature of the market, when models were supposed to get improved... not the benchmark. Well there is now v1 published, which will resolve to No everywhere and v2, which will resolve to Yes otherwise. Rules don't state anything about 2 versions of the benchmark. I think it would be fair to resolve it 50-50 here in these circumstances. But I won't insist.

iusedtowritepoetryforalivingPapermarket5h ago

Should just resolve on v1 imo

TrashGirlPapermarket5h ago

When 42% of tasks are changed it is literally a new benchmark, not a little upgrade. Can appeal to the part of the rules that state "Studies which are not included in the leaderboard will not be considered". But I guess the opposite side will also find arguments. Rules are really not complete for this situation. I think 50-50 is the most fair option in this situation.

StopKillingBlackTransgenderKidsPapermarket5h ago

no way, the guy who bought hundreds of dollars of shares doesnt want to lose! FYI, this error was known for months and they said they would fix it soon.