welcome
TechCrunch

TechCrunch

Technology

Technology

OpenAI's o3 AI model scores lower on a benchmark than the company initially implied | TechCrunch

TechCrunch
Summary
Nutrition label

85% Informative

First- and third -party benchmark results for OpenAI 's o3 AI model are raising questions about the company’s transparency and model testing practices.

When OpenAI unveiled o3 in December , the company claimed the model could answer just over a fourth of questions on FrontierMath , a challenging set of math problems.

That figure was likely an upper bound, achieved by a version of o3 with more computing.

VR Score

88

Informative language

87

Neutral language

70

Article tone

formal

Language

English

Language complexity

50

Offensive language

not offensive

Hate speech

not hateful

Attention-grabbing headline

not detected

Known propaganda techniques

not detected

Time-value

short-lived

Affiliate links

no affiliate links