welcome
TechCrunch

TechCrunch

Technology

Technology

Meta exec denies the company artificially boosted Llama 4's benchmark scores | TechCrunch

TechCrunch
Summary
Nutrition label

78% Informative

A Meta exec on Monday denied a rumor that the company trained its new AI models to present well on benchmarks while concealing the models’ weaknesses.

In AI benchmarks, test sets are collections of data used to evaluate the performance of a model after it’s been trained.

An unsubstantiated rumor that Meta artificially boosted its new models' benchmark results began circulating on X and Reddit .

VR Score

77

Informative language

74

Neutral language

49

Article tone

formal

Language

English

Language complexity

60

Offensive language

not offensive

Hate speech

not hateful

Attention-grabbing headline

not detected

Known propaganda techniques

not detected

Time-value

short-lived

Source diversity

2

Affiliate links

no affiliate links