logo
welcome
Nature

Nature

Has your paper been used to train an AI model? Almost certainly

Nature
Summary
Nutrition label

75% Informative

Academic publishers are selling access to research papers to technology firms to train artificial-intelligence ( AI ) models.

Some researchers have reacted with dismay at such deals happening without consultation of authors.

Experts say that, if a research paper hasn’t yet been used to train a large language model, it probably will be soon.

Even if it were possible to prove that an LLM has been trained on a certain text, it is not clear what happens next.

Publishers maintain that, if developers use copyrighted text in training and have not sought a licence, that counts as infringement.

Many academics are happy to have their work included in LLM training data.

VR Score

86

Informative language

92

Neutral language

44

Article tone

informal

Language

English

Language complexity

51

Offensive language

not offensive

Hate speech

not hateful

Attention-grabbing headline

not detected

Known propaganda techniques

not detected

Time-value

long-living

External references

no external sources

Source diversity

no sources

Affiliate links

no affiliate links