Largest study of its kind shows AI assistants misrepresent news content 45% of the time – regardless of language or territory

sqgl@sh.itjust.works · 20 days ago

Largest study of its kind shows AI assistants misrepresent news content 45% of the time – regardless of language or territory

jordanlund@lemmy.world · 20 days ago

I wish they had broke it out by AI. The article states:

“Gemini performed worst with significant issues in 76% of responses, more than double the other assistants, largely due to its poor sourcing performance.”

But I don’t see that anywhere in the linked PDF of the “full results”.

This sort of study should also be re-done from time to time to track AI version numbers.