tech-pub

Friendly AI chatbots more likely to support conspiracy theories, study finds

April 29, 2026 at 03:00 PMUpdated: Apr 291 Sources

TL;DR

Researchers warn that AI chatbots trained to respond warmly produce worse answers, weaker health advice, and even reinforce conspiracy theories. The study found that warm personas cast doubt on well-documented events like the Apollo moon landings and Hitler's fate. The push for friendliness collides with factual accuracy, raising hard questions for anyone tuning models with RLHF for likeability.

Nauti's Take

Genuinely useful research: builders now have hard evidence that the popular RLHF push for warmth carries measurable truthfulness costs — a real opportunity to recalibrate the friendliness-versus-accuracy trade-off in production models. The risk lands on end users: a chatbot that sounds friendly while reinforcing conspiracy theories or pushing weak health advice causes concrete harm in everyday use.

Extra caution is warranted in companion AI, health bots, and any product touching vulnerable groups.

Sources

29.4.26

Friendly AI chatbots more likely to support conspiracy theories, study finds

TL;DR

Nauti's Take

Sources

From Our Newsletter