There's quite a bit on that lately. Like, who didn't see this coming. I wonder, why is there so much focus on owning and controlling social media, and rich and powerful people put so much worth on it?
The truth is whatever they want it to be. The mind is not yours to wonder or wander.
Chatbots were generally bad at declining to answer questions they couldn’t answer accurately, offering incorrect or speculative answers instead. Premium chatbots provided more confidently incorrect answers than their free counterparts. Multiple chatbots seemed to bypass Robot Exclusion Protocol preferences. Generative search tools fabricated links and cited syndicated and copied versions of articles. Content licensing deals with news sources provided no guarantee of accurate citation in chatbot responses.
Our findings were consistent with our previous study, proving that our observations are not just a ChatGPT problem, but rather recur across all the prominent generative search tools that we tested.
The results were collated to their respective LLM, producing the visual chart that displays a lot more red and pink than shades of green — displaying a puzzling bias toward inaccuracies. According to the chart below, Perplexity and Perplexity Pro returned the most accurate results, while it seems that both Grok models tested and Gemini struggled to return the correct answers. CJR accuracy block graph1-3.18.25
The use of AI chatbots for fact-checking is on the rise. However, Grok's unsolicited claims about "white genocide" show that the responses of Grok, ChatGPT, Meta AI and other chatbots are not always reliable.
"Hey, @Grok, is this true?" Ever since Elon Musk's xAI launched its generative artificial intelligence chatbot Grok in November 2023, and especially since it was rolled out to all non-premium users in December 2024, thousands of X (formerly Twitter) users have been asking this question to carry out rapid fact checks on information they see on the platform.
A recent survey carried out by a British online technology publication TechRadar
found that 27% of Americans had used artificial intelligence tools such as OpenAI's ChatGPT, Meta's Meta AI, Google's Gemini, Microsoft's Copilot or apps like Perplexity instead of traditional search engines like Google or Yahoo.
But how accurate and reliable are the chatbots' responses? Many people have asked themselves this question in the face of Grok's recent statements about "white genocide" in South Africa. Apart from Grok's problematic stance on the topic, X users were also irritated about the fact that the bot started to talk about the issue when it was asked about completely different topics, like in the following example: A screenshot of a post on X shows how a user asked Grok about HBO and unwantedly received information about "white genocide" in South Africa.