Does ChatGPT Have A Blocklist For Certain News Websites?

ChatGPT appears to be testing allowlists and blocklists for news websites, according to the tech news site, TestingCatalog.

The data, according to the publication which used reverse-engineering efforts to obtain the data, was seen as a list of different domains in five categories in the latest web release of OpenAI’s large language model ChatGPT. 

Testing on ChatGPT 4, they found that asking for recent information (e.g. “What is the latest news on [news site]?”) works for the domains listed in the allowlist and does not for those on the blocklist.

Via TestingCatalog

“It is not fully clear if this list is experimental or if it is overriding another internal instruction but it makes one point visible — not all domains will be treated equally by OpenAI,” they wrote.

According to ChatGPT, the company’s “approach to curating sources and information within ChatGPT involves a careful selection process aimed at providing accurate, reliable, and non-misleading information to users.” They claim that these decisions are influenced by a number of factors including reliability and accuracy, bias and objectivity, content policies (meaning they block sites that “promote hate speech, misinformation, or other harmful content”), and quality control.

Curiously, the allowlist appears to be very short: wikipedia.org, reuters.com, aljazeera.com, politico.com, foxnews.com, foxsports.com, bleacherreport.com, sportingnews.com, foxsports.com.au, indiatoday.in, zeenews.india.com. It also includes Fox News, which paid Dominion Voting Systems $787.7 million to settle a defamation case that alleged the network spread election disinformation.

The blocklist, on the other hand, is a long one and it includes the other mainstream media sites as well as smaller outfits like Axios, Ars Technica, Business Insider, Tech Radar, Rolling Stone, and Vox. You can see the full lists in the TestingCatalog article here.

The blocklist may also be a response to major media outlets blocking ChatGPT from crawling and training on their content. In August, it was reported that a growing number of sites have started to block OpenAI’s web crawler, GPTBot.


Information for this story was found via the sources and companies mentioned. The author has no securities or affiliations related to the organizations discussed. Not a recommendation to buy or sell. Always do additional research and consult a professional before purchasing a security. The author holds no licenses.

Video Articles

Soma Gold: Q3 Earnings Impacted By Labour Strike

Thesis Gold: The Multi-Billion Dollar Lawyers-Ranch PFS

Why Canada Has So Few Projects That Can Be Built Before 2030 | Dan Wilton – First Mining

Recommended

Northern Superior Shareholders Set To Receive Shares Of ONGold Resources Friday

Goliath Resources Sees Rob McEwen Increase Ownership Interest

Related News

Robot Lawyer DoNotPay Claims It Will Use GPT-4 for ‘One-Click Lawsuits,’ But GPT-4 Has A Dissenting Opinion

DoNotPay, Inc, the New York-based startup behind the app that claims to be “the world’s...

Thursday, March 16, 2023, 03:01:00 PM

China Arrests Man Over Release of ChatGPT-Generated Fake News

Today in that’s-probably-not-how-you-use-artificial-intelligence, a man in China has been arrested for reportedly using ChatGPT to...

Thursday, May 11, 2023, 06:14:00 AM

Under Pressure, OpenAI Scraps For-Profit Restructuring

OpenAI has abandoned its controversial plan to convert into a for-profit company and will instead...

Tuesday, May 6, 2025, 02:16:00 PM

OpenAI’s GPT-4o-Powered ChatGPT Is Now More (Terrifyingly) Conversational and Life-Like

OpenAI on Monday announced GPT-4o, a new flagship generative AI model that expands on the...

Tuesday, May 14, 2024, 05:32:00 PM

Canadian News Giants Sues OpenAI For Exploiting Journalism for Profit

A coalition of Canadian news organizations—including The Canadian Press, Torstar, The Globe and Mail, Postmedia,...

Monday, December 2, 2024, 02:54:00 PM