BEST OF THE WEB

News organizations block AI crawler bots amid copyright concerns

20 per cent of the world’s top 1000 websites are now blocking AI crawler bots, which are used to collect web data for AI services, according to Originality.AI.

The move comes after OpenAI introduced its GPTBot crawler in early August. The GPTBot was designed to collect data from the web to improve future AI models. However, many websites, including the New York Times, Reuters, and CNN, blocked the GPTBot, concerned that it could be used to scrape their content without permission.

The blocking of AI crawler bots is a sign of the growing tension between websites and AI companies over the use of data. While AI companies argue that they need to collect data to train their models, websites are concerned about protecting their content and intellectual property.

According to Originality.AI, the percentage of the top 1000 websites blocking OpenAI’s ChatGPT bot surged from 9.1 per cent on August 22 to 12 per cent on August 29. The situation is further complicated by the lack of clear legal guidelines governing the use of AI crawler bots. As a result, websites are taking matters into their own hands by blocking these bots.

The sources for this piece include an article in Axios.

IT World Canada Staff
IT World Canada Staffhttp://www.itworldcanada.com/
The online resource for Canadian Information Technology professionals.

Would you recommend this article?

Share

Thanks for taking the time to let us know what you think of this article!
We'd love to hear your opinion about this or any other story you read in our publication.


Jim Love, Chief Content Officer, IT World Canada

Featured Download

ITW in your inbox

Our experienced team of journalists and bloggers bring you engaging in-depth interviews, videos and content targeted to IT professionals and line-of-business executives.

More Best of The Web