Close search
 
Home | News | COUNTER’s bot repository

COUNTER's bot repository

01 July 2026

For many years, COUNTER’s list of bots and spiders was hosted in a GitHub repository that we didn’t control. As part of the fix-and-feature Release 5.1.1 of the Code of Practice, we’ve moved and expanded the repository. COUNTER Bots is now part of our own GitHub account.

Some things haven’t changed

  • We’re still requiring crawler and bot activity to be stripped out so that only genuine, user-driven usage is included in COUNTER reports.
  • Our repository is still a non-exclusive list that is open for extension. WYou can add a bot or crawler to the list by making a new pull request with your additions.
  • And we still recommend case-insensitive matching, so that ‘bot’ will match ‘BOT’, ‘Bot’, ‘BoT’ etc.

So what’s new?

We’ve pulled in a much more comprehensive list of bots from the ai.robots.txt repository. The repo includes AI-related crawlers of all types, regardless of purpose. The longer list can be used to exclude AI usage from COUNTER reports, and to help identify AI usage in line with the best practice on Generative and Agentic AI usage metrics.

This website uses cookies
This site uses cookies to enhance your browsing experience. We use necessary cookies to make sure that our website works. We’d also like to set analytics cookies that help us make improvements by measuring how you use the site. By clicking “Allow All”, you agree to the storing of cookies on your device to enhance site navigation, analyse site usage, and assist in our marketing efforts.
These cookies are required for basic functionalities such as accessing secure areas of the website, remembering previous actions and facilitating the proper display of the website. Necessary cookies are often exempt from requiring user consent as they do not collect personal data and are crucial for the website to perform its core functions.
A “preferences” cookie is used to remember user preferences and settings on a website. These cookies enhance the user experience by allowing the website to remember choices such as language preferences, font size, layout customization, and other similar settings. Preference cookies are not strictly necessary for the basic functioning of the website but contribute to a more personalised and convenient browsing experience for users.
A “statistics” cookie typically refers to cookies that are used to collect anonymous data about how visitors interact with a website. These cookies help website owners understand how users navigate their site, which pages are most frequently visited, how long users spend on each page, and similar metrics. The data collected by statistics cookies is aggregated and anonymized, meaning it does not contain personally identifiable information (PII).
Marketing cookies are used to track user behaviour across websites, allowing advertisers to deliver targeted advertisements based on the user’s interests and preferences. These cookies collect data such as browsing history and interactions with ads to create user profiles. While essential for effective online advertising, obtaining user consent is crucial to comply with privacy regulations.