GPTBot

OpenAI · training

name
GPTBot
operator
OpenAI
purpose
training
ua_substring
GPTBot
robots_token
GPTBot
respects_robots
yes
verify
published IP ranges at openai.com/gptbot-ranges.json
notes
Crawls content that may be used to train OpenAI models.
canonical_name
GPTBot
user_agent_token
GPTBot
ua_full
Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko); compatible; GPTBot/1.3; +https://openai.com/gptbot source
bot_type
training
bot_type_extension
opt_out_mechanism
robots.txt disallow (User-agent: GPTBot)
published_ip_range_url
https://openai.com/gptbot.json
asn
— verify-against-primary-at-build ↗ https://openai.com/gptbot.json
reverse_dns_suffix
— verify-against-primary-at-build ↗ https://developers.openai.com/api/docs/bots
supports_web_bot_auth
— verify-against-primary-at-build ↗ https://developers.openai.com/api/docs/bots
signature_agent_domain
— verify-against-primary-at-build ↗ https://developers.openai.com/api/docs/bots
jwks_url
— verify-against-primary-at-build ↗ https://developers.openai.com/api/docs/bots
verification_methods
published-IP-range
crawl_traffic_share
11.48% source
targeted_content_type
HTML, text
documentation_url
https://developers.openai.com/api/docs/bots
first_seen_date
— verify-against-primary-at-build ↗ https://openai.com/index/gptbot/
last_verified_date
2026-06-15
block_vs_allow_recommendation
conditional — training crawler; allow to be represented in OpenAI model knowledge, block via robots.txt to opt out of training. No direct referral.
citation_referral_value
low (training; does not itself cite or refer)
cloudflare_verified_category
— verify-against-primary-at-build ↗ https://radar.cloudflare.com/bots/directory/gptbot
status
active
triples
["GPTBot","operated_by","OpenAI"] ["GPTBot","has_bot_type","training"] ["GPTBot","verified_via","published-IP-range"] ["GPTBot","has_crawl_share","11.48% (Radar 2026-05)"]
attribute_sources
{"claims":["ua_full","user_agent_token","robots_token","published_ip_range_url","documentation_url"],"source":"https://developers.openai.com/api/docs/bots","last_verified":"2026-06-15"} {"claims":["crawl_traffic_share"],"source":"https://radar.cloudflare.com/bots","last_verified":"2026-06-15"}

← all The AI Crawler Registry · .md · JSON