Diffbot-User
Diffbot · inference
- name
- Diffbot-User
- operator
- Diffbot
- purpose
- inference
- ua_substring
- Diffbot-User
- robots_token
- Diffbot-User
- respects_robots
- yes
- verify
- no operator-published authoritative IP-range file confirmed; verify by user-agent + edge controls. Diffbot documents the token for on-behalf-of fetches.
- notes
- Used for requests made on behalf of human users browsing URLs through Diffbot software, as distinct from Diffbot's proactive Crawlbot. Diffbot documents both 'Diffbot' and 'Diffbot-User' as robots.txt user-agents.
- canonical_name
- Diffbot-User
- user_agent_token
- Diffbot-User
- ua_full
- — verify-against-primary-at-build ↗ https://docs.diffbot.com/docs/does-crawl-respect-robotstxt
- bot_type
- user-action-fetcher
- bot_type_extension
- —
- opt_out_mechanism
- robots.txt disallow (User-agent: Diffbot-User)
- published_ip_range_url
- — verify-against-primary-at-build ↗ https://docs.diffbot.com/docs/does-crawl-respect-robotstxt
- asn
- — verify-against-primary-at-build ↗ https://docs.diffbot.com/
- reverse_dns_suffix
- — verify-against-primary-at-build ↗ https://docs.diffbot.com/
- supports_web_bot_auth
- — verify-against-primary-at-build ↗ https://docs.diffbot.com/
- signature_agent_domain
- — verify-against-primary-at-build ↗ https://docs.diffbot.com/
- jwks_url
- — verify-against-primary-at-build ↗ https://docs.diffbot.com/
- verification_methods
user-agent-match- crawl_traffic_share
- — verify-against-primary-at-build ↗ https://radar.cloudflare.com/bots
- targeted_content_type
- HTML, text
- documentation_url
- https://docs.diffbot.com/docs/does-crawl-respect-robotstxt
- first_seen_date
- — verify-against-primary-at-build ↗ https://docs.diffbot.com/
- last_verified_date
- 2026-06-15
- block_vs_allow_recommendation
- allow (default) — user-initiated fetch on a human's behalf through Diffbot software; respects robots.txt. Blocking degrades that user's task.
- citation_referral_value
- medium (fetches a specific page for a user; can surface it to them)
- cloudflare_verified_category
- — verify-against-primary-at-build ↗ https://radar.cloudflare.com/bots/directory/diffbot-user
- status
- active
- triples
["Diffbot-User","operated_by","Diffbot"]["Diffbot-User","has_bot_type","user-action-fetcher"]["Diffbot-User","verified_via","user-agent-match"]- attribute_sources
{"claims":["user_agent_token","robots_token","respects_robots","documentation_url","opt_out_mechanism"],"source":"https://docs.diffbot.com/docs/does-crawl-respect-robotstxt","last_verified":"2026-06-15"}