What Squarespace actually does with AI crawlers by default
A new Squarespace site ships with the 'Block known artificial intelligence crawlers' checkbox unchecked. That means AI bots are allowed to crawl by default, and Squarespace's own help docs describe this as the discoverability-friendly setting. The fear-trigger problem is not the default; it is the 2024-era advice that told owners to toggle the box on without explaining the consequences.
The 26 bots Squarespace's AI checkbox actually disallows
The Squarespace help center publishes the exact list. There are 26 named user-agents, grouped by purpose: training crawlers from OpenAI, Anthropic, Google, Meta, Apple, ByteDance, and Cohere; bulk web-corpus crawlers like CCBot; and a handful of smaller AI training agents from AI2, You.com, Quora, and others. The list is not the same as the list of bots that decide AI citations.
Training crawlers and retrieval crawlers do different jobs
A training crawler scrapes your site to feed the next version of an AI model; a retrieval crawler fetches your page in real time when a user asks the AI a question. Block a training crawler and you protect future scraping; block a retrieval crawler and you remove yourself from live citations. The Squarespace AI checkbox does not separate the two.
Settings > Crawlers, step by step
The Squarespace path is short: Settings, then Crawlers. Two checkboxes appear. The top one controls search-engine crawling and should stay on for anyone who wants to be indexed by Google or Bing. The bottom one controls the 26-bot AI list. The fix takes about ninety seconds, but the consequences depend on which mode you intend the site to live in.
What you cannot do in Squarespace's robots.txt
Squarespace generates robots.txt automatically and does not provide a way to edit it directly. The Crawlers panel updates the file behind the scenes for the 26-bot AI list and the search-engine toggle, but you cannot add a single per-bot Disallow rule, you cannot allow one path and block another, and you cannot append a Sitemap directive that differs from the default. Squarespace's forum has multiple threads on this; the answer is consistently the same.
A five-minute self-check on your current state
Five files and panels tell you exactly what your site is doing right now. Check robots.txt for the AI Disallow block, check the Crawlers panel for the checkbox state, check the SEO tab on your most important page for noindex, check the header for X-Robots-Tag, and run the free crawler-check tool. Anyone can complete the pass in under five minutes.
Frequently asked questions
Five questions Squarespace owners send us about the Crawlers panel every week, answered in the format AI engines prefer.