GEO audit · May 14, 2026

Monsieur Parfait

https://monsieurparfait.be

Export CSV

AI search visibility

41/100

Decent foundation, real gaps. Address the items below.

Quick Wins

Address these this week. Direct impact on visibility, low effort.

  • Fix

    FAQ schema on key pages

    FAQ schema markup on pages that genuinely answer customer questions. Don't fake it — Google penalises FAQ stuffing, and AI engines verify against page content.

    No FAQPage schema found on the homepage.

    What to do: If you answer common customer questions anywhere on your site, mark them up with FAQPage JSON-LD. Don't fake it — Google penalises FAQ stuffing.

  • Fix

    LocalBusiness schema

    LocalBusiness (or specific subtype like Restaurant, Store) schema with address, hours, geo coordinates, price range, and aggregated review rating.

    No LocalBusiness schema (or subtype) found.

    What to do: If you have a physical location, add LocalBusiness JSON-LD with address, geo coordinates, hours, priceRange, and aggregateRating. Use a specific subtype (Restaurant, Store, Dentist, etc) when applicable.

  • Fix

    Organization schema

    JSON-LD Organization schema on the homepage with name, logo, URL, sameAs links to social profiles, address, and contact info. This is the single most-cited structured data block by AI.

    No JSON-LD Organization schema found on the homepage.

    What to do: Add a JSON-LD Organization block on the homepage with name, logo, URL, sameAs (social links), address, and contact info. This is the most-cited structured data block by AI.

  • Fix

    RFC 8288 Link headers for discovery

    HTTP Link response headers point agents to your well-known resources (mcp-server, api-catalog, agent-skills, sitemap, llms.txt) without parsing HTML. Agents using HEAD requests or Markdown negotiation only see headers, not <link> tags.

    No HTTP Link headers found on the homepage.

    What to do: Add a Link response header pointing to your well-known resources (llms.txt, sitemap.xml, mcp.json, etc). Agents using HEAD requests or markdown negotiation never see <link> tags in HTML — only headers.

  • Fix

    Sitemap.xml present and submitted

    A complete sitemap.xml exists, lists all important pages, is referenced in robots.txt, and is submitted to Google Search Console and Bing Webmaster Tools.

    No sitemap found at /sitemap.xml and no Sitemap: directive in robots.txt.

    What to do: Generate a sitemap.xml listing all important pages and submit it to Google Search Console and Bing Webmaster Tools.

  • Fix

    llms.txt file published

    An llms.txt at the site root tells AI agents what your site is about and points to the most authoritative pages. Less than 1% of sites have one — a free differentiator.

    No llms.txt found at /llms.txt.

    What to do: Publish an llms.txt at the site root summarising what the site is about and pointing AI agents to your most authoritative pages. See llmstxt.org for the format. Fewer than 1% of sites have one.

  • Fix

    sameAs linking to social and directories

    Organization schema includes sameAs URLs to your LinkedIn, Wikipedia, Crunchbase, industry directories — all the entities AI uses to build a trustworthy picture of you.

    No sameAs URLs found in any JSON-LD block.

    What to do: Add sameAs URLs to your Organization schema pointing to your LinkedIn, Wikipedia, Crunchbase, and industry directories. AI uses these to build a trustworthy picture of your entity.

  • Improve

    Canonical tags set correctly

    Every page declares a canonical URL, with no conflicting or self-referential errors. Critical for e-commerce with faceted navigation.

    No <link rel='canonical'> tag found on the homepage.

    What to do: Add a canonical tag pointing to the preferred URL of each page. Critical when you have URL parameters or faceted navigation.

  • Improve

    Content Signals declared in robots.txt

    Use the Content-Signal directive (format: search=yes, ai-input=yes, ai-train=no) to declare how AI may use your content. Distinct from Allow/Disallow — silence is increasingly interpreted as restrictive.

    robots.txt exists but doesn't declare Content-Signal directives.

    What to do: Add 'Content-Signal: search=yes, ai-input=yes, ai-train=no' (adjust to your preference) under your User-agent blocks. Silence is increasingly interpreted as restrictive by AI platforms.

  • Improve

    Faq Buyer Questions

    No FAQ section detected on the homepage.

    What to do: Add an FAQ section answering 5-10 common buyer questions in clear, quotable language. AI engines quote FAQ blocks heavily.

  • Improve

    Menu Or Pricing Visible

    No pricing or menu link detected on the homepage.

    What to do: Mainly for restaurants and local shops — menus and prices should be visible HTML, not PDF or login-walled. AI cannot quote what it cannot read.

  • Manual review

    Article schema on content pages

    Blog posts and case studies use Article schema with author, datePublished, and dateModified. AI weighs freshness signals.

    No Article schema on the homepage. Typically lives on blog post and case study pages.

    What to do: Blog posts and case studies should use Article (or BlogPosting) schema with author, datePublished, and dateModified.

  • Manual review

    BreadcrumbList schema

    BreadcrumbList schema on category and product pages helps AI understand your site hierarchy.

    No BreadcrumbList schema on the homepage. More relevant on deep pages.

    What to do: Add BreadcrumbList JSON-LD on category and product pages to help AI understand your site hierarchy.

  • Manual review

    City and neighborhood named on the site

    Your city and neighborhood appear naturally in the homepage copy, About, and footer. AI cross-references these against the place names in user queries.

    Local signal — requires Google Business Profile API or manual verification.

    What to do: Make sure your city and neighborhood appear naturally in homepage copy, About, and footer.

  • Manual review

    Cloudflare WAF doesn't block AI bots

    Cloudflare's 'Block AI Scrapers and Crawlers' setting and default Bot Fight Mode can silently block AI crawlers. Verify the rules don't apply to GPTBot, ClaudeBot, etc.

    Cloudflare is not detected in front of this site, so the 'Block AI Scrapers' setting doesn't apply here.

  • Manual review

    Consistent NAP (name, address, phone)

    NAP information matches exactly across every directory and platform. A single typo in your address can cost local visibility.

    NAP consistency requires cross-referencing your site with directories.

    What to do: Verify your name, address, and phone number match exactly on website, Google Business, LinkedIn, social profiles, and directories.

  • Manual review

    Google Business & maps presence

    Your presence across Google Business Profile, Google reviews, and the wider maps ecosystem (Apple Maps, Bing Places).

    • Manual review Google Business Profile fully filled

      Local signal — requires Google Business Profile API or manual verification.

      What to do: Fill every field of your Google Business Profile — categories, services, attributes, photos, hours, posts.

    • Manual review Recent photos on Google Business

      Local signal — requires Google Business Profile API or manual verification.

      What to do: Upload fresh photos monthly: interior, products, team, events. Drives ranking and click-through.

    • Manual review Google reviews — volume and recency

      Local signal — requires Google Business Profile API or manual verification.

      What to do: Build a steady inflow of reviews with proprietor responses on most.

    • Manual review Apple Maps and Bing Places presence

      Local signal — requires Google Business Profile API or manual verification.

      What to do: List on Apple Maps and Bing Places, both increasingly cited by AI.

  • Manual review

    No Intrusive Interstitials

    Interstitial detection needs computed rendering, deferred to Lighthouse integration.

    What to do: Avoid full-screen popups or cookie banners that block content on mobile. Google demotes pages with intrusive interstitials.

  • Manual review

    Product schema with reviews

    Product schema on every product page with brand, GTIN/SKU, price, availability, and aggregated reviews. Drives both rich results in Google and citations in shopping AI agents.

    No Product schema on the homepage. Only relevant for e-commerce.

    What to do: If you sell products, ensure each product page has Product JSON-LD with brand, GTIN/SKU, price, availability, and aggregateRating. The homepage typically doesn't need it.

  • Manual review

    Readable text without zooming on mobile

    Body text is at least 16px on mobile. Small text triggers Google's mobile usability warnings.

    Mobile font size detection needs computed rendering, deferred to Lighthouse integration.

    What to do: Ensure body text is at least 16px on mobile. Small text triggers Google mobile usability warnings.

  • Manual review

    Tap targets large enough on mobile

    Buttons and links are at least 44px and not too close to each other. Failing this hurts both mobile UX and Core Web Vitals.

    Tap target size needs computed rendering, deferred to Lighthouse integration.

    What to do: Ensure buttons and links are at least 44px and not too close to each other.

  • Good

    AI citations — Chatgpt

    87 citation(s) of your site across chatgpt responses.

  • Good

    AI citations — Claude

    11 citation(s) of your site across claude responses.

  • Good

    AI citations — Gemini

    39 citation(s) of your site across gemini responses.

  • Good

    AI citations — Perplexity

    130 citation(s) of your site across perplexity responses.

  • Good

    Consistent Business Name

    Business name 'Monsieur Parfait' appears on the homepage.

  • Good

    Full HTTPS, no mixed content

    Every page loads over HTTPS with a valid certificate, and no internal links or assets fall back to HTTP.

    Site loads over HTTPS with no detected mixed content.

  • Good

    Proper mobile viewport meta tag

    viewport meta tag is set correctly so the site adapts to phone screens. Should be standard, but is still missing on older sites.

    Viewport meta tag is properly set: width=device-width, initial-scale=1.0

  • Good

    robots.txt allows AI crawlers

    Your robots.txt explicitly allows GPTBot, ClaudeBot, PerplexityBot, CCBot, and Google-Extended. Blocking any of them — even by accident — makes you invisible to that platform.

    robots.txt does not block any known AI crawler.

Strategic Moves

Address these this month. Structural improvements to how AI parses your site.

  • Fix

    Agent Skills index published

    A /.well-known/agent-skills/index.json file (Agent Skills Discovery RFC v0.2.0) lists callable skills agents can invoke. Each skill has a name, type, description, URL and sha256 digest. Cloudflare-led standard, increasingly required.

    No Agent Skills index found at /.well-known/agent-skills/index.json.

    What to do: Publish an Agent Skills index (Cloudflare's RFC v0.2.0) listing callable skills agents can invoke. Each skill needs a name, type, description, URL and sha256 digest.

  • Fix

    Business mention — Chatgpt

    Your business name was NOT mentioned in any chatgpt response.

    What to do: Focus on brand entity clarity and off-site references so chatgpt learns to recognise and name you.

  • Fix

    Business mention — Claude

    Your business name was NOT mentioned in any claude response.

    What to do: Focus on brand entity clarity and off-site references so claude learns to recognise and name you.

  • Fix

    Business mention — Gemini

    Your business name was NOT mentioned in any gemini response.

    What to do: Focus on brand entity clarity and off-site references so gemini learns to recognise and name you.

  • Fix

    Business mention — Perplexity

    Your business name was NOT mentioned in any perplexity response.

    What to do: Focus on brand entity clarity and off-site references so perplexity learns to recognise and name you.

  • Fix

    Long-form About page

    Substantive About page (1,000+ words) covering history, philosophy, team, and concrete differentiators. The single page AI cites most when answering 'who is X' queries.

    Homepage has only ~331 words — likely too thin for AI to characterize the business.

    What to do: Build a long-form About page (1,000+ words). The single page AI cites most when answering 'who is X' queries.

  • Fix

    Markdown content negotiation supported

    When an AI agent sends 'Accept: text/markdown', your server responds with a clean Markdown version of the page. Saves tokens for the agent — agents are more likely to cite low-token sources. Less than 4% of sites do this.

    Asked for markdown but got Content-Type: text/html.

    What to do: Set up your server to return text/markdown when requested. Either pre-generate .md twins of your key pages or run a lightweight HTML→Markdown conversion middleware.

  • Improve

    Concrete Numbers

    Homepage has only 3 specific numeric claims. AI rewards concrete metrics over vague adjectives.

    What to do: Add concrete numbers — 'X projects since YYYY', 'OEE improved Z%', 'serving N+ clients'. AI rewards specificity with citations.

  • Manual review

    Case studies in crawlable HTML

    Each case study is a public HTML page with client name (where permitted), problem, solution, technologies used, and measurable outcomes. Not just a PDF download.

    Case studies presence and quality need a multi-page crawl.

    What to do: Each case study should be a public HTML page with client, problem, solution, technologies, and measurable outcomes.

  • Manual review

    Critical content is not PDF-only

    Case studies, product specs, and white papers exist as crawlable HTML pages, not just downloadable PDFs. AI cites HTML pages roughly 10x more often than PDFs.

    Multi-page crawl needed to detect PDF-only content.

    What to do: Ensure case studies, product specs, and white papers exist as crawlable HTML, not only PDFs. AI cites HTML pages 10x more than PDFs.

  • Manual review

    Cumulative Layout Shift (CLS) under 0.1

    Your layout doesn't jump around as it loads. Frequent CLS issues come from images without dimensions and late-loading ads.

    Cumulative Layout Shift requires the Google PageSpeed Insights API — deferred to a later pack.

    What to do: Ensure layout doesn't jump as it loads. Set explicit dimensions on images and reserve space for ads.

  • Manual review

    Detailed Product Specs

    Product page completeness needs a multi-page crawl.

    What to do: Product pages should include full specs, dimensions, materials, compatibility, and use-case descriptions.

  • Manual review

    Forms readable and submittable by agents

    Forms use semantic HTML (proper labels, autocomplete attributes, aria-required), don't rely on heavy JS for validation, and accept standard form-encoded POST. An agent can fill them without computer-vision tricks.

    No <form> found on the homepage.

    What to do: If you have forms on other pages (contact, signup, search), ensure they use semantic HTML — proper labels, autocomplete attributes, and standard form-encoded POST. Agents struggle with custom JS form components.

  • Manual review

    Industry-vertical pages

    Dedicated pages for each vertical you serve, with the language, use cases, and outcomes specific to that industry. AI excels at recommending vertical-specific service providers.

    Industry-vertical pages need a multi-page crawl to evaluate.

    What to do: Build dedicated pages for each vertical you serve, with industry-specific language and use cases.

  • Manual review

    Interaction to Next Paint (INP) under 200ms

    Your site responds instantly to clicks and taps. INP replaced FID as a Core Web Vital in 2024.

    Interaction to Next Paint requires the Google PageSpeed Insights API — deferred to a later pack.

    What to do: Ensure your site responds to clicks under 200ms. Often fixed by reducing JavaScript execution time.

  • Manual review

    Largest Contentful Paint (LCP) under 2.5s

    Your main content loads quickly. LCP is one of three Core Web Vitals Google uses as a ranking signal.

    Largest Contentful Paint requires the Google PageSpeed Insights API — deferred to a later pack.

    What to do: Ensure your main content loads under 2.5s. Often fixed by optimizing the hero image and inlining critical CSS.

  • Manual review

    Modern image formats and compression

    Images served as WebP or AVIF, properly sized for the device, with lazy loading on below-the-fold content.

    image format and compression requires the Google PageSpeed Insights API — deferred to a later pack.

    What to do: Serve images as WebP or AVIF, properly sized, with lazy loading on below-the-fold content.

  • Manual review

    No render-blocking resources

    Critical CSS is inlined. Non-critical CSS and JS are deferred or async-loaded. Especially important for ad-heavy or analytics-heavy sites.

    render-blocking resources requires the Google PageSpeed Insights API — deferred to a later pack.

    What to do: Inline critical CSS; defer non-critical CSS and JS.

  • Manual review

    Person schema for leadership

    Founders and key staff have Person schema with role, credentials, and sameAs links to LinkedIn. Helps AI characterize who runs the business.

    No Person schema for leadership on the homepage.

    What to do: Founders and key staff should have Person schema with role, credentials, and sameAs links to LinkedIn. Typically on the About or Team pages.

  • Manual review

    Service schema per core offering

    Each main service has its own page with Service schema, describing what it is, who it's for, and areas served.

    No Service schema on the homepage. Typically lives on individual service pages.

    What to do: Each main service should have its own page with Service JSON-LD describing what it is, who it's for, and areas served.

  • Manual review

    Technology / platform pages

    Explicit pages naming every major technology you work with (e.g., Siemens, Rockwell, Shopify, Salesforce). AI looks for exact-match technology terms when matching buyer intent.

    Technology/platform pages need a multi-page crawl to evaluate.

    What to do: Build explicit pages naming every major technology you work with (Siemens, Salesforce, etc).

  • Good

    Clear Positioning

    Homepage has a clear positioning: H1 ('Des sites quiconvertissent.Des outils quiautomatisent....') and a meta description (163 chars).

  • Good

    Content rendered server-side

    Your important content is in the initial HTML response, not rendered by JavaScript afterwards. Most AI crawlers don't execute JS, so client-rendered content is invisible to them.

    Homepage HTML contains ~331 words of visible text — AI crawlers can read it without JavaScript execution.

Long Game

Address these this quarter. Authority and ecosystem-level work.

  • Fix

    MCP Server Card available

    A /.well-known/mcp.json file declares which actions agents can take on your site (book, submit, query). Emerging standard backed by Anthropic and adopted by Claude Desktop and ChatGPT. Fewer than 100 sites worldwide have one.

    No MCP server card found at /.well-known/mcp.json.

    What to do: Publish an MCP server card declaring actions agents can take on your site. Emerging standard backed by Anthropic. Fewer than 100 sites worldwide have one — strong differentiator.

  • Manual review

    Active Linkedin

    Off-site signal — requires manual research or external API.

    What to do: Maintain a complete LinkedIn company page with 1-2 posts per week and meaningful follower count.

  • Manual review

    Founder Linkedin

    Off-site signal — requires manual research or external API.

    What to do: Founders should publish technical or strategic posts regularly.

  • Manual review

    Industry directory listings

    Listings in directories specific to your sector, with complete profiles and consistent information.

    Off-site signal — requires manual research or external API.

    What to do: List in directories specific to your sector with complete, consistent profiles.

  • Manual review

    Natural anchor text profile

    Backlinks to your site use varied, natural anchor text. Over-optimised anchor text (same keywords repeatedly) triggers spam signals.

    Backlink data — requires Ahrefs/Moz API, deferred to a later pack.

    What to do: Keep backlink anchor text varied and natural; over-optimisation triggers spam signals.

  • Manual review

    No Toxic Backlinks

    Backlink data — requires Ahrefs/Moz API, deferred to a later pack.

    What to do: Audit and disavow paid-link networks and irrelevant directories if necessary.

  • Manual review

    OAuth Protected Resource Metadata

    If you offer protected APIs, publish /.well-known/oauth-protected-resource (RFC 9728) declaring the authorization servers that can issue tokens for it and the supported scopes. Lets agents discover how to authenticate before attempting access.

    Manual review required — only relevant if you offer protected APIs.

    What to do: If you offer protected APIs, publish /.well-known/oauth-protected-resource (RFC 9728) declaring authorization servers and supported scopes.

  • Manual review

    Oauth Discovery Metadata

    Manual review required — only relevant if your site has authenticated areas.

    What to do: If you have user accounts or a dashboard, expose /.well-known/openid-configuration or /.well-known/oauth-authorization-server so agents can authenticate programmatically.

  • Manual review

    Original research or data publication

    Published reports based on data only you have (industry surveys, methodology papers, internal benchmarks). The highest-leverage long-game move — gets cited for years.

    Off-site signal — requires manual research or external API.

    What to do: Publish reports based on data only you have. Highest-leverage long-game move — gets cited for years.

  • Manual review

    Press Media Coverage

    Off-site signal — requires manual research or external API.

    What to do: Get mentions in sector or regional press. Even a single feature in a respected outlet moves the needle.

  • Manual review

    Public API catalog (RFC 9727)

    /.well-known/api-catalog (application/linkset+json) lists your public APIs with service-desc (OpenAPI), service-doc and status endpoints. Agents discover and integrate without scraping.

    No API catalog found at /.well-known/api-catalog. Only relevant if you offer public APIs.

    What to do: If you offer public APIs, publish /.well-known/api-catalog (RFC 9727) listing them with service-desc (OpenAPI), service-doc and status endpoints.

  • Manual review

    Quality Backlinks

    Backlink data — requires Ahrefs/Moz API, deferred to a later pack.

    What to do: Build inbound links from authoritative domains — sector publications, partner companies, gov/edu sites.

  • Manual review

    Speaking engagements and panel slots

    Conference talks, panels, and podcast appearances. Each one produces durable content that AI later cites.

    Off-site signal — requires manual research or external API.

    What to do: Speak at conferences, on panels, podcast appearances. Each produces durable content AI later cites.

  • Manual review

    Strong internal linking structure

    Important pages are linked from many other pages on your site, with descriptive anchor text. Helps AI understand your site hierarchy and what matters most.

    Backlink data — requires Ahrefs/Moz API, deferred to a later pack.

    What to do: Link important pages from many places on your site with descriptive anchor text.

  • Manual review

    Third-party reviews on trusted sites

    Reviews on Google, TripAdvisor, Trustpilot, G2, Capterra, or sector-specific directories. AI weighs aggregated review presence as a credibility signal.

    Off-site signal — requires manual research or external API.

    What to do: Build review presence on Google, TripAdvisor, Trustpilot, G2, Capterra, or sector-specific directories.

  • Manual review

    Vs Comparison Pages

    Comparison/'vs' page presence needs a multi-page crawl.

    What to do: Build honest comparison pages versus competitors. AI surfaces these for high-intent comparison queries.

  • Manual review

    Webmcp Registration

    Manual review required — checks for navigator.modelContext.provideContext() calls in your JS.

    What to do: Call navigator.modelContext.provideContext() with tool definitions so AI-equipped browsers (Chrome with Gemini, Edge with Copilot) can invoke your site's actions client-side. Feature-detected — safe to add.

  • Manual review

    Wikipedia mention or page

    A Wikipedia page (where notability allows) or mentions in relevant Wikipedia articles. Wikipedia is one of the most heavily weighted citation sources by AI engines.

    Wikipedia presence requires manual research.

    What to do: If notability allows, get a Wikipedia page or relevant mentions. Wikipedia is one of the most heavily weighted citation sources by AI.

How AI platforms cite and name your business right now.

  • Fix

    Business mention — Chatgpt

    Your business name was NOT mentioned in any chatgpt response.

    What to do: Focus on brand entity clarity and off-site references so chatgpt learns to recognise and name you.

  • Fix

    Business mention — Claude

    Your business name was NOT mentioned in any claude response.

    What to do: Focus on brand entity clarity and off-site references so claude learns to recognise and name you.

  • Fix

    Business mention — Gemini

    Your business name was NOT mentioned in any gemini response.

    What to do: Focus on brand entity clarity and off-site references so gemini learns to recognise and name you.

  • Fix

    Business mention — Perplexity

    Your business name was NOT mentioned in any perplexity response.

    What to do: Focus on brand entity clarity and off-site references so perplexity learns to recognise and name you.

  • Good

    AI citations — Chatgpt

    87 citation(s) of your site across chatgpt responses.

  • Good

    AI citations — Claude

    11 citation(s) of your site across claude responses.

  • Good

    AI citations — Gemini

    39 citation(s) of your site across gemini responses.

  • Good

    AI citations — Perplexity

    130 citation(s) of your site across perplexity responses.

Can AI engines actually read your site?

  • Fix

    Sitemap.xml present and submitted

    A complete sitemap.xml exists, lists all important pages, is referenced in robots.txt, and is submitted to Google Search Console and Bing Webmaster Tools.

    No sitemap found at /sitemap.xml and no Sitemap: directive in robots.txt.

    What to do: Generate a sitemap.xml listing all important pages and submit it to Google Search Console and Bing Webmaster Tools.

  • Improve

    Canonical tags set correctly

    Every page declares a canonical URL, with no conflicting or self-referential errors. Critical for e-commerce with faceted navigation.

    No <link rel='canonical'> tag found on the homepage.

    What to do: Add a canonical tag pointing to the preferred URL of each page. Critical when you have URL parameters or faceted navigation.

  • Manual review

    Cloudflare WAF doesn't block AI bots

    Cloudflare's 'Block AI Scrapers and Crawlers' setting and default Bot Fight Mode can silently block AI crawlers. Verify the rules don't apply to GPTBot, ClaudeBot, etc.

    Cloudflare is not detected in front of this site, so the 'Block AI Scrapers' setting doesn't apply here.

  • Manual review

    Critical content is not PDF-only

    Case studies, product specs, and white papers exist as crawlable HTML pages, not just downloadable PDFs. AI cites HTML pages roughly 10x more often than PDFs.

    Multi-page crawl needed to detect PDF-only content.

    What to do: Ensure case studies, product specs, and white papers exist as crawlable HTML, not only PDFs. AI cites HTML pages 10x more than PDFs.

  • Good

    Content rendered server-side

    Your important content is in the initial HTML response, not rendered by JavaScript afterwards. Most AI crawlers don't execute JS, so client-rendered content is invisible to them.

    Homepage HTML contains ~331 words of visible text — AI crawlers can read it without JavaScript execution.

  • Good

    Full HTTPS, no mixed content

    Every page loads over HTTPS with a valid certificate, and no internal links or assets fall back to HTTP.

    Site loads over HTTPS with no detected mixed content.

  • Good

    robots.txt allows AI crawlers

    Your robots.txt explicitly allows GPTBot, ClaudeBot, PerplexityBot, CCBot, and Google-Extended. Blocking any of them — even by accident — makes you invisible to that platform.

    robots.txt does not block any known AI crawler.

Can AI agents not just read, but act on your site?

  • Fix

    Agent Skills index published

    A /.well-known/agent-skills/index.json file (Agent Skills Discovery RFC v0.2.0) lists callable skills agents can invoke. Each skill has a name, type, description, URL and sha256 digest. Cloudflare-led standard, increasingly required.

    No Agent Skills index found at /.well-known/agent-skills/index.json.

    What to do: Publish an Agent Skills index (Cloudflare's RFC v0.2.0) listing callable skills agents can invoke. Each skill needs a name, type, description, URL and sha256 digest.

  • Fix

    MCP Server Card available

    A /.well-known/mcp.json file declares which actions agents can take on your site (book, submit, query). Emerging standard backed by Anthropic and adopted by Claude Desktop and ChatGPT. Fewer than 100 sites worldwide have one.

    No MCP server card found at /.well-known/mcp.json.

    What to do: Publish an MCP server card declaring actions agents can take on your site. Emerging standard backed by Anthropic. Fewer than 100 sites worldwide have one — strong differentiator.

  • Fix

    Markdown content negotiation supported

    When an AI agent sends 'Accept: text/markdown', your server responds with a clean Markdown version of the page. Saves tokens for the agent — agents are more likely to cite low-token sources. Less than 4% of sites do this.

    Asked for markdown but got Content-Type: text/html.

    What to do: Set up your server to return text/markdown when requested. Either pre-generate .md twins of your key pages or run a lightweight HTML→Markdown conversion middleware.

  • Fix

    RFC 8288 Link headers for discovery

    HTTP Link response headers point agents to your well-known resources (mcp-server, api-catalog, agent-skills, sitemap, llms.txt) without parsing HTML. Agents using HEAD requests or Markdown negotiation only see headers, not <link> tags.

    No HTTP Link headers found on the homepage.

    What to do: Add a Link response header pointing to your well-known resources (llms.txt, sitemap.xml, mcp.json, etc). Agents using HEAD requests or markdown negotiation never see <link> tags in HTML — only headers.

  • Fix

    llms.txt file published

    An llms.txt at the site root tells AI agents what your site is about and points to the most authoritative pages. Less than 1% of sites have one — a free differentiator.

    No llms.txt found at /llms.txt.

    What to do: Publish an llms.txt at the site root summarising what the site is about and pointing AI agents to your most authoritative pages. See llmstxt.org for the format. Fewer than 1% of sites have one.

  • Improve

    Content Signals declared in robots.txt

    Use the Content-Signal directive (format: search=yes, ai-input=yes, ai-train=no) to declare how AI may use your content. Distinct from Allow/Disallow — silence is increasingly interpreted as restrictive.

    robots.txt exists but doesn't declare Content-Signal directives.

    What to do: Add 'Content-Signal: search=yes, ai-input=yes, ai-train=no' (adjust to your preference) under your User-agent blocks. Silence is increasingly interpreted as restrictive by AI platforms.

  • Manual review

    Forms readable and submittable by agents

    Forms use semantic HTML (proper labels, autocomplete attributes, aria-required), don't rely on heavy JS for validation, and accept standard form-encoded POST. An agent can fill them without computer-vision tricks.

    No <form> found on the homepage.

    What to do: If you have forms on other pages (contact, signup, search), ensure they use semantic HTML — proper labels, autocomplete attributes, and standard form-encoded POST. Agents struggle with custom JS form components.

  • Manual review

    OAuth Protected Resource Metadata

    If you offer protected APIs, publish /.well-known/oauth-protected-resource (RFC 9728) declaring the authorization servers that can issue tokens for it and the supported scopes. Lets agents discover how to authenticate before attempting access.

    Manual review required — only relevant if you offer protected APIs.

    What to do: If you offer protected APIs, publish /.well-known/oauth-protected-resource (RFC 9728) declaring authorization servers and supported scopes.

  • Manual review

    Public API catalog (RFC 9727)

    /.well-known/api-catalog (application/linkset+json) lists your public APIs with service-desc (OpenAPI), service-doc and status endpoints. Agents discover and integrate without scraping.

    No API catalog found at /.well-known/api-catalog. Only relevant if you offer public APIs.

    What to do: If you offer public APIs, publish /.well-known/api-catalog (RFC 9727) listing them with service-desc (OpenAPI), service-doc and status endpoints.

How clearly machines can understand who you are.

  • Fix

    FAQ schema on key pages

    FAQ schema markup on pages that genuinely answer customer questions. Don't fake it — Google penalises FAQ stuffing, and AI engines verify against page content.

    No FAQPage schema found on the homepage.

    What to do: If you answer common customer questions anywhere on your site, mark them up with FAQPage JSON-LD. Don't fake it — Google penalises FAQ stuffing.

  • Fix

    LocalBusiness schema

    LocalBusiness (or specific subtype like Restaurant, Store) schema with address, hours, geo coordinates, price range, and aggregated review rating.

    No LocalBusiness schema (or subtype) found.

    What to do: If you have a physical location, add LocalBusiness JSON-LD with address, geo coordinates, hours, priceRange, and aggregateRating. Use a specific subtype (Restaurant, Store, Dentist, etc) when applicable.

  • Fix

    Organization schema

    JSON-LD Organization schema on the homepage with name, logo, URL, sameAs links to social profiles, address, and contact info. This is the single most-cited structured data block by AI.

    No JSON-LD Organization schema found on the homepage.

    What to do: Add a JSON-LD Organization block on the homepage with name, logo, URL, sameAs (social links), address, and contact info. This is the most-cited structured data block by AI.

  • Fix

    sameAs linking to social and directories

    Organization schema includes sameAs URLs to your LinkedIn, Wikipedia, Crunchbase, industry directories — all the entities AI uses to build a trustworthy picture of you.

    No sameAs URLs found in any JSON-LD block.

    What to do: Add sameAs URLs to your Organization schema pointing to your LinkedIn, Wikipedia, Crunchbase, and industry directories. AI uses these to build a trustworthy picture of your entity.

  • Manual review

    Article schema on content pages

    Blog posts and case studies use Article schema with author, datePublished, and dateModified. AI weighs freshness signals.

    No Article schema on the homepage. Typically lives on blog post and case study pages.

    What to do: Blog posts and case studies should use Article (or BlogPosting) schema with author, datePublished, and dateModified.

  • Manual review

    BreadcrumbList schema

    BreadcrumbList schema on category and product pages helps AI understand your site hierarchy.

    No BreadcrumbList schema on the homepage. More relevant on deep pages.

    What to do: Add BreadcrumbList JSON-LD on category and product pages to help AI understand your site hierarchy.

  • Manual review

    Person schema for leadership

    Founders and key staff have Person schema with role, credentials, and sameAs links to LinkedIn. Helps AI characterize who runs the business.

    No Person schema for leadership on the homepage.

    What to do: Founders and key staff should have Person schema with role, credentials, and sameAs links to LinkedIn. Typically on the About or Team pages.

  • Manual review

    Product schema with reviews

    Product schema on every product page with brand, GTIN/SKU, price, availability, and aggregated reviews. Drives both rich results in Google and citations in shopping AI agents.

    No Product schema on the homepage. Only relevant for e-commerce.

    What to do: If you sell products, ensure each product page has Product JSON-LD with brand, GTIN/SKU, price, availability, and aggregateRating. The homepage typically doesn't need it.

  • Manual review

    Service schema per core offering

    Each main service has its own page with Service schema, describing what it is, who it's for, and areas served.

    No Service schema on the homepage. Typically lives on individual service pages.

    What to do: Each main service should have its own page with Service JSON-LD describing what it is, who it's for, and areas served.

How easy it is for AI to quote you.

  • Fix

    Long-form About page

    Substantive About page (1,000+ words) covering history, philosophy, team, and concrete differentiators. The single page AI cites most when answering 'who is X' queries.

    Homepage has only ~331 words — likely too thin for AI to characterize the business.

    What to do: Build a long-form About page (1,000+ words). The single page AI cites most when answering 'who is X' queries.

  • Manual review

    Case studies in crawlable HTML

    Each case study is a public HTML page with client name (where permitted), problem, solution, technologies used, and measurable outcomes. Not just a PDF download.

    Case studies presence and quality need a multi-page crawl.

    What to do: Each case study should be a public HTML page with client, problem, solution, technologies, and measurable outcomes.

  • Manual review

    Industry-vertical pages

    Dedicated pages for each vertical you serve, with the language, use cases, and outcomes specific to that industry. AI excels at recommending vertical-specific service providers.

    Industry-vertical pages need a multi-page crawl to evaluate.

    What to do: Build dedicated pages for each vertical you serve, with industry-specific language and use cases.

  • Manual review

    Technology / platform pages

    Explicit pages naming every major technology you work with (e.g., Siemens, Rockwell, Shopify, Salesforce). AI looks for exact-match technology terms when matching buyer intent.

    Technology/platform pages need a multi-page crawl to evaluate.

    What to do: Build explicit pages naming every major technology you work with (Siemens, Salesforce, etc).

Whether AI can describe who you are in two sentences.

  • Manual review

    Consistent NAP (name, address, phone)

    NAP information matches exactly across every directory and platform. A single typo in your address can cost local visibility.

    NAP consistency requires cross-referencing your site with directories.

    What to do: Verify your name, address, and phone number match exactly on website, Google Business, LinkedIn, social profiles, and directories.

  • Manual review

    Wikipedia mention or page

    A Wikipedia page (where notability allows) or mentions in relevant Wikipedia articles. Wikipedia is one of the most heavily weighted citation sources by AI engines.

    Wikipedia presence requires manual research.

    What to do: If notability allows, get a Wikipedia page or relevant mentions. Wikipedia is one of the most heavily weighted citation sources by AI.

What outside sources back you up.

  • Manual review

    Industry directory listings

    Listings in directories specific to your sector, with complete profiles and consistent information.

    Off-site signal — requires manual research or external API.

    What to do: List in directories specific to your sector with complete, consistent profiles.

  • Manual review

    Original research or data publication

    Published reports based on data only you have (industry surveys, methodology papers, internal benchmarks). The highest-leverage long-game move — gets cited for years.

    Off-site signal — requires manual research or external API.

    What to do: Publish reports based on data only you have. Highest-leverage long-game move — gets cited for years.

  • Manual review

    Speaking engagements and panel slots

    Conference talks, panels, and podcast appearances. Each one produces durable content that AI later cites.

    Off-site signal — requires manual research or external API.

    What to do: Speak at conferences, on panels, podcast appearances. Each produces durable content AI later cites.

  • Manual review

    Third-party reviews on trusted sites

    Reviews on Google, TripAdvisor, Trustpilot, G2, Capterra, or sector-specific directories. AI weighs aggregated review presence as a credibility signal.

    Off-site signal — requires manual research or external API.

    What to do: Build review presence on Google, TripAdvisor, Trustpilot, G2, Capterra, or sector-specific directories.

Visibility for queries with a place attached.

  • Manual review

    City and neighborhood named on the site

    Your city and neighborhood appear naturally in the homepage copy, About, and footer. AI cross-references these against the place names in user queries.

    Local signal — requires Google Business Profile API or manual verification.

    What to do: Make sure your city and neighborhood appear naturally in homepage copy, About, and footer.

  • Manual review

    Google Business & maps presence

    Your presence across Google Business Profile, Google reviews, and the wider maps ecosystem (Apple Maps, Bing Places).

    • Manual review Google Business Profile fully filled

      Local signal — requires Google Business Profile API or manual verification.

      What to do: Fill every field of your Google Business Profile — categories, services, attributes, photos, hours, posts.

    • Manual review Recent photos on Google Business

      Local signal — requires Google Business Profile API or manual verification.

      What to do: Upload fresh photos monthly: interior, products, team, events. Drives ranking and click-through.

    • Manual review Google reviews — volume and recency

      Local signal — requires Google Business Profile API or manual verification.

      What to do: Build a steady inflow of reviews with proprietor responses on most.

    • Manual review Apple Maps and Bing Places presence

      Local signal — requires Google Business Profile API or manual verification.

      What to do: List on Apple Maps and Bing Places, both increasingly cited by AI.

Speed and stability metrics that gate ranking.

  • Manual review

    Cumulative Layout Shift (CLS) under 0.1

    Your layout doesn't jump around as it loads. Frequent CLS issues come from images without dimensions and late-loading ads.

    Cumulative Layout Shift requires the Google PageSpeed Insights API — deferred to a later pack.

    What to do: Ensure layout doesn't jump as it loads. Set explicit dimensions on images and reserve space for ads.

  • Manual review

    Interaction to Next Paint (INP) under 200ms

    Your site responds instantly to clicks and taps. INP replaced FID as a Core Web Vital in 2024.

    Interaction to Next Paint requires the Google PageSpeed Insights API — deferred to a later pack.

    What to do: Ensure your site responds to clicks under 200ms. Often fixed by reducing JavaScript execution time.

  • Manual review

    Largest Contentful Paint (LCP) under 2.5s

    Your main content loads quickly. LCP is one of three Core Web Vitals Google uses as a ranking signal.

    Largest Contentful Paint requires the Google PageSpeed Insights API — deferred to a later pack.

    What to do: Ensure your main content loads under 2.5s. Often fixed by optimizing the hero image and inlining critical CSS.

  • Manual review

    Modern image formats and compression

    Images served as WebP or AVIF, properly sized for the device, with lazy loading on below-the-fold content.

    image format and compression requires the Google PageSpeed Insights API — deferred to a later pack.

    What to do: Serve images as WebP or AVIF, properly sized, with lazy loading on below-the-fold content.

  • Manual review

    No render-blocking resources

    Critical CSS is inlined. Non-critical CSS and JS are deferred or async-loaded. Especially important for ad-heavy or analytics-heavy sites.

    render-blocking resources requires the Google PageSpeed Insights API — deferred to a later pack.

    What to do: Inline critical CSS; defer non-critical CSS and JS.

Does your site hold up on a phone?

  • Manual review

    Readable text without zooming on mobile

    Body text is at least 16px on mobile. Small text triggers Google's mobile usability warnings.

    Mobile font size detection needs computed rendering, deferred to Lighthouse integration.

    What to do: Ensure body text is at least 16px on mobile. Small text triggers Google mobile usability warnings.

  • Manual review

    Tap targets large enough on mobile

    Buttons and links are at least 44px and not too close to each other. Failing this hurts both mobile UX and Core Web Vitals.

    Tap target size needs computed rendering, deferred to Lighthouse integration.

    What to do: Ensure buttons and links are at least 44px and not too close to each other.

  • Good

    Proper mobile viewport meta tag

    viewport meta tag is set correctly so the site adapts to phone screens. Should be standard, but is still missing on older sites.

    Viewport meta tag is properly set: width=device-width, initial-scale=1.0

Who links to you, and from where.

  • Manual review

    Natural anchor text profile

    Backlinks to your site use varied, natural anchor text. Over-optimised anchor text (same keywords repeatedly) triggers spam signals.

    Backlink data — requires Ahrefs/Moz API, deferred to a later pack.

    What to do: Keep backlink anchor text varied and natural; over-optimisation triggers spam signals.

  • Manual review

    Strong internal linking structure

    Important pages are linked from many other pages on your site, with descriptive anchor text. Helps AI understand your site hierarchy and what matters most.

    Backlink data — requires Ahrefs/Moz API, deferred to a later pack.

    What to do: Link important pages from many places on your site with descriptive anchor text.

Speak to an expert

Book your 30-minute strategic debrief. We'll walk through your report and prioritise the highest-impact moves for your business.

Book your call →