In September 2024, Jeremy Howard published a short proposal on the Answer.AI blog. The idea was elegant: create a lightweight markdown file at your website root that gives large language models a concise, structured guide to your most important content. Call it llms.txt. The concept spread fast through developer communities and SEO circles.

By October 2025, over 844,000 websites had implemented it according to BuiltWith's tracking. Anthropic, Cloudflare, and Stripe all added llms.txt files to their own sites. Yoast SEO automated its generation. An entire cottage industry of llms.txt generators, validators, and directories emerged within months.

There is just one uncomfortable piece of data sitting underneath all of this activity. Not a single major AI platform has officially confirmed that they read or use llms.txt files in any meaningful way.

This post is the honest version of the llms.txt story: what it is, what it actually does, what the data shows, and what the expert disagreement tells us about the broader question of AI search visibility in 2026.

844K+ websites had implemented llms.txt as of October 2025, per BuiltWith tracking
0.1% of AI bot requests went to llms.txt files in a 90-day study by OtterlyAI across 62,100 requests
0 major AI platforms (Google, OpenAI, Anthropic, Perplexity) have officially confirmed using llms.txt

What llms.txt Actually Is (The Technical Reality)

Before evaluating whether llms.txt works, it helps to understand precisely what it does and does not do. The confusion in most coverage stems from conflating what the file was designed to accomplish with what people hope it accomplishes.

llms.txt is a plain text file written in Markdown, placed at the root of your website at yoursite.com/llms.txt. It follows a specific structure proposed by Jeremy Howard. Here is what a correctly formatted file looks like:

# Example llms.txt file structure (from llms-txt.org spec)


 

# Your Company Name


 

> One-paragraph summary of what your site or product does.
> Keep it concise and factual. LLMs use this as context.


 

Additional background the LLM might need at inference time.


 

## Core Documentation


 

## Optional

The file was explicitly designed for one specific purpose: inference-time use, meaning when an AI model is actively answering a question and needs to fetch information from your site in real time. The idea is that instead of parsing your full HTML, with navigation, ads, cookie banners, and JavaScript clutter, the model can read a clean, structured summary and decide which of your pages are worth fetching.

Critically, the proposal also suggests adding .md versions of individual pages at the same URL. So yoursite.com/docs.html.md would be a clean Markdown version of your documentation that AI models can read without parsing HTML.

What llms.txt Is NOT Designed to Do

The original proposal from Jeremy Howard is explicit on this point: llms.txt is not a training opt-out mechanism, not a ranking signal, and not a replacement for good content quality. It is specifically framed as an inference-time helper, not as an AI SEO lever. Most of the hype around it misrepresents its intended scope.

The Case For llms.txt (What Proponents Actually Have Evidence For)

The story is not purely negative. There are real, documented cases where llms.txt provides genuine value, and dismissing the concept entirely would be as inaccurate as overselling it.

AI coding tools and developer agents read it

This is the strongest evidence for llms.txt's practical utility. If your audience includes developers using Cursor, Claude Code, GitHub Copilot, or similar AI-assisted coding tools, those agents actively fetch and read documentation files. When a developer asks an AI coding assistant to help them implement your API, the agent looks for structured documentation. An llms.txt file pointing to clean Markdown versions of your API docs measurably improves how those agents reason about your product.

This is why Anthropic, Stripe, and Cloudflare all implemented it. Their primary audience is developers. Their content is technical documentation. llms.txt is genuinely useful in that specific context.

Anthropic specifically requested it from Mintlify

"Anthropic, creator of Claude, specifically asked Mintlify to implement llms.txt and llms-full.txt for their documentation. This request demonstrates a clear commitment to these standards from one of the leading AI companies."
Source: Mintlify · "The Value of llms.txt: Hype or Real?" · May 2025

AI bots are crawling llms.txt files

Data from Profound, a GEO tracking company, shows that bots from Microsoft and OpenAI are actively fetching both llms.txt and llms-full.txt files. This is confirmed by server log data across multiple domains. The crawling is real. The question is whether that crawling translates into meaningful usage, and on that question, evidence is absent.

The exercise itself has value

Writing an llms.txt file forces you to answer a question that most content teams have never explicitly addressed: what would you want an AI to cite about your product? Curating a one-paragraph summary and a 30-link index of your most important pages requires editorial decisions that clarify your own content hierarchy. That clarity has value independent of whether AI models read the file.

The Case Against the Hype (What the Data Actually Shows)

This is where the research gets uncomfortable for the llms.txt enthusiast community.

"OtterlyAI spent 90 days measuring what happens when you provide a correctly implemented llms.txt. The result: out of 62,100 AI bot requests, exactly 84 went to the llms.txt. That is 0.1 percent. The file performed three times worse than an average content page on the same domain."
Source: Kai Spriestersbach · "The llms.txt is Dead. More Precisely: A Dud." · Medium · February 2026
AI Bot Requests by Destination (OtterlyAI Study, 62,100 requests, 90 days)
Average content page
 
~99%
llms.txt file
 
0.1%
Crawled by BuiltWith only
 
~95%
Note: Most llms.txt crawling is from BuiltWith, a technology detection service that catalogs which files exist. This is not an evidence of AI usage, it is an inventory check.
"No AI system currently uses llms.txt."
Source: John Mueller, Google · Statement on Reddit and Bluesky · 2025

Mueller's statement is direct. Google, which operates both traditional search and AI-powered search features including AI Overviews, has not integrated llms.txt into any of its systems. He did not say "we might use it eventually." He said no AI system currently uses it.

OpenAI has not announced that ChatGPT or GPTBot parses or acts on llms.txt content. Anthropic has not confirmed that Claude's systems reference it during conversations, despite publishing their own llms.txt for their documentation. Perplexity and Meta have maintained silence.

The Crawling vs. Using Distinction

The fact that OpenAI and Microsoft bots are crawling llms.txt files does not mean those files influence AI responses. Search bots crawl millions of pages they never rank. Crawling is inventory. Usage is a separate decision made at inference time by a completely different system. The evidence for crawling is real. The evidence for usage in responses is absent.

Facts vs. Fiction: The Full Breakdown

✓ Facts: What Is True
Proposed by Jeremy Howard of Answer.AI in September 2024
Genuinely useful for AI coding tools like Cursor and Claude Code
Anthropic, Stripe, and Cloudflare have implemented it
AI bots from Microsoft and OpenAI crawl these files
Low effort to implement (10-30 minutes)
Valuable editorial exercise regardless of AI usage
Creates no harm if unused by AI systems
✗ Fiction: What Is Not True
Any major AI platform officially uses it in responses
It boosts your visibility in ChatGPT or Perplexity answers
It functions like robots.txt for AI crawlers (different mechanism)
It is a training data opt-out or opt-in mechanism
It replaces the need for good content quality
Google confirmed any interest in adopting it
0.1% crawl rate proves meaningful AI engagement

How llms.txt Compares to Existing Standards

Part of the confusion around llms.txt comes from drawing imprecise analogies to existing web standards. Here is how the comparison actually holds up.

Featurerobots.txtsitemap.xmlllms.txt
Official standardYes (W3C)Yes (sitemaps.org)No (community proposal)
Confirmed AI platform usageYes (GPTBot, ClaudeBot honor it)Yes (crawl guidance)Not confirmed
Controls crawling behaviorYesPartiallyNo
Designed for training dataCan be used this wayNoNo (inference-time only)
FormatPlain text, directivesXMLMarkdown
Best immediate use caseBlock or allow AI crawlersImprove page indexingDeveloper-facing AI tools

The robots.txt comparison is the most misleading analogy in the current llms.txt discourse. robots.txt works because search engines and AI crawlers are programmed to look for it, respect it, and act on its directives. That programmatic relationship does not currently exist for llms.txt. No AI platform has committed to treating it as authoritative guidance in the way bots treat robots.txt.

What Actually Moves the AI Visibility Needle Today

If llms.txt is not the lever, what is? This question matters more than the llms.txt debate itself, because AI-generated answers are now a meaningful traffic source that businesses cannot afford to ignore.

"The strongest skeptics just point out the lack of measurable benefit from llms.txt. They are not arguing against AI visibility optimization. They are redirecting attention to what actually works."

robots.txt with explicit AI bot rules

This is the lever that provably moves outcomes today. AI crawlers including GPTBot (OpenAI), ClaudeBot (Anthropic), ChatGPT-User, Claude-User, PerplexityBot, and Google-Extended all respect robots.txt directives. If you want to control whether these bots can crawl and potentially cite your content, robots.txt is the mechanism that actually works. Here is an example of a thoughtful 2026 robots.txt setup:

# robots.txt: AI bot rules (the lever that actually works)


 

# Block training crawlers, allow inference crawlers
User-agent: GPTBot
Disallow: /private/
Allow: /


 

User-agent: ChatGPT-User
Allow: /


 

User-agent: ClaudeBot
Allow: /


 

User-agent: PerplexityBot
Allow: /


 

# Block Google's AI training crawler if desired
User-agent: Google-Extended
Disallow: /

Content quality and specificity

AI models cite sources that answer questions precisely. Broad, generic content does not get cited. Specific, authoritative answers to clearly defined questions do. The most durable AI visibility strategy is writing content that is the best available answer to a specific query, not content optimized for a technical file that AI systems may or may not read.

Entity coverage and consistent brand mentions

Generative Engine Optimization (GEO) research consistently shows that entities mentioned more frequently and consistently across high-authority sources appear more often in AI-generated responses. Building your brand's presence through PR, citations, and third-party mentions is more reliably effective than file-level optimizations.

Structured data and schema markup

Google has explicitly confirmed that structured data helps AI Overviews understand and present your content accurately. This is an evidence-based lever. Schema markup on FAQs, products, how-tos, and organization entities demonstrably affects how AI-assisted search surfaces your content.

Should You Add an llms.txt File? A Decision Framework

  • Your site is developer-focused with technical documentation. This is the clear yes case. Developers using AI coding tools will benefit from clean Markdown documentation linked from llms.txt. Stripe, Anthropic, and Cloudflare added it precisely for this reason.

  • You want to future-proof against AI platforms adopting the standard. Implementation takes 10 to 30 minutes and causes zero harm if unused. As a low-effort hedge, it is reasonable. If major AI platforms eventually adopt it, you are already positioned.

  • You want the editorial clarity that comes from writing it. Curating your most important content into a structured llms.txt file is a useful exercise regardless of whether AI models read it. It forces content prioritization decisions that benefit your overall content strategy.

  • ~

    Your site is a content blog or marketing site. The benefit is speculative and the data does not show measurable impact. Adding it is harmless, but it should not take priority over improving content quality, building backlinks, or improving page speed.

  • You expect it to boost your ChatGPT or Perplexity citation frequency. The data does not support this expectation. OtterlyAI's 90-day study found 0.1% of AI bot requests reached the llms.txt file. If AI citation frequency is your goal, focus on content quality and entity coverage instead.

  • You are using it as a training data opt-out. llms.txt has no mechanism for this and was not designed for this purpose. robots.txt with specific User-Agent rules for AI training crawlers is the correct tool for controlling what gets used in training data.

The Broader Context: What GEO Actually Means in 2026

The llms.txt debate is a microcosm of a larger conversation about Generative Engine Optimization (GEO), the emerging practice of optimizing for AI-generated search and answer systems rather than traditional search result pages.

GEO is real. AI-assisted search interfaces from Google, Perplexity, Bing, and others now drive meaningful traffic. Being cited in an AI-generated answer has measurable value. The question is which signals actually influence those citations.

Current research suggests the most reliable factors are: content that directly answers specific questions with authority, consistent entity mentions across third-party sources, technical accessibility for AI crawlers via properly configured robots.txt, and structured data that helps AI systems correctly attribute and represent your content.

llms.txt is a technically sensible idea that may eventually fit into that ecosystem. It is not currently one of the measurably effective factors. The honest position is to implement it if it is low effort for your site architecture, treat it as infrastructure rather than a strategy, and focus your primary AI visibility effort on the factors that have demonstrated impact.

Frequently Asked Questions

Does adding llms.txt improve my visibility in ChatGPT or Perplexity?

There is no confirmed evidence that it does. A 90-day study by OtterlyAI across 62,100 AI bot requests found that only 0.1% went to llms.txt files, performing three times worse than average content pages. Neither OpenAI, Google, Anthropic, nor Perplexity has officially confirmed that their response generation systems use llms.txt content. Google's John Mueller stated explicitly that no AI system currently uses llms.txt.

Who actually benefits from implementing llms.txt?

The clearest beneficiaries are sites with developer-facing technical documentation. AI coding tools like Cursor, Claude Code, and GitHub Copilot actively fetch structured documentation at inference time. For these use cases, llms.txt and clean Markdown versions of documentation pages measurably improve how AI agents reason about your product or API. This is why Anthropic, Stripe, and Cloudflare implemented it. For general marketing or content sites targeting non-developer audiences, the benefit is speculative.

Is llms.txt the same as robots.txt for AI?

No. robots.txt works because all major search engines and AI crawlers are programmed to look for it, respect it, and act on its directives. This programmatic relationship is established, documented, and consistently honored. llms.txt has no equivalent commitment from AI platforms. It is a community proposal, not an official standard. AI bots that crawl your llms.txt are not necessarily using its contents to guide their responses. robots.txt with explicit User-Agent rules for GPTBot, ClaudeBot, and PerplexityBot is the mechanism that actually controls AI crawler behavior today.

What actually improves AI visibility if llms.txt does not?

The factors with documented evidence include: content that precisely and authoritatively answers specific questions, consistent brand and entity mentions across third-party sources and publications, proper robots.txt configuration allowing AI inference crawlers to access your content, structured data and schema markup that helps AI systems correctly understand and attribute your content, and overall content quality and specificity rather than broad generic coverage. These are the levers GEO practitioners have identified as measurably effective.

Should I still add llms.txt to my website?

Probably yes, for the right reasons. If your site is developer-focused, the benefit is real and immediate through AI coding tools. For any site, implementation takes 10 to 30 minutes and causes zero harm if AI systems ignore it. It is a reasonable hedge on future AI platform behavior. The wrong reason to add it is expecting it to boost your ChatGPT or Perplexity citation frequency. The data does not support that expectation, and the time spent on llms.txt is likely better spent on content quality improvements if citation frequency is your goal.

Ahsan Zaidi
Marketing Writer, Trimrly

Ahsan researches and writes about AI search visibility, link management, and digital marketing strategy. This article draws on the original llms.txt proposal from Jeremy Howard at Answer.AI, data from OtterlyAI's 90-day bot request study, statements from Google's John Mueller, analysis from Mintlify, Symphonic Digital, and Am I Cited, and adoption data from BuiltWith as of late 2025.