What Is llms.txt? A Practical Guide for AI Search Visibility
llms.txt is a plain-text Markdown
file placed at the root of your domain that gives AI systems a concise,
curated map of your most important content. It lists your key pages with
short descriptions so language models can orient themselves quickly. It
is a guidance file for large language models, not a confirmed ranking
factor or a guarantee of citations.
Published June 2026 · 9 min read
What is llms.txt?
llms.txt is a Markdown file you
publish at https://yourdomain.com/llms.txt.
Its job is to hand a large language model a clean, human-readable
summary of your site: what the site is about, which pages matter, and
where the canonical information lives. Instead of crawling and guessing,
a model that reads the file gets a curated index written in plain
language.
The format is deliberately simple. It starts with an
H1 title, an optional
blockquote summary, and then a set of
H2 sections containing Markdown
links to your important pages. There is no XML, no schema validator, no
required tooling — just Markdown a person could read and an LLM can
parse cheaply.
The proposal emerged in 2024 as a community convention, similar in
spirit to how robots.txt and
sitemap.xml became de facto
standards. It is not part of an official HTTP specification and is not
mandated by any AI vendor. Adoption is voluntary on both sides: you
choose to publish it, and a given AI tool chooses whether to read it.
Why llms.txt matters for AI SEO
Large language models and AI answer engines work with limited context.
When a tool needs to understand your site quickly, a curated file is
far more efficient than crawling hundreds of pages full of navigation,
cookie banners, and boilerplate. llms.txt
lets you front-load the signal and strip out the noise.
For AI SEO, the value is about clarity and orientation rather than ranking:
- Disambiguation — you state, in your own words, what your site and brand are about, reducing the chance a model misclassifies you.
- Page prioritization — you point to the canonical version of important content instead of leaving a model to find it among duplicates and thin pages.
- Context efficiency — a short, structured file fits inside a model's context window, where a full crawl might not.
- Editorial control — you decide which pages represent you, in what order, with what framing.
Set expectations honestly. llms.txt
is an orientation file, not a ranking factor. No AI vendor has confirmed
that publishing it improves how often you are cited or recommended, and
it cannot guarantee citations. The realistic benefit is that systems
which do read it can understand you faster and more accurately — a
foundation for visibility, not a lever that forces it. It complements
broader
generative engine optimization
work; it does not replace it.
llms.txt vs robots.txt
The two files are often confused, but they do opposite jobs.
robots.txt is about
permission: it tells crawlers what they may and may not fetch.
llms.txt is about
orientation: it tells AI systems what your site means and where
the important content is.
- Purpose —
robots.txtgrants or denies access;llms.txtsummarizes and points to content. - Format —
robots.txtusesUser-agentandDisallowdirectives;llms.txtuses Markdown headings and links. - Audience —
robots.txtspeaks to crawlers likeGooglebotandGPTBot;llms.txtspeaks to LLM tools that read content directly. - Enforcement — well-behaved crawlers respect
robots.txtrules;llms.txtis purely advisory and grants no access by itself.
They are complementary, not interchangeable. If your
robots.txt blocks an AI crawler
such as GPTBot or
PerplexityBot, publishing an
llms.txt will not override that
block — the crawler still cannot fetch the pages you reference. Get
permission right in robots.txt
first, then use llms.txt to guide
the systems you have allowed in.
What to include in llms.txt
A good file is short, curated, and honest. Resist the urge to dump your whole sitemap into it — the point is to highlight what matters, not to mirror your navigation. Include the following:
- An H1 title — the name of your site or brand, so the model knows whose file this is.
- A blockquote summary — one or two sentences (a Markdown
>blockquote) describing what your site does and who it serves. - Sectioned links —
H2sections such as Docs, Guides, Product, or About, each containing Markdown links to canonical pages with a short note after each link. - Canonical URLs only — link the authoritative version of each page, using the same trailing-slash convention as your site.
- An optional details section — secondary links you consider lower priority, often placed under a final section like Optional.
Keep descriptions factual. Each link should answer "what will a reader
find here?" in plain language. Avoid marketing superlatives and avoid
inventing claims — a model that reads inflated copy is more likely to
misrepresent you, not less. Some sites also publish an
llms-full.txt with the full text
of key pages inlined, but the standard
llms.txt is enough to start.
A simple llms.txt example
Here is a minimal, valid llms.txt.
Note the structure: an H1 title,
a blockquote summary, then H2
sections of annotated Markdown links.
# Acme Analytics
> Acme Analytics is a privacy-first web analytics tool for small teams.
> This file points AI systems to our most important documentation and guides.
## Docs
- [Getting started](https://acme.example/docs/start): install and send your first event
- [API reference](https://acme.example/docs/api): endpoints, auth, and rate limits
## Guides
- [Self-hosting Acme](https://acme.example/guides/self-host): deploy on your own server
- [GDPR and privacy](https://acme.example/guides/privacy): how Acme handles data
## About
- [Pricing](https://acme.example/pricing): plans and limits
- [Company](https://acme.example/about): who we are and what we believe
## Optional
- [Changelog](https://acme.example/changelog): release notes and version history
You can see a real file in production at geoready.dev/llms.txt — it uses the same title, blockquote, and sectioned-link structure described here. For a platform-specific walkthrough, see our llms.txt for WordPress guide, which covers both manual and plugin-based implementation.
Common mistakes
Most llms.txt problems come from
treating it like a sitemap or a marketing page. Watch for these:
- Dumping every URL — a 400-link file defeats the purpose. Curate the pages that actually represent your site.
- Wrong location or content type — it must live at the domain root (
/llms.txt) and be served as plain text, not HTML. - Linking non-canonical or redirecting URLs — point to the final, canonical address with the correct trailing slash.
- Blocking the referenced pages in robots.txt — guiding a crawler to pages it is not allowed to fetch is self-defeating.
- Letting it go stale — if your important pages change, update the file. A snapshot from a year ago can point models at dead links.
- Treating it as a ranking lever — it is an orientation file. Expecting guaranteed citations or rankings from it leads to disappointment.
How to check whether your site has llms.txt
Checking is quick. You can do it three ways:
- Visit the URL directly — open
https://yourdomain.com/llms.txtin a browser. If you see Markdown text, the file exists. A404means it is missing. - Use the command line — run
curl -I https://yourdomain.com/llms.txtand confirm a200status and atext/plainortext/markdowncontent type. - Run an AI visibility audit — an automated check confirms the file exists, validates its structure (title, blockquote, sections, links), and flags issues alongside your other AI SEO signals.
If the file is missing, malformed, or out of date, an audit will tell you exactly what to fix before you publish. Don't have one yet? Use the free llms.txt generator to build a starter file from your sitemap in seconds.
Check your llms.txt and AI visibility
The free audit checks whether your site has a valid
llms.txt, validates its structure,
and scores your site across eight AI visibility categories — robots.txt,
schema, content signals, and more. No account needed.
A GeoReady account lets you save reports, track changes over time, and unlock the full audit. Create an account.
FAQ
Is llms.txt a ranking factor?
No. No AI vendor has confirmed that llms.txt
influences rankings or citation frequency, and it does not guarantee
either. It is an orientation file that helps AI systems understand your
site faster and more accurately. Treat it as a clarity signal, not a
lever that forces visibility.
Where do I put the llms.txt file?
At the root of your domain, reachable at
https://yourdomain.com/llms.txt,
served as plain text. It belongs in the same location as
robots.txt and
sitemap.xml, not in a subfolder.
Is llms.txt the same as robots.txt?
No. robots.txt controls crawler
access — what bots may fetch.
llms.txt describes and indexes
your content for AI systems. They are complementary: an
llms.txt cannot override a block
set in robots.txt.
Do AI systems actually read llms.txt?
Adoption is voluntary and varies by tool. Some LLM-powered tools and agents read it when they fetch a site; others do not. Because it is a community convention rather than a mandated standard, you should publish it as a low-effort best practice without expecting universal support.
What format does llms.txt use?
Plain Markdown: an H1 title, an
optional > blockquote summary,
and H2 sections containing
Markdown links with short descriptions. There is no XML and no required
validator — readability is the point.
How is llms.txt different from JSON-LD structured data?
JSON-LD structured data lives
inside individual pages and describes entities and relationships for
search engines and AI systems. llms.txt
is a single site-level file that curates which pages matter. Use both:
schema for per-page meaning, llms.txt
for site-level orientation.
Further reading
- Free llms.txt generator — build a starter llms.txt from your sitemap.
- llms.txt for WordPress and AI visibility — manual and plugin-based implementation.
- Generative Engine Optimization — the broader practice llms.txt fits into.
- GEO vs SEO — what changes when AI engines become the interface.
- AI SEO — how to make your site visible to AI answer engines.
- The signals behind these guides: our research foundation.