Turn Any Docs Site into a Single LLM-Ready Markdown File
Paste a docs URL. SiteToMarkdown crawls every page under that path, strips out the noise, and gives you one clean Markdown file—perfect for NotebookLM, ChatGPT, GitHub Copilot, and RAG pipelines.
- Full-site context – Crawl entire /docs/ paths
- Clean Markdown – No HTML, nav, or ads
- Works on tricky sites – Handles JS & bot-blocking
Convert a Site to Markdown
Login with Google in seconds. No password required.
Works great with
Stop feeding your LLM a single URL. Give it your entire docs site as one structured source.
Why Pasting URLs into LLMs Doesn't Work Well
LLMs only see a slice
Pasting one URL into ChatGPT or NotebookLM gives shallow coverage. Most docs live across dozens or hundreds of pages.
Bots get blocked
Many docs sites block generic bots or rely on heavy JavaScript. Your AI tool never sees the real content.
Manual copy-paste fails
Adding pages to NotebookLM or RAG context one by one is slow, error-prone, and hard to keep updated.
SiteToMarkdown crawls the entire path, renders JavaScript, bypasses bot blocking, and merges everything into one Markdown file.
From Docs URL to LLM Source in Under 5 Minutes
Paste your docs URL
Enter https://yourdomain.com/docs/ or any docs/help path on the homepage.
We crawl & clean
We find all pages, render JS, bypass blocking, and extract only the main content.
Download Markdown
Upload it to NotebookLM, ChatGPT, Copilot, or your RAG app as a single source.
Built for the Way You Use LLMs
For NotebookLM
Turn entire docs sites into a single NotebookLM source for deep product questions.
Learn more →
For ChatGPT
Upload one Markdown file instead of pasting URLs. Get accurate, docs-grounded answers.
Learn more →For GitHub Copilot
Keep a Markdown file in your repo. Copilot uses it to suggest up-to-date API calls.
Learn more →For RAG & Agents
Give your agents a clean, portable knowledge source instead of building custom scrapers.
Learn more →Simple Pay-As-You-Go Pricing
No subscriptions. Just pay for what you convert.
- Full site/path crawl
- Instant download if cached, with a 25% discount
- JS rendering
- Advanced bot bypass
- Clean Markdown: No ads, navigation, or irrelevant content
Try It Free with Popular Docs Sites
Frequently Asked Questions
Does it only work with NotebookLM?
No! The Markdown output is standard and works perfectly with ChatGPT, Claude, GitHub Copilot, Cursor, and any RAG pipeline.
What types of URLs work best?
Documentation sites, help centers, and knowledge bases work best. Point us to the root of the docs (e.g., /docs/ or /help/) and we'll crawl everything underneath.
How do you handle JS-rendered docs?
We use advanced headless browsers to render JavaScript before extracting content, ensuring we capture data from Single Page Applications (SPAs) like React or Vue sites.
Do I need to install anything?
No. SiteToMarkdown is entirely web-based. You just download the final file.