A multithreaded 🕸️ web crawler that recursively crawls a website and creates a 🔽 markdown file for each page, designed for LLM RAG
-
Updated
Aug 13, 2024 - Python
A multithreaded 🕸️ web crawler that recursively crawls a website and creates a 🔽 markdown file for each page, designed for LLM RAG
A python web crawler tool that extracts website content and converts it to Markdown format. Perfect for analyzing how search engines, AI agents, and bots perceive your website structure. Works for localhost too
Add a description, image, and links to the markdown-crawler topic page so that developers can more easily learn about it.
To associate your repository with the markdown-crawler topic, visit your repo's landing page and select "manage topics."