5 releases (1 stable)
Uses new Rust 2024
| 1.0.0 | Aug 3, 2025 |
|---|---|
| 0.2.0 | Aug 1, 2025 |
| 0.1.2 | Jul 27, 2025 |
| 0.1.1 | Jul 27, 2025 |
| 0.1.0 | Jul 27, 2025 |
#1601 in Text processing
202 downloads per month
30KB
357 lines
llmtext 🤖🔗📝
Turn any website into a single, clean Markdown file, ready for your LLM prompts.
Language models have a knowledge cut-off date. llmtext solves this by scraping a webpage and all its relevant internal links, converting the content into a single Markdown file. You can then use this file to give your LLM the up-to-date context it needs.
✨ Features
- Intelligent Scraping: Give it one URL, and it smartly finds and scrapes all related pages.
- Flexible Input: Scrape a single page (
--single) or a list of URLs from a file (--urls). - Fast & Efficient: Built in Rust for maximum speed with parallel downloads.
- Clipboard Integration: Instantly copy the full Markdown output to your clipboard (
--clipboard).
📦 Installation
# From Crates.io (Recommended)
cargo install llmtext
🚀 Usage
# Scrape a site's documentation section
llmtext https://react.dev/reference/react
# Scrape just a single page
llmtext --single https://react.dev/reference/react/useState
# Scrape a list of URLs from a file and save to a custom output file
llmtext --urls my_links.txt --output react-subset.md
# Scrape and copy directly to clipboard, skipping confirmation
llmtext https://vuejs.org/guide/introduction.html --clipboard --yes
License
This project is licensed under the MIT License. See the LICENSE file for details.
Dependencies
~24–48MB
~752K SLoC