5 releases (1 stable)

Uses new Rust 2024

1.0.0 Aug 3, 2025
0.2.0 Aug 1, 2025
0.1.2 Jul 27, 2025
0.1.1 Jul 27, 2025
0.1.0 Jul 27, 2025

#1601 in Text processing

Download history 4/week @ 2025-09-29

202 downloads per month

MIT and GPL-3.0+

30KB
357 lines

llmtext 🤖🔗📝

Turn any website into a single, clean Markdown file, ready for your LLM prompts.

Language models have a knowledge cut-off date. llmtext solves this by scraping a webpage and all its relevant internal links, converting the content into a single Markdown file. You can then use this file to give your LLM the up-to-date context it needs.

Crates.io LICENSE


https://github.com/user-attachments/assets/a558970e-4d75-4b41-8c2c-3bb3d5705e13

✨ Features

  • Intelligent Scraping: Give it one URL, and it smartly finds and scrapes all related pages.
  • Flexible Input: Scrape a single page (--single) or a list of URLs from a file (--urls).
  • Fast & Efficient: Built in Rust for maximum speed with parallel downloads.
  • Clipboard Integration: Instantly copy the full Markdown output to your clipboard (--clipboard).

📦 Installation

# From Crates.io (Recommended)
cargo install llmtext

🚀 Usage

# Scrape a site's documentation section
llmtext https://react.dev/reference/react

# Scrape just a single page
llmtext --single https://react.dev/reference/react/useState

# Scrape a list of URLs from a file and save to a custom output file
llmtext --urls my_links.txt --output react-subset.md

# Scrape and copy directly to clipboard, skipping confirmation
llmtext https://vuejs.org/guide/introduction.html --clipboard --yes

License

This project is licensed under the MIT License. See the LICENSE file for details.

Dependencies

~24–48MB
~752K SLoC