Skip to content

Modern Markdown formatter with smart typography, line wrapping, and tag support (Rust port)

License

Notifications You must be signed in to change notification settings

jlevy/flowmark-rs

Repository files navigation

flowmark

Follow @ojoshe on X CI crates.io docs.rs MSRV

Rust Port of Python Flowmark

Tip

This is a 100% agent-written, auto-synced Rust port of Python Flowmark, the original reference implementation.

This Rust port has carefully tested identical CLI usage and formatting behavior, while giving 50x+ faster performance processing large numbers of files. So it is now the recommended version for CLI and IDE usage.

Last sync: 2026-02-19 against Python v0.6.4

Why the Rust Version?

  • Single binary: install via Cargo, Cargo binstall, or Homebrew.
  • Fast CLI: good for large repos and CI pipelines.
  • Library crate: embed in Rust tooling via docs.rs/flowmark.

Performance

TLDR:

  • If you use it for auto-save in your IDE, it feels instant.
  • If you run it on 1000 documents over and over in your build system, it only takes milliseconds.

Flowmark is now arguably the most sophisticated Markdown autoformatter, given its advanced wrapping and typographic rules. But because it was pure Python, it was never highly performant.

Now flowmark-rs has identical functionality and in a rough benchmark is the #1 fastest Markdown formatter for repeated runs of large numbers of documents, the #2 fastest on new documents, and 50X or more faster than other TypeScript or Python formatters.

Fresh-run cross-formatter ranking (profiled benchmark suite, 928 files / 8.8 MB):

Rank Formatter Mean (fresh) Relative speed
1 dprint 0.37 s 1.0x
2 flowmark-rs 0.73 s 2.0x
3 markdownfmt 0.95 s 2.6x
4 prettier 38.0 s 103x
5 flowmark-py ~48 s ~130x
6 mdformat 72.9 s 197x

Cached second run (unchanged files, warm cache):

Formatter Mean (cached) Relative speed
flowmark-rs (--auto) 0.023 s 1.0x
dprint (fmt) 0.031 s 1.3x

So on the same corpus flowmark-rs is roughly 60-70x faster than flowmark-py.

Rust-Only Features

The only exception to the exact parity of the port of Python Flowmark are these Rust-only performance features:

  • incremental cache (--no-cache, --cache-dir, --incremental, --show-cache, --clear-cache)
  • stage-level performance stats (--perf-stats)

See docs/rust-only-features.md for a concise feature matrix and docs/cache.md for cache behavior details.

See benchmarks/REPORT.md for full profiling details and methodology.

Installing Rust Flowmark

Cargo (source build)

cargo install flowmark

Cargo binstall (prebuilt binary)

cargo binstall flowmark

PyPI (via uv or pip)

uvx flowmark-rs          # run on demand (no install needed)
uv tool install flowmark-rs  # persistent install
pip install flowmark-rs      # classic pip

Homebrew (macOS)

brew tap jlevy/flowmark
brew install jlevy/flowmark/flowmark
"$(brew --prefix)/bin/flowmark" --version

Note on the flowmark command name: The PyPI package flowmark-rs provides both flowmark and flowmark-rs commands. If you only want the CLI tool, just install flowmark-rs — you don’t need the Python flowmark package. If you have both packages installed as uv tools, uv tool install flowmark-rs --force will let the Rust version take precedence for the flowmark command.


Why Use Flowmark?

Flowmark is a Markdown auto-formatter, written in Python with an auto-synced Rust port, designed for better LLM workflows, clean git diffs, and flexible use from CLI, from IDEs, or as a library.

With AI tools increasingly using Markdown, having consistent, diff-friendly formatting has become essential for modern writing, editing, and document processing workflows. Normalizing Markdown formatting greatly improves collaborative editing and LLM workflows, especially when committing documents to git repositories.

You can use Flowmark as a CLI, as an autoformatter in your IDE, or as a Python library.

Comparison With Other Formatters

Flowmark supports both CommonMark and GitHub-Flavored Markdown (GFM) via Marko.

The key differences from other Markdown formatters:

  • Carefully chosen default formatting rules that are effective for use in editors/IDEs, in LLM pipelines, and also when paging through docs in a terminal. It parses and normalizes standard links and special characters, headings, tables, footnotes, and horizontal rules and performing Markdown-aware line wrapping.

  • “Just works” support for GFM-style tables, footnotes, YAML frontmatter, template tags (Markdoc, Jinja, Nunjucks), and inline HTML comments.

  • Advanced and customizable line-wrapping capabilities, including semantic line breaks, a feature that is especially helpful in allowing collaborative edits on a Markdown document while avoiding git conflicts.

  • Optional automatic smart quotes for professional-looking typography.

General philosophy:

  • Be conservative about changes so that it is safe to run automatically on save or after any stage of a document pipeline.

  • Be opinionated about sensible defaults but not dogmatic by preventing customization. You can adjust or disable most settings. And if you are using it as a library, you can fully control anything you want (including more complex things like custom line wrapping for HTML).

  • Be as small and simple as possible, with few dependencies: marko, pathspec, regex, and strif.

Use Cases

The main ways to use Flowmark are:

  • To autoformat Markdown on save in VSCode/Cursor or any other editor that supports running a command on save. See below for recommended VSCode/Cursor setup.

  • As a command line formatter to format text or Markdown files using the flowmark command.

  • As a library to autoformat Markdown from document pipelines. For example, it is great to normalize the outputs from LLMs to be consistent, or to run on the inputs and outputs of LLM transformations that edit text, so that the resulting diffs are clean.

  • As a more powerful drop-in replacement library for Python’s default textwrap but with more options. It simplifies and generalizes that library, offering better control over initial and subsequent indentation and when to split words and lines, e.g. using a word splitter that won’t break lines within HTML tags, template tags ({% %}, {# #}, {{ }}), Markdown links (including links with multi-word text), inline code spans (`code with spaces`), or HTML comments. See wrap_paragraph_lines.

Semantic Line Breaks

Tip

For an example of what an auto-formatted Markdown doc looks with semantic line breaks looks like, see the Markdown source of this readme file.

Some Markdown auto-formatters never wrap lines, while others wrap at a fixed width. Flowmark supports both, via the --width option.

Default line wrapping behavior is 88 columns. The “90-ish columns” compromise was popularized by Black and also works well for Markdown.

However, in addition, unlike traditional formatters, Flowmark also offers the option to use a heuristic that prefers line breaks at sentence boundaries. This is a small change that can dramatically improve diff readability when collaborating or working with AI tools.

This idea of semantic line breaks, which is breaking lines in ways that make sense logically when possible (much like with code) is an old one. But it usually requires people to agree on how to break lines, which is both difficult and sometimes controversial.

However, now we are using versioned Markdown more than ever, it’s a good time to revisit this idea, as it can make diffs in git much more readable. The change may seem subtle but avoids having paragraphs reflow for very small edits, which does a lot to minimize merge conflicts.

This is my own refinement of traditional semantic line breaks. Instead of just allowing you to break lines as you wish, it auto-applies fixed conventions about likely sentence boundaries in a conservative and reasonable way. It uses simple and fast regex-based sentence splitting. While not perfect, this works well for these purposes (and is much faster and simpler than a proper sentence parser like SpaCy). It should work fine for English and many other Latin/Cyrillic languages, but hasn’t been tested on CJK. You can see some old discussion of this idea with the markdownfmt author.

While this approach to line wrapping may not be familiar, I suggest you just try flowmark --auto on a document and you will begin to see the benefits as you edit/commit documents.

This feature is enabled with the --semantic flag or the --auto convenience flag.

Typographic Cleanups

Smart Quote Support

Flowmark offers optional automatic smart quotes to convert "non-oriented quotes" to “oriented quotes” and apostrophes intelligently.

This is a robust way to ensure Markdown text can be converted directly to HTML with professional-looking typography.

Smart quotes are applied conservatively and won’t affect code blocks, so they don’t break code snippets. It only applies them within single paragraphs of text, and only applies to ' and " quote marks around regular text.

This feature is enabled with the --smartquotes flag or the --auto convenience flag.

Ellipsis Support

There is a similar feature for converting ... to an ellipsis character when it appears to be appropriate (i.e., not in code blocks and when adjacent to words or punctuation).

This feature is enabled with the --ellipses flag or the --auto convenience flag.

Frontmatter Support

Because YAML frontmatter is common on Markdown files, any YAML frontmatter (content between --- delimiters at the front of a file) is always preserved exactly. YAML is not normalized.

Tip

See the frontmatter format repo for more discussion of YAML frontmatter and its benefits.

Usage

Flowmark can be used as a library or as a CLI.

Quick Start

# Format all Markdown files in current directory recursively
flowmark --auto .

# Format a single file in-place with all auto-formatting options
flowmark --auto README.md

# List files that would be formatted (without formatting)
flowmark --list-files .

# Format to stdout
flowmark README.md

# Format from stdin (use '-' explicitly)
echo "Some text" | flowmark -

Batch Formatting

The simplest way to format all Markdown in a project:

flowmark --auto .

This recursively discovers all .md files, skips common non-content directories (node_modules, .venv, build, etc.), respects .gitignore, and formats everything in-place with semantic line breaks, smart quotes, ellipses, and cleanups.

For a legacy alternative (pre-v1.0 behavior):

find . -name "*.md" -exec flowmark --auto {} \;

CLI Reference

The main flags:

Flag Description
-o, --output FILE Output file (use - for stdout)
-w, --width WIDTH Line width (default: 88, 0 = disable wrapping)
-p, --plaintext Process as plaintext (no Markdown parsing)
-s, --semantic Semantic (sentence-based) line breaks
-c, --cleanups Safe cleanups (unbold headings, etc.)
--smartquotes Convert straight quotes to typographic quotes
--ellipses Convert ... to
--list-spacing Control list spacing: preserve, loose, tight
-i, --inplace Edit in place
--nobackup Skip .orig backup with --inplace
--auto All auto-formatting: --inplace --nobackup --semantic --cleanups --smartquotes --ellipses. Requires file/directory args (use . for current directory)

File discovery flags:

Flag Description
--list-files Print resolved file paths, don’t format
--extend-include PATTERN Additional file patterns (e.g., *.mdx)
--exclude PATTERN Replace all default exclusions
--extend-exclude PATTERN Add to default exclusions (e.g., drafts/)
--no-respect-gitignore Disable .gitignore integration
--force-exclude Apply exclusions to explicitly-named files
--files-max-size BYTES Skip files larger than this (default: 1 MiB)

File Discovery

When you pass a directory to Flowmark (e.g., flowmark --auto .), it recursively discovers files using a smart filter pipeline:

  1. Default includes: Only *.md files by default. Use --extend-include "*.mdx" to add patterns.

  2. Default exclusions: ~45 directories are automatically skipped, including .git, node_modules, .venv, venv, __pycache__, build, dist, .tox, .nox, .idea, .vscode, vendor, third_party, and more. These directories are pruned during traversal for performance.

  3. .gitignore integration: Enabled by default. Reads .gitignore at every directory level during traversal. Disable with --no-respect-gitignore.

  4. .flowmarkignore: A tool-specific ignore file using gitignore syntax. Place it in your project root to exclude paths specific to Flowmark formatting.

  5. Max file size: Files over 1 MiB are skipped by default. Change with --files-max-size (0 = no limit).

Customizing Includes and Excludes

# Also format .mdx files
flowmark --auto --extend-include "*.mdx" .

# Skip a specific directory
flowmark --auto --extend-exclude "drafts/" .

# Replace ALL default exclusions with your own
flowmark --auto --exclude "my_custom_dir/" .

# Debug: see exactly which files would be formatted
flowmark --list-files .

Glob Patterns

When passing glob patterns as arguments, always quote them so Flowmark can handle expansion internally:

# Correct: Flowmark expands the glob (** works for recursive matching)
flowmark --auto 'docs/**/*.md'

# Risky: shell may expand ** incorrectly if globstar is off (the default in bash)
flowmark --auto docs/**/*.md

Without quoting, the shell may expand ** as a single * (matching only one directory level) or pass nothing if there are no matches. Flowmark uses Python’s pathlib.Path.glob() internally, which always supports ** for recursive matching regardless of shell settings.

Note: The --extend-include and --extend-exclude flags use gitignore-style patterns (e.g., *.mdx, drafts/), not shell globs.

Symlinks

During recursive directory traversal, symlinks are not followed. This prevents infinite loops from circular symlinks and avoids accidentally formatting files outside the project tree.

However, if you pass a symlink explicitly as an argument (e.g., flowmark --auto link-to-readme.md), the symlink is resolved and the target file is processed.

Configuration

Flowmark supports TOML-based configuration files. It searches for config files in this order (first match wins, walking up directories):

  1. .flowmark.toml
  2. flowmark.toml
  3. pyproject.toml (only if it has a [tool.flowmark] section)

Example Config

# flowmark.toml (or .flowmark.toml)

[formatting]
width = 100
semantic = true
smartquotes = true
ellipses = true
list-spacing = "preserve"

[file-discovery]
extend-include = ["*.mdx", "*.markdown"]
extend-exclude = ["drafts/", "archive/"]
files-max-size = 2097152  # 2 MiB

Or in pyproject.toml:

[tool.flowmark]
width = 100
semantic = true
extend-exclude = ["drafts/"]

Config vs --auto

The --auto flag is a fixed formatting preset that always enables --semantic, --cleanups, --smartquotes, and --ellipses. It ignores formatting settings from config files.

However, width and file discovery settings (excludes, max size, etc.) are always read from config regardless of --auto.

When not using --auto, all formatting settings can be configured via the config file and overridden by explicit CLI flags.

Use in VSCode/Cursor

You can use Flowmark to auto-format Markdown on save in VSCode or Cursor. Install the “Run on Save” (emeraldwalk.runonsave) extension. Then add to your settings.json:

  "emeraldwalk.runonsave": {
    "commands": [
        {
            "match": "(\\.md|\\.md\\.jinja|\\.mdc)$",
            "cmd": "flowmark --auto ${file}"
        }
    ]
  }

The --auto option is just the same as --inplace --nobackup --semantic --cleanups --smartquotes --ellipses.

For batch formatting an entire project, use flowmark --auto . from the terminal.

Agent Use (Claude Code and Other AI Coding Agents)

Flowmark can be installed as a skill for Claude Code and other AI coding agents, enabling automatic Markdown formatting in agent workflows.

Install the Skill

# Install globally (available to all projects)
flowmark --install-skill

# Or install to current project only
flowmark --install-skill --agent-base ./.claude

After installation, Claude Code will automatically recognize when to use Flowmark for Markdown formatting tasks.

Agent Skill Options

Flag Description
--skill Print skill instructions (SKILL.md content)
--install-skill Install Claude Code skill for flowmark
--agent-base DIR Agent config directory (default: ~/.claude)
--docs Print full documentation

Manual Usage in Agents

If you prefer to use Flowmark manually within agent sessions:

# Format with all auto-formatting options
flowmark --auto README.md

# Preview formatted output
flowmark README.md

# Format LLM output (use '-' for stdin)
echo "$llm_output" | flowmark --semantic -

Why Another Markdown Formatter?

There are several other Markdown auto-formatters:

  • markdownfmt is one of the oldest and most popular Markdown formatters and works well for basic formatting.

  • mdformat is probably the closest alternative to Flowmark and it also uses Python. It preserves line breaks in order to support semantic line breaks, but does not auto-apply them as Flowmark does and has somewhat different features.

  • Prettier is the ubiquitous Node formatter that handles Markdown/MDX

  • dprint-plugin-markdown is a Markdown plugin for dprint, the fast Rust/WASM engine

  • Rule-based linters like markdownlint-cli2 catch violations or sometimes fix, but tend to be far too clumsy in my experience.

  • Finally, the remark ecosystem is by far the most powerful library ecosystem for building your own Markdown tooling in JavaScript/TypeScript. You can build auto-formatters with it but there isn’t one that’s broadly used as a CLI tool.

All of these are worth looking at, but none offer the more advanced line breaking features of Flowmark or seemed to have the “just works” CLI defaults and library usage I found most useful.

Project Docs

For development workflows, see development.md.

Rust-specific docs:

About

Modern Markdown formatter with smart typography, line wrapping, and tag support (Rust port)

Resources

License

Contributing

Stars

Watchers

Forks

Packages

 
 
 

Contributors