-
unicode-width
Determine displayed width of
charandstrtypes according to Unicode Standard Annex #11 rules -
unicode-normalization
functions for normalization of Unicode strings, including Canonical and Compatible Decomposition and Recomposition, as described in Unicode Standard Annex #15
-
ropey
A fast and robust text rope for Rust
-
unicode-segmentation
Grapheme Cluster, Word and Sentence boundaries according to Unicode Standard Annex #29 rules
-
icu_segmenter
Unicode line breaking and text segmentation algorithms for text boundaries analysis
-
unicode-xid
Determine whether characters have the XID_Start or XID_Continue properties according to Unicode Standard Annex #31
-
fuzzy-matcher
Fuzzy Matching Library
-
unicode-script
exposes the Unicode
ScriptandScript_Extensionproperties from UAX #24 -
unicode-truncate
Unicode-aware algorithm to pad or truncate
strin terms of displayed width -
unicode-bidi
Unicode Bidirectional Algorithm
-
unicode-properties
Query character Unicode properties according to UAX #44 and UTR #51
-
text-size
Newtypes for text offsets
-
os_display
Display strings in a safe platform-appropriate way
-
unicode_names2
Map characters to and from their name given in the Unicode standard. This goes to great lengths to be as efficient as possible in both time and space, with the full bidirectional tables weighing barely 500 KBβ¦
-
pdf-extract
extract content from pdfs
-
prettydiff
Side-by-side diff for two files
-
cow-utils
Copy-on-write string utilities for Rust
-
str_indices
Count and convert between indexing schemes on string slices
-
bstr
A string type that is not required to be valid UTF-8
-
unicode-security
Detect possible security problems with Unicode usage according to Unicode Technical Standard #39 rules
-
sscanf
(inverse of format!()) Macro based on Regex
-
stfu8
Sorta Text Format in UTF-8
-
embedded-text
TextBox for embedded-graphics
-
unicode-id
Determine whether characters have the ID_Start or ID_Continue properties according to Unicode Standard Annex #31
-
textsurf
Webservice for efficiently serving multiple plain text documents or excerpts thereof (by unicode character offset), without everything into memory
-
strck
Checked owned and borrowed strings
-
sprintf
Clone of C s(n)printf in Rust
-
sliceslice
A fast implementation of single-pattern substring search using SIMD acceleration
-
hck
A sharp cut(1) clone
-
mmdflux
Render Mermaid diagrams as Unicode text, ASCII, SVG, and MMDS JSON
-
difflib
Port of Python's difflib library to Rust
-
fasttext
binding
-
daachorse
Double-Array Aho-Corasick
-
byteyarn
hyper-compact strings
-
in_definite
Get the indefinite article ('a' or 'an') to match the given word. For example: an umbrella, a user.
-
hyphenation
Knuth-Liang hyphenation for a variety of languages
-
finetype-cli
CLI for FineType semantic type classification
-
resb
reading and writing ICU resource bundle files
-
glyph_brush_layout
Text layout for ab_glyph
-
aki-xcat
concatenate files that are plain, gzip, xz and zstd
-
wordcut-engine
Word segmentation/breaking library
-
abre
Shorten repetitive text for display. Pipe lines in, get disambiguated lines out.
-
quick-quiz-generator
generate interactive quizzes blazingly fast, right in your terminal
-
linkify
Finds URLs and email addresses in plain text. Takes care to get the boundaries right with surrounding punctuation like parentheses.
-
hi-doc-jumprope
fast rope (fancy string) library built on top of Skiplists - hi-doc fork
-
whitespace-sifter
Sift duplicate whitespaces away!
-
ncount
A word count tool intended to derive useful stats from markdown
-
meow-cli
Print ASCII cats to your terminal
-
scrolling_window_pattern_matcher
A unified pattern matching library with optional context support and extractor-driven architecture for streaming data processing
-
aki-xtee
copy standard input to each files and standard output
-
colx
Extract the specified columns from FILES or stdin
-
aki-gsub
substitude text command, replace via regex
-
swappy
An anagram generator
-
aki-mcolor
mark up text with color
-
svgbob
Transform your ascii diagrams into happy little SVG
-
aki-mcycle
mark up text with cycling color
-
biip
Redact sensitive information from text!
-
perg
A fast, feature-rich text search tool similar to grep, written in Rust
-
lexxor
A fast, extensible, greedy, single-pass text tokenizer for Rust
-
content_inspector
Fast inspection of binary buffers to guess/determine the encoding
-
slugomatic
π A simple CLI tool to slugify and unslugify text, perfect for branch names and URLs
-
nils-memo-cli
CLI crate for nils-memo-cli in the nils-cli workspace
-
matcher_py
A high-performance matcher designed to solve LOGICAL and TEXT VARIATIONS problems in word matching, implemented in Rust
-
herolib-core
Core utilities including text processing, networking, and HeroScript configuration language
-
sed-rs
A GNU-compatible sed implementation in Rust, powered by sd's regex engine
-
aki-stats
output the statistics of text, like a wc of linux command
-
fuzzy-muff
Fuzzy Matching Library
-
aki-resort
sort lines of text. You can use regex to specify the KEY.
-
tamarind
tui editor for ASCII/ANSI banner art Tamarind (tamr) is an application for creating, previewing, and exporting colorful ASCII art for use in the terminal
-
aki-unbody
output first or last n lines, like a head and tail of linux command
-
nu_plugin_emoji
a nushell plugin called emoji
-
aki-mline
match line, regex text filter like a grep of linux command
-
celo
BMP Steganography tool
-
zaz
cross-platform textual UI toolkit with bindings for Rust, C++, Zig and etc
-
quagga
CLI tool that combines multiple text files into a single prompt suitable for Large Language Models
-
slice-command
slice is a command-line tool that allows you to slice the contents of a file using syntax similar to Python's slice notation
-
unicode-charname
functions for retrieving Unicode character name properties as described in Unicode Standard Annex #44
-
nanohtml2text
A zero-dependency library to convert HTML to plain text
-
rstring
A comprehensive set of string manipulation utilities inspired by Apache Commons Lang3 StringUtils
-
kiru
Fast text chunking for Rust
-
niblits
Token-aware, multi-format text chunking library with language-aware semantic splitting
-
xim-ctext
compound text en/decoder
-
osclip
small utility to copy text from terminal to system clipboard
-
fontique
Font enumeration and fallback
-
threeway_merge
Git-style 3-way string merging using proven algorithms from libgit2/xdiff. Statically links xdiff (LGPL-2.1+).
-
wgpu_glyph
A fast text renderer for wgpu, powered by glyph_brush
-
jumprope
fast rope (fancy string) library built on top of Skiplists
-
kugiri
Marker-based block editing CLI
-
prose
Reformat text pleasantly
-
recase
Changes the convention case of input text
-
pdf_tables
Scrape text from tables in PDF files
-
cql2
Parse, validate, and convert Common Query Language (CQL2) text and JSON
-
topiary-cli
CLI app for Topiary, the universal code formatter
-
tui-shimmer
Shimmer text effect for terminal UIs
-
whitespace-rs
Tools for fixing whitespace problems in text and source code files
-
inboxbot
A telegram bot to save messages to a file
-
pastebinit-rs
Just Paste It! A simple CLI tool to paste text to various pastebin services
-
assert-text
the testing macro tools
-
twas
A text substitution application for using random look-up tables to generate text in a manner similar to the Mad Libs game
-
simplematch
Fast wildcard pattern matching for strings and bytes with a simple api
-
spellabet
Convert characters into spelling alphabet code words
-
booky
analyze English text
-
sapling-renderdag
Render a DAG into ASCII or Unicode text
-
ttlint
Small, fast utility to lint text
-
dec_from_char
Small library for converting unicode decimal into numbers
-
shifter
A CLI tool to rotate lines from stdin by n positions
-
difference-rs
text diffing and assertion library
-
matcher_c
A high-performance matcher designed to solve LOGICAL and TEXT VARIATIONS problems in word matching, implemented in Rust
-
plsfix
Text cleaner upper
-
dec-sixbit
DEC SIXBIT encoding
-
bevy_text_animation
text animation library for Bevy
-
ansi-align
Text alignment library with ANSI escape sequence and Unicode support
-
utf64
encode utf-8 strings into utf-64, and decode them back
-
niho
A command-line tool for converting romanized Japanese text to Japanese characters
-
finetype-core
Core taxonomy and data generation for FineType
-
ephemeral_hub
Ephemeral Hub is a temporary, no-login-required hub for text, files, and collaborative brainstorming. It provides a digital equivalent of a scrap piece of paper or a temporary whiteboardβ¦
-
sublime_fuzzy
Fuzzy matching algorithm based on Sublime Text's string search
-
text-editing
string with utilities for editing
-
ucd
Extends the char type to provide access to most fields of the UCD, Unicode Character Database, as of version 9.0.0. It aims to be compact, fast, and use minimal dependencies (only rust's core crate)β¦
-
srt2txt
Convert SRT subtitle files into clean plain text (strip timestamps, tags, merge lines)
-
subslay
Text β emoji π π» Powered by Rust
-
thai-number-text
Convert numbers to Thai text with Baht currency, accounting style, and scientific notation support
-
yeslogic-unicode-script
Fast lookup of the Unicode Script property
-
line_cutter
quickly derive structs that de/serialize positionally encoded text data
-
freya-edit
Text Editing APIs for Freya
-
xquo
Quote null splited lines for Bash command line
-
stylish
implementing colorized text
-
syllabize-es
Syllabize Spanish text, and much more
-
tui-dialog
A widget for entering a single line of text in a dialog for Ratatui
-
newline_normalizer
Zero-copy newline normalization to \n or \r\n with SIMD acceleration
-
finetype-model
Candle-based transformer model for FineType
-
chamkho
Khmer, Lao, Myanmar, and Thai word segmentation/breaking library and command line
-
french-numbers
Represent numbers in French language
-
text-tokenizer
Custom text tokenizer
-
allegro_font
Allegro 5 font addon Rust wrapper
-
slugrs
A fast, locale-aware slugify library for Rust
-
voidio
VOID I/O - High-performance Cross-platform I/O for Rust
-
text-scatters
A cut-up technique generator from text and ebook files in the terminal
-
kashida
Insert Kashidas/Tatweel into Arabic text, e.g. for justification purposes.
-
genpdf-json
PDF generator using JSON data
-
protextinator
Text management, made simple
-
txlib
text based epub library
-
plainllm
A plain & simple LLM client
-
awful_book_sanitizer
CLI to clean up OCR-mangled book excerpts into readable text using OpenAI-compatible APIs
-
grapheme
Abstractions for working with extended Unicode grapheme clusters
-
mdbook-replace
mdBook preprocessor that simply replaces text
-
eunicode
A text processing CLI and library that helps sanitize text by removing the naughty bits to make strings good and safe
-
maybe_xml
XML processing library
-
askalono-cli
detect the contents of license files
-
penmanship
A Unicode character lookup library for converting text patterns to Unicode characters
-
topiary-queries
tree-sitter query files compatible with Topiary
-
aozora2
Aozora Bunko format converter CLI
-
extractous
fast and efficient way to extract content from all kind of file formats including PDF, Word, Excel CSV, Email etc... Internally it uses a natively compiled Apache Tika for formats are not supported natively by the Rustβ¦
-
bevy_ecs_typewriter
ECS typewriter system for Bevy engine
-
aneubeck-daachorse
Daachorse: Double-Array Aho-Corasick
-
waterui-str
String utilities for WaterUI
-
wordcloud-rs
A word-cloud image generation crate
-
chonkie
π¦ Chonkie, now in Rust π¦: No-nonsense, ultra-fast, ultra-light chunking library
-
tokenizations
alignments library
-
rehuman
Unicode-safe text cleaning & typographic normalization for Rust
-
read_input
Ask for user input until the data inputted is valid
-
trxx
pack and unpack text files
-
nib
static site generator
-
nixel
Parser for the Nix Expressions Language
-
rs-tail1
Prints the last line of the input
-
interpolator
runtime format strings, fully compatible with std's macros
-
microslop
Turn your text into beautifully chaotic, glitchy, Wandoze-level slop
-
text-similarity-metrics
A high-performance Rust library for computing text similarity using multiple algorithms
-
lowcharts
draw low-resolution graphs in terminal
-
argsplitter
A light-weight command line argument parser
-
japanese-text
ζ₯ζ¬θͺγγγΉγζ£θ¦εγ©γ€γγ©γͺ - ε ¨θ§εθ§ε€ζγγ²γγγͺγ«γΏγ«γε€ζ
-
rslug
fast, and configurable library to create URL-friendly slugs from strings
-
clipcount
Counting words from the clipboard content
-
recoco-splitters
Text splitters/parsers for ReCoco, an all-Rust fork of CocoIndex with greater flexibility
-
mini_markdown
Small markdown renderer
-
sixbit
Small packed strings
-
patcher
generating and applying Git-style unified diff patches
-
clip-sanitize
Meta-library for robust text sanitization, repair, and normalization
-
project-finder
Fast project finder for developers
-
up2code
check code listings in Markdown files against a GitHub repository
-
alphabetic
featuring a struct that represents a letter in Latin-script alphabet
-
egui_keyboard
virtual keyboard widget for egui
-
skimple
interface for the skim fuzzy-matcher
-
wordnet-db
Memory-mapped reader for prebuilt WordNet database files
-
xa
Execute Anything via LLM - A CLI tool for arbitrary text processing using LLMs
-
clipin
get text from clipboard or stdin
-
unicode-ellipsis
truncate Unicode strings to a certain width, automatically adding an ellipsis if the string is too long
-
spyne
All-in-one toolkit
-
grapheme_machine
Grapheme cluster text segmentation (UAX #29) state machine for streaming input
-
fpas
command line software helps you create password from some text
-
editdistancek
Fast algorithm for computing edit distance
-
clearhead_cli
A CLI tool and library for working with the clearhead framework
-
flat
Project multi-dimensional data onto the flat textual plane
-
aozora2text
Convert Aozora Bunko format to plain text
-
vectorscan-rs
Ergonomic bindings to the Vectorscan high-performance regex library
-
reda-geometry
geometry library
-
cfasttext-sys
fastText ffi binding
-
printerator
simply print iterators without collecting them
-
image-anonymizer
A command-line tool to detect and mask sensitive content in images
-
md-parser-kma
parser that converts basic Markdown syntax (headers, bold, italic, lists) into corresponding HTML tags
-
nib-cli
A cli for a yet another static site generator Nib
-
simple-summarize
Fast extractive text summarizer in Rust (with 30-70% compression)
-
lavendeux-parser
Extensible parsing engine for expression evaluation
-
matchy-extractor
Fast extraction of IPs, domains, emails, hashes from text (internal)
-
thediff
Difference between 2 files in percentages
-
rust-logger
logger
-
tiny-grep
grep-like text search utility written in Rust
-
wordnet-types
Shared types and part-of-speech enums for WordNet data
-
string-view
Work with views into string slices. Safely extend, reduce without losing parent string size. Modify str slices in-place to avoid allocations.
-
git-blamediff
A program to automatically annotate changes to a file in git(1)
-
grink
find and check web links in text files
-
rustdoc-text
A lightweight library and CLI tool to view Rust documentation as plain text
-
print-positions
providing string segmentation on grapheme clusters and ANSI escape sequences for accurate length arithmetic based on visible print positions
-
intuicio-framework-text
Text types framework module for Intuicio scripting platform
-
file-action
file operations including reading/writing text, CSV, and JSON files
-
fexpr
filter query language parser so that you can build SQL, Elasticsearch, etc. queries safely from user input.
-
santiago
A lexing and parsing toolkit for Rust
-
eddie
Fast and well-tested implementations of edit distance/string similarity metrics: Levenshtein, Damerau-Levenshtein, Hamming, Jaro, and Jaro-Winkler
-
diacritics
Remove diacritics from letters, for example when standardizing input for a search
-
utf8-io
Traits and types for UTF-8 I/O
-
egui_text_animation
text animation using egui
-
easy-sgr
An easy to use Select Graphics Rendition(SGR) library
-
syllarust
quickly counting syllables
-
colored-print
Concise syntax for colors and styles in a format string literal
-
genpdfi
User-friendly PDF generator written in pure Rust
-
corn
socket server free to chat & a web server display files
-
crawdad-rkyv
Crawdad: ChaRActer-Wise Double-Array Dictionary with rkyv support
-
extract-frontmatter
that allows a user to extract an arbitrary number of lines of 'front-matter' from the start of any string
-
content-canonical
Content canonicalization and text normalization library
-
text-span
Struct covering areas of text
-
slugi
slugify text (e.g., 'CafΓ© au lait' β 'cafe_au_lait')
-
mi_primera_lib
Basic math and text utilities for Rust
-
duat-jump-list
A jump list for Duat, meant to be used by modes
-
line_diff
compare single lines by tokenizing them into chunks
-
btt
Binary to text encodings
-
translitrs
Transliteration utility for Serbian language
-
suffix
arrays
-
quot
A fast and flexible command-line tool that converts text input into escaped string literals
-
unicode-normalization-alignments
functions for normalization of Unicode strings, including Canonical and Compatible Decomposition and Recomposition, as described in Unicode Standard Annex #15
-
pink_accents
Replacement of patterns in string to simulate speech accents
-
sayit
String replacements using regex
-
zen-colour
The most minimal terminal colour and style crate
-
psource
CLI tool to pretty print source code to stdout or directly to the clipboard
-
chronofold
A conflict-free replicated data structure (a.k.a CRDT) for versioned text
-
text-grid
create formatted plain-text tables
-
crawdad
ChaRActer-Wise Double-Array Dictionary
Try searching with DuckDuckGo.