-
encoding_rs
A Gecko-oriented implementation of the Encoding Standard
-
png
PNG decoding and encoding library in pure Rust
-
deunicode
Convert Unicode strings to pure ASCII by intelligently transliterating them. Suppors Emoji and Chinese.
-
bytecount
count occurrences of a given byte, or the number of UTF-8 code points, in a byte slice, fast
-
skrifa
Metadata reader and glyph scaler for OpenType fonts
-
yrs
High performance implementation of the Yjs CRDT
-
daachorse
Double-Array Aho-Corasick
-
lsd
An ls command with a lot of pretty colors and some other stuff
-
stringprep
algorithm
-
swc_ecma_utils
swc ecmascript ast nodes
-
garde
Validation library
-
unicode-general-category
Fast lookup of the Unicode General Category property for char
-
shell-quote
shell-quoting strings, e.g. for interpolating into a Bash script.
-
grex
grex generates regular expressions from user-provided test cases
-
icu_collections
Collection of API for use in ICU libraries
-
unicode-joining-type
Fast lookup of the Unicode Joining Type and Joining Group properties
-
bstr
A string type that is not required to be valid UTF-8
-
oem_cp
that handles OEM code pages (e.g. CP{437,737,850}) for single byte character sets
-
swc_ecma_parser
Feature-complete es2019 parser
-
decancer
that removes common unicode confusables/homoglyphs from strings
-
raylib
Safe Rust bindings for Raylib
-
unicode-case-mapping
Fast lowercase, uppercase, and titlecase mapping for characters
-
nucleo-matcher
plug and play high performance fuzzy matcher
-
utf8-decode
UTF-8 incremental decoding iterators
-
allsorts
Font parser, shaping engine, and subsetter for OpenType, WOFF, and WOFF2
-
crop
A pretty fast text rope
-
stringsext
find multi-byte-encoded strings in binary data
-
gix-path
gitoxide project dealing paths and their conversions
-
ason
data serialization format that evolved from JSON, featuring strong numeric typing and native support for enumeration types
-
hstr
A string type which is optimized for hash operations
-
textdistance
Lots of algorithms to compare how similar two sequences are
-
stfu8
Sorta Text Format in UTF-8
-
bebytes
serialization and deserialization of network structs
-
zenpixels
Pixel format interchange types for zen* codecs
-
obeli-sk-boa-string
Boa is a Javascript lexer, parser and compiler written in Rust. Currently, it has support for some of the language.
-
uwc
Counts things in unicode text files
-
rylai
Generate Python .pyi stub files from pyo3-annotated Rust source code statically without compilation
-
binary-layout
allows type-safe, inplace, zero-copy access to structured binary data. You define a custom data layout and give it a slice of binary data, and it will allow you to read and write the…
-
hdds-samples-types
HDDS Type samples demonstrating DDS type serialization
-
swc_experimental_ecma_parser
An extensible Rust-based platform for the next generation of fast developer tools
-
headson
Budget‑constrained JSON preview renderer
-
utf8-zero
Zero-copy, incremental UTF-8 decoding with error handling
-
nu_plugin_regex
nu plugin to search text with regex
-
precis-core
PRECIS Framework: Preparation, Enforcement, and Comparison of Internationalized Strings in Application Protocols as defined in rfc8264
-
jonesy
here to help you not panic!
-
rushdown
A 100% CommonMark-compatible GitHub Flavored Markdown parser and renderer
-
rxml_validation
Plumbing crate for rxml and rxml_proc crates
-
ts_query_ls
An LSP implementation for Tree-sitter's query files
-
mp4ra-rust
Types and associated constants representing code points managed by the MPEG4 Registration Authority
-
whichlang
A blazingly fast and lightweight language detection library for Rust
-
feruca
Unicode Collation Algorithm
-
versatiles_glyphs
generating SDF glyphs from fonts
-
shaperglot
Test font files for OpenType language support
-
arf-strings
Encoding and decoding for ARF strings
-
bies
Helpers for dealing with BIES vectors with text segmentation applications
-
japanese-codepoints
A high-performance Rust library for Japanese character validation and code point handling based on JIS standards
-
tpuf_icu_properties_211
Definitions for Unicode properties
-
libyaff
Font Format (YAFF) - A compact bitmap font format with kerning support
-
kamo
assist in the creation of an interpreter or compiler and its associated runtime
-
plain_trie
Classic trie implementation capable of mapping any T to char iterator
-
futf
Handling fragments of UTF-8
-
urandom
Produce and consume randomness, to convert them to useful types and distributions, and some randomness-related algorithms
-
wary
validation and transformation library
-
sweeten
sweetenyour dailyicedbrew -
font-test-data
Test data for the fontations crates
-
json-schema-rs
generate Rust structs from JSON Schema
-
unicode-canonical-combining-class
Fast lookup of the Canonical Combining Class property
-
dom-content-extraction
Content extraction via text density paper
-
yeslogic-ucd-generate
A program for generating packed representations of the Unicode character database that can be efficiently searched with support for additional tables
-
versed
generating DTOs and their migrations from schema descriptions
-
precis-tools
Tools and parsers to generate PRECIS tables from the Unicode Character Database (UCD)
-
libyay
YAY (Yet Another YAML) parser - a human readable, writable, and diffable data format
-
ruma-identifiers-validation
Validation logic for ruma-common and ruma-macros
-
skera
Subsetting a font file according to provided input
-
core-json
A non-allocating JSON deserializer
-
gws-builder
Build-time codegen from Google API Discovery: Rust types, method metadata, and serde helpers for Workspace agents
-
hebrew_unicode_script
A low-level library designed to ascertain whether a character belongs to the Hebrew Unicode script. It supports checks for individual characters as well as for membership within collections
-
eunicode
A text processing CLI and library that helps sanitize text by removing the naughty bits to make strings good and safe
-
material-icons
Strongly-typed codepoint mappings for the Google Material Icons font
-
identifier_safety
Unicode confusable character detection and canonicalization
-
harfbuzz_rs
A high-level interface to HarfBuzz, exposing its most important functionality in a safe manner using Rust
-
bbpe
Binary byte pair encoding (BPE) trainer and CLI compatible with Hugging Face tokenizers
-
tpuf_icu_collections_211
Collection of API for use in ICU libraries
-
hebrew_unicode_utils
Some functions for processing Hebrew unicode characters
-
fontspector-checkapi
developing checks for the fontspector font QA tool
-
google-fonts-glyphsets
Google Fonts font axis support data
-
google-fonts-languages
Google Fonts script and language support data
-
ezemoji
Catigoryized Emoji's
-
icu_codepointtrie_builder
Runtime builder for CodePointTrie
-
kiri-kotoba
Input text processing for Kiri Japanese morphological analyzer
-
peasytext
Rust client for PeasyText — text tools, glossary, and guides API
-
iconflow
Unified icon library for Rust GUI apps with 10+ embedded icon packs (Bootstrap, Heroicons, Phosphor, Lucide, Tabler). Type-safe API for egui, iced and more
-
ucd-parse
parsing data files in the Unicode character database
-
television-nucleo-matcher
plug and play high performance fuzzy matcher
-
atuin-nucleo-matcher
plug and play high performance fuzzy matcher
-
google-fonts-subsets
Google Fonts subset definitions
-
incremental-font-transfer
Client side implementation of the Incremental Font Transfer standard (https://w3c.github.io/IFT/Overview.html)
-
ucd-trie
A trie for storing Unicode codepoint sets and maps
-
iftree
Include many files in your Rust code for self-contained binaries
-
ignite-v2-client
Apache Ignite v2 Client
-
ucd
Extends the char type to provide access to most fields of the UCD, Unicode Character Database, as of version 9.0.0. It aims to be compact, fast, and use minimal dependencies (only rust's core crate)…
-
nerdle
A macro-powered compile-time nerd-font code point resolver
-
ecmascript_atomics
ECMAScript and Java memory model racy atomics
-
servo-base
A component of the servo web-engine
-
string-mumu
String functions and tools plugin for the Lava / Mumu language
-
unicode-charname
functions for retrieving Unicode character name properties as described in Unicode Standard Annex #44
-
string-offsets
Converts string offsets between UTF-8 bytes, UTF-16 code units, Unicode code points, and lines
-
dbxcase
Dropbox-compatible case-folding algorithm
-
sanitise-file-name
An unusually flexible and efficient file name sanitiser
-
bhc-types
Type system, type inference, and type representations for BHC
-
fontcull-skrifa
Metadata reader and glyph scaler for OpenType fonts. (Vendored fork for fontcull)
-
tengwar
Transliterate text into J.R.R. Tolkien's Tengwar.
-
ai-png
A no_std fork of png
-
avr-progmem
Progmem utilities for the AVR architectures
-
lsp-positions
LSP-compatible character positions
-
sgx_tstd
Rust SGX SDK provides the ability to write Intel SGX applications in Rust Programming Language
-
unicode_reader
Adaptors which wrap byte-oriented readers and yield the UTF-8 data as Unicode code points or grapheme clusters
-
typg-core
Core search/discovery engine for typg (made by FontLab https://www.fontlab.com/)
-
yeslogic-unicode-script
Fast lookup of the Unicode Script property
-
tpuf_icu_properties
Definitions for Unicode properties
-
uresamp
HIFI uresamp delivers ultrasonic-fidelity text resampling via adaptive 64-bit floating-point spectral mapping, preserving Unicode 32-bit codepoint integrity with zero-phase distortion
-
unicode-id-trie-rle
Unicode identifier (UAX #31) checker using a deduplicated trie with run-length encoded leaves
-
harfbuzz_rs_now
A high-level interface to HarfBuzz, exposing its most important functionality in a safe manner using Rust
-
zeph-scheduler
Cron-based periodic task scheduler with SQLite persistence for Zeph
-
unicode-display-width
Unicode 15.1.0 compliant utility for determining the number of columns required to display an arbitrary string
-
base256u
mapping between bytes and Unicode codepoints
-
utf8proc
Rust bindings to the utf8proc library
-
bevy_emoji
Color emoji rendering for Bevy — bypasses swash's broken CBDT support by extracting bitmap glyphs directly via ttf-parser
-
hebrew_accents
finding, filtering, and displaying Hebrew accents, specifically focusing on the Tiberian accent system as documented by the Masoretes
-
ratex-font
Font metrics and symbol tables for RaTeX
-
string_manipulation_utf8
String manipulation functions using character indexing (UTF-8) instead of bytes
-
java_string
Java strings, tolerant of invalid UTF-16 encoding
-
steam-mail
Minimal SMTP server for extracting Steam Guard codes from incoming emails
-
crlf-to-lf-inplace
Fast in-place CRLF to LF line ending conversion for Rust strings. Uses memchr for good performance without custom SIMD.
-
arnak
BoardGameGeek XML API
-
zhtw
Traditional Chinese converter for Taiwan — Rust SDK
-
univiz
A command-line tool for analyzing Unicode strings, providing detailed information about graphemes, code points, and UTF-8 byte sequences
-
unicode-language
detect language coverage given a list of codepoints
-
dangerous
Safely and explicitly parse untrusted / dangerous data
-
bparse
parsing bytes
-
ai-stringprep
A no_std fork of stringprep
-
tpuf_collections_211
Collection of API for use in ICU libraries
-
seshat-unicode
A Unicode Library for Rust. Unicode 16.0.0 ready. XID_Start and XID_Continue are also available.
-
charname
Incredibly simple library that just gives you the Unicode name for a character
-
cesu8str
Convert to and from CESU-8 or Modified UTF-8 encodings, only when necessary
-
rustpython-wtf8
WTF-8 for use in RustPython
-
chars
A commandline tool to display information about unicode characters
-
packtab-macro
Proc-macro for packtab compile-time table generation
-
unicop
scanning source code for potentially malicious unicode code points. Helps prevent Trojan source bidi attacks, homoglyph attacks, invisible character attacks etc. Intended to run manually…
-
fontcull-font-test-data
Test data for the fontations crates. (Vendored fork for fontcull)
-
east-asian-width
Determine the display width of Unicode characters in East Asian contexts
-
unicode-intervals
Search for Unicode code points intervals by including/excluding categories, ranges, and custom characters sets
-
staart
tail-like library and binary crate
-
unicodefyi
Unicode character lookup with 17 encodings — async API client for unicodefyi.com
-
uv-globfilter
internal component crate of uv
-
ucd-general-category-ranges
Unicode character ranges by general category
-
lexical-sort
Sort Unicode strings lexically
-
whatwg-infra
Tiny Rust-based implementation of the WHATWG Infra Standard
-
wtf8
WTF-8 encoding. https://simonsapin.github.io/wtf-8/
-
gofmt
implemention
-
wiwi
Stuff™
-
unicode-writing-script
The package provides detection of writing scripts from Unicode codepoints
-
fontcull-klippa
Subsetting a font file according to provided input. (Vendored fork for fontcull)
-
jismesh
handling and converting JIS X0410 area mesh codes
-
uwl
A management stream for bytes and characters
-
blurry
SDF font asset generation
-
twitter_text_config
Configuration for twitter-text in Rust
-
ratex-types
Shared types for the RaTeX math typesetting engine
-
fmtbuf
formatting to a fixed-size buffer
-
diff3proof
A font proofing tool that knows about languages
-
rfc9839-rs
RFC9839 to test for problematic Unicode code points
-
collect-rs
Special and custom Collections and Containers (for Rust)
-
ming
Minimalist pedantic command line parser
-
libghostty-vt-sys
Raw FFI bindings for libghostty-vt, the Ghostty terminal emulation library
-
mkwebfont_hb-subset
A wrapper for HarfBuzz font subsetting API
-
freetypegl
Rust build helpers and bindings for freetype-gl
-
aparte
XMPP console client written in Rust and inspired by Profanity
-
diffenator3-lib
A font comparison tool that will not stop until your fonts are exhaustively compared
-
harfbuzz-sys
Rust bindings to the HarfBuzz text shaping engine
-
plainjson
simply provides low-level access and JSONPath way to query or set JSON values
-
yeslogic-ucd-parse
parsing data files in the Unicode character database
-
residua-cesu8
converting between CESU-8 and UTF-8
-
basehan
A data encryption method using Chinese characters. Kind of like base64.
-
typg-python
PyO3 bindings for typg-core (made by FontLab https://www.fontlab.com/)
-
rfc9839
RFC 9839 specification
-
character-set
High performance
set.contains(char) -
allsorts-subset-browser
Temp fork of allsorts 0.15 - includes patch for subsetting fonts for browsers
-
mandarin-to-pinyin
converting Mandarin Chinese to Pinyin
-
char_reader
Safely read wild streams as chars or lines
-
residua-mutf8
converting between MUTF-8 and UTF-8
-
codes-iso-4217
This package contains an implementation of the ISO 4217 Currency Codes specification
-
mr-mime
A no_std MIME type library for Rust
-
icu_unicodeset_parse
API to parse unicode sets as defined in UTS35
-
wtf8-rs
WTF-8 encoding
-
li-http-types
Common types for HTTP operations
-
raster-fonts
Bitmap font creation tool and accompanying metadata deserialization library
-
var_byte_str
Variable byte encoding string by using gap instead of code point
-
i-dunno
RFC 8771 Internationalized Deliberately Unreadable Network Notation
-
utf58
High-tech encoding of the Unicode space in one quibble and up to 3 bytes
-
opstr
‘Operate on strings’ command line utility
-
owned_str
Provide a stack allocated String for no-std or const environement
-
hfs_nfd
Handle Apple's unique NFD-like Unicode normalization, which is used in HFS+, in Rust
-
unicode_converter
CLI tool to convert data between various Unicode encodings
-
unicode-id-trie-rle-derived-core-properties
A parser for the unicode database DerivedCoreProperties.txt file used in the unicode-id-trie-rle crate
-
utf8proc-sys
Unsafe rust bindings to the utf8proc library
-
hb-subset
A wrapper for HarfBuzz font subsetting API
-
utf8-fix
Fix invalid UTF-8 sequences in-place while preserving buffer size - useful for fuzzing and mutation testing
-
libbaremetal-unifont
accessing GNU Unifont glyphs in bare-metal environments
-
base256emoji
A multibase base256emoji implementation
-
lesbar
String types that must encode legible text
-
cesu8-str
CESU-8 and Java CESU-8 string validation and manipulation
-
strcursor
string cursor type for seeking through a string whilst respecting grapheme cluster and code point boundaries
-
martin-core
Basic building blocks of MapLibre's Martin tile server
-
bdf2
BDF format handling
-
ende
encoding/decoding unicode/utf-8/utf-16(ucs-2) code points
-
utf8char
that supplies a utf8 encoded char
-
known-values
Blockchain Commons Known Values
-
unifont-bitmap
Get bitmap data for GNU Unifont. Embedded in compressed form, light on memory.
-
kittenmoji
encode and decode kittenmoji
-
omgwtf8
Optimized-Matching-Generalized Wobbly Transformation Format — 8-bit
-
u-plus
Pretty Unicode code point literals: U+12345 instead of '\u{12345}'
-
piper-tts-rs-sys
Piper-TTS raw implementation in Rust
-
classreader
that reads, writes, and parses Java Virtula Machine class files
-
embedded-mogeefont
Pixel font for embedded-graphics
-
asciis
ASCII base on RFC20. Just ord() and chr()
-
utf8toipv4
Convert UTF-8 to ipv4 addresses and vice versa
-
base131072
encoding
-
font-awesome
A typed manifest of constants of Font Awesome unicode code points
-
lucide-rs
Provide lucide icon for rust
-
syntax-rs
parsing programming languages
-
font-map-core
Core font-parsing capabilities for font-map
-
difftastic_mod_for_windows
A diff that understands syntax (fork which fixes terminal size for windows)
-
pattern-3
Needle API (née Pattern API 3.0), generalization of
std::str::pattern -
pretok
A string pre-tokenizer for C-like syntaxes
-
inf-rs
INF file parse library
-
blanket-script
BlanketScript is a simple script language inspired by Rust that transpiles to JavaScript
-
palpo-identifiers-validation
Validation logic for palpo-core and palpo-macros
-
triangle-from-scratch
An educational series on how to do stuff yourself
-
rscenes-raylib-connector
Raylib connector for Rscenes 2
-
vga-rs
Low level VGA(0xB8000) library in freestanding Rust
-
encoding-next-index-singlebyte
Index tables for various single-byte character encodings
-
boxy
Declarative builder for Unicode box-drawing characters
-
is-vowel
Heuristically test whether a character is a vowel letter
Try searching with DuckDuckGo or on crates.io.