-
regex
regular expressions for Rust. This implementation uses finite automata and guarantees linear time matching on all inputs.
-
unicode-width
Determine displayed width of
charandstrtypes according to Unicode Standard Annex #11 rules -
ropey
A fast and robust text rope for Rust
-
textwrap
word wrapping, indenting, and dedenting strings. Has optional support for Unicode and emojis as well as machine hyphenation.
-
encoding_rs
A Gecko-oriented implementation of the Encoding Standard
-
similar
A diff library for Rust
-
fancy-regex
regexes, supporting a relatively rich set of features, including backreferences and look-around
-
const_format
Compile-time string formatting
-
convert_case
Convert strings into any case
-
lazy-regex
lazy static regular expressions checked at compile time
-
tabled
An easy to use library for pretty print tables of Rust
structs andenums -
tokenizers
today's most used tokenizers, with a focus on performances and versatility
-
pulldown-cmark
A pull parser for CommonMark
-
unicode-normalization
functions for normalization of Unicode strings, including Canonical and Compatible Decomposition and Recomposition, as described in Unicode Standard Annex #15
-
heck
case conversion library
-
deunicode
Convert Unicode strings to pure ASCII by intelligently transliterating them. Suppors Emoji and Chinese.
-
html2text
Render HTML as plain text
-
lopdf
PDF document manipulation
-
ansi-to-tui
Convert ANSI color and style codes into Ratatui Text
-
rustybuzz
A complete harfbuzz shaping algorithm port to Rust
-
unicode-segmentation
Grapheme Cluster, Word and Sentence boundaries according to Unicode Standard Annex #29 rules
-
mime_guess
detection of a file's MIME type by its extension
-
sqlformat
Formats whitespace in a SQL string to make it easier to read
-
prometheus-client
Open Metrics client library allowing users to natively instrument applications
-
widestring
wide string Rust library for converting to and from wide strings, such as those often used in Windows API or other FFI libaries. Both
u16andu32string types are provided, including support for UTF-16 and UTF-32… -
termimad
Markdown Renderer for the Terminal
-
fuzzy-matcher
Fuzzy Matching Library
-
nucleo-matcher
plug and play high performance fuzzy matcher
-
printpdf
reading and writing PDF files
-
unicase
A case-insensitive wrapper around strings
-
indenter
A formatter wrapper that indents the text, designed for error display impls
-
diff
An LCS based slice and string diffing implementation
-
strip-ansi-escapes
Strip ANSI escape sequences from byte streams
-
lindera
A morphological analysis library
-
text-splitter
Split text into semantic chunks, up to a desired chunk size. Supports calculating length by characters and tokens, and is callable from Rust and Python.
-
onig
Rust-Onig is a set of Rust bindings for the Oniguruma regular expression library. Oniguruma is a modern regex library with support for multiple character encodings and regex syntaxes.
-
lngcnv
linguistics: display pronunciation, translate between dialects, convert between orthographies; support for multiple languages: English, Latin, Polish, Quechua, Spanish, Tikuna
-
grok
popular Java & Ruby grok library which allows easy text and log file processing with composable patterns
-
const-str
compile-time string operations
-
roff
ROFF (man page format) generation library
-
unicode-truncate
Unicode-aware algorithm to pad or truncate
strin terms of displayed width -
protobuf
Protocol Buffers - Google's data interchange format
-
pulldown-cmark-to-cmark
Convert pulldown-cmark Events back to the string they were parsed from
-
chardetng
A character encoding detector for legacy Web content
-
finl_unicode
handling Unicode functionality for finl (categories and grapheme segmentation)
-
emojis
✨ Lookup emoji in *O(1)* time, access metadata and GitHub shortcodes, iterate over all emoji, and more!
-
garde
Validation library
-
charabia
detect the language, tokenize the text and normalize the tokens
-
unicode-script
exposes the Unicode
ScriptandScript_Extensionproperties from UAX #24 -
pdf-extract
extract content from pdfs
-
jieba-rs
The Jieba Chinese Word Segmentation Implemented in Rust
-
regress
A regular expression engine targeting EcmaScript syntax
-
diffy
Tools for finding and manipulating differences between files
-
rphonetic
Rust port of phonetic Apache commons-codec algorithms
-
unescaper
Unescape strings with escape sequences written out as literal characters
-
synoptic
low-level, syntax highlighting library with unicode support
-
os_display
Display strings in a safe platform-appropriate way
-
regex-syntax
A regular expression parser
-
lipsum
lorem ipsum text generation library. It generates pseudo-random Latin text. Use this if you need filler or dummy text for your application. The text is generated using a simple Markov chain…
-
focaccia
no_std implementation of Unicode case folding comparisons
-
line-index
Maps flat
TextSizeoffsets to/from(line, column)representation -
harfrust
A complete HarfBuzz shaping algorithm port to Rust
-
prettydiff
Side-by-side diff for two files
-
unicode-general-category
Fast lookup of the Unicode General Category property for char
-
cow-utils
Copy-on-write string utilities for Rust
-
spellbook
A spellchecking library compatible with Hunspell dictionaries
-
unicode-reverse
Unicode-aware in-place string reversal
-
heatseeker
A fast, robust, and portable fuzzy finder
-
stop-words
Common stop words in many languages
-
unicode-xid
Determine whether characters have the XID_Start or XID_Continue properties according to Unicode Standard Annex #31
-
mupdf
Safe Rust wrapper to MuPDF
-
stringzilla
Search, hash, sort, fingerprint, and fuzzy-match strings faster via SWAR, SIMD, and GPGPU
-
unicode_names2
Map characters to and from their name given in the Unicode standard. This goes to great lengths to be as efficient as possible in both time and space, with the full bidirectional tables weighing barely 500 KB…
-
shiguredo_http11
HTTP/1.1 Library
-
epub-builder
generating EPUB files
-
typstyle
The CLI for Typstyle
-
edit
Open a file in the default text editor
-
localgpt
A local device focused AI assistant with persistent markdown memory, autonomous heartbeat tasks, and semantic search. Single binary, no runtime dependencies.
-
any_ascii
Unicode to ASCII transliteration
-
titlecase
Capitalize text according to a style defined by John Gruber for Daring Fireball
-
zawk
An efficient Awk-like language implementation by Rust with stdlib
-
unicode-joining-type
Fast lookup of the Unicode Joining Type and Joining Group properties
-
emojic
Emoji constants
-
html2md
binary to convert simple html documents into markdown
-
Inflector
Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
ascii
ASCII-only equivalents to
char,strandString -
fax
Decoder and Encoder for CCITT Group 3 and 4 bi-level image encodings used by fax machines TIFF and PDF
-
stfu8
Sorta Text Format in UTF-8
-
regex-cursor
regex fork that can search discontiguous haystacks
-
str_indices
Count and convert between indexing schemes on string slices
-
decancer
that removes common unicode confusables/homoglyphs from strings
-
unicode-properties
Query character Unicode properties according to UAX #44 and UTR #51
-
omekasy
Decorate alphanumeric characters in your input with various font; special characters in Unicode
-
uncased
Case-preserving, ASCII case-insensitive, no_std string types
-
htmd
A turndown.js inspired HTML to Markdown converter
-
crop
A pretty fast text rope
-
mdbook-pdf
A backend for mdBook written in Rust for generating PDF based on headless chrome and Chrome DevTools Protocol
-
igrepper
The interactive grepper
-
stringcase
Converts string cases between camelCase, COBOL-CASE, kebab-case, and so on
-
cruet
Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
textsurf
Webservice for efficiently serving multiple plain text documents or excerpts thereof (by unicode character offset), without everything into memory
-
hypher
separates words into syllables
-
zpl_toolchain_cli
Command-line interface for parsing, validating, formatting, and printing ZPL II label code (part of the zpl-toolchain project)
-
icu_pattern
ICU pattern utilities
-
rich_rust
port of Python's Rich library for beautiful terminal output
-
textdistance
Lots of algorithms to compare how similar two sequences are
-
boreal
evaluate YARA rules, used to scan bytes for textual and binary pattern
-
panache
An LSP, formatter, and linter for Pandoc markdown, Quarto, and RMarkdown
-
mdbook-katex
mdBook preprocessor rendering LaTeX equations to HTML
-
marmite
easiest static site generator
-
diff-match-patch-rs
The fastest implementation of Myer's diff algorithm to perform the operations required for synchronizing plain text
-
mdvault
CLI tool for managing markdown vaults with structured notes, validation, and search
-
mmdflux
Render Mermaid diagrams as Unicode text, ASCII, SVG, and MMDS JSON
-
difflib
Port of Python's difflib library to Rust
-
htop
HTML to PDF converter
-
hgrep
grep tool with human-friendly search output. This is similar to
-Coption ofgrepcommand, but its output is enhanced with syntax highlighting focusing on human readable outputs. -
frizbee
Fast typo-resistant fuzzy matching via SIMD smith waterman, similar algorithm to FZF/FZY
-
deno_media_type
Media type used in Deno
-
hck
A sharp cut(1) clone
-
kak-lsp
Kakoune Language Server Protocol Client
-
treelog
A highly customizable, optimized, and modular tree rendering library
-
lsp-textdocument
A LSP text documents manager that map of text document
-
unicode_categories
Query Unicode category membership for chars
-
rumdl
A fast Markdown linter written in Rust (Ru(st) MarkDown Linter)
-
sanitizer
A collection of methods and macros to sanitize struct fields
-
arrow-string
String kernels for arrow arrays
-
lindera-tantivy
Lindera Tokenizer for Tantivy
-
terraphim_rolegraph
Terraphim rolegraph module, which provides role handling for Terraphim AI
-
olpc-cjson
serde_json Formatter to serialize as OLPC-style canonical JSON
-
font-types
Scalar types used in fonts
-
codebook
A code-aware spell checker library (dependency for codebook-lsp)
-
markdown_timesheet
processing markdown files to extract and format timesheet data
-
dwrote
Lightweight binding to DirectWrite
-
glob-match
An extremely fast glob matcher
-
kbremap
Custom keyboard layouts for windows
-
zhconv
Traditional, Simplified and regional Chinese variants converter powered by MediaWiki & OpenCC rulesets and the Aho-Corasick algorithm 中文简繁及地區詞轉換
-
unicode-blocks
contains a list of all unicode blocks and provides some functions to search across them
-
esed
Easy sed
-
fontcull
Pure Rust font subsetting library
-
idna
IDNA (Internationalizing Domain Names in Applications) and Punycode
-
entities
raw data needed to convert to and from HTML entities
-
sdml-cli
Rust CLI for Simple Domain Modeling Language (SDML)
-
uwc
Counts things in unicode text files
-
papers-cli
CLI for academic paper search, management, and local RAG
-
arborium-c
C grammar for arborium (tree-sitter bindings)
-
mdbook-admonish
A preprocessor for mdbook to add Material Design admonishments
-
matchers
Regex matching on character and byte streams
-
sliceslice
A fast implementation of single-pattern substring search using SIMD acceleration
-
mime-infer
detection of a file's MIME type by its extension
-
boxen
creating styled terminal boxes around text with performance optimizations
-
mdbook-yapp
mdBook preprocessor for simple text replacements
-
giallo
A code highlighter giving the same output as VSCode
-
serpl
terminal UI for search and replace, ala VS Code
-
repgrep
An interactive command line replacer for
ripgrep -
html-to-markdown-cli
Command-line interface for html-to-markdown - high-performance HTML to Markdown converter
-
languagetool-rust
LanguageTool API bindings in Rust
-
mktoc
Generate Table of Contents from Markdown files
-
qpdf
Rust bindings to QPDF C++ library
-
xan
The CSV magician
-
tiefdownconverter
A CLI tool to manage and convert Markdown-based projects
-
rst_parser
a reStructuredText parser
-
dug
An exhaustive name resolution aggregator
-
hyperlink
Very fast link checker for CI
-
linkify
Finds URLs and email addresses in plain text. Takes care to get the boundaries right with surrounding punctuation like parentheses.
-
unicode_titlecase
add Unicode titlecase and Turkish and Azeri locale upper/lowercase utilities to chars and strings
-
hyphenation
Knuth-Liang hyphenation for a variety of languages
-
quamina
Fast pattern-matching library for filtering JSON events
-
at-commands
AT Commands builder and parser for Rust #![no_std]
-
in_definite
Get the indefinite article ('a' or 'an') to match the given word. For example: an umbrella, a user.
-
savvy
R extension interface
-
finetype-cli
CLI for FineType semantic type classification
-
fasttext
binding
-
line-ending
Detect, normalize, and convert line endings across platforms, including support for character streams. Ensures consistent handling of LF, CRLF, and CR line endings in text processing.
-
usage-lib
working with usage specs
-
rschess
chess library with the aim to be as feature-rich as possible
-
srgn
A grep-like tool which understands source code syntax and allows for manipulation in addition to search
-
chewing
(酷音) intelligent Zhuyin input method
-
wana_kana
checking and converting between Japanese characters - Kanji, Hiragana, Katakana - and Romaji
-
collclean
Clean up collaboration commands in LaTeX files
-
unicode-id
Determine whether characters have the ID_Start or ID_Continue properties according to Unicode Standard Annex #31
-
markdown-tool
A CLI utility for converting Markdown into AST and vice versa
-
mdbook-epub
An EPUB renderer for mdbook
-
chat-gpt-lib-rs
interacting with OpenAI's ChatGPT API, providing a simple interface to make API requests and handle responses
-
crowbook
Render a Markdown book in HTML, PDF or Epub
-
attack-data
Request Mitre ATTACK data offline
-
mlc
The markup link checker (mlc) checks for broken links in markup files
-
mq-lang
Core language implementation for mq query language
-
kas-text
Text layout and font management
-
simple-string-patterns
Makes it easier to match, split and extract strings in Rust without regular expressions. The parallel string-patterns crate provides extensions to work with regular expressions via the Regex library
-
ferroni
Pure-Rust Oniguruma regex engine with SIMD-accelerated search
-
deno_doc
doc generation for deno
-
buup
Core transformation library with zero dependencies
-
sapling-streampager
streampager is a pager for command output or large files
-
presenterm
A terminal slideshow presentation tool
-
mdr
A lightweight Markdown viewer with live reload and multiple rendering backends
-
uncomment
A CLI tool to remove comments from code using tree-sitter for accurate parsing
-
treegrep
regex pattern matcher that displays results in a tree structure with an interface to jump to matched text
-
dptran
run DeepL translations on command line written by Rust
-
simdnbt
an unnecessarily fast nbt decoder
-
todo_lib
Collection of utilities for todo.txt format
-
inlinable_string
inlinable_stringcrate provides theInlinableStringtype – an owned, grow-able UTF-8 string that stores small strings inline and avoids heap-allocation – and theStringExttrait… -
decasify
A CLI utility and library to cast strings to title-case according to locale specific style guides including Turkish support
-
matcher_rs
A high-performance matcher designed to solve LOGICAL and TEXT VARIATIONS problems in word matching, implemented in Rust
-
jx
An interactive JSON explorer for the command line
-
fop-cli
Command-line interface for Apache FOP - XSL-FO to PDF converter
-
yggdrasil-cli
Yggdrasil is a project flattener and diff engine that turns any subset of your codebase into a single AI-ready codex (index + contents), or compares snapshots with annotated diffs
-
rdfless
A colorful pretty printer for RDF (Turtle/TriG/N-Triples/N-Quads/PROV-N) with ANSI colors
-
bashtestmd
Compiles shell commands in .md files into Bash scripts for testing
-
awabi
A morphological analyzer using mecab dictionary
-
rapidfuzz
rapid fuzzy string matching library
-
termdiff
Write a diff with color codes to a string
-
turbovault-parser
Obsidian Flavored Markdown (OFM) parser
-
md-tui
A terminal markdown viewer
-
patchkit
parsing and manipulating patch files
-
unicode-case-mapping
Fast lowercase, uppercase, and titlecase mapping for characters
-
mdbook-kroki-preprocessor
render kroki diagrams from files or code blocks in mdbook
-
typst-kit
Common utilities for Typst tooling
-
ferris-says
flavored replacement for the classic cowsay
-
stylin
Convert markdown to pandoc markdown with custom styles
-
kazoe
Fast wc replacement
-
mdbook-shiftinclude
mdbook preprocessor for file inclusion with shift
-
graphannis
new backend implementation of the ANNIS linguistic search and visualization system
-
mdbook-plantuml
A preprocessor for mdbook which will convert plantuml code blocks into inline SVG diagrams
-
madato
command line tool for reading and writing tabular data (XLS, ODS, CSV, YAML), and Markdown
-
norad
Read and write Unified Font Object files
-
harper-core
The language checker for developers
-
vaporetto
pointwise prediction based tokenizer
-
minimizer
Minimize files to find minimal test case
-
askalono
detect the contents of license files
-
supermarkdown-cli
CLI for supermarkdown HTML to Markdown conversion
-
epub
support the reading of epub files
-
flickzeug
A fork of diffy: diff, patch, and merge library featuring Myers' algorithm, unified diff format parsing, fuzzy patch application, and three-way merge with conflict detection
-
wit_owo
interacting with the Wit.ai API
-
iepub
epub、mobi电子书读写
-
asimov-cli
ASIMOV Command-Line Interface (CLI)
-
hermes-tool
CLI tools for Hermes - index management, simhash, sorting, and data processing
-
nu_plugin_regex
nu plugin to search text with regex
-
hongdown
A Markdown formatter that enforces Hong Minhee's Markdown style conventions
-
inflections
High performance inflection transformation library for changing properties of words like the case
-
aki-xcat
concatenate files that are plain, gzip, xz and zstd
-
zalgo-codec
Convert an ASCII text string into a single unicode grapheme cluster and back. Provides a macro for embedding Rust source code that has been encoded in this way.
-
mkdlint
A style checker and lint tool for Markdown/CommonMark files, written in Rust
-
blockwatch
Language agnostic linter that keeps your code and documentation in sync and valid
-
red-sed
An experimental drop-in replacement for GNU sed, written in Rust
-
indefinite
Prefix a noun with an indefinite article - a or an - based on whether it begins with a vowel
-
wordcut-engine
Word segmentation/breaking library
-
rsrpp
project for research paper pdf
-
ul/kak-lsp
Kakoune Language Server Protocol Client
-
bochi
A CLI tool to interact with Android UI elements with CSS-like selectors
-
stringdex
A suffixtree search system for static sites
-
rake
Rapid Automatic Keyword Extraction (RAKE) algorithm
-
trueno-rag
Pure-Rust Retrieval-Augmented Generation pipeline built on Trueno
-
inlyne
Introducing Inlyne, a GPU powered yet browserless tool to help you quickly view markdown files in the blink of an eye
-
oranda
🎁 generate beautiful landing pages for your projects
-
lindera-ko-dic-builder
A Korean morphological dictionary builder for ko-dic
-
inflection-rs
Inflection is a string transformation library. It singularizes and pluralizes English words, and transforms strings from CamelCase to underscored string.
-
apisnip
A terminal user interface (TUI) tool for trimming OpenAPI specifications down to size ✂️
-
icy_sauce
handling SAUCE – Standard Architecture for Universal Comment Extensions
-
ctj
A command-line tool to convert CSV to JSON written in Rust
-
text2num
Parse and convert numbers written in English, Dutch, Spanish, Portuguese, German, Italian or French into their digit representation
-
arf-strings
Encoding and decoding for ARF strings
-
mad-useful
A multi-tool utility for file operations and text processing
-
mad
A fast Markdown terminal renderer with syntax highlighting
-
quickmark-cli
Lightning-fast Markdown/CommonMark linter CLI tool with tree-sitter based parsing
-
unicode-casing
Titlecase helper function on characters
-
phs
Runtime for Phlow Script, PHS
-
shiguredo_toml
TOML Library
-
topo-score
BM25F, heuristic, structural, and RRF fusion scoring
-
typub-html
HTML processing utilities for typub (AST types, parsing, serialization, SVG handling, link resolution)
-
sift-search
Sift — a DSL for agents to search codebases in faster, deeper, and more sophisticated ways
-
levenshtein_automata
Creates Levenshtein Automata in an efficient manner
-
crabular-cli
A CLI tool for generating ASCII tables
-
cli-pdf-extract
Fast Rust CLI wrapper around pdf_oxide for LLM-friendly PDF extraction
-
twas
A text substitution application for using random look-up tables to generate text in a manner similar to the Mad Libs game
-
koji
An interactive CLI for creating conventional commits
-
svgdx-pandoc
pandoc filter for svgdx codeblocks in Markdown
-
wildcard
matching
-
tossicat
입력된 단어에 맞게 같이 입력된 토시(조사)를 적절하게 변환하는 라이브러리
-
roman-numerals-rs
Manipulate well-formed Roman numerals
-
textra
type less, write more
-
task-heap
stack tasks and pop them randomly
-
sara-cli
CLI for Sara - Requirements Knowledge Graph
-
yake-rust
Yake (Yet Another Keyword Extractor) in Rust
-
cirru_parser
Parser for Cirru text syntax
-
llmvm-core
The core application for llmvm
-
fuzzt
Implementations of string similarity metrics. Includes Hamming, Levenshtein, OSA, Damerau-Levenshtein, Jaro, Jaro-Winkler, and Sørensen-Dice.
-
rsxxd
hex-dump-type utility rsxxd
-
mime_guess2
detection of a file's MIME type by its extension
-
memory-indexer
An in-memory full-text fuzzy search indexer
-
ib-matcher
A multilingual, flexible and fast string, glob and regex matcher. Support 拼音匹配 (Chinese pinyin match) and ローマ字検索 (Japanese romaji match).
-
flowmark
A Markdown auto-formatter for clean diffs and semantic line breaks
-
pprint
Flexible and lightweight pretty printing library for Rust
-
yore-cli
Fast document indexer for finding duplicates and searching content
-
twilight-mention
working with mentions in the Twilight ecosystem
-
rhai-autodocs
Custom documentation generator for the Rhai scripting language
-
skyspell
Fast and handy spell checker for the command line
-
mdbook-plotly
An mdbook preprocessor that renders plot code blocks (e.g., ```plot) into interactive or static charts during book build
-
retrofont-cli
CLI for retrofont: Render and convert retro ASCII/ANSI art fonts
-
ngrammatic
Character-oriented ngram generator and fuzzy matching library
-
obsidian-logging
A journaling/logging CLI that stores logs in Obsidian markdown files
-
aico-cli
Scriptable control over LLMs from the terminal
-
readable-name-generator
Generate a readable name for throwaway infrastructure
-
tabprinter
creating and printing formatted tables in the terminal. It supports various table styles and offers both color and non-color output options.
-
ascfix
Automatic ASCII diagram repair tool for Markdown files
-
uroman
A self-contained Rust reimplementation of the uroman universal romanizer
-
ascii-canvas
canvas for drawing lines and styled text and emitting to the terminal
-
qj
A fast, jq-compatible JSON processor powered by simdjson
-
pasta_shiori
SHIORI DLL interface for pasta script engine
-
fetchkit
AI-friendly web content fetching and HTML-to-Markdown conversion library
-
sentencex
Sentence segmentation library with wide language support optimized for speed and utility
-
pomsky
A new regular expression language
-
mdv
Terminal Markdown Viewer
-
assay-mcp-server
MCP server integration for Assay
-
swappy
An anagram generator
-
markdown2pdf
Create PDF with Markdown files (a md to pdf transpiler)
-
str-utils
some traits to extend
[u8],strandCow<str> -
litho-book
Litho Book is a modern web documentation reader specifically designed for the Litho (deepwiki-rs) documentation generation engine. It provides…
-
jsongrep
A JSONPath-inspired query language for JSON documents
-
ncount
A word count tool intended to derive useful stats from markdown
-
semtools
Semantic search and document parsing tools for the command line
-
wayland-clipboard-listener
impl wlr-data-control-unstable-v1, listen for clipboard
-
datafusion-functions
Function packages for the DataFusion query engine
-
dicexp
A Dice Expression Interpreter program and library for parsing (and rolling) role-playing game style dice notations (e.g. "2d8+5")
-
mdbook-inline-highlighting
mdBook preprocessor that enables support for inline highlighting
-
markdown-harvest
designed to extract, clean, and convert web content from URLs found in text messages into clean Markdown format. Originally created as an auxiliary component for Retrieval-Augmented Generation (RAG)…
-
bulletty
a pretty TUI feed reader (RSS+ATOM) that stores articles locally as Markdown files
-
unindent
Remove a column of leading whitespace from a string
-
asimov-prompt
ASIMOV Software Development Kit (SDK) for Rust
-
fuzzy-aho-corasick
Aho–Corasick automaton with fuzzy matching
-
anda_db_hnsw
A high-performance vector search library in Rust
-
near-facsimile
Find similar or identical text files in a directory
-
mdbook-preprocessor
assist implementing an mdBook preprocessor
-
ferrous-opencc
A pure Rust implementation of Open Chinese Convert (OpenCC), for fast and reliable conversion between Traditional and Simplified Chinese
-
dmos
Djot HTML renderer with advanced features
-
wsl-clip
High-performance clipboard bridge for WSL2
-
vibrato-rkyv
Vibrato: viterbi-based accelerated tokenizer with rkyv support for fast dictionary loading
-
fontconfig
Safe, higher-level wrapper around the Fontconfig library
-
tantivy-common
common traits and utility functions used by multiple tantivy subcrates
-
measured
A better way to measure your application statistics
-
bareun_rs
an unofficial Rust library for Bareun, a Korean morphological analyzer
-
ocr-rs
A lightweight and efficient OCR library based on PaddleOCR models, using the MNN inference framework for high-performance text detection and recognition
-
braillify
Rust 기반 크로스플랫폼 한국어 점역 라이브러리
-
cskk
C ABIから使う事を目的とした SKK(Simple Kana Kanji henkan)方式のかな漢字変換ライブラリ
-
guardrails
Enforce architectural decisions AI coding tools keep ignoring
-
rustdoc-stripper
manipulate rustdoc comments
-
netform_cli
CLI for diffing lossless network configuration IR documents
-
unicode-security
Detect possible security problems with Unicode usage according to Unicode Technical Standard #39 rules
-
recase
Changes the convention case of input text
-
yangon
A high-performance, stack-allocated string type for Rust with fixed capacity and zero heap allocations
-
kiwi-rs
Ergonomic Rust bindings for the Kiwi Korean morphological analyzer C API
-
cicero-sophia
High-performance NLU (natural language understanding) engine built in Rust for speed, accuracy, and privacy
-
mdbook-typst-math
An mdbook preprocessor to use typst to render math
-
gilt
A fast, rich terminal formatting library — Rust port of Python's rich
-
news-flash
Base library for a modern feed reader
-
terraphim_automata
Automata for searching and processing knowledge graphs
-
gen-mdbook-summary
generate SUMMARY.md for mdbook project
-
type-safe-id
A type-safe, K-sortable, globally unique identifier
-
seeyou-cub
reading and writing the SeeYou CUB binary file format, which stores airspace data for flight navigation software
-
galm
pattern matching library
-
regex-literal
delimited regular expression literals
-
mdbook-mermaid
mdbook preprocessor to add mermaid support
-
slugomatic
🐌 A simple CLI tool to slugify and unslugify text, perfect for branch names and URLs
-
src2md
Turn source code into a Markdown document with syntax highlighting, or extract it back
-
sbnf
A BNF-style language for writing sublime-syntax files
-
codeix
Fast semantic code search for AI agents — find symbols, references, and callers across any codebase
-
awful_dataset_builder
Build LLM-ready Q/A datasets from reference text-to-question mappings produced by Awful Knowledge Synthesizer
-
pdfrs
A CLI tool to read/write PDFs and convert to/from markdown
-
semantic-edit-mcp
MCP server for semantic code editing with tree-sitter
-
kdex
A fast CLI for indexing and searching code repositories and knowledge bases for AI-powered workflows
-
japanese-codepoints
A high-performance Rust library for Japanese character validation and code point handling based on JIS standards
-
cedarwood
efficiently-updatable double-array trie in Rust (ported from cedar)
-
agentroot-mcp
Model Context Protocol server for agentroot - AI assistant integration
-
xrusty
Parse documents and transform using χrust
-
base-d
Universal base encoder: Encode binary data to 33+ dictionaries including RFC standards, hieroglyphs, emoji, and more
-
nuch
A CLI to manage Markdown content and images for Nuxt Content sites
-
unbom
Remove UTF-8 BOM from files
-
rgx-cli
A terminal regex debugger with real-time matching, capture group highlighting, and plain-English explanations
-
minspan
a package for determining the minimum span of one vector within another
-
howmany
A blazingly fast, intelligent code analysis tool with parallel processing, caching, and beautiful visualizations
-
opserve
Opinionated page server
-
giallo-kak
Kakoune syntax highlighter using TextMate grammars
-
jetscii
A tiny library to efficiently search strings and byte slices for sets of ASCII characters or bytes
-
md-wiki
Markdown based static site generator for minimal wikis
-
sd
An intuitive find & replace CLI
-
annatto
Converts linguistic data formats based on the graphANNIS data model as intermediate representation and can apply consistency tests
-
cmx
Rust Spectral Color Management Library
-
sed-rs
A GNU-compatible sed implementation in Rust, powered by sd's regex engine
-
cesu8
Convert to and from CESU-8 encoding (similar to UTF-8)
-
alfrusco
building Alfred workflows with Rust
-
svgbob
Transform your ascii diagrams into happy little SVG
-
atlassian-cli
Unified CLI for Atlassian Cloud products
-
unreal-doc
generating documentation from Unreal C++ sources
-
okh-tool
A CLI tool to deal with Open Know-How (OKH) data files. Its main functionalities are: validation of and conversion between the different formats
-
koicore
core KoiLang module
-
isbn
handling ISBNs
-
lipilekhika
A transliteration library for Indian Brahmic scripts
-
unidown
Convert Markdown to Unicode
-
acdc-parser
AsciiDocparser using PEG grammars -
derivre
A derivative-based regular expression engine
-
shell2batch
Coverts simple basic shell scripts to windows batch scripts
-
sile
Simon’s Improved Layout Engine
-
codex
Human-friendly notation for Unicode symbols
-
blurhash
A pure Rust implementation of blurhash
-
mdbook-pandoc
A pandoc-powered mdbook backend
-
asposepdf
Aspose.PDF for Rust via C++ is a powerful toolkit that allows developers to manipulate PDF files directly and helps do various tasks for PDF. Contains unique features for converting PDF to other formats.
-
mandown
Markdown to groff (man page) converter
-
zspell
Native Rust library for spellchecking
-
claude-list
CLI tool for viewing installed plugins, skills, agents, and MCP servers in Claude Code
-
fitsio-pure
Pure Rust FITS file reader and writer
-
patto
🪽 Yet another plain text format for quick note taking and task management
-
word-tally
Output a tally of the number of times unique words appear in source input
-
gulagcleaner_rs
Ad removal tool for PDFs
-
drova_plugins
Main plugins for drova
-
oyster-md
Static site generator for Markdown with bidirectional links and HTML rendering
-
mdbook-codeblocks
A mdbook preprocessor to prepend customizable vignette to code blocks
-
panfix
parsing: linear time parsing of multifix operators
-
whatwg_streams
whatwg_streams for rust
-
tmenu
TUI fuzzy finder
-
ripgrep_all
rga: ripgrep, but also search in PDFs, E-Books, Office documents, zip, tar.gz, etc
-
mdbook-typst
An mdBook backend to output Typst markup, pdf, png, or svg
-
zepub-mini
Minimal crate for writing epubs (in-memory)
-
esri_ascii_grid
reading ESRI Ascii Grid .asc files
-
littrs-ruff-source-file
Vendored ruff_source_file for littrs (from github.com/astral-sh/ruff)
-
reconcile-text
Intelligent 3-way text merging with automated conflict resolution
-
readable-rs
A native Rust port of Mozilla's Readability algorithm for extracting readable content from HTML pages
-
mdcat
cat for markdown: Show markdown documents in terminals
-
cmark-writer
A CommonMark writer implementation in Rust for serializing AST nodes to CommonMark format
-
udataframe_rs
A pure Rust library for data frame operations, particularly useful for processing data extracted from PDF files or OCR recognize
-
kreuzberg-cli
Command-line interface for Kreuzberg document intelligence
-
simsearch
lightweight fuzzy search engine that works in memory, searching for similar strings (a pun here)
-
merge-code
A CLI tool for merging multiple code files into a signle file
-
rhema_contracts_chirho
Shared type-level contracts, newtypes, DTOs, and trait definitions for the Rhema Chirho engine
-
vi
An input method library for vietnamese IME
-
opentalk-types-common-identifiers
Common identifier types for OpenTalk crates
-
blocklet
A cross-platform CLI tool that generates ASCII art using Unicode block characters, similar to figlet but with beautiful solid Unicode blocks instead of outlines or hash symbols
-
etradeTaxReturnHelper
Parses etrade and revolut financial documents for transaction details (income, tax paid, cost basis) and compute total income and total tax paid according to chosen tax residency (currency)
-
asciisavers
A small collection of ascii screensavers
-
blinc_layout
Blinc layout engine - Flexbox layout powered by Taffy
-
ttypr
terminal typing practice
-
txt_to_md
Command converting from a txt file to a markdown file
-
mkulid
A command-line ULID generator — like uuidgen, but for ULIDs
-
precis-tools
Tools and parsers to generate PRECIS tables from the Unicode Character Database (UCD)
-
mdbook-repl
based mdbook preprocessor that allows you to execute code in your mdbook without any server. Python, Typescript, Javascript etc.
-
terraphim-cli
CLI tool for semantic knowledge graph search with JSON output for automation
-
embeddenator-workspace
Workspace management utilities for embeddenator development
-
syslog_fmt
A formatter for the 5425 syslog protocol
-
reword
some utility functions for human-readable formatting of words
-
hawkeye-fmt
The formatter library for hawkeye cli
-
quixote
Quizzes and tests in Markdown
-
table-grep
A grep-like tool for searching CSV and Parquet table files
-
richrs
port of the Rich Python library for beautiful terminal output
-
shibuichi
zsh prompt preprocessor to add git integration
-
spellcode-cli
Minimal CLI frontend for codebook
-
mdbook-linkcheck2
A backend for
mdbookwhich will check your links for you -
makepad-widgets
Makepad widgets
-
latkerlo-jvotci
Tools for creating and decomposing Lojban lujvo
-
geoipsed
Inline decoration of IPv4 and IPv6 address geolocations
-
uast
Unicode Aware Saṃskṛta Transliteration in Rust 🦀
-
instant-segment
Fast English word segmentation
-
smart-patcher
Patcher based on rules
-
zettel-cli
cli app for Luhmann-style Zettelkasten management
-
mdbook-quiz
Interactive quizzes for your mdBook
-
model2vec-rs
Official Rust Implementation of Model2Vec
-
bbd
Binary Braille Dump
-
gatekpr-patterns
Regex pattern registry and pre-built pattern sets for Shopify validation
-
slice-command
slice is a command-line tool that allows you to slice the contents of a file using syntax similar to Python's slice notation
-
regexr
A high-performance regex engine built from scratch with JIT compilation and SIMD acceleration
-
eloran
Comics and Ebook web library written in rust, with reading, search, reading status, bookmarks
-
runiq
An efficient way to filter duplicate lines from input, à la uniq
-
mdbook-footnote
mdbook preprocessor for footnotes
-
bloodtree
A hierarchical note-taking system with focus on relationships between nodes
-
tagged-urn
Tagged URN - Flat tag-based identifier system
-
mdbook-svgbob
SvgBob mdbook preprocessor which swaps code-blocks with neat SVG
-
atoxide-export
Export formats for the Ato electronics compiler (netlist, BOM)
-
edgequake-pdf2md
Convert PDF documents to Markdown using Vision Language Models — CLI and library
-
marqant
Quantum-compressed markdown format for AI consumption with 90% token reduction
-
anno
Information extraction for Rust: Named Entity Recognition (NER), coreference resolution, and structured extraction
-
asciidork-backend
Asciidork backend
-
underthesea_core
Underthesea Core
-
minimo
terminal ui library combining alot of things from here and there and making it slightly easier to play with
-
kiru
Fast text chunking for Rust
-
mdvalidate
Markdown schema validation engine
-
ssfmt
Excel-compatible ECMA-376 number format codes
-
bmfont_rs
Load/ save/ manipulate BMFont files
-
wrap-ansi
A high-performance, Unicode-aware Rust library for intelligently wrapping text while preserving ANSI escape sequences, colors, styles, and hyperlinks
-
y-octo
High-performance and thread-safe CRDT implementation compatible with Yjs
-
monster-regex
A custom regex spec
-
fop-layout
Layout engine for Apache FOP Rust implementation
-
erg_common
A common components library of Erg
-
deeprl
DeepL client library with all the things (blocking)
-
Inscribe
A markdown preprocessor that executes code fences and embeds their output
-
notion2prompt
CLI tool that converts Notion pages and databases into structured prompts for AI models
-
lo_
A modern Rust utility library delivering modularity, performance & extras ported from JavaScript Lodash
-
mdbook-catppuccin
🎊 Soothing pastel theme for mdBook
-
fits-io
A pure-Rust FITS file handling library inspired by CFITSIO, focused on safety, clarity, and performance
-
aptu-cli
CLI for Aptu - Gamified OSS issue triage with AI assistance
-
arborium-cli
Command-line syntax highlighter powered by arborium
-
turndown-cdp
Convert CDP-style DOM nodes to Markdown
-
antex
Styled text and tree in terminal
-
rmeow
A command line tool that aims to be a replacement for cat/bat with better highlighting
-
hy-mt
A lightweight machine translation inference library for Tencent Hunyuan MT models
-
lychee-lib
A fast, async link checker
-
unidoc
Unite all Markdown
-
tokstream-cli
CLI token stream simulator using Hugging Face tokenizers
-
markdown-org-extract
CLI utility for extracting tasks from markdown files with Emacs Org-mode support
-
mdbook-toc
mdbook preprocessor to add Table of Contents
-
cwe-data
Request CWE data offline
-
roe
Unicode case conversion
-
shaperglot
Test font files for OpenType language support
-
no-crlf
A CLI tool to convert CRLF line endings to LF in text files
-
string_pipeline
A flexible, template-driven string transformation pipeline for Rust
-
unicode-bidi-mirroring
Unicode Bidi Mirroring property detection
-
opml
OPML library for Rust
-
gibberish-or-not
Figure out if text is gibberish or not
-
safe-string
safe interface for interacting with multi-byte strings in Rust, namely IndexedStr, IndexedString, and IndexedSlice
-
zp
Copy the contents of the source file or the standard output buffer to the clipboard, with support for maintaining a history of copied content, allowing users to easily paste into another file or program
-
yaml-include
A lib and a CLI for recursively parsing "!include" data in yaml files
-
asimov-imap-module
ASIMOV module for IMAP email import
-
fdn
uniformly change file or directory names and support rollback these operations
-
writ
A hybrid markdown editor combining raw text editing with live inline rendering
-
pretty-console
A fluent, zero-cost API for styling terminal text with colors and attributes
-
ruma-events
Serializable types for the events in the Matrix specification
-
pdf_tables
Scrape text from tables in PDF files
-
rdocx-oxml
WordprocessingML XML element types for OOXML
-
kanpyo
Japanese Morphological Analyzer
-
mdbook-angular
mdbook renderer to run angular code samples
-
docanvil
A Rust-based static documentation generator that converts Markdown into HTML sites
-
dom-content-extraction
Content extraction via text density paper
-
my-notes
note taking app for taking hierarchical notes in markdown
-
secular
No Diacr!
-
pathmut
Command line utility for manipulating path strings
-
typub-ir
Semantic IR types for typub
-
claude-kb-cli
CLI for generating, validating, and searching Markdown knowledge bases under a .claude/ hierarchy
-
xi-unicode
Unicode utilities useful for text editing, including a line breaking iterator
-
gazenot
Gaze Not Into The Abyss, Lest You Become A Release Engineer
-
stam-python
STAM is a library for dealing with standoff annotations on text, this is the python binding
-
threeway_merge
Git-style 3-way string merging using proven algorithms from libgit2/xdiff. Statically links xdiff (LGPL-2.1+).
-
libphext
A rust-native implementation of phext
-
rustidy-ast-literal
Rustidy formatter
-
mdbook-aquascope
Interactive Aquascope editor for your mdBook
-
hanja_hangul
that converts Chinese characters to Korean characters. That is, convert hanja to hangul
-
mdbook-pagetoc
A mdbook plugin that provides a table of contents for each page
-
lgtmeow
🐾 —— 「本喵觉得很不错~」
-
ripsecrets
A command-line tool to prevent committing secret keys into your source code
-
leptos-sync-components
Leptos components for synchronization UI
-
xim-ctext
compound text en/decoder
-
search-text
A fast and flexible command-line tool to recursively search for text or regex patterns in files under a directory
-
ascii-img2-cli
ASCII image generation CLI
-
tendril
Compact buffer/string type for zero-copy parsing
-
obsidian-export
associated CLI program to export an Obsidian vault to regular Markdown
-
swc-plugin-inferno
SWC plugin for InfernoJS
-
rust_metrics
Incremental evaluation metrics for various machine learning pipelines
-
pure-tui
A modern terminal-based word processor for Markdown and other structured text documents
-
ADA_Standards
help you handle checks on your ADA projects, especially good to build scripts to check coding standards conformity
-
text_io
really simple to use panicking input functions
-
quickctx
A bidirectional file content aggregator and extractor that converts between files and markdown-formatted representations
-
fuzzytail
A modern, colorful tail replacement with split-pane log monitoring
-
ascii-fmt
CLI tool to fix and align ASCII diagrams generated by AI agents
-
wdl-doc
Documentation generator for Workflow Description Language (WDL) documents
-
capns
Core cap URN and definition system for FGND plugins
-
lazy-grep
A high-performance, line-oriented command-line tool for searching text with regular expressions
-
forbidden-bands
8-bit string handling library
-
tu
CLI tool to convert a natural language date/time string to UTC
-
academic-journals
managing journal abbreviations and full names
-
vibequest
A vibe-coded scripting language focused on developer happiness with a REPL implementation in Rust
-
laurus
Unified search library for lexical, vector, and semantic retrieval
-
ndg-commonmark
Flavored CommonMark processor for Nix-related projects, with support for CommonMark, GFM, and Nixpkgs extensions
-
omry-search
Search-related abstractions for the Omry project
-
go22dos
go to todos
-
latexmk-diff-head
LaTeX compilation tool that generates diff PDFs against Git commits
-
artificial
Typed, provider-agnostic prompt-engineering SDK for Rust
-
linestats
Group similar text lines and compute numeric statistics
-
mdbook-callouts
mdBook preprocessor to add Obsidian Flavored Markdown's Callouts to your book
-
aki-xtee
copy standard input to each files and standard output
-
swc_ecma_transformer
Compatibility layer for the ECMAScript standard
-
aasvg
Convert ASCII art diagrams to SVG with automatic light/dark mode support
-
gllm
Pure Rust library for local embeddings, reranking, and text generation with MoE-optimized inference and aggressive performance tuning
-
lexicmp
comparing and sorting strings lexicographically and naturally
-
embed-src
Embed source files into any text file
-
rep-grep
wgrep/write-grep CLI
-
wikipedia_prosesize
Count Wikipedia prose size
-
rst
a reStructuredText parser and renderer for the command line
-
textpod
Local, web-based notetaking app inspired by 'One Big Text File' idea
-
mdbook-cmdrun
mdbook preprocessor to run arbitrary commands
-
cssbox-test-harness
WPT test runner for cssbox layout engine
-
kataru
YAML-based dialogue engine for game development
-
mdbook-theme
A preprocessor and a backend to config theme for mdbook, especially creating a pagetoc on the right and setting full color themes from the offical ace editor
-
rust_string_utils
String utilities for rust based on org.apache.commons.lang3
-
quickmd
Quickly preview a markdown file
-
cascii-core-view
Core frame display and animation library for ASCII art viewers
-
easymark
Lightweight Markdown rendering utility that just works
-
casile
The command line interface to the CaSILE toolkit, a book publishing workflow employing SILE and other wizardry
-
kiri-engine
Core Rust engine for Kiri Japanese morphological analyzer
-
wetext-rs
Text normalization library for TTS, Rust implementation of WeText
-
vlazba
Lojban words generator and analyzer
-
feedyourai
combine text files for AI processing with flexible filtering options
-
constellate
Rust-powered CLI + live editor for curated markdown workspaces (requirements, docs, ADRs, audits, support, status-driven tasks) that build/serve/CRUD a themeable knowledge portal from a single binary
-
torudo
A terminal-based todo.txt viewer and manager with TUI interface
-
spellabet
Convert characters into spelling alphabet code words
-
archive-pdf-urls
Extract all links from a PDF and archive the URLs in the Internet Archive's Wayback Machine
-
noil
file explorer using text buffers
-
identifier_safety
Unicode confusable character detection and canonicalization
-
blitztext
fast keyword extraction and replacement in strings
-
dspy-rs
A DSPy rewrite(not port) to Rust
-
pyohwa-search
Search index builder for Pyohwa static site generator
-
text-to-ascii-art
program to convert text to ASCII art
-
rsonpath-lib
Blazing fast JSONPath query engine powered by SIMD. Core library of
rsonpath. -
booky
analyze English text
-
skimtoken
Fast token count estimation library
-
mdbook-combiner
combine mdbook summaries from multiple source into one mdbook
-
whitespace-sifter
Sift duplicate whitespaces away!
-
nanofts
High-performance full-text search engine in Rust
-
uvie
Ultra fast Vietnamese input method engine (Telex, VNI)
-
ntcip
National Transportation Communications for ITS Protocol
-
mistral_ocr_gui
GUI tool for Mistral OCR - convert documents to Markdown using Mistral AI
-
mdbook-d2
D2 diagram generator plugin for MdBook
-
pipefog
Stream-structured data obfuscator for JSON/YAML/CSS
-
regect
A cli tool to quickly test regular expressions
-
kfst-rs
Fast and portable HFST-compatible finite-state transducers
-
opencc-fmmseg
High-performance OpenCC-based Chinese conversion using FMM (Forward Maximum Matching) segmentation
-
commitbot
A CLI assistant that generates commit and PR messages from your diffs using LLMs
-
ipset_lookup
ipset is a command-line tool that takes networks or IPs and searches through a lot of different threat feeds quickly. It can also download the feed data necessary to perform the queries…
-
synaptic-e2b
E2B code execution sandbox for Synaptic
-
kdl-xml
XML<->KDL conversion
-
docbox-storage
Docbox storage layer abstraction
-
fsqlite-ext-json
JSON1 functions and virtual tables
-
greppy-cli
Sub-millisecond semantic code search and trace with AI reranking (Claude/Gemini/Ollama)
-
ccase
Command line interface to convert strings into any case
-
quant1x
Cross-language standard library for quantitative trading
-
facet-singularize
Fast, no-regex English singularization for the facet ecosystem
-
adc-lang
Array-oriented reimagining of dc, a terse RPN esolang
-
ultra-nlp
A NLP library
-
mq-hir
High-level Internal Representation (HIR) for mq query language
-
grift_unicode
Unicode character operations for the Grift Scheme language
-
catbus
A Wayland IME for multilingual text input
-
bear-query
A read-only Rust library for querying the Bear note-taking app's SQLite database with minimal interference
-
opusmeta
reading and writing metadata to opus files
-
mdbook-pikchr
A mdbook preprocessor to render pikchr code blocks as images in your book
-
plsfix
Text cleaner upper
-
trpl
A support crate for The Rust Programming Language book
-
md-crdt
Conflict-free replicated data types for collaborative markdown editing
-
mdbook-yml-header
mdBook preprocessor for removing yml header
-
latex-thebib
Clean and sort legacy TeX bibliographies written using ‘thebibliography’ via the
refactorsub-command. Compile BibTeX files to legacythebibliographyTeX code using thecompilesub-command… -
ucd
Extends the char type to provide access to most fields of the UCD, Unicode Character Database, as of version 9.0.0. It aims to be compact, fast, and use minimal dependencies (only rust's core crate)…
-
twars-url2md
A powerful CLI tool that fetches web pages and converts them to clean Markdown format using Monolith for content extraction and htmd for conversion
-
wchar
Procedural macros for compile time UTF-16 and UTF-32 wide strings
-
flatpage
file system based markdown flat page
-
strs_tools
Tools to manipulate strings
-
datafusion-spark
DataFusion expressions that emulate Apache Spark's behavior
-
ucp-cli
Command-line interface for Unified Content Protocol
-
mdbook-wordcount
Word count for mdbook, inspired by the mdbook tutorial
-
distrs
PDF, CDF, and percent-point/quantile functions for the normal and Student’s t distributions
-
linkup
Automatically add links to Markdown files
-
re-x
AI-native regex CLI — Test, validate, explain. Built for coding agents.
-
mdka
HTML to Markdown converter
-
lopdf-parang
A fork of lopdf optimized for PDF text extraction — lazy streams, O(1) object slicing, zlib-rs
-
rustpress
增量编译倒分页无后端 Rust 纯静态博客程序
-
utf8_iter
Iterator by char over potentially-invalid UTF-8 in &[u8]
-
oxford_join
Join string slices with Oxford Commas!
-
ggemtext
Glib-oriented Gemtext API
-
redact-ner
Named Entity Recognition for PII detection using ONNX Runtime
-
dw2md
Crawl a DeepWiki repository and compile all pages into a single, LLM-friendly markdown file
-
string_wizard
manipulate string like a wizard
-
airs-memspec
Streamline AI-assisted development with Multi-Project Memory Bank management and GitHub Copilot integration
-
blogr-cli
A CLI static site generator for blogs
-
nu_plugin_emoji
a nushell plugin called emoji
-
mdbook-exercises
An mdBook preprocessor for interactive exercises with hints, solutions, and test execution
-
chunk
The fastest semantic text chunking library — up to 1TB/s chunking throughput
-
office2pdf
Convert DOCX, XLSX, and PPTX files to PDF using pure Rust
-
markex
Fast, non-validating markup element extractor (Tag Element, MdRef, MdCodeBlock, MDSection)
-
iregex
Intermediate representation for Regular Expressions
-
dbxcase
Dropbox-compatible case-folding algorithm
-
mask-pii
A lightweight library to mask PII (Personally Identifiable Information) like emails and phone numbers
-
rwkv-tokenizer
A fast RWKV Tokenizer
-
rustkorean
processing Korean characters. It provides functionalities to check if a character is Korean, classify Korean characters, verify if a character is a leading consonant (choseong), a medial vowel (jungseong)…
-
htmd-cli
The command line tool for htmd
-
tre-regex
Rust safe bindings to the TRE regex module
-
hub-codegen
Multi-language code generator for Hub plugins from Synapse IR
-
xml-3dm-cli
3DM XML Tree Differencing and Merging Tool CLI
-
tesseract-rs
Rust bindings for Tesseract OCR with optional built-in compilation
-
pdfvec
High-performance PDF text extraction library for vectorization pipelines
-
sara-core
Core library for Sara - Requirements Knowledge Graph CLI
-
typst-count
Count words and characters in Typst documents
-
hexout
A compact and dependency-free, flexible and customizable hex dump library for Rust that provides beautiful, configurable binary data visualization
-
bangumi-api
An api implementation for Bangumi website
-
url_encor
A lightweight library to encode and decode special characters in urls
-
runmunch
hunspell's unmunch tool for expanding dictionary words using affix files
-
readability
Port of arc90's readability project to rust
-
mago-casing
converting strings between various common casing styles
-
atog
ascii to greek - prints greek letters given latin alphabets as input
-
twincat
ADS tools
-
cosmic-text-tessera-fork
Pure Rust multi-line text handling
-
lumin
searching and displaying local files
-
css_lexer
A spec-compliant CSS tokenizer with zero-copy cursors and optional feature gates
-
mdbook-alerts
mdBook preprocessor to add GitHub Flavored Markdown's Alerts to your book
-
rust-canto
Convert Chinese characters to Jyutping (粵拼)
-
simplematch
Fast wildcard pattern matching for strings and bytes with a simple api
-
mdbook-qr
An mdBook preprocessor that generates a QR code using fast_qr
-
mdpdf
A fast CLI tool to convert Markdown files to PDF
-
rob_test_sagebox_integration_001
Internal test crate for validating Sagebox packaging and README rendering. Not intended for public use.
-
ai-translator
基于 AI 的多语言文本翻译工具,支持自定义提示词
-
newsfresh
CLI and library for querying, filtering, and analyzing GDELT Global Knowledge Graph (GKG) v2.1 data — the world's largest open news event dataset
-
unimorph
Command-line interface for UniMorph morphological data
-
sedx
A safe, modern replacement for GNU sed with automatic backups, preview mode, and rollback
-
litedoc-cli
Command-line tool for parsing and validating LiteDoc documents
-
fast_symspell
Spelling correction & Fuzzy search
-
rosetta-aisp
Bidirectional prose ↔ AISP symbolic notation conversion based on the Rosetta Stone mappings
-
telegram-markdown-v2
Transform regular Markdown into Telegram MarkdownV2 (parse_mode = MarkdownV2)
-
engish
A language utility for sampling and building words
-
tiny_pretty
Tiny implementation of Wadler-style pretty printer
-
streplace
A tiny library for matching and replacing in strings and slices with user-defined functions
-
dvd-term
A bouncing ASCII art DVD logo (or custom text) for the terminal
-
fonts
High-performance font parsing and analysis library for Grida Canvas
-
pii
PII detection and anonymization with deterministic, capability-aware NLP pipelines
-
creature_feature
Composable n-gram combinators that are ergonomic and bare-metal fast
-
lexir
Lexical IR (BM25/TF-IDF) on top of postings lists
-
zipcodes
Query US zipcodes without SQLite
-
worf-launcher
Wayland application launcher inspired by wofi, rofi, and walker. Written in Rust with GTK4, supporting multiple modes (math, drun, file, ssh, run, emoji, search, auto), modern theming, and high performance.
-
zalo
A code highlighter giving the same output as VSCode
-
mdbook-numeq
An mdbook preprocessor for automatically numbering centered equations
-
probly-search
A lightweight full-text search engine with a fully customizable scoring function
-
newline_normalizer
Zero-copy newline normalization to \n or \r\n with SIMD acceleration
-
mdbook_fork4ls
Fork of mdBook for mdBook_LS
-
md-ulb-pwrap
Markdown paragraph wrapper using Unicode Line Breaking Algorithm
-
mds
A skim-based
*.mdexplore and surf note-taking tool -
md-scatter
split up and reassemble markdown files
-
nu-command
Nushell's built-in commands
-
mdbook-chess
An mdbook preprocessing plugin to generate chess boards
-
phd
an esoteric gopher server
-
mdbook-curly-quotes
mdBook preprocessor that replaces straight quotes with curlyquotes, except within code blocks or code spans
-
fkys-rs
F*cking Kill Yourself lang interpreter written in Rust
-
vidyut-prakriya
A Sanskrit word generator
-
asimov-core
ASIMOV Software Development Kit (SDK) for Rust
-
zh_num
Convert ASCII numbers and zh words
-
bibleref
Structures and functions for managing Bible references
-
cro_stem
A lightning-fast, zero-dependency Croatian stemming library written in Rust
-
hebrew_unicode_utils
Some functions for processing Hebrew unicode characters
-
runefix-core
Unicode character display width engine supporting CJK, emoji, and grapheme clusters
-
r2md
Entire codebase to single markdown or pdf file
-
document_tree
reStructuredText’s DocumentTree representation
-
case
A set of letter case string helpers
-
bhc-lexer
Lexical analysis for BHC
-
ragit
git-like rag pipeline
-
lindera-unidic-builder
A Japanese morphological dictionary builder for UniDic
-
gitfluff
Commit message linting tool with presets, custom formats, and cleanup automation
-
adobe-cmap-parser
parse Adobe CMap files
-
hayro-syntax
A low-level crate for reading PDF files
-
indent_write
Write adapters to add line indentation
-
easy_reader
easily navigating forward, backward or randomly through the lines of huge files
-
genedex
A small and fast FM-Index implementation
-
diagnostic
Pretty diagnostic report
-
autosurgeon
working with data in automerge documents
-
lingua-spanish-language-model
The Spanish language model for Lingua, an accurate natural language detection library
-
plot3d
reading, writing, and manipulating NASA PLOT3D structured grids
-
byteutils
that provides a collection of frequently used utility functions for working with bytes, strings, and vectors. It includes common tasks such as converting between strings and byte arrays…
-
repvar
A tiny CLI tool that replaces variables of the style
${KEY}in text with their respective value. It can also be used as a rust library -
codabase
Polyglot development tool for markdown-defined data types
-
kashida
Insert Kashidas/Tatweel into Arabic text, e.g. for justification purposes.
-
md-formatter
A fast, opinionated Markdown formatter
-
rsword_chirho
Core SWORD module library in pure Rust
-
el_roi
simplify reading user input
-
tphrase
A translatable phrase generator
-
wordcutw
A C-interface wrapper for Wordcut - a Lao/Thai word segmentation/breaking library
-
fm
Non-backtracking fuzzy text matcher
-
spacemod
A easy to understand and powerful text search-and-replace tool
-
connected-papers
client for Connected Papers integrated with Semantic Scholar utilities
-
yosina
Japanese text transliteration library
-
mnemoria
Persistent, git-friendly memory storage for AI agents with hybrid semantic + full-text search
-
docx_mcp_rust
A Rust-based MCP (Model Context Protocol) server for creating and manipulating DOCX files
-
postcode_extractor
extract and identify postcodes
-
swc_xml_visit
Visitor for xml
-
mantra-miner
your software recite mantras while it runs
-
tsuki
Lua 5.4 ported to Rust
-
mdbook-assets-hash
mdbook preprocessor that adds content-based cache-busting hashes to asset filenames
-
markon
A lightweight Markdown renderer with GitHub styling and Medium-like annotations
-
litsea
extreamely compact word segmentation and model training tool implemented in Rust
-
vmks-exam-generator
CLI program for pseudo-randomly generating different variants of an embedded programming exam
-
liblevenshtein
Levenshtein/Universal Automata for approximate string matching using various dictionary backends
-
unicode-ccc
Unicode Canonical Combining Class detection
-
autotex
Continuously compile TeX and LaTeX
-
srt2txt
Convert SRT subtitle files into clean plain text (strip timestamps, tags, merge lines)
-
herring-automata
Automata construction for Herring
-
hanconv
Convert between Chinese characters variants
-
zeitgrep
Find frecent results in git repositories using regular expressions
-
misaki-rs
A self-contained, POS-aware Grapheme-to-Phoneme (G2P) engine for Rust, optimized for TTS models like Kokoro
-
xhtml_parser
Non-validating XHTML Tree-based parser
-
omg-idl-code-gen
OMG IDL to Rust code generator library
-
mdbook-mermaid-ssr
mdbook preprocessor to add mermaid support with server-side rendering
-
kaff_sso
Small-buffer-optimized generic buffer and UTF-8 string type
-
fencecat
Walkdir cat with markdown fenced code output
-
seonbi
Korean text arrow/quote/punctuation processor
-
csvpretty
A command-line tool that formats CSV input into tables with Unicode box-drawing characters
-
slugrs
A fast, locale-aware slugify library for Rust
-
furigana
Map furigana to a word given its reading
-
mdbook-variables
mdBook proprocessor for risolve variables configured from book.toml
-
quranize
Encoding transliterations into Quran forms
-
rust-port-kill
A cross-platform command-line tool to manage listening ports and processes
-
rsxiv
Tools for working with arXiv and the arXiv API
-
zet
zet finds the union, intersection, set difference, etc of files considered as sets of lines
-
syllabize-es
Syllabize Spanish text, and much more
-
synaptic-slack
Slack message loader for Synaptic
-
chinese_dictionary
A searchable Chinese / English dictionary with helpful utilities
-
socorro-cli
CLI tool for querying Mozilla's Socorro crash reporting system, optimized for LLM coding agents
-
zpl_toolchain_core
Core parser, emitter, and validator for ZPL II label code (part of the zpl-toolchain project)
-
kiroku-tui
terminal-based personal journaling and note-taking tool
-
copyit
A cross-platform clipboard tool similar to pbcopy/pbpaste
-
fabryk-fts
Full-text search infrastructure for Fabryk (Tantivy backend)
-
ankit-builder
TOML-based Anki deck builder with .apkg generation and AnkiConnect import
-
mq-markdown
Markdown parsing and manipulation utilities for mq
-
rsigma-eval
Evaluator for Sigma detection and correlation rules — match rules against events
-
streamdown
A streaming markdown renderer for modern terminals (Rust port of Streamdown)
-
neo4j_cypher
A flexible and intuitive query builder for Neo4j and Cypher
-
mdsh
Markdown shell pre-processor
-
ib-pinyin
一个高性能拼音查询、匹配库
-
topiary-cli
CLI app for Topiary, the universal code formatter
-
jpreprocess
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
typdiff
A diff tool for Typst documents, similar to latexdiff
-
comically
fast manga & comic optimizer for e-readers
-
deencode
Reverse engineer encoding errors
-
genpdf
User-friendly PDF generator written in pure Rust
-
armnod
random string generator
-
lil-tabby
A macro-based library for creating visually appealing tables with automatic column spanning
-
legalis-eu
European Union jurisdiction support for Legalis-RS (GDPR, Consumer Rights, Competition, Treaties)
-
mini-calc
A Fully-Featured Configurable (mini) Rust Calculator
-
encoding-next
Character encoding support for Rust
-
pgf2json
Application Programming Interface to load and interpret grammars compiled in Portable Grammar Format (PGF). The PGF format is produced as a final output from the GF compiler. The library…
-
teip
Masking tape to help commands "do one thing well"
-
glyphana
Quickly find, inspect & collect unicode glyps
-
string-auto-indent
Normalizes multi-line string indentation while preserving platform-specific line endings
-
dwg-core
Deterministic Writing Guard core analysis engine for spotting AI-styled prose
-
picodiff
Tiny GUI app to compare text easily
-
mdbook-numbering
A mdBook preprocessor that adds numbers to headings and code block lines (for mdbook 0.5.0 and above)
-
kmdbook
Creates a book from markdown files
-
mdbook-private
An mdbook preprocessor that controls visibility of private chapters and sections within them
-
substring
method for string types
-
normalized-line-endings
Line endings normalizer
-
qrcode2pdf
Render barcodes (QR Codes, Aztec, Data Matrix, etc) using rxing into a krilla Surface (PDF)
-
loc
Count lines of code (cloc) fast
-
lethe-core-rust
High-performance hybrid retrieval engine combining BM25 lexical search with vector similarity using z-score fusion. Features hero configuration for optimal parity with splade baseline…
-
recursive-file-loader
recursively load files via references in the files
-
basic-text
Basic Text strings and I/O streams
-
langram
Natural language detection library
-
cron_clock
A cron expression parser and schedule explorer. Rich documentation and case studies and related upper-level libraries are available.
-
anaso_site_api_models
API models for Ana.so
-
autonomo-arabic-reshaper
Arabic text shaper + visual RTL reverser tailored for game modding (RimWorld, Unity LTR UIs). Handles tags, escapes, entities, and common UI artifacts.
-
tibco_ems
A high level API for the Tibco EMS
-
codefmt
a markdown code block formatter
-
opentalk-types-signaling-meeting-notes
Signaling types the OpenTalk meeting-notes module
-
crowbook-text-processing
some utilities functions for escaping text (HTML/LaTeX) and formatting it according to typographic rules (smart quotes, ellipsis, french typograhic rules)
-
ere
A compile-time alternative for POSIX extended regular expressions
-
rust-regex-dsl
Regular expression DSL
-
aqp3
Congress.gov legislation text query syntax parser
-
lcat
lolcat in rust! Full unicode support, escapes for ANSI escape sequences, hue shift in Cubehelix
-
datadriven
Rewritable table-driven testing
-
edit_core
Dependency-free text editing core for terminal and GUI editors
-
rascii_art
Advanced ASCII Art Generator
-
mdbook-termlink
mdBook preprocessor that auto-links glossary terms throughout documentation
-
serb_stem
A high-performance Serbian stemming library supporting both Cyrillic and Latin scripts (Ekavica)
-
mdbook-git
Insert git commit files and diffs into mdbook
-
zepub
epub、mobi电子书读写
-
mdbook-bibtex
Adds support for bibliographies to mdbook. Backed by hayagriva, supporting all CSL citation styles.
-
rspack_plugin_case_sensitive
rspack case sensitive plugin
-
xid
Globally unique sortable id generator. A Rust port of https://github.com/rs/xid.
-
netform_diff
Deterministic diff engine and reporting primitives for lossless network config IR
-
pinyin-parser
Parses a string of pinyin syllables. Covers marginal cases such as
ẑ,ŋandê. -
codesearch
A fast, intelligent CLI tool with multiple search modes (regex, fuzzy, semantic), code analysis, and dead code detection for popular programming languages
-
cpf_cnpj
Validador de CPF e CNPJ para Rust
-
async-utf8-decoder
Convert AsyncRead to incremental UTF8 string stream
-
atomic-plus
type extensions for the atomic standard library
-
lumen-cli
The AI-Native Programming Language
-
date_time_parser
Rust NLP library for parsing English natural language into dates and times
-
tars-bin
A small, fast, static site generator
-
p2panda-discovery
Confidential topic and node discovery protocol
-
dnd-character
A Dungeons and Dragons character generator
-
ht32-panel-daemon
Daemon with web UI for HT32 panel control
-
autofoam
related tools
-
parsanol
High-performance PEG parser library with packrat memoization and arena allocation
-
spel-right
A fast and lightweight spell checker and suggester
-
caco3
common lib
-
unicode-matching
match Unicode open/close brackets
-
nobom
remove UTF-8 BOM (Byte Order Mark) from stdin and write to stdout
-
krafna
terminal-based alternative to Obsidian's Dataview plugin, allowing you to query your Markdown files using standard SQL syntax
-
codebook-lsp
A code-aware spell checker with language server implementation, installable via cargo install
-
patiencediff
algorithm
-
neofile
linux file command for windows. PATCH 4.
-
rustdoc-md
Convert Rust documentation JSON into clean, organized Markdown files
-
mdbook-indexing
mdbook preprocessor for index generation
-
text-similarity-metrics
A high-performance Rust library for computing text similarity using multiple algorithms
-
dprint-development
Helper functions for testing dprint plugins
-
searcher_txt
A copy of grep that I made to show that im bad at rust
-
yeslogic-fontconfig-sys
Raw bindings to Fontconfig without a vendored C library
-
simstring_rust
A native Rust implementation of the SimString algorithm
-
fsqlite-ext-fts5
FTS5 full-text search extension
-
arxiv-cli
CLI to download papers from arXiv
-
normy
Ultra-fast, zero-copy text normalization for Rust NLP pipelines & tokenizers
-
sk-skimmer
Fuzzy Finder in rust!
-
steelkilt
Draft 0.4 RPG rule set
-
oak-pretty-print
Syntax highlighter supporting multiple programming languages
-
clima
A minimal Markdown reader in the terminal
-
ttf_word_wrap
Wraps text based on character width
-
palpad
A really simple static site generator
-
printwell-cli
Command-line tool for HTML to PDF conversion
-
matcher_py
A high-performance matcher designed to solve LOGICAL and TEXT VARIATIONS problems in word matching, implemented in Rust
-
text_trees
textual output for tree-like structures
-
rstring
A comprehensive set of string manipulation utilities inspired by Apache Commons Lang3 StringUtils
-
mdiew
A lightweight macOS markdown viewer with live reload
-
laser-pdf
programmatic PDF generation with precise, predictable layout control
-
string-offsets
Converts string offsets between UTF-8 bytes, UTF-16 code units, Unicode code points, and lines
-
markless
A terminal markdown viewer with image support
-
mdbook-last-changed
mdbook preprocessor to add the last modification date per page
-
agentai
designed to simplify the creation of AI agents
-
repr
The regular-expression-as-linear-logic interpretation and its implementation
-
erebus
A CLI message generation library
-
lister-cli
Lister: Navigate Markdown Lists
-
human_regex
A regex library for humans
-
sff
SemanticFileFinder (sff): Fast semantic file finder using sentence embeddings. Searches .txt, .md, .mdx files.
-
mdtrans
Markdown parser and transformer using
pest.rs, focused on flexibility to a project’s needs -
papers-core
CLI and MCP server for academic paper search, management, and local RAG
-
rag-cli
Local-first RAG CLI powered by candle for semantic search over your files
-
unicode-display-width
Unicode 15.1.0 compliant utility for determining the number of columns required to display an arbitrary string
-
chatpack-cli
CLI tool for parsing and converting chat exports into LLM-friendly formats
-
latinga
High-performance, Zero-Copy Uzbek Cyrillic-Latin transliterator
-
mdbook-preprocessor-boilerplate
Boilerplate code for mdbook preprocessors
-
bwrap
A fast, lightweight, embedded systems-friendly library for wrapping text
-
pdflens-mcp
An MCP server for reading PDFs, coded by human, designed for AI
-
pdf2pwg
Single purpose A4 page renderer rendering PDF using pdfium to PWG/URF
-
fileslug
Filename-aware slug generator — slugifies file and directory names while preserving extensions, dotfiles, and version numbers
-
renderdag
An ASCII or Unicode renderer for directed acyclic graphs
-
markdown-translator
A translation library with DeepLX API integration, rate limiting, and smart text chunking
-
tbll
tbll outputs data in tabular format
-
memchr-rs
Fast memchr and memchr2 implementations in Rust
-
madoru
markdown task runner
-
prompt-input
lightweight library for user input prompts in Rust, designed to make input handling straightforward
-
syntaxfmt
A derive macro-based library for flexible syntax tree formatting with pretty printing support
-
saytify
greeting and farewell messages
-
xifetch
neofetch alternative written in Rust
-
graphrag-cli
Modern Terminal User Interface (TUI) for GraphRAG operations
-
seshat-unicode
A Unicode Library for Rust. Unicode 16.0.0 ready. XID_Start and XID_Continue are also available.
-
unicode-language
detect language coverage given a list of codepoints
-
tortilla
Somewhat syntax-aware text wrapping for source code and plain text documents
-
serpscraper
A CLI tool to fetch and convert search results into Markdown
-
acroform
High-level PDF form manipulation library using lopdf
-
mdref
Markdown Reference finding and migration tool
-
zen-rs
generating non-interactive content like cards or files
-
onig-regset
Rust-Onig is a set of Rust bindings for the Oniguruma regular expression library. Oniguruma is a modern regex library with support for multiple character encodings and regex syntaxes.
-
ucm-engine
Transformation engine for the Unified Content Model
-
brk_string_wizard
manipulate string like a wizard
-
ttlint
Small, fast utility to lint text
-
ttf2woff2
A Pure Rust library and CLI for compressing TTF fonts to WOFF2 format
-
filenamify
Convert a string to a valid filename
-
redactor
Secure PDF redaction library with Type3 font support using MuPDF
-
mdbook-embedify
based mdbook preprocessor plugin that allows you to embed apps to your book, like youtube, codepen, giscus and many other apps
-
mdsql
SQL queries for markdown tables
-
sleipnir
Memory safe font operations for Google Fonts
-
rascii_art_img
Advanced ASCII Art Generator. Fork for imgii.
-
fast-slice-utils
Highly optimized slice utilities using SIMD instructions when available
-
uv-requirements-txt
internal component crate of uv
-
omnix-common
Common functionality for omnix frontends
-
mdtablefix
mdtablefixunb0rks and reflows Markdown tables so that each column has a uniform width. When the--wrapoption is used, it also wraps paragraphs and list items to 80 columns. -
tree-sitter-stack-graphs-typescript
Stack graphs definition for TypeScript & TSX using tree-sitter-typescript
-
slack-blocks-render
Slack blocks render is a Rust library to render Slack blocks as Markdown
-
ungoliant
The pipeline for the OSCAR corpus
-
caseify
A CLI tool to convert strings between different cases
-
easyeditor
Easy Markdown Editor
-
dedoc
Terminal-based viewer for DevDocs documentation
-
typub-core
Shared capability types and content model for typub
-
CompactPrefixRadix
a minimalistic but efficient radix tree implementation with extra prefix support
-
anthropic-text-editor
A micro-CLI to apply tool calls from Anthropic for their text_editor_20250124 built-in computer use tool
-
likewise
A diff library for Rust (fork of similar)
-
ansic
does ansi parsing in a dynamic DSL and at compile time for efficient and zero cost ansi styling
-
google-book-scraper
downloading the contents of books hosted on books.google.com for offline viewing
-
rexpaint
functionality for reading and writing .xp files of the Grid Sage Games REXPaint ASCII art editor
-
iati-types
Core data types for IATI Activity v2.03: Activity, Transaction, Money, TxType, etc
-
pangu
Paranoid text spacing for good readability, to automatically insert whitespace between CJK (Chinese, Japanese, Korean) and half-width characters (alphabetical letters, numerical digits and symbols)
-
pragmatic-segmenter
Rust port of pySBD v3.1.0
-
ncp-matcher
plug and play high performance fuzzy matcher
-
codemem-mcp
MCP server for Codemem (JSON-RPC 2.0 over stdio)
-
markov_strings
A simplistic Markov chain text generator
-
spyne
All-in-one toolkit
-
zeroten-denote
Handle denote name scheme
-
COXave
Instruments for codings
-
grapheme_machine
Grapheme cluster text segmentation (UAX #29) state machine for streaming input
-
synaptic-together
Together AI integration for Synaptic
-
colonnade
format tabular data for display
-
odtgen
Flat ODT writer
-
nanohtml2text
A zero-dependency library to convert HTML to plain text
-
stenotype
Machine stenography primitives
-
genpdf-json
PDF generator using JSON data
-
chord3
Create pdf songbooks from chopro source
-
shoco
port to Rust, a fast compressor for short strings
-
harfbuzz_rs
A high-level interface to HarfBuzz, exposing its most important functionality in a safe manner using Rust
-
rewrite
Safely rewrite file contents from stdin, even when file is open as an input
-
pyohwa-core
Core engine for Pyohwa static site generator — config, markdown, rendering, and build pipeline
-
mdbook-tabs
mdBook plugin for rendering content in tabs
-
linkcheck2
extracting and validating links
-
difference-rs
text diffing and assertion library
-
mdbook-replace
mdBook preprocessor that simply replaces text
-
koto_test_utils
Testing utilities for the Koto programming language
-
dodo-zh
operate on pinyin and cedict.u8 file
-
himmelblau_red_asn1
A little library to encode/decode ASN1 DER
-
madskills
The toolchain for madskilling: lint, format, and wrangle Agent Skills like you mean it
-
doryen-rs
Pure rust OpenGL accelerated roguelike console API with native/wasm support
-
monochora
gif to ascii art converter written in rust
-
json_to_table
pretty print JSON as a table
-
agentic-tools-registry
Unified tool registry aggregating all agentic-tools domain registries
-
matchr
A fast fuzzy matcher library written in Rust for use in CLI tools and TUI apps
-
sourcefile
Retain mapping information when concatenating source files, to make error messages more useful
-
wdl-lint
Lint rules for Workflow Description Language (WDL) documents
-
dec_from_char
Small library for converting unicode decimal into numbers
-
alphabet_detector
Natural language alphabet detection library
-
iregex-automata
Finite automata definitions for the
iregexcrate -
caseless
Unicode caseless matching
-
docxide-pdf
CLI and Library for converting DOCX files to PDF, matching Microsoft Word's output as closely as possible
-
stringmatch
Allow the use of regular expressions or strings wherever you need string comparison
-
orgflow
managing documents with support for tasks and notes
-
pdfgen
PDF rendering library
-
memcached-async
Asynchronous memcached protocol parser
-
string-patterns
Makes it easier to work with common string patterns and regular expressions in Rust, adding convenient regex match and replace methods (pattern_match and pattern_replace) to the standard…
-
blame-rs
Line-by-line authorship tracking for revisioned text
-
sigrs
Interactive grep (for streaming)
-
yamake
yet another make tool
-
colourss
parsing CSS color strings into RGB values
-
utf64
encode utf-8 strings into utf-64, and decode them back
-
ident_case
applying case rules to Rust identifiers
-
hemoglobin-search
Hemoglobin search utilities
-
dictionary_bitmap_trie
Bitmap trie backed autocomplete and search
-
streamdown-parser
Streaming markdown parser for streamdown
-
ystd
An opinionated and batteries included
stdmirror for convenient, correct code and pleasant error messages -
pelagic
Small command parsing primitives for CLI tools and text interfaces
-
rblog
Blog app
-
soon
Predict your next shell command based on history — like shell autocomplete, but MORE stupid
-
rust-ai
A collection of 3rd-party AI APIs for Rust
-
anycase
a case conversion library for Rust
-
qmd
Lightweight SOTA local search engine for AI agents in Rust
-
invisible-characters
A list of invisible characters
-
seam
Symbolic Expressions As Markup
-
mdbook-tocjs
A mdbook preprocessor which adds extra js and css file for ToC hydration
-
treebender
An HDPSG inspired symbolic NLP library for Rust
-
mdbook-findrep
mdBook find / replace preprocessor
-
agentroot
Fast local semantic search for codebases and knowledge bases with AI-powered features
-
sedregex
Sed-like regex library
-
textprep
Text preprocessing primitives: normalization, tokenization, and fast keyword matching
-
aki-gsub
substitude text command, replace via regex
-
sigye-background
Background animations for sigye terminal clock
-
e9571_str_lib
string processing, sorting, counting, and hashing
-
mdlynx
Small, fast utility to find broken file links in Markdown documents
-
escaping
configurable string escaping and unescaping
-
ezemoji
Catigoryized Emoji's
-
finetype-core
Core taxonomy and data generation for FineType
-
proper-sort
Small crate for natural sorting of strings that include number and size data
-
pandoc_types
Rust port of pandoc-types
-
zz-data
Data structures for Zanzarah apis
-
rlex
A cursor-based, utf-8 Vec<char> lexer
-
sema-stdlib
Standard library (350+ native functions) for the Sema programming language
-
latentdirichletallocation
Latent Dirichlet Allocation (LDA) using collapsed Gibbs sampling for topic modeling
-
emoji-remover
A fast command-line tool to remove emojis from source code files
-
ox_content_search
Full-text search engine for Ox Content
-
textcon
Template text files with file/directory references for AI/LLM consumption
-
pdfium-auto
Auto-download and cache PDFium binaries — zero-friction setup for pdfium-render
-
aurora-semantic
Local embedded semantic search engine for source code, designed for IDE integration
-
reinhardt-forms
Form handling and validation
-
bullet_stream
Bulletproof printing for bullet point text
-
univiz
A command-line tool for analyzing Unicode strings, providing detailed information about graphemes, code points, and UTF-8 byte sequences
-
efr
efr (EFile Rust)
-
lib-bcsv-jmap
reading and writing BCSV/JMap format used for Wii and GC games, including Super Mario Galaxy
-
untangle
Module-level dependency graph analyzer for Python, Ruby, Go, and Rust
-
mudssky_utils
A comprehensive Rust utility library providing common functionality for everyday programming tasks
-
ohos-ime
Bindings to the
inputmethodAPI of OpenHarmony -
inflector-plus
Adds String based inflections for Rust. Snake, kebab, camel, word, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
quagga
CLI tool that combines multiple text files into a single prompt suitable for Large Language Models
-
lumis
Syntax highlighter powered by Tree-sitter and Neovim themes
-
markdown-readtime
estimate reading time for Markdown content
-
mdbook-langtabs
An mdbook preprocessor that adds language tabs for code blocks
-
mdbook-environment
A preprocessor for MdBook for working with environment variables
-
unsafe-tools-mimic
Size and alignment matched opaque types
-
ucp-llm
LLM-focused utilities for the Unified Content Protocol
-
xerg
Ultra-fast grep implementation in Rust - built for maximum speed with direct output and parallel processing
-
udpipe-rs
Rust bindings for UDPipe - a trainable pipeline for tokenization, tagging, lemmatization and dependency parsing of CoNLL-U files
-
bump-bin
Increments version with semver specification
-
unicode-writing-script
The package provides detection of writing scripts from Unicode codepoints
-
streampager
pager for command output or large files
-
ucfirst
Uppercase the first letter of a string
-
wikidot-normalize
provide Wikidot-compatible string normalization
-
snips
Keep code snippets in markdown files in sync
-
codebase-to-prompt
bundling text files like code to single file
-
mdbook-cat-prep
a preprocessor for mdbook which provides teacher, subject, material and tag functionality
-
pukram2html
converting Pukram-formatted text to HTML
-
unicode-intervals
Search for Unicode code points intervals by including/excluding categories, ranges, and custom characters sets
-
crlf-to-lf-inplace
Fast in-place CRLF to LF line ending conversion for Rust strings. Uses memchr for good performance without custom SIMD.
-
typed-oid
Typed Object IDs
-
emojicon
Find Emoji by using Emoticons and GitHub's, Bengali emoji names
-
devek
CLI for copying HTML to clipboard
-
wcount
CLI word counting tool
-
scrunch
full-text-searching compression
-
wkhtmlapp
Convert html to pdf or image
-
gitbook2text
A CLI tool to download GitBook pages and convert them to markdown and text
-
kreuzberg-paddle-ocr
PaddleOCR via ONNX Runtime for Kreuzberg - high-performance text recognition
-
pandoc
API that wraps calls to the pandoc 2.x executable
-
paperless-api-client
Paperless-ngx API client
-
ascii_table_rs
Elegant ASCII table renderer for Rust CLI and terminal apps
-
bogrep
Full-text search for bookmarks from multiple browsers
-
ucf
A universal code formatter
-
mdbook-open-on-gh
mdbook preprocessor to add a open-on-github link on every page
-
pspp
Statistical analysis software
-
husk-lexer
Lexer for the Husk programming language
-
trump
Spawn a background thread that prints a Donald Trump quote every 10 seconds
-
ruchydbg
ML-powered debugger for Ruchy with SBFL fault localization
-
sqlite-english-stemmer
auxiliary library used for normalizing English words and identifying some special characters
-
cosmic-text
Pure Rust multi-line text handling
-
csml_interpreter
The CSML Interpreter is the official interpreter for the CSML programming language, a DSL designed to make it extremely easy to create rich and powerful chatbots
-
deucalion
High-performance Windows library for capturing decoded FFXIV packets
-
illuminate-string
A comprehensive Rust library for advanced string manipulation and processing
-
polished_scancodes
handling and mapping keyboard scancodes in Rust
-
analiticcl
approximate string matching or fuzzy-matching system that can be used to find variants for spelling correction or text normalisation
-
reggy
friendly, resumable regular expressions for text analytics
-
vectradb-chunkers
Chunking utilities for VectraDB in Rust
-
lingua-portuguese-language-model
The Portuguese language model for Lingua, an accurate natural language detection library
-
rust-persian-tools
Official Rust implementation of Persian Tools
-
metatron
core library
-
goose-eggs
in writing Goose load tests
-
gspell
Rust bindings for gspell
-
pink_accents
Replacement of patterns in string to simulate speech accents
-
uresamp
HIFI uresamp delivers ultrasonic-fidelity text resampling via adaptive 64-bit floating-point spectral mapping, preserving Unicode 32-bit codepoint integrity with zero-phase distortion
-
pager2
pipe your output through an external pager
-
typub-passes
Semantic IR passes for typub
-
tantivy-stemmers
A collection of Tantivy stemmer tokenizers
-
typwriter
compiling, formatting, and watching Typst documents, with PDF metadata and permission management
-
cockpitctl-render
Deterministic markdown and annotation rendering for cockpitctl reports
-
loki_text
advanced string manipulation with pattern searching and replacement capabilities
-
wasm-word-char-count
Count word and chars in TXT/MD/PDF in the frontend very fast
-
ripdoc
Query Rust docs and crate API from the command line
-
tetratto-markdown
Markdown rendering for Tetratto
-
rustybook-utilities
lib for RustyBook
-
liber
creating (sync/async) EPUB files
-
luau-lexer
A lexer for the luau language
-
tarzi
Rust-native lite search for AI applications
-
fastui-cosmic
Pure Rust multi-line text handling
-
kiters
timestamps, request IDs, and external IDs
-
frawk
an efficient Awk-like language
-
mdbook-mermaid-mmdr
A mdbook preprocessor that renders mermaid diagrams using mermaid-rs-renderer
-
numaelis-rckive-genpdf
User-friendly PDF generator written in pure Rust
-
maddi-recipe
parsing and scaling markdown recipes
-
awk-rs
A 100% POSIX-compatible AWK implementation in Rust
-
gremlh
A CLI tool to find and fix invisible 'gremlin' characters (homoglyphs, zero-width spaces, Bidi overrides) in source code
-
spellchk
A blazingly fast spellchecker CLI for any text file
-
upid
Universally Unique Prefixed Lexicographically Sortable Identifier
-
flo_rope
An attributed and streaming implementation of the rope data structure
-
content-canonical
Content canonicalization and text normalization library
-
nova-cite
Smart citation management with CrossRef/Zotero integration
-
docbox-secrets
Docbox secret management abstraction
-
ansi-width
Calculate the width of a string when printed to the terminal
-
opentalk-types-signaling-legal-vote
Signaling types for the OpenTalk legal vote module
-
yy1
Tiny utility to convert KiCad centroid files into Neoden YY1 pick and place machine format
-
matchy-paraglob
Glob pattern matching with Aho-Corasick for matchy (internal)
-
hexdump
Easy hexdump to stdout or as an iterator
-
duca
Search and read Dante's Divine Comedy from your terminal
-
vader-sentimental
A faster Rust version from the original Python VaderSentiment analysis tool
-
retrogress
Progress bars with a thin API around complex features
-
css_recess_order
Recess-based sort order for CSS properties
-
asimov-brightdata-module
ASIMOV module for data import powered by the Bright Data web data platform
-
learner
learning stuff
-
dm-scan
Document tree scanner for markdown files
-
unescape
Unescapes strings with escape sequences written out as literal characters
-
folderwalk
Folder walking tool
-
commit_crafter
AI powered tool for Git commit message generator
-
traverze
CLI for full-text search built on Tantivy and Lindera
-
okh-scraper
A scraper of Open Source Hardware (OSH) projects. based on the Open Know-How (OKH) standard
-
libappindicator-zbus
zbus implement for libappindicator
-
xarray
version of the XArray with copy-on-write capabilities
-
semantic-query
AI-powered schema validation with automatic JSON generation for type-safe responses
-
readability-js
wrapper for Mozilla's Readability.js library
-
ansi-align
Text alignment library with ANSI escape sequence and Unicode support
-
stam
powerful library for dealing with stand-off annotations on text. This is the Rust library.
-
case_insensitive_hashmap
A HashMap that uses case-insensitive strings as keys
-
zen-expression
Zen Expression Language
-
yara-x-fmt
A code-formatting library for YARA rules
-
inlet_manifold
A general purpose highlighting library
-
scraps_libs
Scraps is a portable CLI knowledge hub for managing interconnected Markdown documentation with Wiki-link notation
-
merge-engine
A non-LLM merge conflict resolver using structured merge, Version Space Algebra, and search-based techniques
-
rustflexstack
ETSI C-ITS GeoNetworking and BTP protocol stack in Rust
-
jdpub
Annotate source documents with Japanese readings and definitions
-
skyspell_kak
skyspell - kakoune integration
-
eddie
Fast and well-tested implementations of edit distance/string similarity metrics: Levenshtein, Damerau-Levenshtein, Hamming, Jaro, and Jaro-Winkler
-
quot
A fast and flexible command-line tool that converts text input into escaped string literals
-
float-pretty-print
Format f64 for showing to user, not for serialisation
-
artifacts-rs
Rust client for Artifacts
-
grapheme-cluster-utils
Unicode-safe String manipulation utilities by extended grapheme cluster (UAX #29)
-
sketchir
Sketching primitives for IR: minhash/simhash/LSH-style signatures
-
indent
Functions for indenting multiline strings
-
sortuniq
Find or count unique values in an input stream
-
unicount
Alphabetic counter supporting unicode
-
xee-xpath
XPath 3.1 library API
-
lemmeknow
Identify any mysterious text or analyze strings from a file
-
jsrmx
command-line tool to manipulate JSON files. It can split large single-object JSON files into many files; merge multiple JSON files into one large JSON file; bundle multiple JSON files into one NDJSON file…
-
nib
static site generator
-
exine
Universal Markdown extraction engine. 37+ formats, zero external dependencies, 10-96× faster than Pandoc.
-
santoka
Translations of 668 of Taneda Santoka's free-verse haiku
-
gazetta-render-ext
A static site generator framework. Extra render code.
-
amdm
Rust client for amdm.ru with Russian lyrics stress marking and meter analysis
-
waken_snowball
Snowball stemming algorithms for 33 languages
-
ticker-sniffer
extracting multiple stock ticker symbols from a text document
-
yuuang-test-napi
N-API bindings
-
pomsky-bin
Compile pomsky expressions, a new regular expression language
-
aclneko
caitsith policy abstract
-
utf16_iter
Iterator by char over potentially-invalid UTF-16 in &[u16]
-
mono
Mono repository automation toolkit
-
ygrep-core
Core library for ygrep – fast, local, indexed code search
-
tergo-formatter
Formatter for tergo
-
hlight
dedicated to delivering exceptional syntax highlighting capabilities
-
like
A SQL like style pattern matching
-
basalt-tui
Basalt TUI application for Obsidian notes
-
hemoglobin
Bloodless
-
glk
Bindings for the Glk I/O interface for hosting interactive fiction interpreters
-
kathoey
text feminization using open corpus linguistics data
-
nonsense
Lorem ipsum placeholder text generator with clipboard integration
-
snailquote
Escape and unescape strings with shell-inspired quoting
-
trailfix
Trim trailing whitespace and ensure single newline at EOF
-
brk_rolldown_plugin_vite_html
brk fork of rolldown_plugin_vite_html
-
gaze
small utility library with the goal of making it easier to scan/lex text and collections
-
mdbook-quiz-validate
Input validation for quizzes used in mdbook-quiz
-
nsys-curses-utils
Rust *curses utilities
-
domrs
Document builder and serializer
-
aster-a2ui
A2UI (Agent-to-User Interface) protocol implementation for Aster
-
langextract-rust
extracting structured and grounded information from text using LLMs
-
smoltok-core
Byte-Pair Encoding tokenizer implementation in Rust
-
mdbook-markdown
Markdown processing used in mdBook
-
flxy
Full-text searching and scoring of strings
-
ebg
Eric's Blog Generator, a simple static site generator
-
hexxy
all over the place!
-
large-text-core
Core library for handling large text files search and replace efficiently
-
journey-cli
A CLI-based journal application with automatic timestamping, vault management, and Obsidian integration
-
dm-index
Index and changelog generator for documentation trees
-
gh-emoji
Convert
:emoji:to Unicode using GitHub’s emoji names -
markdown-peek
Markdown previewer in browser and terminal
-
bfom-lib
Brendan's Flavor of Markdown: I'll build my own markdown format, what could go wrong?
-
rfgrep
Advanced recursive file grep utility with comprehensive file type classification - search, list, and analyze 153+ file formats with intelligent filtering and safety policies
-
chonkie
🦛 Chonkie, now in Rust 🦀: No-nonsense, ultra-fast, ultra-light chunking library
-
mdbook-typst-pdf
mdbook typst pdf backend
-
modeling
tools to analysis different languages by Ctags
-
egui_text_animation
text animation using egui
-
drova_sdk
Sdk for absolute converter of formats for dalet
-
shift_or_euc
Detects among the Japanese legacy encodings
-
parserst
A recursive-descent reST parser and renderer
-
llm-utl
Convert code repositories into LLM-friendly prompts with smart chunking and filtering
-
logappend
Execute child process, read from stdin and stderr, emit into files, truncate at given total content sizes
-
mdbook-numthm
An mdbook preprocessor for automatically numbering theorems, lemmas, etc
-
regex-charclass
Manipulate and convert regex character classes
-
reason-shell
Reason: A Shell for Research Papers
-
case_clause
case clause macro for rust
-
jawk
JSON AWK
-
skillsrs-registry
Registry management for skills.rs - MCP server skill registration and discovery
-
anda_nexus
Anda -- an AI Nexus powered by the Knowledge Interaction Protocol (KIP)
-
bitutils2
A package of tools for bit manipulations, including bit indexing, bitfields, and a variation of regular expressions for binary data
-
utf16_lit
macro_rules to make utf-16 literals
-
moonwave
generating documentation from comments in Lua source code
-
yamth
Markdown To HTML, A fast Markdown to HTML converter with live reload
-
qmd-cli
CLI for qmd - lightweight SOTA local search engine for AI agents
-
memvid-ask-model
LLM inference module for Memvid Q&A with local and cloud model support
-
fmd
Find Markdown files by metadata - Search by tags, frontmatter, and custom fields
-
komito
A fast, reliable semantic commit message validator and version bumper with gitmoji support
-
easy-regex
Make long regular expressions like pseudocodes
-
rtl_typepad
A minimal RTL typepad GUI for Persian/Arabic text with automatic Unicode bidi isolate injection for technical tokens
-
spanned
string processing with file/line/col information and the regular rust
strAPI -
mongodb-gridfs
Mongo GridFS
-
mdbook-renderer
assist implementing an mdBook renderer
-
bash-builtins
implement loadable builtins for bash
-
legalis-fr
French jurisdiction support for Legalis-RS (Code civil, Code de commerce, Code du travail)
-
tengwar
Transliterate text into J.R.R. Tolkien's Tengwar.
-
ferret
A trigram-based tool for detecting similarity in groups of text documents or program code
-
fontheight
Find out the vertical extents your font reaches on shaped words
-
legalis-in
India jurisdiction support for Legalis-RS - comprehensive modeling of Indian law
-
ps-hash
Generates 64-byte ascii hashes with 256 bits of security
-
twitter_text_config
Configuration for twitter-text in Rust
-
sphinx-rustdocgen
Executable to extract rustdoc comments for Sphinx
-
rust_readability
A package to assess the complexity of texts using a variety of readability formulas
-
afrim-translator
Manage the predication system of the afrim input method
-
linebreak
breaking a given text into lines within a specified width
-
krilla-rxing
Render barcodes (QR Codes, Aztec, Data Matrix, etc) using rxing into a krilla Surface (PDF)
-
riimut
Transform latin letters to runes & vice versa
-
timeblok
A language for event scheduling in plain text
-
mdbook-jupyter
enables using jupyter notebooks in mdbook
-
subslay
Text → emoji 💅🏻 Powered by Rust
-
luciferous-case-converter
A CLI tool to convert text between different cases
-
asciidork-eval
Asciidork eval
-
mecrab
A high-performance, thread-safe morphological analyzer compatible with MeCab, written in pure Rust
-
linurgy
Manipulate the output of multiple newlines. Replace/Insert/Append newlines with text. Input and output from stdio/files/buffers
-
advent-ocr
Converts ASCII-art representations of letters generated by Advent of Code puzzles into a String containing those letters
-
fast-cat
An ergonomic macro for efficient string concatenation with a single memory allocation
-
unicode-charname
functions for retrieving Unicode character name properties as described in Unicode Standard Annex #44
-
md_check
Check markdown link validity
-
hyphertool
Hypertool is a command-line tool for syllabification and hyphenisation
-
crate2bib
Create BibLaTeX entries for crates hosted on crates.io
-
capec-data
Request CAPEC data offline
-
emoji
Every emoji, their metadata, and localized annotations
-
edit-without-waiting
Open a file in the default text editor
-
companies-house-api
consuming various APIs from Companies House
-
beautiful-md
A CLI tool to format and beautify Markdown files
-
doxx
Terminal document viewer for .docx files
-
ik-rs
chinese segment, ik-analyzer for rust
-
pretty_regex
🧶 Elegant and readable way of writing regular expressions
-
nerdfetch-rs
A minimal, fast system fetch tool - rewritten in Rust for speed, clarity, and configurability
-
thulp-skill-files
SKILL.md file parsing and loading for Thulp
-
eco
reasoning about breaking changes in Rust ecosystems
-
aho-corasick
Fast multiple substring searching
-
clipboard-stream
Async stream of clipboard change events
-
docloom
Programmatically compose documents and render them to Markdown or styled terminal output
-
mdbook-hints
mdBook preprocessor to add hover hints to your book
-
rsticle
Treat source files as articles / narrative documentation
-
regextrie
High-performance Rust library designed for matching a large number of regular expressions against a given input string
-
gramdex
k-gram / trigram indexing primitives for approximate string matching
-
mecab-ko-dict
한국어 형태소 사전 관리 - 바이너리 포맷, FST 검색, 연접 비용
-
mdbookkit
Support library for mdBook preprocessors in the mdbookkit project
-
tokmd-cockpit
Cockpit PR metrics computation and rendering for tokmd
-
spider_agent_html
HTML processing utilities for spider_agent — cleaning, content analysis, and diffing
-
awful_news_vibes
Daily news meta-analysis pipeline with AI-powered clustering and D3 visualizations
-
tcalc-rustyline
A fork of Rustyline for use specifically with tcalc
-
ezstr
A String wrapper supporting negative indexing with grapheme indexing for slices and Regex::find_iter and Regex::find
-
basen
Convert binary data to ASCII with a variety of supported bases
-
newdoc
Generate pre-populated module files formatted with AsciiDoc that are used in Red Hat and Fedora documentation
-
kiri-kotoba
Input text processing for Kiri Japanese morphological analyzer
-
ponsic-winsafe
The dependency of the
ponsiccrate -
markdown-to-ansi
Render Markdown as ANSI-formatted terminal text
-
alass-util
convenience API for subtitle synchronization with alass-core
-
fontcull-font-types
Scalar types used in fonts. (Vendored fork for fontcull)
-
mdi
markdown include
-
text-editing
string with utilities for editing
-
thesaurus
An offline thesaurus library for Rust
-
csmlinterpreter
The CSML (Conversational Standard Meta Language) is a Domain-Specific Language developed for creating conversational experiences easily
-
yara-x-parser
A parsing library for YARA rules
-
md-to-incodoc
Convert markdown to incodoc
-
terraphim_hooks
Unified hooks infrastructure for Terraphim AI - knowledge graph-based text replacement and validation
-
typing_engine
A typing game engine for Japanese and English
-
markov
A generic markov chain implementation in Rust
-
md2pdf-rs
A CLI tool to convert Markdown to PDF using Typst
-
charx
A replacement for char::is_ascii*
-
dm-meta
YAML frontmatter parser and validator for technical documentation
-
sonai_metrics
Text metrics for sonai
-
vibrato
viterbi-based accelerated tokenizer
-
cleanpy
Python 3.12+ 最佳实践知识库 - MCP 服务器
-
rawgrep
Grep at the speed of raw disk
-
finding
command line finding tool
-
vader_sentiment
Bindings for Rust from the original Python VaderSentiment analysis tool
-
vidyut-lipi
A Sanskrit transliterator
-
dirdocs
Generate terse, one-sentence documentation for files; plus Nushell-style directory viewers (dls, dtree) that surface .dirdocs.nu descriptions
-
img2epub
Convert images to EPUB
-
rrename
" Opinionated tool to rename files in batch. Match regular expression, replace some characters I consider noise to kebab case
-
term_grid
formatting strings into a grid layout
-
rheo
A typesetting and static site engine based on Typst
-
sniffer-rs
that simplifies fuzzy string matching in rust
-
shifter
A CLI tool to rotate lines from stdin by n positions
-
rtranslate
dependency-free Rust wrapper for Google Translate public web API
-
none-shall-pass
Artifact for GitHub Action to validate hyperlinks in all markdown files
-
cin
that simplifies command-line input in Rust, especially mimicking C++-style input
-
gemini-map
A command-line tool to run files in parallel through Google Gemini
-
speedreader
A command-line speed reading tool
-
files-to-text-cli
Files to text cli
-
sesters
💱 Fast, offline currency converter 💴 💷 💶 💵
-
cfd16-assembler
An assembler backend for the CFD-16 ISA
-
csvpp
Compile csv++ source code to a target spreadsheet format
-
pups_text
Pretty Understandable Parsers
-
diffutils
A CLI app for generating diff files
-
keep-a-changelog
generating and manipulating CHANGELOG.md files that use the Keep A Changelog format
-
pangu2
Paranoid text spacing for good readability, to automatically insert whitespace between CJK (Chinese, Japanese, Korean) and half-width characters (alphabetical letters, numerical digits and symbols)
-
gxter
A parsing library for creating and reading GTA 3/VC/SA GXT (text string) files
-
unimorph-cli
Command-line interface for UniMorph morphological data
-
lindera-ipadic-builder
A Japanese morphological dictionary builder for IPADIC
-
yozuk
Chatbot for Programmers
-
okane-golden
supporting Golden Testing
-
rzozowski
A regex crate using Brzozowski derivatives
-
yeslogic-ucd-generate
A program for generating packed representations of the Unicode character database that can be efficiently searched with support for additional tables
-
broken-md-links
A command-line tool and library to detect broken links in Markdown files
-
joyful
Generate delightful, random word combinations - Rust port of the joyful TypeScript library
-
cloc
Count, or compute differences of, lines of source code and comments
-
anno-lib
Information extraction for Rust: Named Entity Recognition (NER), coreference resolution, and structured extraction
-
markdown-toc
Markdown Table of Contents generator
-
kd-rust
A crystal clear command-line dictionary
-
bpe-match
A pattern matching library for BPE tokenization, intended to replace regex-based approaches
-
include-preprocessor
Tooling for C preprocessor style include directives
-
mdriver
Streaming markdown printer for the terminal with syntax highlighting
-
gtk4cmark
A GTK4 widget for rendering CommonMark content
-
enma
serving anime and manga information 📦
-
tetratto-shared
Shared stuff for Tetratto
-
hebrew_unicode_script
A low-level library designed to ascertain whether a character belongs to the Hebrew Unicode script. It supports checks for individual characters as well as for membership within collections
-
tdoc
assorted CLI tools for working with FTML (Formatted Text Markup Language) documents
-
bm25-vectorizer
A minimal Rust library for creating sparse vector representations (embeddings) using the BM25 algorithm for information retrieval
-
enum-ts
TypeScript Enum pattern matcher codegen
-
docki
cli for building and publishing documentation using asciidoctor
-
bpetok
CLI for tokenizing text input using Byte Pair Encoding (BPE)
-
diffy-imara
Tools for finding and manipulating differences between files
-
convert_string
A trait to convert Strings to safe non-keywords and/or convert a Strings case (snake_case, PascalCase, ...)
-
mdbook-fs-summary
Summary generator for mdbook
-
fmtt
A diff-friendly text formatter that breaks lines on sensible punctuations and words to fit a line width
-
lindera-ko-dic
A Korean morphological dictionary for Ko-Dic
-
ojcmp
online judge comparer
-
pdfcat
Concatenate PDF files into a single document
-
lexical-sort
Sort Unicode strings lexically
-
skp-validator-rules
Built-in validation rules for skp-validator
-
block-list
A minimalist hosts-based tool for managing block lists and ad-blocking
-
colored_text
adding colors and styles to terminal text
-
diamond-types
The world's fastest text CRDT
-
text-document
Text document structure and management
-
dash-em
Enterprise-Grade Em-Dash Removal Library — SIMD-Accelerated String Processing
-
miku_best_friend_printer
Contains a MikuPrinter that can produce an ASCII art of Miku - your best friend - from the music video 'Monitoring (Best Friend Remix)' by DECO*27
-
ucd-general-category-ranges
Unicode character ranges by general category
-
mdbook-linkcheck
A backend for
mdbookwhich will check your links for you -
rsnltk
Rust-based Natural Language Toolkit
-
ipa-translate
translating between IPA and ASCII text
-
tectonic
A modernized, complete, embeddable TeX/LaTeX engine. Tectonic is forked from the XeTeX extension to the classic “Web2C” implementation of TeX and uses the TeXLive distribution of support files.
-
grep-app-cli
CLI for grep.app — search code across 1M+ public GitHub repos
-
cistring
A string type that preserve case, but compares insensitiveley
-
catfx
Apply effects to a text stream
-
chatter
Translate a human-readable dialogue script into Rust
-
mdbook
Creates a book from markdown files
-
paltoquet
rule-based general-purpose tokenizers
-
grapheme
Abstractions for working with extended Unicode grapheme clusters
-
simple-xlsx-writer
memory efficient XLSX writer
-
gpui_rn_util
structs and functions (fork for React Native GPUI)
-
cn-font-split
划时代的字体切割工具,CJK与任何字符!支持 otf、ttf、woff2 字体多线程切割,完美地细颗粒度地进行包大小控制。A revolutionary font subetter that supports CJK and any characters…
-
regex-specificity
A heuristic-based crate to calculate the specificity of a regular expression pattern against a specific string
-
byteforge
A next-generation byte-level transformer with multi-signal patching and SIMD optimization
-
tuicam
Terminal-based camera with switchable modes
-
wg-ragsmith
Semantic chunking and RAG utilities for document processing and retrieval-augmented generation
-
directwrite
A safe abstraction for interacting with DirectWrite, intended initially to be used with direct2d for easy text rendering
-
sansaccent
Convertit les chaînes françaises en slugs URL-friendly en supprimant accents et caractères spéciaux
-
rustkernel-orderbook
RustKernels Orderbook domain kernels
-
nlprule
A fast, low-resource Natural Language Processing and Error Correction library
-
vincent_minigrep
Project from the rust book
-
retrofont
Retro terminal font toolkit: TDF/FIGlet parsing, rendering, conversion
-
dingtalk
Robot Util, Send text/markdown/link messages using DingTalk robot, 钉钉机器人
-
regex_generate
Use regular expressions to generate text
-
re_view_text_log
A view that shows text entries in a table and scrolls with the active time
-
toolstr
formatting data into String's
-
text-parsing
Hierarchical text processing preserving char position info
-
rust_iso15924
ISO 15924, Codes for the representation of names of scripts, is an international standard defining codes for writing systems or scripts (a "set of graphic characters used for the written form of one or more languages")…
-
tag_index
searchable index over key-value tags
-
nstd_core
NSTD core crate
-
cgp-dispatch
Extensible data type dispatchers for CGP handlers
-
basic64
A straightforward implementation of base64
-
mdbook-ocirun
mdbook preprocessor to run arbitrary commands and code snippets inside containers
-
skill-tree
generate graphviz files to show roadmaps
-
memchunk
The fastest semantic text chunking library — up to 1TB/s chunking throughput
-
smallgrep
A Lite version of a CLI tool grep made with rust
-
mdbook-tiny
Use mdbook to generate tiny and fast static sites
-
unicode-width-16
Determine displayed width of
charandstrtypes according to Unicode Standard Annex #11 rules -
vectorscan-rs
Ergonomic bindings to the Vectorscan high-performance regex library
-
mdplayscript
An extension of Markdown for play scripts
-
terraphim-repl
Offline-capable REPL for semantic knowledge graph search
-
viddy
A modern watch command
-
fsays
flavored replacement for the classic cowsay
-
aki-mcolor
mark up text with color
-
magic-regexp
creating regular expressions with ease
-
hunspell-rs
Rust bindings to the Hunspell library
-
ucp-translator-markdown
Markdown translator for UCP
-
r-matrix
Rust port of cmatrix
-
tpt
Pure Rust implementation of the Unix concatenate (cat), word-count (wc) and echo command
-
mdbook-auto-gen-summary
A preprocessor and cli tool for mdbook to auto generate summary
-
xgrammar-rs
Efficient, Flexible and Portable Structured Generation for Rust - Rust bindings for XGrammar
-
ilyvion-util
Collection of utility functions and types for use in my personal projects
-
typos-cli
Source Code Spelling Correction
-
minigrep_pack
A small clip clone of minigrep
-
ya
YAML parser in pure Rust
-
loe
Very fast and yet another line ending (CRLF <-> LF) converter written in Rust
-
vtashkov-bf
Brainfuck interpreter
-
rustling
A blazingly fast library for computational linguistics
-
prometheus-32bitfix
Prometheus instrumentation library for Rust applications
-
mdbook-selfpath
A preprocessor for mdbook that allows self-referencing paths
-
zed-collections
Standard collection type re-exports used by Zed and GPUI
-
colored-hexdump
Create beautifuly colored hexdumps
-
htmls
parsing HTML and extracting HTML elements or text
-
genpdfi
User-friendly PDF generator written in pure Rust
-
subtitles
A command line programme to generate SRT format subtitles for use in video players
-
mdbook-image-attrs
An mdbook preprocessor for adding attributes to images
-
matrix-ui-serializable
Opinionated abstraction of the matrix-sdk crate with serializable structs
-
lsp-ty
type definitons for LSP
-
filecheck
writing tests for utilities that read text files and produce text output
-
glow_glyph
A fast text renderer for glow, powered by glyph_brush
-
merge3
merge tool for three-way merges
-
rustpython-parser-vendored
RustPython parser vendored third-party crates
-
is_printable
Determine whether a given text-based value is printable
-
vn-settings
Various settings intended to simulate visual novels
-
dictator-frontmatter
Markdown frontmatter decree for Dictator structural linter
-
lex-analysis
Semantic analysis for the lex format
-
doxygen-bindgen
Converts Doxygen comments into Rustdoc markdown
-
readable-regex
Regex made for humans. Wrapper to build regexes in a verbose style.
-
synaptic-middleware
Middleware system for Synaptic agents: AgentMiddleware trait and built-in middlewares
-
notidium
Developer-focused, local-first note-taking with semantic search and MCP integration
-
patchlib
Tooling for working with patch files
-
u8lit
Custom literal to convert strings to UTF-8 bytes
-
gdnative-doc
Documentation tool for gdnative
-
ranked-searcher
Search inside text files using tf-idf formula, showing the most relevant search at the top
-
webdog
static site generator fit for a dog
-
bstr
A string type that is not required to be valid UTF-8
-
tectonic_bridge_core
Exposing core backend APIs to the Tectonic C/C++ code
-
yekdast
normalizing and cleaning up Persian (Farsi) text
-
minislug
A tiny, dependency-free slugifier that turns any &str/String into a safe cross-platform filename
-
sbom-diff
diff engine and cli for sbom comparison
-
scan-rules
some macros for quickly parsing values out of text. Roughly speaking, it does the inverse of the print!/format! macros; or, in other words, a similar job to scanf from C.
-
srx
A mostly compliant Rust implementation of the Segmentation Rules eXchange (SRX) 2.0 standard for text segmentation
-
popgetter-py
Python bindings for popgetter
-
opentalk-report-generation
OpenTalk report generation functionality
-
mpdf
writing PDF files
-
markdown2json
Reads a markdown file or directory of markdown documents and emits a structured JSON
-
textwrap-macros
procedural macros to use textwrap utilities at compile time
-
ised
An interactive tool for find-and-replace across many files
-
latex
An ergonomic library for programatically generating LaTeX documents and reports
-
oak-highlight
A lightweight syntax highlighter for Rust with support for multiple programming languages and customizable themes
-
animated-emojis-rs
Noto Animated emojis
-
uapi-version
Compare versions according to the UAPI Version Format Specification
-
hangeul
Korean alphabet manipulation library
-
stego_wps
text-based steganography using the Words Per Sentence (WPS) method. It allows encoding and decoding hidden messages within the structure of a given text.
-
cadi-scraper
CADI Scraper/Chunker utility for converting source code repos and file data into reusable CADI chunks
-
fortune-rs
classic BSD fortune program
-
kl-hyphenate
Knuth-Liang hyphenation for a variety of languages
-
boreal-cli
CLI utility to run boreal, a YARA rules engine
-
cglue-bindgen
cleanup cbindgen headers for CGlue
-
rag-cli-cuda
CUDA-accelerated build of rag-cli — local semantic search powered by candle + NVIDIA GPU
-
feedr
feature-rich terminal-based RSS/Atom feed reader written in Rust
-
biometrics
provide the vitals of a process in the form of counters, gauges, moments, and T-digests
-
next-plaid-cli
Semantic code search powered by ColBERT
-
repub-rs
binary for converting mhtml webpages into remarkable-style summarized epubs
-
mdbook-godbolt
A preprocessor for mdbook to add runnable code snippets via Godbolt
-
getch-rs
getchfor Windows and Unix -
opstr
‘Operate on strings’ command line utility
-
pulldown-cmark-toc
Generate a table of contents from a Markdown document
-
pithy
Ultra-fast, spookily accurate text summarizer that works on any language
-
futf
Handling fragments of UTF-8
-
taboc
A table of contents generator for markdown documents
-
matchy-literal-hash
O(1) exact string matching via memory-mapped hash tables (internal)
-
askalono-cli
detect the contents of license files
-
json_keyquotes_convert
convert JSON from and to JSON without key-quotes
-
asimov-sdk
ASIMOV Software Development Kit (SDK) for Rust
-
mdbook-lint-rulesets
Modular rulesets for mdbook-lint - standard and mdBook-specific linting rules
-
rsmorphy
Morphological analyzer / inflection engine for Russian and Ukrainian (soon) languages (WIP)
-
ngram_rs
Facilitate creating ngrams in Rust to be used in the polars plugin
-
cli-boxes
Unicode box drawing characters for creating beautiful CLI interfaces
-
savagestr
SAVAGE string encoder/decoder. If can encode or decode by specifying the code page or the encoding name, it works, else it uses the savage way to encode/decode by using
String::from_utf8_lossy() -
unicode-vo
Unicode vertical orientation detection
-
poriborton
Interconversion between Unicode and various Bengali ANSI encodings
-
chaf
A CLI filter tool that removes lines based on logical conditions using AND, OR, NOT, and parentheses
-
streaming_serde_json
streaming alternative to serde_json for processing JSON data sources that are too large to fit in memory
-
heavy-pdf-parser
Extract text from PDF files with support for multiple output formats
-
unicode_reader
Adaptors which wrap byte-oriented readers and yield the UTF-8 data as Unicode code points or grapheme clusters
-
human-time-cli
A command-line tool for converting time durations to human-readable formats, built using the
human-timecrate -
tracery
Text-expansion library
-
ada-idna
Fast IDNA (Internationalized Domain Names in Applications) implementation in Rust
-
tagalyzer
A CLI tool to gather statistics on collections of plaintext-adjacent files
-
capitalize
Change first character to upper case and the rest to lower case, and other common alternatives
-
sublime_fuzzy
Fuzzy matching algorithm based on Sublime Text's string search
-
qpprint
console printing/formatting
-
swc-formatjs-visitor
formatjs custom transform visitor for SWC
-
attuned-infer
Fast, transparent inference of human state axes from natural language
-
diacritics
Remove diacritics from letters, for example when standardizing input for a search
-
numeronym-cli
A CLI tool that generates numeronym abbreviations from text
-
mdbook-presentation-preprocessor
A preprocessor for utilizing an MDBook as slides for a presentation
-
rhema_module_chirho
Self-contained SQLite module format (.rhema) for distributing Bible modules
-
oyster-lib
Markdown processing library with querying, bidirectional links, and HTML rendering
-
ansi-escape-sequences
High-performance Rust library for detecting, matching, and processing ANSI escape sequences in terminal text with zero-allocation static regex patterns
-
purlu
A full-text search engine
-
jira-clean
clean up Jira task description that is an output of jira-cli tool
-
bmo-to-md
Dump a bugzilla bug to markdown, with attachments unziped
-
screenplay-doc-parser-rs
Tools to parse Screenplay-formatted documents into semantically-typed structs
-
artificial-openai
OpenAI backend adapter for the Artificial prompt-engineering SDK
-
vestalia
Vestaboard API wrapper
-
dmos-cli
Djot HTML renderer with advanced features - CLI
-
duvet
A requirements traceability tool
-
unobtanium-segmenter
A text segmentation toolbox for search applications inspired by charabia and tantivy
-
prettychars
Unicode text styling and named glyph lookup with zero runtime overhead
-
imperative
Check for imperative mood in text
-
waterui-text
Text and typography components for WaterUI
-
neo-mime
Strongly Typed Mimes
-
justify
plaintext while handling Unicode gracefully
-
msbwt2
multi-string BWT query library
-
cai
User friendly CLI tool for AI tasks
-
onig_sys
onig_syscrate contains raw rust bindings to the oniguruma library. This crate exposes a set of unsafe functions which can then be used by other crates to create safe wrappers around Oniguruma… -
webgrep
grep the web: a full-browser-spec search-focused ultra-simple way to read the web without having to leave the terminal
-
codump
A straightforward and flexible code/comment dump tool
-
bilingual
A cmdline tool used for markdown translation via calling Chinese translation api cloud services
-
izihawa-tantivy-stacker
term hashmap used for indexing
-
doc_loader
A comprehensive toolkit for extracting and processing documentation from multiple file formats (PDF, TXT, JSON, CSV, DOCX) with Python bindings
-
yara-x-cli
A command-line interface for YARA-X
-
text-scatters
A cut-up technique generator from text and ebook files in the terminal
-
bk-tree
A Rust BK-tree implementation
-
almanaculum
Core types and traits for analysis
-
seams
High-throughput sentence extractor for Project Gutenberg texts with dialog-aware detection
-
coverio
Better code coverage reporting for Rust crates
-
frankensearch-lexical
Tantivy BM25 full-text search integration for frankensearch
-
pandoc-ac
pandoc filter for converting acronym codes to LaTeX
-
jrl
Journaling terminal app that prompts you questions from time to time when opening a new terminal and allows you to rate, describe and take notes of your day, as well as view past entries
-
recvmsg
Traits for receiving datagrams reliably, without truncation
-
char-ranges
Iterate chars and their start and end byte positions
-
utf-64
The next-generation text encoding standard using 64 bits per character
-
mat-o-viewer
A modern terminal file viewer combining cat, less, and grep with syntax highlighting and markdown rendering
-
libharu
Rust binding for libharu(http://libharu.org/) PDF library
-
viks
vim-like key crate
-
adabraka_util
A collection of utility structs and functions for Adabraka GPUI (originally from Zed - github.com/zed-industries/zed)
-
choco
markup language for dialogue systems
-
maytrix-value
Core value types for Maytrix
-
utf8next
function for getting the next character and its length in bytes from a string
-
fuse-rust
Fuse is a super lightweight library which provides a simple way to do fuzzy searching. Fuse-Rust is a port of Fuse-Swift, written purely in rust
-
zine
opinionated tool to build your own magazine
-
pullup
Convert between markup formats
-
justcsv
CSV-file reader/writer
-
eggbug
Bot library for cohost.org
-
mailidator
A lightweight Rust library for checking email address misspellings
-
markdown-extract
Extract sections of a markdown file
-
artificial-types
Reusable prompt fragments and helper types for the Artificial prompt-engineering SDK
-
mdless
A terminal-based markdown file viewer
-
talos_messenger_actions
Messenger actions for Talos
-
mq-run
Command-line interface for mq Markdown processing tool
-
mupdf-basic-text-extractor
Basic structured text extraction using mupdf-rs
-
cargo-markdown
Local crates.io readme development server with ultra-fast hot reloading goodness
-
pretty-xmlish
Pretty print XML-ish data with unicode art
-
plagiarismbasic_lib
Basic plagiarism checker written in Rust
-
icondata_zi
Zi icon set for the icondata family of crates
-
lindera-cc-cedict-builder
A Chinese morphological dictionary builder for CC-CEDICT
-
mdbook-scientific
Enables inline equations for mdbook to set by $..$ signs and $$..$$
-
libretranslate
A wrapper for the LibreTranslate web API
-
rexify
trait based matcher alternative to Regex
-
codebase-to-markdown
convert codebase to markdown format
-
spongebob
convert text to spongebob case a.k.a tHe MoCkInG sPoNgEbOb MeMe
-
mcat
a powerfull extended cat command, to cat all the things you couldn't before
-
scanlex
lexical scanner for parsing text into tokens
-
turndown-core
Core Markdown AST and serialization for turndown
-
tidyvcf
command-line tool to convert VCF files to tab/comma separated tables
-
ari
ari. the extras library.
-
lingua-english-language-model
The English language model for Lingua, an accurate natural language detection library
-
wdpe
WebDynpro Parse Engine
-
memkb
A local AI in memory RAG for markdown files in a folder exposed as MCP
-
expression_parser
minimal Rust parser and evaluator for arithmetic expressions with variables, built using the pest parsing library. It converts a text formula into an abstract syntax tree (AST) and…
-
opencc-jieba-rs
High-performance Chinese text conversion and segmentation using Jieba and OpenCC-style dictionaries
-
livre
type-safe PDF parser
-
tergo-tokenizer
R language tokenizer
-
streamdown-plugin
Plugin system for streamdown extensibility
-
stylish-style
Internal implementation details of
stylish-core -
did-toolkit
spec-compliant implementation of did-core, W3C's 'Decentralized Identity Documents'
-
rpdfium-text
Text extraction for rpdfium
-
assemblage_db
Distributed Document/Graph DB for Connected and Overlapping Pages
-
sre-engine
A low-level implementation of Python's SRE regex engine
-
create_broken_files
Create broken files from other ones
-
jq-rs
Run jq programs to extract data from json strings
-
lre
Rust bindings for the regular expression library from QuickJS
-
codetypo
Source Code Spelling Correction
-
ob
A Blog and RSS system written in Rust
-
carlotk
The main library for Carlo, a simple interpreted programming language
-
winsfs-cli
Site frequency spectrum estimation based on window expectation-maximisation algorithm
-
rust-regex-dsl-creator
Regular expression DSL derive macros
-
tmpltr
Template-based document generation CLI
-
gigagei
random quote fetching console utility
-
rustyink
Blazing fast static site generator
-
penmanship
A Unicode character lookup library for converting text patterns to Unicode characters
-
phaier_markdown
A markdown parser and renderer
-
swc_ecma_regexp_ast
AST definitions of ECMAScript regular expressions
-
prettyless
Wadler-style pretty-printing combinators in Rust with more features
-
ironstorm_lookup
Lightning fast lookup table for auto completion, type ahead, suggestion engines
-
marktask
A CLI tool for parsing and manipulating Markdown tasks
-
topfew
CLI to find high frequency occurrences in structured text files
-
defuddle-rs
extracting main content and metadata from HTML web pages
-
aneurysm
Brainf**k interpreter
-
mioffice-pdf-utils
Lightweight PDF utilities — merge, split, extract pages, read metadata. Pure Rust, no native dependencies.
-
filename-refactor
Command to refactor file names
-
pinyin2ch
converting Chinese Pinyin to Chinese characters with various levels of detail
-
rjot
A minimalist, command-line jotting utility that's fast, private, and git-friendly
-
jpreprocess-njd
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
md2adf
Markdown to Atlassian Document Format converter
-
mdbook-blox
A preprocessor for mdbook to add figures and admonishment-like blox
-
papers-mcp
MCP server for academic paper search, management, and local RAG
-
harfbuzz
Rust bindings to the HarfBuzz text shaping engine
-
rivus-core
rivus tools
-
man
Generate structured man pages
-
bbcode-tagger
BBCode tree parser and tagger
-
arborium-theme
Theme support for arborium syntax highlighting
-
hns
Human numeric sorting program — does what
sort -his supposed to do! -
quake_text
Utils for Quake strings and characters
-
indentsort
Structure-preserving sorting of arbitrary indented text
-
roff-cli
Skillful man page to JSON/Markdown converter - human readable, AI-friendly
-
arabic_pdf_to_text
A CLI tool to convert Arabic PDFs to text using Google's Gemini API
-
unicode-rs
A comprehensive Unicode character library for Rust applications with theme support
-
path2regex
Express style path to RegExp utility
-
xee-ir
Xee intermediate representation and compilation to bytecode
-
semire_core
An extension to my former semire_read crate now with more functionality
-
mdbook-translator
A translation preprocessor plugin for mdBook that automatically translates documents using the DeepSeek API
-
safe-regex-compiler
Regex compiler for the safe-regex crate
-
idna-cli
Encode/decode Unicode domain names to/from IDNA ASCII
-
lightgrep
A fast, ergonomic grep-like tool in Rust
-
see-cat
A cute cat(1)
-
minigrep_mm
minigrep app from rust-book with public lib methods
-
bbx
A robust, performant BBCode pull parser
-
pi_ucd
unicode字符函数,获得字符的语言区间段;及根据文字排版的需要,判断字符是否为单字字符或字母字符
-
binyl
A bitwise UTF-8 string inspection tool
-
tessera-embeddings
Multi-paradigm embedding library: ColBERT, dense, sparse, vision-language, and time series models
-
iconv-native
A lightweight text encoding converter based on platform native API or libiconv
-
ik-mini-epub
Minimal async IK to EPUB downloader | Extremely minimal
-
unveil-rs
Unveil Rs is a tool to create presentations from markdown files
-
gpui_util
A collection of utility structs and functions used by Zed and GPUI
-
spcfile
.spc file format reading and writing
-
mdbook-driver
High-level library for running mdBook
-
ripmors
encoding and decoding international Morse code and several variants
-
quoted-string-parser
Quoted string parser for grammar defined in RFC3261
-
microslop
Turn your text into beautifully chaotic, glitchy, Wandoze-level slop
-
smt-str
working with SMT-LIB strings in Rust
-
cwc
A word counter utility that properly handles CJK and Unicode text
-
furze
finite state transducers (fst) writen in rust
-
swift-check
High-performance, robust, and expressive searching and validation (uses SIMD on x86_64, aarch64, and WASM)
-
typope
Pedantic source code checker for orthotypography mistakes and other typographical errors
-
sms_splitter
An SMS message splitter and part calculator with support for GSM and Unicode
-
puppet-fmt
Automatic code formatter for puppet manifests
-
inkline
Display colorized ASCII art and images directly in the terminal
-
markdowndown
acquiring markdown from URLs with smart handling
-
simple_peg
A command-line peg parser implemented by Rust
-
unicode-canonical-combining-class
Fast lookup of the Canonical Combining Class property
-
opendal-util
OpenDAL
-
finetype-model
Candle-based transformer model for FineType
-
esl01-renderdag
Render a graph into ASCII or Unicode text
-
dossiers
home for your specs, policies, and process docs
-
ib-unicode
Unicode utils
-
lzy-codec
一種變長文本編解碼方案,支持對Unicode進行編解碼。編解碼效率、存儲空間全面優於UTF-8,未來會替代UTF-8成為新的世界通用編解碼標準。
-
leptos_forge_ui_components
Components you can use in your leptos_forge application
-
wimbd
A CLI for inspecting and analyzing large text datasets
-
pinzh
converting Pinyin to Zhuyin (Bopomofo)
-
unified-diff
GNU unified diff format
-
static-lang-word-lists
Runtime decompressed statically-included word lists
-
undoc-cli
CLI for undoc - Microsoft Office document extraction
-
inkjet
A batteries-included syntax highlighting library for Rust, based on tree-sitter
-
mdbook-plugin-utils
mdBook plugins
-
mdvalidate-utils
functions for mdvalidate
-
kspconfigtool
KSP1 ConfigNode parser and block removal tool
-
ricat
A Rust-Based implemenation of classic UNIX
catcommand -
agnostic-levenshtein
Levenshtein distance for ASCII or Unicode strings
-
ethan-rs-wc
The ethan-rs-ws(erwc) is word, line, character, and byte count. Like wc command but not just wc command, more accurate and faster. Text can also be read from standard input for statistics.
-
copier
A bidirectional file content aggregator and extractor that converts between files and markdown-formatted representations
-
acorns
Generate an AsciiDoc release notes document from tracking tickets
-
mdzk
Plain text Zettelkasten based on mdBook
-
markdown-live-preview
A live Markdown preview server with WebSocket and HTML rendering
-
reedy
A terminal-based RSS reader with a clean TUI interface
-
grapheme-utils
Handy utils for working with utf-8 [unicode] Extended Grapheme Clusters
-
hmd
Custom Markdown Engine for my personal blog
-
hina
:]
-
ranting
Linguistic formatting placeholder extensions for rust
-
rawk-core
Core library for the AWK interpreter
-
topiary-queries
tree-sitter query files compatible with Topiary
-
html-index
Generate an HTML index
-
detone
Decompose Vietnamese tone marks
-
csv-groupby
execute a sql-like group-by on arbitrary text or csv files
-
lookbook
Component preview framework for Dioxus
-
akiaki
A good old fashioned wiki engine with a flat-file database
-
enso-lazy-reader
An efficient buffered reader
-
swot
community-driven or crowdsourced library for verifying that domain names and email addresses are tied to a legitimate university of college
-
code-to-pdf
Generates a syntax-highlighted PDF of your source code
-
sayit
String replacements using regex
-
rulet
figlet implementation
-
block-id
generating opaque, unique, and short string values from (unsigned) integers
-
textgridde-rs
dealing with Praat TextGrid files. MIT licensed.
-
regex-chunker
Iterate over the data in a
Readtype in a regular-expression-delimited way -
squishyid
Shorten and obfuscate IDs
-
xml1
sane, non compliant xml parser without allocations
-
truecase
Restore correct letter casings in arbitrary text using a statistical model
-
valid_rust_char
A tiny library to check if a char it's valid in a rust file
-
kreuzberg-tesseract
Rust bindings for Tesseract OCR with cross-compilation, C++17, and caching improvements
-
matcher_c
A high-performance matcher designed to solve LOGICAL and TEXT VARIATIONS problems in word matching, implemented in Rust
-
pulldown_typst
A pull parser for Typst markup
-
compression-prompt
Fast statistical compression for LLM prompts - 50% token reduction with 91% quality retention
-
tiefdownlib
manage and convert TiefDown projects
-
atrium-api
API library for AT Protocol (Bluesky)
-
lindera-dictionary
A morphological dictionary library
-
detect-newline-style
Determine a string's preferred newline character
-
tet_rs
A third-party implementation of Text Entry Throughput (ref. https://doi.org/10.1145/3290605.3300866) for Rust
-
kana-converter
converter for half-width/full-width Japanese language characters (katakana, hiragana, and ASCII)
-
ox_content_parser
High-performance Markdown parser for Ox Content
-
controlled-option
Custom Option type with explicit control over niches and memory layout
-
raw-string
RawStr/RawString are like str/String, but without any guaranteed encoding
-
calculator-tui
A command-line calculator with symbolic math support
-
wrapr
wrap your code for ai
-
rustruut
Text-to-IPA converter and phonetic translator for Rust, powered by the Goruut phonemization engine
-
parlib
A barebones parser combinator library, inspired by Haskells parsec
-
mdbook-quiz-schema
Schema for quizzes used in mdbook-quiz
-
redact-core
Core PII detection and anonymization engine - Presidio replacement
-
catalog-of-markdown
Generate the catalog of markdown file
-
quillmark-typst
Typst backend for Quillmark
-
mdbook-open-git-repo
mdbook preprocessor to add a open-on-git-repo link on every page
-
arabic-script
An expressive API for the characters of the Arabic script
-
asciidoctor-client
A kludge to improve the performance of static site generators that use asciidoc through its cli
-
rustme
Automatically generate Rusty READMEs
-
wildcard-trie
A space-efficient router / radix-trie that supports wildcards
-
kyte
Extensible mechanics for operational transformation in Rust that are generic with respect to their value (not constrained to text), wire-compatible with Quill and fully fuzzed
-
asciifolding
ascii folding library
-
utf58
High-tech encoding of the Unicode space in one quibble and up to 3 bytes
-
dprint-plugin-markdown
Markdown formatter for dprint
-
quickner-core
A fast and simple NER tool
-
bottomify
Fantastic (maybe) CLI for translating between bottom and human-readable text
-
rialo-s-spl-memo
SPL Memo v3 (Rialo adaptation)
-
findcar
An extensible CLI tool that filters, sorts, and aggregates search results from car websites. Because of course you want to search for cars from the command line.
-
checkstream-policy
YAML-based policy engine for LLM safety rules, triggers, and actions
-
character_converter
Turn Traditional Chinese script ot Simplified Chinese script and vice-versa and tokenize
-
guarding
guardians for code, architecture, layered. Guarding crate a architecture aguard DSL which based on ArchUnit.
-
legalis-us
United States jurisdiction support for Legalis-RS (Common Law)
-
opentalk-types-signaling-meeting-report
Signaling types the OpenTalk meeting-report module
-
shellvetica
Bring your shell output to the web
-
asimov-serpapi-module
ASIMOV module for data import powered by the SerpApi search data platform
-
tiktokenx
A high-performance Rust implementation of OpenAI's tiktoken library
-
tg_bot_models
Telegram Bot API types (for API 4.3)
-
scanix
search a text or pattern in files. A fast and lightwight text tool.
-
fop
FOP (Formatting Objects Processor) — Apache FOP-compatible XSL-FO processor in pure Rust
-
midstring
Create a string between two other strings, that is lexicographically halfway between them
-
grammateus
facilitate working with Ancient Greek words
-
wildcard_ex
extended wildcards that allows VB-like specifications
-
sakurs-cli
Command-line interface for Sakurs sentence boundary detection
-
rslug
fast, and configurable library to create URL-friendly slugs from strings
-
BBSGlib11
num BBSGlib11
-
mdloc
command-line tool for processing image links in Markdown files. Download remote images and convert them to Base64 embedded format or local file references.
-
tectonic_xetex_format
Tectonic/XeTeX engine data structures and their expression in TeX "format" files
-
natord-plus-plus
Natural ordering for Rust
-
gender_guesser
that guesses the gender of a given name
-
libruskel
Generates skeletonized outlines of Rust crates
-
hitori
Generic compile-time regular expressions
-
surt-rs
Sort-friendly URI Reordering Transform (SURT)
-
holy-carpet
customizable blog creator
-
kind-tree
Syntatic trees for Kind compiler
-
tantivy-tokenizer-api
Tokenizer API of tantivy
-
synaptic-xai
xAI Grok integration for Synaptic
-
east-asian-width
Determine the display width of Unicode characters in East Asian contexts
-
gregex
Regex solver utilizing NFA
-
textnonce
Text based random nonce generator
-
mdbook-checklist
An mdBook preprocessor for generating checklists and indexes
-
film_parser
application for parsing detailed film data from raw text, including information such as title, release year, director, screenwriter, cast, genre, and runtime, into a structured Film…
-
json-carver
Digital forensics tool that reads (carves) JSON strings from a dump. Think of it as a more accurate and faster replacement for the strings(1) utility.
-
chardet
rust version of chardet
-
ansimake
Quickly convert pixel images of ANSI art created with AI to actual ANSI art
-
etch
Not just a text formatter, don't mark it down, etch it
-
mdbook-external-links
Open external links inside your mdBooks in a different tab
-
natural-sort-rs
sorting strings based on Natural sort order
-
cursive_hexview
hexviewer for cursive
-
ptero-cli
A text steganography CLI tool for Social Media
-
aki-mcycle
mark up text with cycling color
-
meztide
SDK for lotide
-
aozora2
Aozora Bunko format converter CLI
-
pgn-traits
Traits for chess variations that can be annotated with Portable Game Notation
-
parangi
PDF text extraction library — Rust port of Apache PDFBox
-
sik
A fast and concurrent command-line tool for searching patterns in files
-
fifthtry-mdbook
fork of mdbook, only for ft-cli
-
char_reader
Safely read wild streams as chars or lines
-
serenity_utils
provide additional utilies for Discord bots created with serenity
-
clarifai_grpc
The official Clarifai gRPC Rust client
-
utf8proc-sys
Unsafe rust bindings to the utf8proc library
-
bat
A cat(1) clone with wings
-
markitdown
designed to facilitate the conversion of various document formats into markdown text
-
fx-mistral
leverage the Mistral API for OCR and data extraction from PDFs
-
slugify
Macro for flexible slug generation
-
trees-rs
tree-rs is a drop-in replacement for the tree utility (ASCII tree view of folder directory), but written in Rust. The primary goal of this project is to provide a non-bloated and more…
-
mullama
Comprehensive Rust bindings for llama.cpp with memory-safe API and advanced features
-
anagrambot
find anagrams of words
-
rtl_isolate_fix
Plain-text RTL/LTR technical-token isolation using Unicode bidi isolates (LRI/PDI), with URL/email exclusions
-
hebrew_accents
finding, filtering, and displaying Hebrew accents, specifically focusing on the Tiberian accent system as documented by the Masoretes
-
Contact_Book
contact book
-
subject-classifier
classifying a commit by it's subject
-
text-transliterate
transliterate texts using the SO iconv from POSIX
-
liwe
IWE core library
-
linkcheck
extracting and validating links
-
nlpo3
Thai natural language processing library, with Python and Node bindings
-
dynfractal
A program to render any equation (in terms of z and c) into an ASCII fractal
-
sevenmark_utils
Shared utilities for SevenMark crates
-
dtools
A collection of tools for working with Commodore disk images
-
stam-tools
Command-line tools for working with stand-off annotations on text (STAM)
-
easy_io
Fast and dead-simple IO for competitive programming in Rust
-
synaptic-runnables
LCEL composition: Runnable trait, pipe operator, Lambda, Parallel, Branch, Fallbacks
-
obsidian-cli-inspector
Local-first CLI/TUI for indexing and querying Obsidian vaults
-
nfa_regex
NFA regex engine for text processing
-
genpdf-chinese
User-friendly PDF generator written in pure Rust
-
pillar
small tool to format lines into columns
-
anno-metrics
Shared evaluation/analysis primitives for anno (metrics + cluster encoders)
-
cgp-monad
Monadic constructs for CGP computation
-
mdxport
Markdown to PDF via Typst — comrak AST, in-process compilation, LaTeX math support
-
tkrar
Count frequency of words in a file or a directory
-
text-search
easy to use plug and play wrapper around tantivy for simple search scenarios
-
record-query
doing record analysis and transformation
-
rakugaki
rendering TTF/OTF font characters as ASCII art in the terminal
-
prototty_common
Definition of common ui elements and views for use with prototty
-
p4d-mdproof
Markdown to PDF converter
-
dictutils
Dictionary utilities for Mdict and other formats
-
what-rs
Identify what something is! A pyWhat reimplementation in Rust
-
pdfcr
render a codebase to a pdf
-
svgbob_cli
Transform your ascii diagrams into happy little SVG
-
overlap-chunk
splitting text into chunks of specified size with adjustable overlap percentage
-
lucide-dioxus
Dioxus port of Lucide
-
case-conv
Faster case conversion crate
-
text-tokenizer
Custom text tokenizer
-
convert-to-spaces
Convert tabs to spaces in a string
-
qsv_vader_sentiment_analysis
Bindings for Rust from the original Python VaderSentiment analysis tool. Forked for use with qsv.
-
quickmark-core
Lightning-fast Markdown/CommonMark linter core library with tree-sitter based parsing
-
vndb_tags_get
convert VNDB tag list (JSON to markdown)
-
character-set
High performance
set.contains(char) -
markup_engine
Zero-dependency, trait-based, ultra-lightweight markup renderer (Markdown + HTML + HTML + pluggable). Born from LeadSheetML, useful anywhere.
-
omgwtf8
Optimized-Matching-Generalized Wobbly Transformation Format — 8-bit
-
lowcharts
draw low-resolution graphs in terminal
-
sqdj
sqdj shortens delimited data
-
minigrep_wise
minigrep project from the rust book project
-
kvu
The simplest command line tool to manage key-value pair lines
-
lex_sleuther
Sleuth out script languages by lexing
-
quillmark-cli
Minimal CLI for testing Quillmark packages locally
-
translitrs
Transliteration utility for Serbian language
-
terminal_tools
Power-Terminal TUI - fuzzy file and text finder, process manager, git browser, and more
-
clipcount
Counting words from the clipboard content
-
slugify-core
Fast, Unicode-aware slug generation library with multi-language bindings
-
mdbook-org
A preprocessor plugin for mdBook that enables org-mode file support
-
skills-ref-rs
agentskills library for validating, parsing, and managing Agent Skills
-
bocu1
BOCU-1 compressed unicode encoding
-
kbnf-regex-automata
A forked version of regex-automata for kbnf
-
wtf8-rs
WTF-8 encoding
-
wagyan
CLI tool to convert text into extruded ASCII STL meshes (TTF/OTF supported)
-
rust-functions
A collection of Rust utility functions (starting with format_number)
-
niho
A command-line tool for converting romanized Japanese text to Japanese characters
-
rust_file_encode_mode_convert
这是一个rust的库,用于检测文件的编码格式。 支持GBK,GBK2312 , UTF8, UTF16LE, UTF16BE, UTF8+BOM,UTF32 等多种编码格式。
-
mtf
Markdown Table Formatter
-
codebook_downloader
Dictionary downloading utility for the Codebook spell checker
-
diaryx_core
Core library for Diaryx - a tool to manage markdown files with YAML frontmatter
-
kyu-parser
openCypher parser for KyuGraph with hand-written lexer and chumsky combinators
-
string-box
Create Rust string from UTF-8 string, byte string or wide string
-
words-count
Count the words and characters, with or without whitespaces
-
lasr
Live Action Search and Replace
-
random-zh
generating random Chinese characters
-
rtss
A command-line tool to annotate stdout/stderr with elapsed times
-
textwrap-cli
Command line interface for textwrap
-
clafrica
This application allows you to type most of the characters in the african alphabet in any text field
-
pdf_composer_definitions
PDF Composer definitions crate
-
streamdown-syntax
Syntax highlighting for streamdown via syntect
-
ext-fts
Full-text search extension for KyuGraph powered by Tantivy
-
lindera-wasm
A morphological analysis library for WebAssembly
-
asoiaf-api
Rust wrapper for the api of ice and fire
-
udp-logger-rs
Log macro for log's kv-unstable backend and a UDP socket logger
-
rumage
framework for making simple markdown sites
-
opentalk-roomserver-report-generation
OpenTalk RoomServer report generation
-
minigrep_tuhin
A mini grep tool implemented in Rust
-
argus-core
AI code review platform — your coding agent shouldn't grade its own homework
-
term-gpt
A fast, colorful ChatGPT CLI for your terminal!
-
usage-cli
CLI for working with usage-based CLIs
-
nib-cli
A cli for a yet another static site generator Nib
-
semchunk-rs
A fast and lightweight Rust library for splitting text into semantically meaningful chunks
-
mdbook-treesitter
mdBook preprocessor for html adding tree-sitter highlighting support
-
alpha-counter
Alphabetic counter
-
ascii_tree
generates ascii trees
-
gbx
GBX (Grundbuch-Exchange) Dateiformat
-
hanconv-cli
Convert between Chinese characters variants
-
text_lines
Information about lines of text in a string
-
gecliht
A disparate collection of text manipulation and formatting algorithms
-
somedoc
A very simple document model and markup generator
-
mecab-ko-dict-builder
한국어 형태소 사전 빌더 - CSV에서 바이너리 사전 생성
-
orly
Download O'Reilly books as EPUB
-
markdown-code-runner
Automatically update Markdown files with code block output
-
mupdf-sys
Rust FFI binding to MuPDF
-
cfasttext-sys
fastText ffi binding
-
uniquewords-rs
Count the frequencies of words in text file(s) or stdin
-
xj_scanf
Safe reimplementation of
scanf() -
swimos_model
SwimOS Generic Representation of the SwimOS Serialization Model
-
nxfetch
A minimal, fast and batteries included fetcher!
-
tree-sitter-stack-graphs-python
Stack graphs definition for Python using tree-sitter-python
-
mdbook-svgdx
mdbook preprocessor to convert svgdx fenced code blocks into inline SVG images
-
slop-guard
Detect AI slop patterns in prose — scores text 0-100 for ~80 regex-based rules targeting LLM writing tics
-
fontheight-cli
Find out the vertical extents your font reaches on shaped words
-
vesti
A preprocessor that compiles into LaTeX
-
ucp-translator-html
HTML to UCM document translator
-
ix-embeddings
Shared embedding infrastructure for Ixchel with pluggable providers
-
mintyml-cli
Creates HTML from MinTyML, a minialist alternative syntax to HTML
-
yagenerator
Application that uses tinytemplate engine to generate text files. If you have a set of structured data, and need to generated a bunch of arbitrary types of files from it, this tool can help you to save some time.
-
grift_util
Shared utilities for the Grift Scheme implementation
-
html-linter
An HTML linting library for checking HTML structure and semantics
-
mecab-ko-hangul
한글 처리 유틸리티 - 자모 분리/결합, 음절 처리, 정규화
-
re2
Wrapper for the re2 C++ regex library
-
mecab
Safe Rust wrapper for mecab a japanese language part-of-speech and morphological analyzer library
-
rbook-cli
Experimental command-line interface for rbook
-
libcommand
creating ascii command interpreters with builtin argument-splitting
-
mdbook_header_footer
mdBook preprocessor to prepend header and append footer to certain chapters
-
eml2md
Convert EML files to Markdown
-
file-editor
Clean, elegant API for creating and editing text files
-
arinamcnulty-markdown-parser
Markdown parser - university project
-
repoyank
Interactively traverse your repository, select files/directories, and quickly prepare structured snippets for LLM interactions
-
waterui-str
String utilities for WaterUI
-
real_time_note_taker
A terminal UI tool to take time stamped notes in real time
-
r4d
Text oriented macro processor
-
fastn-builtins
fastn: Full-stack Web Development Made Easy
-
m_lexer
extensible regular expressions based lexer
-
tabulate-rs
Pretty-print tabular data. Port of python-tabulate to Rust.
-
llm-coding-tools-core
Lightweight, high-performance core types and utilities for coding tools - framework agnostic
-
workstyle
Workspaces with style! This program will dynamically rename your workspaces to indicate which programs are running in each workspace. It uses the i3 ipc protocol, which makes it compatible with sway and i3…
-
cuteness
Cute static site (+ server) generator with a bunch of plugins :3
-
pdf_form
programatically filling out pdf forms
-
extract-strings
Extract ascii strings from files
-
glyph-names
Mapping of characters to glyph names according to the Adobe Glyph List Specification
-
text_layout
Text layout algorithms
-
superfold
A multilingual Rust library and CLI to process UTF-8 strings to exclude diacritics and fold non-phonetic graphemes into their phonetic ASCII representation
-
reddish
A Rust utility library
-
correct_word
A No brainer 'did you mean' library for Rust
-
count-md
configurable command-line tool and Rust library for Unicode-aware, Markdown-aware, HTML-aware word counting in Markdown documents
-
datatroll
a robust and user-friendly Rust library for efficiently loading, manipulating, and exporting data stored in CSV files
-
rusttyper
Basic text layout, using rusttype
-
man_parser
roff parser for converting man pages to JSON/Markdown
-
regexy
lightweight Rust library for working with regular expressions. The regexy crate provides an easy-to-use interface for matching patterns in strings using regex
-
mdbook-compress
Compress an mdBook project into a single PDF file
-
corpus-preproc
A preprocessor for text and HTML corpora
-
transmutation
High-performance document conversion engine for AI/LLM embeddings - 27 formats supported
-
fusefiles
Concatenate a directory full of files into a single prompt for use with LLMs
-
ogam
A markup language for story writers
-
rhema_testkit_chirho
Shared test fixtures, generators, golden harnesses, differential runners
-
anslatortray
translate from English to Pig Latin!
-
streamdown-config
Configuration loading and management for streamdown
-
strval
Parse strings into values
-
diff-man
diff utility lib
-
mdbook-gitbook
mdBook preprocessor to properly render GitBook specific syntax
-
flatlake
In the works
-
porter-stemmer
Flexible and unicode friendly, Porter stemmer implementation
-
tagsearch
Filter plaintext files based on @keyword tags
-
sbert
Sentence Bert (SBert)
-
spandex-hyphenation
Knuth-Liang hyphenation for a variety of languages
-
aki-unbody
output first or last n lines, like a head and tail of linux command
-
arborium-highlight
Unified syntax highlighting for arborium - works with both static Rust grammars and WASM plugins
-
deepl-openapi
The DeepL API provides programmatic access to DeepL’s machine translation technology
-
createqrcode
A CLI tool to convert data into QR codes
-
regex_ansi
ANSI escape sequence regular expression (Rust port of ansi-regex behavior)
-
fast_trie
A memory efficient trie library
-
meme_generator_utils
Meme generator utils
-
upodesh
generating Bengali word suggestions based on a given input string
-
codetypo-dict
Source Code Spelling Correction
-
c6o-obsidian-export
associated CLI program to export an Obsidian vault to regular Markdown
-
rosetta-aisp-llm
LLM fallback for AISP conversion using Claude SDK - extends rosetta-aisp with AI-powered conversion
-
lingua-turkish-language-model
The Turkish language model for Lingua, an accurate natural language detection library
-
sublime-syntaxes
Precompiled Sublime Text syntax definitions for languages not in syntect's defaults
-
stylometry-analyzer
Minimal CLI tool that combines one or more
.txtfiles, extracts user-authored text, and enforces a minimum size. Hash-embeds text chunks and queries a local vector DB to classify writing style… -
mdlib
A beautiful markdown note-taking application
-
colgrep
Semantic code search powered by ColBERT
-
bin-rs
Binary Reader from multi source
-
n_gram
training n-gram language models
-
typship
A cli for typst packages
-
emoji-search
Fast fuzzy emoji searcher and picker for the terminal
-
qmd-mcp
MCP server for qmd - lightweight SOTA local search engine for AI agents
-
a3s-privacy
Privacy classification and data protection for A3S
-
turbo-json-checker
A pushdown automaton low memory JSON bytes stream checker returning the JSON root-type followed by its start and end index in the Reader
-
rgon
A command-line tool written in Rust that searches for a query string within a file
-
legalis-au
Australia jurisdiction support for Legalis-RS (Commonwealth Constitution, ACL, Fair Work, Mabo)
-
rupantor
A Bengali Phonetic Parser which is very flexible and supports Avro Phonetic
-
perm-text
curling straight/dumb quotation marks ("") and apostrophes (') into their curly/smart (“”’) equivalents
-
bashdoc
generating documentation/help menu for user defined bash functions
-
escrit
learning languages by reading texts
-
text-block-permutation-optimizer
If TSP would meet Text processing
-
interslavic
in rust
-
ripgrep
line-oriented search tool that recursively searches the current directory for a regex pattern while respecting gitignore rules. ripgrep has first class support on Windows, macOS and Linux.
-
aimds-detection
Fast-path detection layer for AIMDS with pattern matching and anomaly detection
-
zettel-core
Core library for Luhmann-style Zettelkasten management
-
scfmt
formatting code with structured comments
-
git-blamediff
A program to automatically annotate changes to a file in git(1)
-
lindera-unidic
A Japanese morphological dictionary for UniDic
-
timug
It has been created for personal blog creation purpose. Timus has its limits, but it fulfills the purposes for which it was created.
-
gemini-tokenizer
Authoritative Gemini tokenizer for Rust, ported from the official Google Python GenAI SDK
-
zenpatch
A robust library for applying text-based patches, designed for AI coding agents with backtracking algorithm
-
deindent
A command line utility and Rust library to format overly-indented text
-
rust-tfidf
calculate TF-IDF (Term Frequency - Inverse Document Frequency) for generic documents
-
clparse
A command line tool for parsing CHANGELOG.md files that use the Keep A Changelog format
-
refac
Transform some text given a generic natural language prompt
-
asciimath-unicode
Convert asciimath to unicode
-
blackboxmc_java
BlackboxMC bindings for java.util
-
whitespace-conf
Key-value configuration file delimited with whitespaces
-
kicad-text-injector
A tiny CLI tool that replaces variables of the style
${KEY}within KiCad PCB (pcbnew) files -
solr_post
command line tool and library to post files in a directory to a Solr server
-
trie-match
Fast match macro
-
owlz
"Owlz" ascii emojis, created randomly or by design
-
mdbook-glossary
mdBook preprocessor to generate glossary tables
-
boykisser
A collection of compile time ASCII art and images, especially boykisser
-
hangeul_jamo_rs
A high-performance Korean Hangul syllable and jamo manipulation library. included Python bindings.
-
font-map
Macros and utilities for parsing font files
-
badascii
Backend rendering library for BadASCII diagrams. Block diagrams in ASCII.
-
thank
Shows info about crates used in your projects so you know who to thank for them and where to do the thanking
-
mdxbook
Fork of mdBook, with more customizations and flexibility for programmers
-
matchy-match-mode
Shared MatchMode enum for matchy workspace (internal)
-
two_five_six
Two-five-six provides high-entropy, human-friendly identifiers
-
slabs
Text chunking for RAG: fixed, sentence, recursive, and semantic strategies
-
ricecoder-research
Research and analysis utilities for code understanding
-
rew
A text processing CLI tool that rewrites FS paths according to a pattern
-
fetchkit-cli
Command line interface for FetchKit web content fetching tool
-
rust_tokenizers
High performance tokenizers for Rust
-
git2prompt
command-line tool that takes a GitHub repository URL, downloads its contents, and generates a single text file optimized for use as input to AI tools
-
uclanr
A random word picker that gives you actually useful words
-
wikidump
parsing Mediawiki XML dumps
-
yar_markdown
Markdown handling for yar
-
ascii-hangman-backend
customizable Hangman game with ASCII-art rewarding for children (backend)
-
regexml
XPath compatible regex engine
-
crawdad-rkyv
Crawdad: ChaRActer-Wise Double-Array Dictionary with rkyv support
-
small-bwt
BWT construction in small space
-
yuto51942-servant
cli
-
synaptic-parsers
Output parsers: String, JSON, Structured, List, Enum — all composable as Runnable
-
comment-strip
Remove comments out of text files
-
character_text_splitter
splitting text into chunks with overlap, designed for handling large amounts of text efficiently. Implementation is identical to langchain's CharacterTextSplitter
-
substring-replace
developer-friendly methods to manipulate strings with character indices
-
a3s-common
Shared types for A3S: privacy classification, tool definitions, transport protocol
-
clip-sanitize
Meta-library for robust text sanitization, repair, and normalization
-
utf8-io
Traits and types for UTF-8 I/O
-
lucide-icon-name
Lucide icon names
-
ragtime
Easy Retrieval Augmented Generation
-
elicitor-doc-latex
Backend for elicitor: generates fillable LaTeX forms from SurveyDefinition
-
fuzzy-datetime
Detects, completes and normalises fuzzy date and date-time strings for interoperability with chrono or direct output as ISO-8601-compatible strings
-
mdbook-svgbob2
Alternative mdbook preprocessor for svgbob
-
codes-iana-charset
This package contains an implementation of the IANA Character Set registry
-
caser
Change text between PascalCase, camelCase, and snake_case
-
encoding-index-singlebyte
Index tables for various single-byte character encodings
-
clippers
Cross-platform clipboard management library
-
tectonic_engine_bibtex
The
bibtexprogram as a reusable crate -
lindera-ipadic-neologd-builder
A Japanese morphological dictionary builder for IPADIC NEologd
-
g2-unicode-jp
convert Japanese Half-width-kana[半角カナ] and Wide-alphanumeric[全角英数] into normal ones
-
legalis-ca
Canada jurisdiction support for Legalis-RS (Charter of Rights, Federal/Provincial Law, Quebec Civil Law)
-
ftrace
trace files and paths
-
human-names
A random generator for human first and last names
-
skimple
interface for the skim fuzzy-matcher
-
ftb
A fast CLI tool to format and align Markdown tables
-
xml-string
String types for XML
-
fancy-regex-fork-pb
A custom fork of the fancy-regex crate. You probably don't want to use this.
-
constr
Constant string generics
-
good-pc
Just a silly little tool to pet your pc and display some ASCII art
-
translation-api-cn
Some useful structs for calling Chinese translation api cloud services. A helper tool for
bilingualcmdline tool. -
ascii_converter
converting between different ascii representations
-
ncase
Enforce a case style
-
csvsc
Build processing chains for CSV files
-
ascii_help
help you quickly convert ASCII codes
-
supermarkdown
High-performance HTML to Markdown conversion for LLMs
-
esc
Escape characters in strings
-
mecab-ko
한국어 형태소 분석기 - MeCab-Ko의 순수 Rust 구현
-
lipgloss-tree
A tree component for terminal user interfaces, styled with Lip Gloss
-
amdb
Turn your codebase into AI context. A high-performance context generator for LLMs (Cursor, Claude) using Tree-sitter and Vector Search.
-
to_markdown_table
An easy way to format any data structure into a Markdown table
-
wtf8
WTF-8 encoding. https://simonsapin.github.io/wtf-8/
-
libabbs
aosc-os-abbs maintenance
-
find_unicode
Find Unicode characters, the easy way!
-
mantra-lang-tracing
Contains functionality needed to collect requirement traces from code or plain text files for the
mantraframework -
mdbook-dtmo
Creates a book from markdown files with added plugins
-
axonml-text
Text processing utilities for the Axonml ML framework
-
docket
markdown to HTML documentation rendering
-
mle
The markup link extractor (mle) extracts links from markup files (Markdown and HTML)
-
cliche
Dead simple static site generator
-
fea-rs
Tools for working with Adobe OpenType Feature files
-
asciirend
no_std compatible ascii rendering core
-
glifnames
Mapping of characters to glyph names according to the Adobe Glyph List Specification
-
uwl
A management stream for bytes and characters
-
awful_knowledge_synthesizer
Generate LLM-powered exam questions from YAML books, manpages, mdbooks, tealdeer pages, and code
-
crustword
Crusty Crosswords
-
unicode-normalization-alignments
functions for normalization of Unicode strings, including Canonical and Compatible Decomposition and Recomposition, as described in Unicode Standard Annex #15
-
biodiff-wfa2-sys
Rust bindings for the WFA2 library
-
zw
encoding and decoding text using zero-width characters
-
text-replacer
Takes a String, or Bytes and replaces each word found with a same word from the provided dictionary
-
foundation_nostd
Foundational nostd core implementation for the ewe_platform project
-
veltox
A fast, themeable static site generator written in Rust — ideal for documentation and blogs
-
mq-repl
Read-Eval-Print Loop (REPL) for mq query language
-
bcdown
Bilibili漫画下载器,written in Rust,支持epub pdf zip格式
-
pad
padding strings at runtime
-
text_utils_s
edit array. Example delete duplicate in array. Clear string
-
rs-tool
A command-line tool to perform reservoir sampling on a file or a stream
-
charmap
one-to-(none/one/many) character mapping
-
roan-engine
The core engine for the Roan project
-
artificial-prompt
Fluent builders and helpers for composing markdown prompt fragments
-
crawdad
ChaRActer-Wise Double-Array Dictionary
-
microformats-types
A representation of the known objects of Microformats
-
flashtext2
The FlashText algorithm implemented in Rust
-
zhconv-cli
Convert Traditional/Simplified Chinese and regional words of Taiwan/Hong Kong/mainland China/Singapore based on Wikipedia and OpenCC rulesets 轉換中文簡體、繁體及兩岸、新馬地區詞,基於維基百科和…
-
rdocx-cli
CLI tool for inspecting, converting, and manipulating DOCX files
-
rpdf
PDF command-line utils written in Rust
-
gchemol-parser
Text parsing made simple
-
hayro-font
A parser for CFF and Type1 fonts
-
ngram
Iterator adaptors for n-grams and k-skip-n-grams
-
asimov-openai-module
ASIMOV OpenAI module
-
md_ol_util
Transform markdown unordered lists to ordered lists with current position expressions
-
miktik
A unified, multi-backend tokenizer library for LLMs
-
bible-io
working with Bible text data structures
-
mdbook-typstpdf
An mdBook backend that generates PDF output using Typst
-
inslice
A command-line utility for filtering text input by columns and rows
-
harfshapedfa
Glue functionality between skrifa & harfrust
-
jpreprocess-jpcommon
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
sakurs-core
High-performance sentence boundary detection using Delta-Stack Monoid algorithm
-
kiosk-core
Core library for kiosk — tmux session manager with worktree support
-
runestr-pancjkv
rune-based Pan-CJKV support -
tgrep
Toy grep that honors .gitignore
-
ctf-brute
Brute-force utilities for Rust
-
dr
Command-line data file processing in Rust
-
alass-ffi
FFI bindings for subtitle synchronization with alass
-
mkweb
A markdown to html converter and server
-
vregex
Regular Expression engine
-
pulldown-cmark-fork
A pull parser for CommonMark
-
dedent
Procedural macro for stripping indentation from multi-line string literals
-
java_string
Java strings, tolerant of invalid UTF-16 encoding
-
shimmytok
Pure Rust tokenizer for GGUF models with llama.cpp compatibility (SentencePiece + BPE + WPM + UGM + RWKV)
-
mdbook-morsels
Morsels plugin for Mdbook
-
scribe-patterns
Advanced pattern matching and search algorithms for Scribe
-
macro_colors
colorful printing macros
-
text_manipulation_rs
generating random placeholder text in different languages
-
legalis-th
Thailand jurisdiction support for Legalis-RS - Thai legal system with Buddhist Era calendar, FBA, BOI, PDPA, Labor law
-
slugify-rs
generate slugs from strings
-
jfmt
command-line tool for formatting json files in both readable and compact formats. It supports stdin/stdout shell usage, as well as working on files directly.
-
jtool-grep
notebook-specific grep tool for jtool
-
hydroper_source_text
Source text containing line locations
-
assemblage_view
Linearized View Model and Bindings for AssemblageDB
-
aqp3-cli
Congress.gov legislation text search query syntax validator
-
nano_banana_pro_prompt
High-quality integration for https://supermaker.ai/blog/nano-banana-pro-prompt-use-cases-ready-to-copy-paste/
-
rustwemoji
Twemoji for Rust
-
fast_whitespace_collapse
Collapse consecutive spaces and tabs into a single space using SIMD
-
syntaxdot-encoders
Encoders for linguistic features
-
bard
Creates PDF and HTML songbooks out of easy-to-write Markdown sources
-
sauron-md
parsing markdown into sauron node
-
par-term-fonts
Font management and text shaping for par-term terminal emulator
-
streamparser
Adapt simple stream parsers for different types of streams
-
LitePhoton
A blazingly fast text file/csv file/etc scanner
-
align
aligning text
-
thoth-note
note-taking app written in Rust
-
deliminator
Universal code documentation generator
-
todo-to-issue
CLI tool that converts forgotten TODO comments into actionable GitHub issues
-
jmemo
CUI tool for maintaining memos
-
grader
Stream-based CLI for binary sorting text files via a given shell command
-
turndown
An opionated Rust port of Turndown.js
-
replacer-cli
A flexible cli to replace strings in files or a directory
-
mdbook-ai-pocket-reference
mdbook preprocessor for the ai-pocket-reference project
-
encoding-next-index-tradchinese
Index tables for traditional Chinese character encodings
-
indeed
Append lines to a file with no shell bullshit
-
mdast_util_to_markdown
Markdown to AST
-
rex-regex
A pedestrian library for regular expressions: Slow, but easy to understand
-
lithe
A Slim template engine by using Pest
-
bubble-bath
Small and quick HTML sanitizer
-
rst_renderer
a reStructuredText renderer
-
epub2mdbook
convert EPUB files to MDBook format
-
book2pdf
CLI utility to turn published code documentation into PDFs
-
charclass
define and modify unicode character classes
-
stringsext
find multi-byte-encoded strings in binary data
-
utf8-bytes
bytes::Bytes, but UTF-8
-
committed
Nitpicking commit history since beabf39
-
typos-dict
Source Code Spelling Correction
-
mut-str
A toolkit for working with mutable string slices (&mut str)
-
incredimo
just another font for your terminal
-
ramp-text
Pure Rust
-
persian-tools-cli
cli for rust-persian-tools crate
-
asimov-module-cli
ASIMOV Module Command-Line Interface (CLI)
-
jira-mcp-server
AI-friendly JIRA MCP server with semantic search capabilities
-
minigrep_sensitive
search for a querry case sensitive or not, in a file
-
cur
that will hunt for your regular expression
-
naming_clt
Extract and convert the naming format(case|notation) of identifiers from files or stdin. Use this tool to prepare identifier name strings for further operations (matching,replacing...) on relative files
-
flux-tui
Fast and lightweight Terminal UI drawing library
-
ruSTLa
A reStructuredText → LarST ⊂ LaTeX transpiler
-
koruma-collection
A collection of common validators using koruma
-
gix-glob
gitoxide project dealing with pattern matching
-
pulldown_mdbook
A pull parser for mdBook
-
literumilo
A spell checker and morphological analyzer for Esperanto
-
htmlsnob_rules
HTML validator, formatter and autofixer
-
mdbook-trunk
mdBook plugin which bundles packages using Trunk and includes them as iframes
-
indoc
Indented document literals
-
rjoin
joining CSV data on command line
-
typub-adapter-ghost
Ghost platform adapter for typub
-
noise_search
Nested Object Inverted Search Engine
-
ut1_blocklist
UT1 blocklist URL/domain filters
-
print-positions
providing string segmentation on grapheme clusters and ANSI escape sequences for accurate length arithmetic based on visible print positions
-
url-cleaner-engine
The engine behind URL Cleaner
-
libopenlipc-sys
Wrapper around liblipc to interact with Kindle dbus-based LIPC events
-
tweet-memo
A CLI tool to record Twitter-style short memos in Markdown files
-
chonkier
🦛 Chonkie, now in Rust 🦀: No-nonsense, ultra-fast, ultra-light chunking library
-
gh_page_tool
A github gh-pages tool for static blog site
-
cli-colors
A CLI tool for outputting text in ANSI format with features like colors, underlining, boldening, and italicizing
-
uiua-doc-gen
Documentation generator for Uiua libraries
-
dioxus-typst
Typst component for Dioxus
-
unicode-box-drawing
Unicode box-drawing characters
-
dmntk-gendoc
dmntk | Documentation generator
-
renamer-rs
process and rename files or text
-
rckive-genpdf
User-friendly PDF generator written in pure Rust
-
analyse-json
CLI tool for inspecting (Newline Delimited) NDJSON or JSON to understand the contents
-
code_generator
A code generator (Currently only targets C)
-
s3grep
A fast, parallel grep tool for searching logs and unstructured content in AWS S3 buckets
-
file-size
a function formatting file sizes in 4 chars
-
rfc9839
RFC 9839 specification
-
nile-library
supporting nile
-
lindera-sqlite
Lindera tokenizer for SQLite FTS5 extention
-
japanese-text
日本語テキスト正規化ライブラリ - 全角半角変換、ひらがなカタカナ変換
-
rcut
replacement for GNU cut that supports UTF-8
-
typoglycemia
A function to convert text to typoglycemic format
-
mq-crawler
Directory crawler for batch Markdown file processing
-
matcher
UCFP matching layer for semantic and perceptual search over indexed fingerprints
-
reflexo-vec2svg
Export a frame of Typst document into svg element
-
pukram-formatting
A type to represent the formatting of the pukram markup language
-
vidyut-kosha
A Sanskrit key-value store
-
rk-utils
A collection of utility functions and data structures for rust
-
pdfrust
PDF parser
-
libxdiff
Rust bindings for the libxdiff C library
-
moobi
A random password generation CLI tool
-
meaningsearch
package that helps you find meaningful lines of any given input. Especially useful in CTFs.
-
zorto
A fast static site generator with executable code blocks
-
naromat
Convert text to narou novel format
-
uniart
A CLI tool to convert images and gifs to terminal characters
-
syllable
counter for use with reading level calculations
-
ferritin-common
rustdoc navigation and search
-
ogrep
searching in indentation-structured texts
-
afrim-memory
handle of sequential codes easier for an input method
-
rmw-utf8
Short text compression algorithm for utf-8 (optimized for Chinese , developed based on rust programming language). 面向utf-8的短文本压缩算法(为中文压缩优化,基于rust编程语言开发)。
-
playbill
ASCII art title generator with random gradient effects
-
lexikit
CLI that packs an entire project into a single LLM-ready document (Markdown or //-comment style) with UTF-8 detection, binary handling, glob include/exclude filters, and .gitignore awareness
-
change-case-rs
Convert strings between camelCase, snake_case, PascalCase, kebab-case, and more
-
fcowsay
working with cowsay
-
matchpick
Find and replace multi-lines using a match-case
-
anon-csv-cli
anonymise CSV files, providing various options to substitute real data with plausable fake data
-
wordshk_tools
A combination of parsers and other tools for words.hk (粵典)
-
lucide-yew
Yew port of Lucide
-
giff
Visualizes the differences between the current HEAD and a specified branch in a git repository using a formatted table output in your terminal. The differences are displayed with color-coded…
-
csv_to_table
pretty print CSV as a table
-
ssort
CLI tool for suffix (inverse lexicographic) sorting
-
minigrep_rd
searching through lines of text
-
strip-codeblocks
strip markdown code blocks from text, preserving only the inner content
-
humnum
Human numeric sorting program — does what
sort -his supposed to do! -
orgflow-tui
A terminal user interface for orgflow - manage notes and tasks with a smooth workflow
-
chars_data
Build-dependency for chars, the unicode character information CLI
-
gstring
String with support for Unicode graphemes
-
pinyin-sort
sort pinyin
-
jxoesneon-tectonic-io_base
Internal crate for the Tectonic project: tectonic_io_base
-
steve
Search Technical Evidence Very Easily
-
salvation-cosmic-text
Pure Rust multi-line text handling
-
chinese-telegraph
unicode to chinese telegraph code conversion
-
kanuni
AI-powered legal intelligence CLI - The Ottoman Edition
-
zspell-cli
Command line interface for the ZSpell spellchecking library
-
klirr-render
Zero-maintenance and smart FOSS generating beautiful invoices for services and expenses
-
mdbook-rustdoc-links
Link to Rust API docs by name in mdBook
-
const_format_proc_macros
detail of the
const_formatcrate -
string-width
Accurate Unicode string width calculation for terminal applications, handling emoji, East Asian characters, combining marks, and ANSI escape sequences
-
atlas-memo
Atlas Program Library Memo
-
textcat
detect text categories. It can be used to detect the language of a given text
-
excavate
A command line tool to excavate fields from multi-column texts
-
py-regex
A small wrapper around the Python regex module via PyO3
-
campfire
A tiny static site generator, greatly inspired by Zola
-
lingua-thai-language-model
The Thai language model for Lingua, an accurate natural language detection library
-
asimov-chromium-module
ASIMOV module for Chromium (and Brave, Google Chrome) bookmark import
-
ottotom-testsupport
Test helpers used for the ottotom crate tests
-
grep-regex
Use Rust's regex library with the 'grep' crate
-
mq-conv
A CLI tool for converting various file formats to Markdown
-
fop-types
Core types for Apache FOP Rust implementation
-
rfc9839-rs
RFC9839 to test for problematic Unicode code points
-
rustmax-doctest
Doctest runner for rustmax crate examples
-
ucd-util
A small utility library for working with the Unicode character database
-
boundary-report
Report generators for boundary analysis results
-
charname
Incredibly simple library that just gives you the Unicode name for a character
-
delim
parsing values from delimited strings
-
tantivy-object-store
A tantivy Directory implementation against object stores (S3, GCS, etc.)
-
typst-ts-core
Core function of Typst.ts
-
whisperer
将文本编码为简短的中文字符, 防和谐
-
kodegen_native_notify
KODEGEN.ᴀɪ: Memory-efficient, Blazing-Fast, MCP tools for code generation agents
-
unistructgen-markdown-parser
Markdown table parser for UniStructGen
-
hangul
manipulate Hangul Syllables
-
enc-check
inspect utf-8 and utf-16 character encodings
-
probe-code
AI-friendly, fully local, semantic code search tool for large codebases
-
hex-utilities
working with hexadecimal numbers
-
miette-arborium
Arborium-powered syntax highlighter for miette diagnostics
-
mediawiki_parser
A strict parser for MediaWiki markdown
-
lk-inside
A terminal user interface (TUI) application for interactive data analysis
-
assert-text
the testing macro tools
-
ghost-lib
Ghost Librarian — ultra-lightweight local-LLM RAG engine with Context Distillation
-
mdbook-keeper
An improved testing experience for mdbook
-
rosie
Interface for the Rosie Pattern Language, for efficient and maintainable text pattern matching and search
-
aklat
create books from markdown files (like Gitbook)
-
convert_case_extras
Extra features for convert_case
-
wildflower
Wildcard matching against strings
-
zenbase-llml
Lightweight Language Markup Language - converts data structures to XML-like markup
-
vds
Visibly distinguishable string types for identifiers and codes
-
tree-sitter-stack-graphs-java
Stack graphs for the Java programming language
-
html-auto-p
function like
wpautopin Wordpress. It uses a group of regex replaces used to identify text formatted with newlines and replace double line-breaks with HTML paragraph tags. -
abbreviation_extractor
extracting abbreviations from text
-
highlights
Convert kindle highlights into different formats
-
mention
code highlighter
-
nlf
A CLI to append newline characters (LF) at the end of text file
-
verba
working with Latin words
-
reqmd_tui
TUI tool for reqmd
-
pandoc_ast
deserializes and serializes the markdown ast for writing pandoc filters
-
rehuman
Unicode-safe text cleaning & typographic normalization for Rust
-
markdown-table
Creating markdown tables with Rust!
-
line-numbers
Find line numbers in strings by byte offsets, quickly
-
snekdown
A parser for the custom snekdown markdown syntax
-
array_tool
Helper methods for processing collections
-
encoding-index-tradchinese
Index tables for traditional Chinese character encodings
-
mdbook-nice
A mdbook plugin to add nice css to your book
-
kiri_nif
Erlang NIF wrapper for Kiri Japanese morphological analyzer
-
trustformers-tokenizers
Tokenizers for TrustformeRS
-
stringzz
strings and opcodes extraction from various file formats
-
mdbook-skill-tree
mdbook plugin to show roadmaps
-
floating-ui-utils
Rust port of Floating UI. Utilities for Floating UI.
-
benda
Use Bend language functions and capabilities in Python!
-
aki-txpr-macro
the more easy to use libaki-*
-
flerp
CLI tool that does XYZ
-
makepad-rustybuzz
A complete harfbuzz shaping algorithm port to Rust
-
goofy-animals
Generate a name in adjective-adjective-animal form
-
j4l_minigrep
learning Project for test
-
csvpeek-rs
A CLI tool to quickly peek into, list, and filter CSV data
-
rhema_accel_chirho
FPGA acceleration: packed hierarchical bit-domain engine (Chi-Rho patent)
-
unicode-jp
convert Japanese Half-width-kana[半角カナ] and Wide-alphanumeric[全角英数] into normal ones
-
ufofmt
A fast, flexible UFO source file formatter based on the Norad library
-
lindera-cc-cedict
A Chinese morphological dictionary for CC-CEDICT
-
voirs-g2p
Grapheme-to-Phoneme conversion for VoiRS speech synthesis
-
fasta-cleaner
Transform fasta files by upper-casing all sequence characters and removing non-ACGT sequence characters
-
awful_book_sanitizer
CLI to clean up OCR-mangled book excerpts into readable text using OpenAI-compatible APIs
-
glowpub
A glowfic to epub converter
-
codetypo-vars
Source Code Spelling Correction
-
mistral_ocr
CLI tool to convert PDF, image, and document files into Markdown using Mistral AI's OCR API
-
latin
A standard library with a focus on common operations
-
litsea-cli
Litsea is an extreamely compact word segmentation and model training tool implemented in Rust
-
mdbook-permalinks
Generate permalinks in mdBook using paths
-
shell-color
shell-colorprovides a portable, reliable way of determining color support for applications spawned by the shell -
lesspub
CLI tool for downloading Sequences from LessWrong and exporting them as EPUB format ebooks
-
yeslogic-unicode-script
Fast lookup of the Unicode Script property
-
zed-util
A collection of utility structs and functions used by Zed and GPUI
-
rust_lemmatizer
A lemmatizing package for use with a .csv dictionary of lemmas and their corresponding words
-
tiny-grep
grep-like text search utility written in Rust
-
kaiba
domain library - Core types and interfaces for AI persona system
-
alemat
type-safe building of MathML
-
kmdbook-driver
High-level library for running mdBook
-
caribon
A repetition detector program and library
-
flabild
A fast Markov chain-based fake word generator that produces pronounceable pseudo-words
-
sevenmark-utils
Shared utilities for SevenMark crates
-
tirith-core
Terminal security analysis engine - homograph attacks, pipe-to-shell, ANSI injection
-
aho-corasick-unsafe
Fast multiple substring searching
-
ispell
Easily run ispell, hunspell or aspell from Rust
-
vl-convert-pdf
convert SVG to PDF with embedded text
-
char-positions
Iterate chars and their positions, i.e. line, column, and byte ranges
-
language-tokenizer
Text tokenizer for linguistic purposes, such as text matching. Supports more than 40 languages, including English, French, Russian, Japanese, Thai etc.
-
runefix-cli
Unified CLI frontend for runefix tools: width detection, markdown formatter, and terminal testing
-
mdopen
Preview markdown files in a browser
-
mdbook-summary
Summary parser for mdBook
-
node-emoji
Convert
:emoji:to Unicode using GitHub’s and EmojiDB’s emoji names -
untex
Understand and manipulate TeX files with ease
-
regex-automata
Automata construction and matching using regular expressions
-
meet
building website
-
emoji-printer
Replace emoji shortcodes in string with emoji unicode (":sushi:" -> 🍣)
-
token-parser
parsing texts into data structures
-
serde_ssml
A robust Rust library for parsing, manipulating, and generating Speech Synthesis Markup Language (SSML) documents
-
old_icelandic_zoega
Old Icelandic dictionary for Rust. From "A Concise Dictionary of Old Icelandic" by Geir Zoëga
-
text-utils
Text utils for unescaping and align
-
fozzie
A quick fuzzy finder for the terminal
-
content-ingest
Content ingestion, validation, and normalization pipeline for text and binary data
-
cabocha
Safe Rust wrapper for cabocha a japanese language dependency structure analyzer library
-
chinese_detection
Classify a string as either English, Chinese, or Pinyin
-
vew
Visualize lsof output
-
ean-rs
generating and validating EAN barcodes
-
scripter
A screenplay compiler
-
twitter-text
in Rust
-
ascii-hangman-webapp
customizable Hangman game with ASCII-art rewarding for children (webapp version)
-
curtana
Simplified zero-cost wrapper over llama.cpp powered by lama-cpp-2
-
uwurs
UwUify your strings with uwurs!
-
simple-xml-builder
XML builder/writer
-
fmtm
A diff-friendly Markdown formatter that breaks lines on sensible punctuations and words to fit a line width
-
crate2bib-cli
A CLI tool for the crate2bib crate
-
html-compare
compare html files
-
synaptic-anthropic
Anthropic integration for Synaptic: ChatModel
-
saville_encode
Encode UTF-8 text into Peter Saville's colour code
-
htmlescape
HTML entity encoding and decoding
-
minigrepbk
A collection of utilities to make performing searching text in a certain document
-
md-kroki
Render kroki diagrams inside markdown
-
utf8-fix
Fix invalid UTF-8 sequences in-place while preserving buffer size - useful for fuzzing and mutation testing
-
promptify
A plaintext directory formatting tool for interacting with LLMs on the command line
-
custard_lib
A core library for Custard
-
minigrep_elijahkx
MiniGrep is a Rust-based command-line tool, with a (current) size of 588KB that lets users search files for a given query string and shows matching lines with their line numbers
-
gematria_rs
Gematria, a traditional Hebrew numerology system
-
ryaspeller
lib for searching typos in text, files and websites
-
token-dict
basic dictionary based tokenization
-
notedown_ast
Notedown Abstract Syntax Tree
-
august
& program for converting HTML to plain text
-
scribe-webservice
Web service interface for Scribe repository analysis
-
forgetless
Smart context optimization for LLMs that compresses massive content to fit your token budget
-
thediff
Difference between 2 files in percentages
-
legalis-vn
Vietnam jurisdiction support for Legalis-RS - Vietnamese legal system with socialist market economy, Labor Code, Enterprise, Investment
-
bbd-lib
Binary Braille Dump
-
textshrink
A minimalistic lib that reduces text size by replacing specific character sequences with single Unicode characters (e.g., 'mm' becomes '㎜'). On average, this compresses text by ~5%, with variation also ~5%.
-
skribo
low-level text layout
-
invisible-sun
program for generating svg "suns"
-
aozora2text
Convert Aozora Bunko format to plain text
-
mintyml
A minialist alternative syntax to HTML
-
mdtranslation
prepare multi-lingual Markdown documents
-
clippy_lints
A bunch of helpful lints to avoid common pitfalls in Rust
-
langsan
sanitizing language model input and output
-
soft-ascii-string
char/str/string wrappers which add a "is-ascii" soft constraint
-
regex-lite
A lightweight regex engine that optimizes for binary size and compilation time
-
chinese-ner
A CRF based Chinese Named-entity Recognition Library written in Rust
-
notmecab
tokenizing text with mecab dictionaries. Not a mecab wrapper.
-
fmty
Composable
core::fmtutilities -
table_to_html
interface to convert a
tabled::Tableinto a HTML table (<table>) -
az_minigrep
minigrep CLI program to search a query string from a file and print out the matched lines
-
markdown-formatter
Flavored Markdown (ZH) content formatter
-
eytan-minigrep
minigrep from "the book"
-
lindera-ipadic
A Japanese morphological dictionary for IPADIC
-
badascii-mdbook
Embed badascii diagrams in your mdbook. See badascii.me for the editor.
-
codive-tools
Tool implementations for Codive
-
unaccent
remove accents from strings, inspired by PostgreSQL's unaccent extension
-
minigrep-cli-tool
cli tool to search upon a text file for lines containing specified string
-
static_table
creates pretty tables at compiler time
-
opentalk-roomserver-types-legal-vote
OpenTalk RoomServer Types Legal Vote
-
daft-functions-utf8
UTF-8 functions for the Daft project
-
to_fraktur
Function that converts any string to fraktur font
-
jpreprocess-window
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
sauron-markdown
parsing markdown into sauron node
-
typub-adapter-wordpress
WordPress platform adapter for typub
-
aki-mline
match line, regex text filter like a grep of linux command
-
uniwhat
Display the unicode characters text
-
lang-unicodes
Effortlessly convert and validate text across various Unicode ranges, supporting multiple languages for seamless internationalization in your applications
-
lingua-lithuanian-language-model
The Lithuanian language model for Lingua, an accurate natural language detection library
-
skera
Subsetting a font file according to provided input
-
steer-tui
Terminal UI library for Steer
-
escape-bytes
Escapes bytes that are not printable ASCII characters
-
story-dl
Story web scraping
-
maybe-regex
Wrapper for strings that may be either a regex or a plain-text string
-
dianac
An emulator, compiler, and interpreter for the Diana Compiled Language
-
ox_content_docs
Source code documentation generator for Ox Content (like cargo docs)
-
sixbit
Small packed strings
-
aki-stats
output the statistics of text, like a wc of linux command
-
wordchipper-training
BPE LLM Tokenizer Training for wordchipper
-
rdocx-html
DOCX-to-HTML and DOCX-to-Markdown conversion
-
grep-printer
grep crate's Sink trait that provides standard printing of search results, similar to grep itself
-
pulldown-cmark-escape
An escape library for HTML created in the pulldown-cmark project
-
cargo-cargofmt
Cargo file formatter
-
write16
A UTF-16 analog of the Write trait
-
mdbook-latex
An mdbook backend for generating LaTeX and PDF documents
-
dcsv
Dyanmic csv reader,writer,editor
-
asciimath-text-renderer
Render asciimath in terminal
-
rsticle-cli
Command line tool to convert source files into narratives/articles
-
selmr
Package to create and use Simple Explainable Language Multiset Representations
-
make_it_braille
A CLI tool and library to manipulate unicode braille images
-
zbuf
“Zero-copy” string and bytes buffers
-
libanubhav
management system written in Rust
-
limace
Slugify some strings
-
xpath-cli
Evaluate XPath selectors on XML or HTML documents
-
kindle_epub_converter
sanitize epubs for kindles
-
hayro-write
rewriting pages of a PDF file
-
termio
styling terminal output with CSS-like syntax
-
checkstream-classifiers
ML and pattern-based classifiers for toxicity, PII, and prompt injection detection
-
summera
TUI for webpage summarisation
-
vize_patina
Patina - The quality checker for Vize code linting
-
tectonic_bridge_harfbuzz
Expose the Harfbuzz C/C++ APIs to Rust/Cargo
-
nlprule-build
Build tools for a fast, low-resource Natural Language Processing and Error Correction library
-
jlu
Command-line utilities for on-the-fly investigation of JSON Lines
-
computergeneration
compgen but all wrong
-
rustex
auto-generated LaTeX files in Rust
-
tectonic_io_base
Basic types for Tectonic's pluggable I/O backend system
-
scribe-graph
Graph-based code representation and analysis for Scribe
-
kizame
(刻め!) - CLI for MeCrab morphological analyzer and data pipeline
-
unicode_categories_zheyuye
Query Unicode category membership for chars
-
a8e-mcp
Articulate (a8e): The sovereign AI operator for your terminal
-
mdvault-core
Core library for mdvault - markdown vault management
-
llama-tokenizer
Tokenizer crate for llama.rs — deterministic text-to-token conversion
-
pinot
Fast, high-fidelity OpenType parser
-
typub-adapter-notion
Notion adapter for typub - REST API publishing with HTML to blocks conversion
-
tokenx-rs
Fast token count estimation for LLMs at 96% accuracy without a full tokenizer
-
regex_static
Compile-time validated regex, with convenience functions for lazy and static regexes
-
tokengrams
Compute n-gram statistics and model language over pre-tokenized text corpora used to train large language models
-
opengrep
Advanced AST-aware code search tool with tree-sitter parsing and AI integration capabilities
-
mdbook-spec
An mdBook preprocessor to help with the Rust specification
-
encoding-next-index-simpchinese
Index tables for simplified Chinese character encodings
-
synaptic-gemini
Google Gemini integration for Synaptic: ChatModel
-
kitoken
Fast and versatile tokenizer for language models, supporting BPE, Unigram and WordPiece tokenization
-
staple
powerful static site generator
-
csv-sanity
Sanitize and transform large CSVs with millions of records quickly and efficiently
-
ultron-syntaxes-themes
Syntaxes and themes dump for ultron
-
wesichain-retrieval
Retrieval stores and helpers for Wesichain
-
mdja
日本語に最適化されたMarkdownパーサー - CommonMark + GFM対応、目次生成、読了時間計算
-
kpathsea
Rust interface to the kpathsea TeX file management library
-
tantivy-czech-stemmer
Czech stemmer as Tantivy tokenizer
-
kindleclip
CLI tool to export a file per book in kindle 'My Clippings.txt'
-
character_frequency
counting character frequencies in a string concurrently
-
encoding_c
C API for encoding_rs
-
igo-rs
Pure Rust port of the Igo, a POS(Part-Of-Speech) tagger for Japanese (日本語 形態素解析)
-
cp-parser
Document parsing for PDF, Markdown, and text files
-
tabwriter
Elastic tabstops
-
jxoesneon-tectonic-xetex_format
Internal crate for the Tectonic project: tectonic_xetex_format
-
xhtmlchardet
Character set detection for XML and HTML
-
google-fonts
Download and cache TTF fonts from Google
-
molten_ember
Render Markdown beautifully in the terminal 🔥
-
fontcull-klippa
Subsetting a font file according to provided input. (Vendored fork for fontcull)
-
md-inc
Include files in Markdown docs
-
markdown_converter
html to markdown converter and flavored markdown to discord markdown converter
-
mq-task
A task runner using Markdown
-
moenarchbook
Creates a book from markdown files
-
zummi
fun lib that produces spoonerisms
-
asimov-ollama-module
ASIMOV Ollama module
-
shift_or_euc_c
C API for shift_or_euc
-
is_utf8
functions to determine if a sequence of bytes is valid utf-8
-
cindex
CSV indexing library
-
rustdoc-markdown
convert Rust documentation to Markdown, for use with LLMs
-
code-splitter
Split code into semantic chunks using tree-sitter
-
toml_to_table
pretty print TOML as a table
-
aki-resort
sort lines of text. You can use regex to specify the KEY.
-
indentation
Formatter
-
hashtag-regex
regex matching hashtags accoding to the unicode spec: http://unicode.org/reports/tr31/#hashtag_identifiers
-
string-replace-all
String replacement utility inspired by JavaScript, allowing pattern-based substitutions with support for both exact matches and regex patterns
-
grep-searcher
Fast line oriented regex searching as a library
-
sejong
Buffer is a buffer that can receive ASCII bytes different from keyboard and send out UTF-32 Hangul string. This buffer allows deletion by Jamo.
-
simple-markdown-parser
handling markdown. May include extra utilities
-
search-in-terminal
A terminal-based search tool
-
ewts-cli
Converter from EWTS (Extended Wylie Transliteration Scheme) to Tibetan Unicode symbols (cli)
-
jmdict-fast
Blazing-fast Japanese dictionary engine with FST-based indexing
-
cmark-syntax
Syntax highlighting for pulldown-cmark
-
groop
Outline-aware grep with syntax highlighting
-
wz
Count words, fast
-
m2h
Convert Markdown to HTML with syntax highlighting
-
shopping-parser
A Rust-based parser for parsing structured product information and shopping lists, supporting multiple currencies and units
-
varcon
Source Code Spelling Correction
-
markdown-gen
generating Markdown files
-
doccy
brace based markup language
-
textr
TeX-inspired plug-n-play interface for converting JSON documents into PDFs
-
alpino-tokenizer
Wrapper around the Alpino tokenizer for Dutch
-
wkhtmltopdf
High-level bindings to wkhtmltopdf
-
e2m
A lightweight CLI tool that transforms English words into emojis using only Rust’s standard library
-
marker
finding issues in CommonMark documents
-
llm-coding-tools-rig
Lightweight, high-performance Rig framework Tool implementations for coding tools
-
grepox
Minimalist's grep written in Rust
-
fast-unescape
'unescapes' a escaped string with escape sequences into literal one
-
natural_syntax
Natural language syntax highlighting
-
wikitext_table_parser
implement for parse the wikitext's table
-
rustsay
CLI tool in Rust that mimics the classic cowsay program, allowing a cow to speak your text in the terminal
-
cesu8-str
CESU-8 and Java CESU-8 string validation and manipulation
-
text-tables
A terminal/text table prettifier with no dependencies
-
kelp
A convert tool for Japanese
-
md-designer
A CLI tool for creating design docs in Markdown
-
exit-with-code
A CLI which returns the exit code of your choice
-
mitex-parser
Parser for MiTeX
-
kiri-native
Native Rust accelerator for Kiri Japanese morphological analyzer
-
venus-sync
Sync engine for Venus - converts .rs notebooks to .ipynb
-
sm-search
way of searching through text - for people who are too lazy to use Regex
-
mark-html
efficient Markdown to HTML parser written in Rust
-
marisa-rs
Safe Rust wrapper for the marisa-trie C++ library
-
k-lighter
syntax highlighter for K language
-
lindera-ipadic-neologd
A Japanese morphological dictionary for IPADIC NEologd
-
mdbook-mermaid-animate
mdbook preprocessor to add mermaid animation support, before mdbook-mermaid preprocessor
-
mq-docs
Documentation generator for mq functions and macros
-
go-brrr
Token-efficient code analysis for LLMs - Rust implementation
-
regex-cli
A command line tool for debugging, ad hoc benchmarking and generating regular expressions
-
rhema_ai_chirho
AI integration: LLM providers, embeddings, vector search, query expansion
-
korrektor
work with Uzbek language text processing
-
harfbuzz-sys
Rust bindings to the HarfBuzz text shaping engine
-
typos
Source Code Spelling Correction
-
indent_tokenizer
Generate tokens based on indentation
-
mdbook-iced
An mdBook preprocessor to turn iced code blocks into interactive examples
-
lformat
Clone of Lua
string.formatin Rust based on Cs(n)printf -
dimgrep
command-line text search tool written in Rust, based on the minigrep example from The Rust Programming Language
-
scie
research about how to build simple code identify engine for different languages
-
mandate
Convert Markdown or YAML manuals into roff manpages
-
jxoesneon-tectonic-cfg_support
Internal crate for the Tectonic project: tectonic_cfg_support
-
autoruby-cli
CLI to easily generate furigana for various document formats
-
kanji_hanzi_converter
Translate Japanese Kanji to Chinese Hanzi
-
fiberplane-markdown
convert Fiberplane Notebooks to and from Markdown
-
iata_bcbp
IATA BCBP Parser in Rust Based on Resolution 792
-
json2markdown
convert JSON data into well-structured Markdown format
-
unicode_converter
CLI tool to convert data between various Unicode encodings
-
inline_flexstr
copy/clone-efficient inline string type for Rust
-
ucd-generate
A program for generating packed representations of the Unicode character database that can be efficiently searched
-
formatjson
Formats JSON files
-
asimov-x-module
ASIMOV module
-
supercat
A syntax highlighting alternative to cat
-
hashmoji_generator
Code generation tool for hashmoji
-
mecab-ko-core
한국어 형태소 분석 핵심 엔진 - Lattice, Viterbi, 토크나이저
-
utf8-supported
Determine the UTF-8 support of the current locale
-
wordfreq
port of wordfreq for looking up the frequencies of words in many languages
-
deck
A command line tool to generate HTML presentations from Markdown documents
-
grep-matcher
A trait for regular expressions, with a focus on line oriented search
-
krilla-svg
Converting SVG files to PDF
-
md-parser-kma
parser that converts basic Markdown syntax (headers, bold, italic, lists) into corresponding HTML tags
-
ps-str
String transcoding library
-
syntax-html
cli application for generating syntax-highlighted HTML of code files
-
pcre2
High level wrapper library for PCRE2
-
rcut-lib
rcut is a Rust replacement for GNU cut that supports UTF-8
-
code-agent
AI-Native Code Assistant Library
-
gret
command line tool to search for patterns and show matches in a tree structure
-
boringascii
Strings that can only be constructed to contain non-whitespace, non-control ASCII characters
-
u8char
An alternative to the char type that uses UTF-8 as its in-memory representation
-
regexgrep
ripgrep tool that suports regular expressions
-
rigrep
grep from Rust Book
-
aistack
Functional text-to-function AI utilities
-
vizaminigrep
search text
-
utf8-command
UTF-8 encoded
std::process::Commandoutput -
textframe
query plain text documents by unicode offset without loading them all into memory
-
unicode_names
Map characters to and from their name given in the Unicode standard. This goes to great lengths to be as efficient as possible in both time and space, with the full bidirectional tables weighing barely 500 KB…
-
lindera-filter
Character and token filters for Lindera
-
elizaos-plugin-local-embedding
Local text embedding and tokenization plugin for elizaOS - Rust implementation
-
batuta-common
Shared utilities for the Batuta stack: formatting, system info, display helpers
-
lindera-decompress
A morphological analysis library
-
notion2md
converting Notion pages to Markdown
-
regex_quote_fixer
Rewrites grep regexpressions for the use in the regex crate
-
termwrap
Wrap Unicode text with ANSI color codes
-
markovish
Markov chain implementation for text generation
-
recode_rs
Command-line tool for converting between the character encodings defined in the Encoding Standard
-
slugi
slugify text (e.g., 'Café au lait' → 'cafe_au_lait')
-
endf_parser
parsing ENDF-6 format nuclear data
-
unflow
DSL to convert design to code
-
toonconv
CLI tool for converting JSON to TOON (Token-Oriented Object Notation) format
-
kradical_static
Ready-to-use EDRDG radical decompositions
-
ai-context-gen
A context generator for Rust repositories that creates structured markdown files with relevant information for LLMs and AI agents
-
lingua-persian-language-model
The Persian language model for Lingua, an accurate natural language detection library
-
dala
Dalia is a light weight formula language
-
svgbobdoc
Renders ASCII diagrams in doc comments as SVG images
-
stringsort
Pathological sorting of string characters
-
magic_string
magic string
-
fast-str
A flexible, easy-to-use, immutable, efficient
Stringreplacement for Rust -
yeslogic-unicode-blocks
Functions to access and search Unicode blocks
-
aki-json-pick
The json pick out command
-
harfbuzz-traits
Rust Traits for the HarfBuzz text shaping engine
-
presciidoc
Preprocessing AsciiDoc for other tools
-
unitoken
Fast BPE tokenizer/trainer with a Rust core and Python bindings
-
yeslogic-ucd-parse
parsing data files in the Unicode character database
-
synaptic-pdf
PDF document loader for Synaptic
-
anno-core
Core types and invariants for anno
-
good-morning
A welcome message generator
-
three-dcf-core
Document-to-dataset encoding library for LLM training data preparation. Converts PDFs, Markdown, HTML into structured formats optimized for machine learning.
-
grep-search
search query in contents lib
-
bpmf_py
A Bopomofo and Pinyin library
-
math-text-transform
Transform greek letters, latin letters, or decimal digits into certain variants from the mathematical alphanumeric symbols Unicode block (U+1D400–U+1D7FF). For example to bold, italic, script or double-struck.
-
catatau
A TUI EPUB reader with fuzzy search and navigation
-
fontconfig-rs
Safe, higher-level wrapper around the fontconfig library
-
ende
encoding/decoding unicode/utf-8/utf-16(ucs-2) code points
-
k-edit-distance
A basic edit distance implementation for korean characters
-
sesdiff
Generates a shortest edit script (Myers' diff algorithm) to indicate how to get from the strings in column A to the strings in column B. Also provides the edit distance (levenshtein).
-
dekor
styling and character repository in Rust
-
codetypo-cli
Source Code Spelling Correction
-
poly_doctest
Polyglot documentation snippet test generator - extract code snippets from docs and generate tests
-
render_readme
Render Markdown or reStructuredText with syntax highlighting and image filtering similar to GitHub's
-
lingua-georgian-language-model
The Georgian language model for Lingua, an accurate natural language detection library
-
pdf-create
low-level, strongly-typed PDF creation library
-
symscan
Fast discovery of similar strings in bulk
-
Ds_minigrep
grep implementation from The Rust Programing Book
-
tectonic_engine_xetex
The XeTeX engine as a reusable crate
-
divvunspell-bin
Spellchecker for ZHFST/BHFST spellers, with case handling and tokenization support
-
milligrep
Custom simplified implementation of grep
-
utf-8
Incremental, zero-copy UTF-8 decoding with error handling
-
simdutf
Unicode validation and transcoding at billions of characters per second
-
syntaxdot-tokenizers
Subword tokenizers
-
llm-tui
A Terminal User Interface (TUI) for interacting with Language Learning Models (LLM) using llm-cli
-
text-fx
A collection of text processing utilities for Rust
-
rust_nickname_generater
that generates user/nick names based on the rust language
-
llmtext
Turns any website into a single LLM-ready markdown file
-
hayro-svg
Converting PDF pages into SVG files
-
tform
format plain text into well-structured Markdown or HTML
-
mdbook-preprocessor-utils
writing mdBook preprocessors
-
rusty_regex
A minimalistic regex engine using the pipeline: Regex -> AST -> NFA -> DFA -> Match(String)
-
encoded-words
Encoded Words for usage in MIME headers
-
segtok
Sentence segmentation and word tokenization tools
-
text2art
translate text to ascii art
-
mdbook-tectonic
An mdbook backend for generating LaTeX and PDF documents
-
contractions
expand contractions in English
-
actiondb
A safe and efficient unstructured text (log) parsing library
-
quillmark-core
Core types and functionality for Quillmark
-
jxoesneon-tectonic-dep_support
Internal crate for the Tectonic project: tectonic_dep_support
-
morc
Dead simple, minimal markdown generator library written in Rust
-
gen-epub-book
Generate an ePub book from a simple plaintext descriptor
-
chisel-lexers
Chisel backend lexers/scanners
-
addbib
An app to add linked bibliographies to markdown files
-
jg
Jeff Goldblum (jg) is a command-line JSON processor. jg searches for structural patterns in json input and prints each json object that matches the pattern.
-
kakasi
Romanize hiragana, katakana and kanji (Japanese text)
-
writings
The Bahá’í Sacred Writings for use in Rust projects and APIs
-
arabic_text_utils
Arabic text processing and manipulation
-
sudachiclone
sudachiclone-rs is a Rust version of Sudachi, a Japanese morphological analyzer
-
toresy
term rewriting system based on tokenization
-
darts
A double array trie, A Forward Maximum Matching Searcher
-
strloin
copy on write slices of a string
-
ghimlink
Generates a Github-Flavoured-Markdown formatted image link to an image in the same repository
-
jpreprocess-dictionary-builder
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
wordnik_list
Quick, local library to check valid words within the Wordnik word list
-
mq-lsp
Language Server Protocol implementation for mq query language
-
jpreprocess-dictionary
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
rhema_ingest_chirho
SWORD/OSIS/TEI/IMP importers and normalization into canonical corpus
-
soup
Inspired by the python library BeautifulSoup, this is a layer on top of html5ever that adds a different API for querying and manipulating HTML
-
ox_content_og_image
OG image generation for Ox Content
-
tokenizers-enfer
today's most used tokenizers, with a focus on performances and versatility
-
advanced_string_generator
A command-line tool for generating strings based on customizable regex patterns
-
parattice
Recursive paraphrase lattice generator
-
markdown-to-html
Markdown parser that runs at hyper speeds!
-
marcus
An experimental Markdown parser written in Rust
-
aster-mcp
Aster - An AI agent framework
-
unicode-ellipsis
truncate Unicode strings to a certain width, automatically adding an ellipsis if the string is too long
-
spongedown
Converts markdown to html with svgbob support
-
mitex-spec
Specification Library for MiTeX
-
tiny-ver
tiny version parser
-
hulk
An ultra simple no hassle static site generator
-
xssan
that sanitizes HTML to prevent XSS vulnerabilities
-
markdown-tables
generating markdown-formatted tables
-
word-star-puzzle-solver
A CLI tool for solving VGs word star puzzle
-
hexify
Format hex
-
rins_markdown_parser
markdown parser written on Rust
-
lexmatch
lexicon matching tool that, given a lexicon of words or phrases, identifies all matches in a given target text. Uses suffix arrays.
-
screen_printer
displaying rectangular blocks of text to a terminal
-
rhtml
Html Utils
-
ed_join
Implemtation of Ed-Join Algorithm for string similarity join
-
mecrab-word2vec
High-performance Word2Vec implementation with Hogwild! parallelization for MeCrab
-
portmanteau
create portmanteaux
-
jxoesneon-tectonic-xdv
Internal crate for the Tectonic project: tectonic_xdv
-
write-html
writing HTML in Rust
-
blz-cli
CLI for blz – fast local llms.txt search
-
unicodeit-cli
The command line interface to unicodeit
-
unicodeit
Converts LaTeX to Unicode (rust port)
-
lindera-python
A morphological analysis libraries and command line interface
-
spezilinter
spezifisch's linter for different file formats, linting for weirdly specific stuff
-
mdoc
Modern PDF creation through Markdown and LaTeX
-
ik-mini-gen
Minimal async IK to EPUB/HTML/PDF downloader | Extremely minimal
-
bito-lint-core
Core library for bito-lint
-
encoding-index-simpchinese
Index tables for simplified Chinese character encodings
-
asimov-readwise-module
ASIMOV module
-
fuzzy-string-distance
Fuzzy string distance comparisons
-
allsorts-subset-browser
Temp fork of allsorts 0.15 - includes patch for subsetting fonts for browsers
-
braille_pics
producing text-art pictures using Braille characters
-
heyiwei
何意味
-
dtxt-detect
dangerous text detection, optimized for high speeds
-
ipynb-to-md
Convert Jupyter Notebooks to Markdown files
-
kanjiwrapper
Wrapper for kanjiapi.dev
-
termbook
behind the
termbook-cli -
lingua-punjabi-language-model
The Punjabi language model for Lingua, an accurate natural language detection library
-
p101_enc
convert Olivetti P101 program to and from different encodings
-
varcon-core
Varcon-relevant data structures
-
llm-shield-scanners
Security scanners for LLM Shield toolkit
-
term-rustdoc
A TUI for Rust docs
-
mdbook-core
The base support library for mdbook, intended for internal use only
-
telegram-escape
Escape text for Telegram's MarkdownV2 format
-
unimorph-core
Core library for UniMorph morphological data
-
terraphim-markdown-parser
Terraphim Markdown Parser
-
llmvm-codeassist
A LLM-powered code assistant that automatically retrieves context (i.e. type definitions) from a Language Server Protocol server.
-
ron_to_table
pretty print RON as a table
-
mdbook-pagebreaks
A mdbook preprocessor to insert page breaks when rendering to HTML
-
roxy_cli
A command-line static site generator
-
mdbook-diataxis
A preprocessor for mdBook which helps apply the Diátaxis documentation framework
-
mini-openai
An OpenAI API client with minimal dependencies
-
mdtranslation-cli
Command-line tools for using mdTranslation, which can be used to prepare multi-lingual Markdown documents
-
pulldown-html-ext
Extended HTML rendering capabilities for pulldown-cmark
-
hsk
Return HSK Level for Simplified Chinese Characters
-
byte_string
Wrapper types for outputting byte strings (b"Hello") using the Debug ({:?}) format
-
tattoy-wezterm-char-props
Unicode and other character properties
-
rust-md2html
CLI to convert Markdown to HTML with file watching
-
ahtml-from-markdown
Convert Markdown to ahtml HTML element trees
-
argot
Parse documentation from codebases into Markdown for easy doc creation
-
tectonic_pdf_io
Xdvipdfmx’s PDF, XDV, and image I/O APIs in C, as a Rust crate
-
cw
Count Words, a fast wc clone
-
mini__grep
grep tool
-
izihawa-tantivy-tokenizer-api
Tokenizer API of tantivy
-
rusty-grep
RustyGrep is a command-line tool written in Rust that allows users to search for patterns in files
-
asimov-dataset-cli
ASIMOV Dataset Command-Line Interface (CLI)
-
stringutils
A collection of various and (hopefully) useful String utility functions
-
letrs-fonts
A collection of FIGlet fonts
-
grepmini
grep-like command line tool written in Rust to search for text patterns in files. Built following the Rust book's example.
-
grepster
command-line tool for searching text in files
-
pygmentize
wrapper for syntax highlighting
-
text-sanitizer
convert text to plain ASCII text
-
lindera-cli
A morphological analysis CLI
-
jxoesneon-tectonic-engine_bibtex
Internal crate for the Tectonic project: tectonic_engine_bibtex
-
mdbook-translation
prepare multi-lingual mdBook books
-
minigrep_xyz1
For finding strings in text
-
fuzzywuzzy
A pure-Rust clone of the incredibly useful fuzzy string matching python package, FuzzyWuzzy
-
lex-babel
Format conversion library for the lex format
-
gmi2html
Convert text/gemini into HTML
-
jp_utils
Utils for working with Japanese text
-
betacode
conversion
-
typub-adapter-static
Static HTML adapter for typub - generates standalone deployable HTML files
-
base64-lib
A base64 encoder / decoder
-
rexis-rag
Rexis RAG - High-performance Retrieval-Augmented Generation framework with memory-first agents, vector search, and LLM integration
-
ox_content_vite
Vite Environment API integration for Ox Content
-
squ
command-line utility for converting quotation marks in plaintext files to "smart quotes"
-
string-overlap
A helper crate for "layering" ASCII art
-
common-words-all
Most common words sorted by ngram frequency
-
kytea-tokenizer
Wrapper of tokenization by KyTea
-
aglow
Render Markdown beautifully in the terminal 🔥
-
goya
morphological analyzer for Rust and WebAssembly
-
chinese_segmenter
Tokenize Chinese sentences using a dictionary-driven largest first matching approach
-
lindera-compress
A morphological analysis library
-
encoding-index-japanese
Index tables for Japanese character encodings
-
warrah
command-line utility and Rust library that sloppily removes code comments from a text file, supporting 60+ programming languages
-
latex-to-html
Latex to html converter
-
castle_tokenizer
Castle Tokenizer: tokenizer
-
xsv
A high performance CSV command line toolkit
-
chisel-parsers
Chisel parser front ends
-
ced
Dead easy csv editor
-
synthphonia-rs
DryadSynth solver for string transformation programs
-
chanoma
Characters Normalization library. 文字列正規化処理用のライブラリです。
-
tectonic_bridge_graphite2
Expose the graphite2 library to Rust/Cargo
-
mecab-ko-dict-validator
한국어 형태소 사전 검증 도구 - CSV 형식 검증, 품사 체계 검사
-
mq-view
CLI tool for rendering Markdown documents with syntax highlighting and rich text formatting
-
ox_content_ssg
Static Site Generation for Ox Content documentation
-
pray
A tui tool for preparing a prompt to the llms
-
mq-dap
Debug Adapter Protocol implementation for mq
-
repa
Peak Performance Pattern Seeker
-
nerd_font
Enumerated contents of the JetBrainsMono Nerd Font
-
lingua-serbian-language-model
The Serbian language model for Lingua, an accurate natural language detection library
-
tinytoken
tokenizing text into words, numbers, symbols, and more, with customizable parsing options
-
asciidork-opts
Asciidork opts
-
suffix
arrays
-
yamc
Markdown Converter - A fast and feature-rich markdown to HTML and PDF converter
-
rure
A C API for Rust's regular expression library
-
lignin-html
HTML renderer for lignin VDOM Nodes. This crate is primarily for static and server-side rendering. For client-side use, see lignin-dom.
-
htmlfixinator
A composable HTML transformation library with filters for cleaning, modifying, and standardizing HTML content
-
promptune
A tiny tool to auto-augment prompts programmatically
-
chunk_norris
splitting large text into smaller batches for LLM input
-
xmldecl
Extracts an encoding from an ASCII-based bogo-XML declaration in text/html in a Web-compatible way
-
asimov-maildir-module
ASIMOV module for Maildir email import
-
unicode-bidi
Unicode Bidirectional Algorithm
-
html2runes
An HTML to Text converter
-
cutters
Rule based sentence segmentation library
-
base_emoji
Convert everything to Emojis (and back)! - 🐫🔦🔥🎲🐬🍀🍟🔋🐬🎲🐬🍀🎁🍟💧👂🔥🚪🔋🍟🔦🔋🚗👣🔦🌍👂🍤🐗
-
mutf8
Basic support for mutf8 strings
-
jxoesneon-tectonic-errors
Internal crate for the Tectonic project: tectonic_errors
-
regex-split
split_inclusive for the regex crate
-
jpreprocess-naist-jdic
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
encoding_c_mem
C API for encoding_rs::mem
-
lingo
detect text languages
-
tectonic_xetex_layout
XeTeX's font loading and layout interface encapsulation, as a crate
-
yeslogic-fontconfig
RENAMED: use the fontconfig crate instead
-
tiny-clean
A lightweight, high-performance string sanitizer with configurable rules
-
validated-slice
Helper macros to implement std traits for custom validated slice types
-
strip-prefix-suffix-sane
A small utility crate providing ‘sane’
strip_prefixandstrip_suffixmethods forstrthat returnselfinstead of anOptionwhen the prefix/suffix is not found -
folia
High-performance library for handling the FoLiA XML format (Format for Linguistic Annotation)
-
substr-iterator
Substring extractor based on characters without allocation