-
regex
regular expressions for Rust. This implementation uses finite automata and guarantees linear time matching on all inputs.
-
textwrap
word wrapping, indenting, and dedenting strings. Has optional support for Unicode and emojis as well as machine hyphenation.
-
fancy-regex
regexes, supporting a relatively rich set of features, including backreferences and look-around
-
similar
A diff library for Rust
-
unicode-segmentation
Grapheme Cluster, Word and Sentence boundaries according to Unicode Standard Annex #29 rules
-
const_format
Compile-time string formatting
-
tabled
An easy to use library for pretty print tables of Rust
struct
s andenum
s -
unicode-width
Determine displayed width of
char
andstr
types according to Unicode Standard Annex #11 rules -
tokenizers
today's most used tokenizers, with a focus on performances and versatility
-
unicode-xid
Determine whether characters have the XID_Start or XID_Continue properties according to Unicode Standard Annex #31
-
ascii
ASCII-only equivalents to
char
,str
andString
-
unicase
A case-insensitive wrapper around strings
-
comrak
A 100% CommonMark-compatible GitHub Flavored Markdown parser and formatter
-
convert_case
Convert strings into any case
-
lazy-regex
lazy static regular expressions checked at compile time
-
pulldown-cmark
A pull parser for CommonMark
-
onig
Rust-Onig is a set of Rust bindings for the Oniguruma regular expression library. Oniguruma is a modern regex library with support for multiple character encodings and regex syntaxes.
-
Inflector
Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
deunicode
Convert Unicode strings to pure ASCII by intelligently transliterating them. Suppors Emoji and Chinese.
-
termimad
Markdown Renderer for the Terminal
-
widestring
wide string Rust library for converting to and from wide strings, such as those often used in Windows API or other FFI libaries. Both
u16
andu32
string types are provided, including support for UTF-16 and UTF-32… -
lopdf
PDF document manipulation
-
html2text
Render HTML as plain text
-
diff
An LCS based slice and string diffing implementation
-
rustybuzz
A complete harfbuzz shaping algorithm port to Rust
-
fuzzy-matcher
Fuzzy Matching Library
-
regress
A regular expression engine targeting EcmaScript syntax
-
emojis
✨ Lookup emoji in *O(1)* time, access metadata and GitHub shortcodes, iterate over all emoji, and more!
-
sd
An intuitive find & replace CLI
-
regex-syntax
A regular expression parser
-
chardetng
A character encoding detector for legacy Web content
-
charabia
detect the language, tokenize the text and normalize the tokens
-
lngcnv
linguistics: display pronunciation, translate between dialects, convert between orthographies; support for multiple languages: English, Latin, Polish, Quechua, Spanish, Tikuna
-
mdxjs
Compile MDX to JavaScript in Rust
-
prettydiff
Side-by-side diff for two files
-
printpdf
writing PDF files
-
linkify
Finds URLs and email addresses in plain text. Takes care to get the boundaries right with surrounding punctuation like parentheses.
-
mdbook
Creates a book from markdown files
-
smartcat
Putting a brain behind
cat
. CLI interface to bring language models in the Unix ecosystem 🐈⬛ -
titlecase
Capitalize text according to a style defined by John Gruber for Daring Fireball
-
shell-escape
Escape characters that may have a special meaning in a shell
-
unicode-bidi
Unicode Bidirectional Algorithm
-
const-str
compile-time string operations
-
unicode-script
exposes the Unicode
Script
andScript_Extension
properties from UAX #24 -
mdbook-graphviz
mdbook preprocessor to add graphviz support
-
synoptic
low-level, syntax highlighting library with unicode support
-
garde
Validation library
-
text-splitter
Split text into semantic chunks, up to a desired chunk size. Supports calculating length by characters and tokens, and is callable from Rust and Python.
-
lipsum
lorem ipsum text generation library. It generates pseudo-random Latin text. Use this if you need filler or dummy text for your application. The text is generated using a simple Markov chain…
-
slug
Convert a unicode string to a slug
-
decancer
that removes common unicode confusables/homoglyphs from strings
-
lindera
A morphological analysis library
-
textdistance
Lots of algorithms to compare how similar two sequences are
-
mdbook-katex
mdBook preprocessor rendering LaTeX equations to HTML
-
text_io
really simple to use panicking input functions
-
unicode_names2
Map characters to and from their name given in the Unicode standard. This goes to great lengths to be as efficient as possible in both time and space, with the full bidirectional tables weighing barely 500 KB…
-
line-index
Maps flat
TextSize
offsets to/from(line, column)
representation -
mdbook-pdf
A backend for mdBook written in Rust for generating PDF based on headless chrome and Chrome DevTools Protocol
-
cargo-spellcheck
Checks all doc comments for spelling mistakes
-
unicode-normalization
functions for normalization of Unicode strings, including Canonical and Compatible Decomposition and Recomposition, as described in Unicode Standard Annex #15
-
hgrep
grep tool with human-friendly search output. This is similar to
-C
option ofgrep
command, but its output is enhanced with syntax highlighting focusing on human readable outputs. -
str_indices
Count and convert between indexing schemes on string slices
-
text-size
Newtypes for text offsets
-
mdbook-yapp
A mdBook preprocessor for simple text replacements
-
pad
padding strings at runtime
-
cruet
Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
unindent
Remove a column of leading whitespace from a string
-
indenter
A formatter wrapper that indents the text, designed for error display impls
-
autocorrect
A linter and formatter for help you improve copywriting, to correct spaces, words, punctuations between CJK (Chinese, Japanese, Korean)
-
rphonetic
Rust port of phonetic Apache commons-codec algorithms
-
mdbook-admonish
A preprocessor for mdbook to add Material Design admonishments
-
epub-builder
generating EPUB files
-
jieba-rs
The Jieba Chinese Word Segmentation Implemented in Rust
-
unescaper
Unescape strings with escape sequences written out as literal characters
-
languagetool-rust
LanguageTool API bindings in Rust
-
hck
A sharp cut(1) clone
-
ricat
A Rust-Based implemenation of classic UNIX
cat
command -
uncased
Case-preserving, ASCII case-insensitive, no_std string types
-
cuid
An ipmlementation of CUID protocol in rust
-
chamkho
Khmer, Lao, Myanmar, and Thai word segmentation/breaking library and command line
-
roff
ROFF (man page format) generation library
-
create_broken_files
Create broken files from other ones
-
unicode-truncate
Unicode-aware algorithm to pad or truncate
str
in terms of displayed width -
focaccia
no_std implementation of Unicode case folding comparisons
-
unicode-id
Determine whether characters have the ID_Start or ID_Continue properties according to Unicode Standard Annex #31
-
vesti
A preprocessor that compiles into LaTeX
-
bashtestmd
Compiles shell commands in .md files into Bash scripts for testing
-
mktoc
Generate Table of Contents from Markdown files
-
lindera-ko-dic-builder
A Korean morphological dictionary builder for ko-dic
-
file-organiser
Command line file manager to list, move or delete large numbers of files in nested folders filtered by age, file extension, file name pattern and/or size range
-
probly-search
A lightweight full-text search engine with a fully customizable scoring function
-
rapidfuzz
rapid fuzzy string matching library
-
igrepper
The interactive grepper
-
substring
method for string types
-
hyphenation
Knuth-Liang hyphenation for a variety of languages
-
creature_feature
Composable n-gram combinators that are ergonomic and bare-metal fast
-
font-types
Scalar types used in fonts
-
boreal
evaluate YARA rules, used to scan bytes for textual and binary pattern
-
norad
Read and write Unified Font Object files
-
entities
raw data needed to convert to and from HTML entities
-
fontconfig
Safe, higher-level wrapper around the Fontconfig library
-
cow-utils
Copy-on-write string utilities for Rust
-
pulldown-cmark-to-cmark
Convert pulldown-cmark Events back to the string they were parsed from
-
stam
powerful library for dealing with stand-off annotations on text
-
percent-encoding
Percent encoding and decoding
-
difflib
Port of Python's difflib library to Rust
-
pandoc
API that wraps calls to the pandoc 2.x executable
-
byteyarn
hyper-compact strings
-
chewing
(酷音) intelligent Zhuyin input method
-
any_ascii
Unicode to ASCII transliteration
-
pdf-extract
extract content from pdfs
-
aki-resort
sort lines of text. You can use regex to specify the KEY.
-
pdf_composer
Create PDF documents from YAML Front Matter source documents
-
lindera-dictionary
A Japanese morphological dictionary
-
hypher
separates words into syllables
-
llmvm-core
The core application for llmvm
-
frawk
an efficient Awk-like language
-
ra_ap_test_utils
TBD
-
tantivy-stemmers
A collection of Tantivy stemmer tokenizers
-
bwrap
A fast, lightweight, embedded systems-friendly library for wrapping text
-
utf8_iter
Iterator by char over potentially-invalid UTF-8 in &[u8]
-
vmks-exam-generator
CLI program for pseudo-randomly generating different variants of an embedded programming exam
-
r4d
Text oriented macro processor
-
yffi
Bindings for the Yrs native C foreign function interface
-
rustyline-async
A minimal readline with multiline and async support
-
uclanr
A random word picker that gives you actually useful words
-
bfom
Brendan's Flavor of Markdown: I'll build my own markdown format, what could go wrong?
-
tauri-plugin-clipboard
A clipboard plugin for Tauri that supports text, files and image, as well as clipboard update listening
-
tantivy-analysis-contrib
A set of analysis components for Tantivy
-
mlc
The markup link checker (mlc) checks for broken links in markup files
-
unicode-security
Detect possible security problems with Unicode usage according to Unicode Technical Standard #39 rules
-
glyph_brush_layout
Text layout for ab_glyph
-
jayce
tokenizer 🌌
-
regex-literal
delimited regular expression literals
-
mdcat
cat for markdown: Show markdown documents in terminals
-
asciidork-cli
Asciidork CLI
-
simple-string-patterns
Makes it easier to match, split and extract strings in Rust without regular expressions. The parallel string-patterns crate provides extensions to work with regular expressions via the Regex library
-
gh-emoji
Convert
:emoji:
to Unicode using GitHub’s emoji names -
hyperlink
Very fast link checker for CI
-
fuzzt
Implementations of string similarity metrics. Includes Hamming, Levenshtein, OSA, Damerau-Levenshtein, Jaro, Jaro-Winkler, and Sørensen-Dice.
-
substudy
Language-learning tools for working with parallel, bilingual subtitles and media files
-
unicode_titlecase
add Unicode titlecase and Turkish and Azeri locale upper/lowercase utilities to chars and strings
-
mdbook-quiz
Interactive quizzes for your mdBook
-
ruke
A dead-simple automation tool. Inspired by Makefile and Justfile.
-
crowbook-text-processing
some utilities functions for escaping text (HTML/LaTeX) and formatting it according to typographic rules (smart quotes, ellipsis, french typograhic rules)
-
tengwar
Transliterate text into J.R.R. Tolkien's Tengwar.
-
rascii_art
Advanced ASCII Art Generator
-
wana_kana
checking and converting between Japanese characters - Kanji, Hiragana, Katakana - and Romaji
-
line-span
Find line ranges and jump between next and previous lines
-
shell2batch
Coverts simple basic shell scripts to windows batch scripts
-
mdbook-epub
An EPUB renderer for mdbook
-
regex-cursor
regex fork that can search discontiguous haystacks
-
harfbuzz
Rust bindings to the HarfBuzz text shaping engine
-
mdbook-catppuccin
🎊 Soothing pastel theme for mdBook
-
savvy
R extension interface
-
ncount
A word count tool intended to derive useful stats from markdown
-
fm
Non-backtracking fuzzy text matcher
-
prop-check-rs
A Property-based testing Library in Rust
-
treegrep
A pattern matcher frontend or backend which displays results in a tree
-
slice-command
slice is a command-line tool that allows you to slice the contents of a file using syntax similar to Python's slice notation
-
ferris-says
flavored replacement for the classic cowsay
-
near-facsimile
Find similar or identical text files in a directory
-
emojic
Emoji constants
-
quixote
Quizzes and tests in Markdown
-
wezterm-bidi
The Unicode Bidi Algorithm (UBA)
-
obsidian-export
associated CLI program to export an Obsidian vault to regular Markdown
-
chinese_dictionary
A searchable Chinese / English dictionary with helpful utilities
-
colonnade
format tabular data for display
-
inkjet
A batteries-included syntax highlighting library for Rust, based on tree-sitter
-
precis-profiles
PRECIS Framework: Preparation, Enforcement, and Comparison of Internationalized Strings Representing Usernames and Passwords as defined in rfc8265; and Nicknames as defined in rfc8266
-
mdbook-metadata
mdBook preprocessor to parse markdown metadata
-
unicode-blocks
contains a list of all unicode blocks and provides some functions to search across them
-
scraps
A static site generator that builds a wiki from a set of markdown files. Inspired by https://scrapbox.io/.
-
lorem-ipsum
Quickly generate placeholder test
-
mini_markdown
Small markdown renderer
-
santoka
Translations of 668 of Taneda Santoka's free-verse haiku
-
os_display
Display strings in a safe platform-appropriate way
-
unicode-general-category
Fast lookup of the Unicode General Category property for char
-
ultra-nlp
A NLP library
-
rustkorean
processing Korean characters. It provides functionalities to check if a character is Korean, classify Korean characters, verify if a character is a leading consonant (choseong), a medial vowel (jungseong)…
-
arrow-row
Arrow row format
-
reason-shell
Reason: A Shell for Research Papers
-
text_analysis
Analyze text stored as *.txt in provided file or directory. Doesn't read files in subdirectories. Counting all words and then searching for every unique word in the vicinity (+-5 words)…
-
lgtmeow
🐾 —— 「本喵觉得很不错~」
-
arf-strings
Encoding and decoding for ARF strings
-
swift-check
High-performance, robust, and expressive searching and validation (uses SIMD on x86_64, aarch64, and WASM)
-
qpdf
Rust bindings to QPDF C++ library
-
rslint_errors
Pretty error reporting library based on codespan-reporting built for the RSLint project
-
cskk
C ABIから使う事を目的とした SKK(Simple Kana Kanji henkan)方式のかな漢字変換ライブラリ
-
vi
An input method library for vietnamese IME
-
mdbook-toc
mdbook preprocessor to add Table of Contents
-
mdbook-embedify
based mdbook preprocessor plugin that allows you to embed apps to your book, like youtube, codepen and some other apps
-
zalgo-codec
converting an ASCII text string to a single unicode grapheme cluster and back. Provides a macro for embedding Rust source code that has been encoded in this way.
-
sttx
belt for transforming speech-to-text data
-
autotex
Continuously compile TeX and LaTeX
-
tremor-kv
A logstash inspured key value extractor
-
mdopen
Preview markdown files in a browser
-
newdoc
Generate pre-populated module files formatted with AsciiDoc that are used in Red Hat and Fedora documentation
-
zhconv
Traditional/Simplified and regional Chinese variants converter based on MediaWiki & OpenCC rulesets and powered by AC automata 轉換简体、繁體及兩岸、新馬中文地區詞,基於MediaWiki和OpenCC之字詞轉…
-
opml
OPML library for Rust
-
kathoey
text feminization using open corpus linguistics data
-
instant-segment
Fast English word segmentation
-
unic-char-property
UNIC — Unicode Character Tools — Character Property taxonomy, contracts and build macros
-
betacode
conversion
-
repgrep
An interactive command line replacer for
ripgrep
-
jetscii
A tiny library to efficiently search strings and byte slices for sets of ASCII characters or bytes
-
sublime_fuzzy
Fuzzy matching algorithm based on Sublime Text's string search
-
levenshtein_automata
Creates Levenshtein Automata in an efficient manner
-
omekasy
Decorate alphanumeric characters in your input with various font; special characters in Unicode
-
clipcount
Counting words from the clipboard content
-
minimo
terminal ui library combining alot of things from here and there and making it slightly easier to play with
-
yara-x-parser
A parsing library for YARA rules
-
zspell
Native Rust library for spellchecking
-
unidown
Convert Markdown to Unicode
-
pink_accents
Replacement of patterns in string to simulate speech accents
-
ident_case
applying case rules to Rust identifiers
-
whitespacers
A whitespace JIT-compiler, for when your whitespace just isn't executing fast enough
-
unicode-ccc
Unicode Canonical Combining Class detection
-
uuhelp_parser
A collection of functions to parse the markdown code of help files
-
sliceslice
A fast implementation of single-pattern substring search using SIMD acceleration
-
uwc
Counts things in unicode text files
-
typed-oid
Typed Object IDs
-
mix
markdown compiler with 🚀 fuel
-
cindex
CSV indexing library
-
analiticcl
approximate string matching or fuzzy-matching system that can be used to find variants for spelling correction or text normalisation
-
crowbook
Render a Markdown book in HTML, PDF or Epub
-
mdbook-presentation-preprocessor
A preprocessor for utilizing an MDBook as slides for a presentation
-
idna
IDNA (Internationalizing Domain Names in Applications) and Punycode
-
codespan_preprocessed
Beautiful diagnostic reporting for M4 (or cpp) preprocessed text files
-
rust-ai
A collection of 3rd-party AI APIs for Rust
-
docket
markdown to HTML documentation rendering
-
mdbook-pagetoc
A mdbook plugin that provides a table of contents for each page
-
decasify
A CLI utility and library to cast strings to title-case according to locale specific style guides including Turkish support
-
ferret
A trigram-based tool for detecting similarity in groups of text documents or program code
-
nucleo-matcher
plug and play high performance fuzzy matcher
-
whitespace-sifter
Sift duplicate whitespaces away!
-
array_tool
Helper methods for processing collections
-
in_definite
Get the indefinite article ('a' or 'an') to match the given word. For example: an umbrella, a user.
-
asciigraph
Ascii Graph
-
ib-pinyin
一个高性能拼音匹配库
-
oxford_join
Join string slices with Oxford Commas!
-
galm
pattern matching library
-
svgbob
Transform your ascii diagrams into happy little SVG
-
pinyin-parser
Parses a string of pinyin syllables. Covers marginal cases such as
ẑ
,ŋ
andê
. -
tfidf-text-summarizer
extractive text summarization system which uses TF-IDF scores of words present in the text to rank sentences and generate a summary
-
mdbook-linkcheck
A backend for
mdbook
which will check your links for you -
mdbook-pandoc
A mdbook backend that outsources most of the rendering process to pandoc
-
diffy
Tools for finding and manipulating differences between files
-
jmemo
CUI tool for maintaining memos
-
two_percent
Fuzzy Finder in rust!
-
escape-bytes
Escapes bytes that are not printable ASCII characters
-
utf8-io
Traits and types for UTF-8 I/O
-
subplot
tools for specifying, documenting, and implementing automated acceptance tests for systems and software
-
bogrep
Full-text search for bookmarks from multiple browsers
-
moto
motivated automation
-
arabic_reshaper
Reconstruct Arabic sentences to be used in applications that doesn't support Arabic script
-
dsntk-recognizer
Decision Toolkit | Decision table recognizer
-
dicexp
A Dice Expression Interpreter program and library for parsing (and rolling) role-playing game style dice notations (e.g. "2d8+5")
-
utf64
encode utf-8 strings into utf-64, and decode them back
-
skyspell
Fast and handy spell checker for the command line
-
inspector-gadget
A cli-based, multi-architecture gadget-finding tool, designed for fast operation, even with large binaries like browser engines and OS kernels
-
mecab
Safe Rust wrapper for mecab a japanese language part-of-speech and morphological analyzer library
-
inflections
High performance inflection transformation library for changing properties of words like the case
-
vaporetto
pointwise prediction based tokenizer
-
sre-engine
A low-level implementation of Python's SRE regex engine
-
console_static_text
Logging for text that should stay in the same place in a console
-
casile
The command line interface to the CaSILE toolkit, a book publishing workflow employing SILE and other wizardry
-
egui-dropdown
An actual dropdown list for egui
-
somedoc
A very simple document model and markup generator
-
mdbook-tools
A collection of tools for mdbook
-
aki-mcolor
mark up text with color
-
kanjidic_server
A REST API server for Kanjidic and EDRDG radical decompositions
-
harfbuzz_rs
A high-level interface to HarfBuzz, exposing its most important functionality in a safe manner using Rust
-
grok
popular java & ruby grok library which allows easy text and log file processing with composable patterns
-
grader
Stream-based CLI for binary sorting text files via a given shell command
-
text2num
Parse and convert numbers written in English, Spanish or French into their digit representation
-
evcxr
An Evaluation Context for Rust
-
nom-grapheme-clusters
Adapter that allows nom to account for unicode grapheme clusters
-
ik-rs
chinese segment, ik-analyzer for rust
-
select
extract useful data from HTML documents, suitable for web scraping
-
mdbook-tailor
mdbook preprocessor for image-tailor
-
hangeul
Korean alphabet manipulation library
-
gene
providing a log matching framework written in Rust
-
chocodye
changing the color of the chocobos' plumage in Final Fantasy XIV
-
stream-rate-limiter
A rate limiter for Tokio streams
-
pager
pipe your output through an external pager
-
advreader
Split text file into text sequences, strings and (line) comments
-
jpreprocess
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
mdbook-combiner
combine mdbook summaries from multiple source into one mdbook
-
markov_namegen
Random text generators based on Markov models
-
mdbook-typst
An mdBook backend to output Typst markup, pdf, png, or svg
-
snailquote
Escape and unescape strings with shell-inspired quoting
-
lexical-sort
Sort Unicode strings lexically
-
mdbook-dtmo
Creates a book from markdown files with added plugins
-
domrs
Document builder and serializer
-
mdbook-reference-table
mdBook preprocessor to create reference tables
-
asmfmt
A formatter designed for programs in assembly language with AT&T syntax
-
readability
Port of arc90's readability project to rust
-
mdbook-angular
mdbook renderer to run angular code samples
-
indefinite
Prefix a noun with an indefinite article - a or an - based on whether it begins with a vowel
-
simple_peg
A command-line peg parser implemented by Rust
-
stringsext
find multi-byte-encoded strings in binary data
-
zipcodes
Query US zipcodes without SQLite
-
hyperscan
bindings for Rust with Multiple Pattern and Streaming Scan
-
pluralizer
Rust package to pluralize or singularize any word based on a count inspired on pluralize NPM package
-
grepox
Minimalist's grep written in Rust
-
mut-str
A toolkit for working with mutable string slices (&mut str)
-
stop-words
Common stop words in many languages
-
dmos-cli
Djot HTML renderer with advanced features - CLI
-
inline_colorization
format!("Lets the user {color_red}colorize{color_reset} and {style_underline}style the output{style_reset} text using inline variables");
-
ruby_inflector
Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
txt_to_md
Command converting from a txt file to a markdown file
-
words-count
Count the words and characters, with or without whitespaces
-
marker
finding issues in CommonMark documents
-
mkwebsite
build websites using markdown
-
unicode-reverse
Unicode-aware in-place string reversal
-
mdbook-shiftinclude
mdbook preprocessor for file inclusion with shift
-
rsonpath-lib
Blazing fast JSONPath query engine powered by SIMD. Core library of
rsonpath
. -
ripsecrets
A command-line tool to prevent committing secret keys into your source code
-
uwu_cli
uwuifying the terminal
-
mdbook-chess
An mdbook preprocessing plugin to generate chess boards
-
relog
Strong String Normalization
-
multimarkov
generic tool for training and using multi-order Markov chains for procedural generation applications such as generating randomized but authentic-sounding names for characters and locations
-
stringcase
Converts string cases between camelCase, COBOL-CASE, kebab-case, and so on
-
cbfr
A buffer that run on stack, focusing on performance and speed
-
whichlicense_detection
detect licenses used by the WhichLicense project
-
candle_embed
Text embeddings with Candle. Fast and configurable. Use any model from Hugging Face. CUDA or CPU powered.
-
unicode-display-width
Unicode 15.1.0 compliant utility for determining the number of columns required to display an arbitrary string
-
gspell
Rust bindings for gspell
-
finl_unicode
handling Unicode functionality for finl (categories and grapheme segmentation)
-
mdi
markdown include
-
stego_wps
text-based steganography using the Words Per Sentence (WPS) method. It allows encoding and decoding hidden messages within the structure of a given text.
-
wordshk_tools
A combination of parsers and other tools for words.hk (粵典)
-
scanlex
lexical scanner for parsing text into tokens
-
linebreak
breaking a given text into lines within a specified width
-
dekor
styling and character repository in Rust
-
nu-utils
Nushell utility functions
-
line-straddler
Determine how lines through text (underlines, strikethroughs, etc) should be rendered
-
string-patterns
Makes it easier to work with common string patterns and regular expressions in Rust, adding convenient regex match and replace methods (pattern_match and pattern_replace) to the standard…
-
mdbook-theme
A preprocessor and a backend to config theme for mdbook, especially creating a pagetoc on the right and setting full color themes from the offical ace editor
-
pragmatic-segmenter
Rust port of pySBD v3.1.0
-
datatroll
a robust and user-friendly Rust library for efficiently loading, manipulating, and exporting data stored in CSV files
-
quranize
Encoding transliterations into Quran forms
-
ende
encoding/decoding unicode/utf-8/utf-16(ucs-2) code points
-
regex_generate
Use regular expressions to generate text
-
mdbook-tocjs
A mdbook preprocessor which adds extra js and css file for ToC hydration
-
orly
Download O'Reilly books as EPUB
-
ssbm_utils
A collection of useful tools for working with data from Super Smash Bros Melee
-
dcsv
Dyanmic csv reader,writer,editor
-
tossicat
입력된 단어에 맞게 같이 입력된 토시(조사)를 적절하게 변환하는 라이브러리
-
secular
No Diacr!
-
palmdoc-compression
Fast & safe implementation of PalmDoc/MOBI/AZW/Kindle flavored LZ77
-
html-auto-p
function like
wpautop
in Wordpress. It uses a group of regex replaces used to identify text formatted with newlines and replace double line-breaks with HTML paragraph tags. -
yozuk
Chatbot for Programmers
-
char-ranges
Iterate chars and their start and end byte positions
-
utf16_iter
Iterator by char over potentially-invalid UTF-16 in &[u16]
-
pretty-xmlish
Pretty print XML-ish data with unicode art
-
mudder
Generate lexicographically-spaced strings between two strings from pre-defined alphabets
-
lindera-unidic-builder
A Japanese morphological dictionary builder for UniDic
-
safe-string
safe interface for interacting with multi-byte strings in Rust, namely IndexedStr, IndexedString, and IndexedSlice
-
indent
Functions for indenting multiline strings
-
filecheck
writing tests for utilities that read text files and produce text output
-
unicode-bidi-mirroring
Unicode Bidi Mirroring property detection
-
textcode
Text encoding/decoding library. Supports: UTF-8, ISO6937, ISO8859, GB2312
-
truncate_string_at_whitespace
Truncate a &str at the closest whitespace to a specified length with unicode safety
-
choco
markup language for dialogue systems
-
stylin
Convert markdown to pandoc markdown with custom styles
-
align
aligning text
-
line-numbers
Find line numbers in strings by byte offsets, quickly
-
twilight-mention
working with mentions in the Twilight ecosystem
-
dnd_spellbook_maker
making pdf documents of spells that ressemble 5th edition D&D official source book spell descriptions
-
lcat
lolcat in rust! Full unicode support, escapes for ANSI escape sequences, hue shift in Cubehelix
-
hr-id
A human-readable unique ID which supports Unicode, safe for URLs and file paths
-
aki-xcat
concatenate files that are plain, gzip, xz and zstd
-
crlify
A std::io::Write wrapper that replaces with on Windows
-
stringzilla
Faster SIMD-accelerated string search, sorting, fingerprints, and edit distances
-
mdbook-codeblocks
A mdbook preprocessor to prepend customizable vignette to code blocks
-
text-to-ascii-art
program to convert text to ASCII art
-
image-to-ascii
Converts images and gifs to ascii art
-
flatpage
file system based markdown flat page
-
docx-rust
parsing and generating docx files
-
malachi
A domain specific pattern matching language made for defining bot commands
-
lemmeknow
Identify any mysterious text or analyze strings from a file
-
indented_text_writer
IndentedTextWriter
-
miHoYo-API
A API-Wrapper of miHoYo games in Rust
-
hello-phext
A rust-native implementation of phext
-
text_layout
Text layout algorithms
-
minigrep_elijahkx
MiniGrep is a Rust-based command-line tool, with a (current) size of 588KB that lets users search files for a given query string and shows matching lines with their line numbers
-
wikidot-normalize
provide Wikidot-compatible string normalization
-
str_inflector
Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
xi-rope
A generic rope data structure built on top of B-Trees
-
metatron
Metatron library
-
numeric-sort
A zero-allocation, human-readable sorting library
-
human_regex
A regex library for humans
-
sbert
Sentence Bert (SBert)
-
mdbook-cat-prep
a preprocessor for mdbook which provides teacher, subject, material and tag functionality
-
roe
Unicode case conversion
-
chisel-lexers
Chisel backend lexers/scanners
-
linoleum
but ergonomic line editor
-
rewrite
Safely rewrite file contents from stdin, even when file is open as an input
-
cloc
Count, or compute differences of, lines of source code and comments
-
alien_temple
This program is so hyperspecific to me that I'm not going to even try to make it work for others
-
poriborton
Interconversion between Unicode and various Bengali ANSI encodings
-
lindera-ipadic-builder
A Japanese morphological dictionary builder for IPADIC
-
djotters
Djot (Markdown) parser that runs at hyper speeds!
-
utf16_lit
macro_rules to make utf-16 literals
-
capitalize
Change first character to upper case and the rest to lower case, and other common alternatives
-
vndb_tags_get
convert VNDB tag list (JSON to markdown)
-
tex-fmt
LaTeX formatter written in Rust
-
mdtrans
Markdown parser and transformer using
pest.rs
, focused on flexibility to a project’s needs -
pulldown-cmark-toc
Generate a table of contents from a Markdown document
-
thesaurus
An offline thesaurus library for Rust
-
slicestring
slicing Strings
-
mdbook-indexing
mdbook preprocessor for index generation
-
onig_sys
onig_sys
crate contains raw rust bindings to the oniguruma library. This crate exposes a set of unsafe functions which can then be used by other crates to create safe wrappers around Oniguruma… -
screen_printer
displaying rectangular blocks of text to a terminal
-
easy_reader
easily navigating forward, backward or randomly through the lines of huge files
-
clarifai_grpc
The official Clarifai gRPC Rust client
-
hyphertool
Hypertool is a command-line tool for syllabification and hyphenisation
-
mdbook-davids_cooking
A preprocesor for whatever https://davidsotomarchena.gitlab.io/davids-cooking/ needs
-
catalog-of-markdown
Generate the catalog of markdown file
-
mdbook-variables
mdBook proprocessor for risolve variables configured from book.toml
-
kelp
A convert tool for Japanese
-
easy_random
Generate random data easily with easy_random :)
-
rust-tfidf
calculate TF-IDF (Term Frequency - Inverse Document Frequency) for generic documents
-
gematria_rs
Gematria, a traditional Hebrew numerology system
-
etch
Not just a text formatter, don't mark it down, etch it
-
mdbook-private
An mdbook preprocessor that controls visibility of private chapters and sections within them
-
runiq
An efficient way to filter duplicate lines from input, à la uniq
-
archive-pdf-urls
Extract all links from a PDF and archive the URLs in the Internet Archive's Wayback Machine
-
wa
WebAssembly and Rust utility library
-
mdbook-keeper
An improved testing experience for mdbook
-
streampager
pager for command output or large files
-
term-rustdoc
A TUI for Rust docs
-
minify-html-common
Common code and data for minify-html*
-
litua
Read a text document, receive its tree in Lua and manipulate it before representing it as string
-
lsp-textdocument
A LSP text documents manager that map of text document
-
kataru
YAML-based dialogue engine for game development
-
loc
Count lines of code (cloc) fast
-
zet
zet finds the union, intersection, set difference, etc of files considered as sets of lines
-
moon-phases
Fast command-line application to show the moon phase
-
gosyn
Golang syntax parser in Rust
-
asciimath-unicode
Convert asciimath to unicode
-
mdbook-kroki-preprocessor
render kroki diagrams from files or code blocks in mdbook
-
syllabize-es
Syllabize Spanish text, and much more
-
glk
Bindings for the Glk I/O interface for hosting interactive fiction interpreters
-
reword
some utility functions for human-readable formatting of words
-
gregex
Regex solver utilizing NFA
-
tantivy-czech-stemmer
Czech stemmer as Tantivy tokenizer
-
collclean
Clean up collaboration commands in LaTeX files
-
text_trees
textual output for tree-like structures
-
yeslogic-fontconfig-sys
Raw bindings to Fontconfig without a vendored C library
-
frida-build
Rust bindings for Frida
-
sayit
String replacements using regex
-
lister-cli
Lister: Navigate Markdown Lists
-
dtex
Better TeX
-
tre-regex
Rust safe bindings to the TRE regex module
-
sanitize-filename-reader-friendly
A filename sanitizer aiming to produce reader friendly filenames
-
llmvm-codeassist
A LLM-powered code assistant that automatically retrieves context (i.e. type definitions) from a Language Server Protocol server.
-
mdbook-collector
A mdbook preprocessor which collects json/yaml/toml data from each post and builds a json file of it
-
asciit
A compact and visually appealing ASCII table for your terminal, featuring colored numbers and letters
-
chord3
Create pdf songbooks from chopro source
-
repvar
A tiny CLI tool that replaces variables of the style
${KEY}
in text with their respective value. It can also be used as a rust library -
egg-mode-text
Text parsing for Twitter: character counting, hashtag/mention extraction
-
mdbook-pikchr
A mdbook preprocessor to render pikchr code blocks as images in your book
-
srx
A mostly compliant Rust implementation of the Segmentation Rules eXchange (SRX) 2.0 standard for text segmentation
-
hmd
Custom Markdown Engine for my personal blog
-
symspell
Spelling correction & Fuzzy search
-
intuicio-data
Data module for Intuicio scripting platform
-
csml_interpreter
The CSML Interpreter is the official interpreter for the CSML programming language, a DSL designed to make it extremely easy to create rich and powerful chatbots
-
koji
An interactive CLI for creating conventional commits
-
ncase
Enforce a case style
-
lisho
personal link shortener with no external dependencies in under 200 lines of Rust
-
timeblok
A language for event scheduling in plain text
-
lowcharts
draw low-resolution graphs in terminal
-
twas
A text substitution application for using random look-up tables to generate text in a manner similar to the Mad Libs game
-
fast_aug
Fast data augmentation for text
-
techlead
CLI is a command-line interface that enables developers to chat with an AI assistant powered by the OpenAI GPT language model, designed specifically to help with your Rust project
-
rust_baht_text
Convert number to Thai Baht text
-
unicode_categories
Query Unicode category membership for chars
-
strloin
copy on write slices of a string
-
rep-grep
wgrep/write-grep CLI
-
tectonic_engine_bibtex
The
bibtex
program as a reusable crate -
pandoc_types
Rust port of pandoc-types
-
wikipedia_prosesize
Count Wikipedia prose size
-
squishyid
Shorten and obfuscate IDs
-
tokengrams
Compute n-gram statistics and model language over pre-tokenized text corpora used to train large language models
-
jcalendar
Japanese Calendar for Rust
-
amongify
A very ඞ sus ඞ program
-
mdbook-plantuml
A preprocessor for mdbook which will convert plantuml code blocks into inline SVG diagrams
-
mdbook-mermaid
mdbook preprocessor to add mermaid support
-
cglue-bindgen
cleanup cbindgen headers for CGlue
-
man
Generate structured man pages
-
hi-doc
Diagnostics formatting library
-
regex_mutator
The Nautilus regex_mutator
-
stylish-html
stylish
helpers for writing styles as HTML elements -
character_converter
Turn Traditional Chinese script ot Simplified Chinese script and vice-versa and tokenize
-
interslavic
in rust
-
ngrammatic
Character-oriented ngram generator and fuzzy matching library
-
command-error
Detailed error messages and status checking for
std::process::Command
-
sms_splitter
An SMS message splitter and part calculator with support for GSM and Unicode
-
rustpython-sre_engine
A low-level implementation of Python's SRE regex engine
-
ut1_blocklist
UT1 blocklist URL/domain filters
-
emojicon
Find Emoji by using Emoticons and GitHub's, Bengali emoji names
-
termdiff
Write a diff with color codes to a string
-
schmfy
Schmfication library
-
akai
Next-generation, functional and simple ID system
-
clafrica
This application allows you to type most of the characters in the african alphabet in any text field
-
readability-text-cleanup
cleaning up text from HTML using Mozilla's Readability algorithm
-
rtlicious
A nom-based parser for Yosys RTLIL files
-
soundex-rs
that calculates soundex value
-
svgbob_cli
Transform your ascii diagrams into happy little SVG
-
mdbook-callouts
mdBook preprocessor to add Obsidian Flavored Markdown's Callouts to your book
-
unfurl
expanding links in text
-
royal_road_archiver
An archival program and library for the webnovel site RoyalRoad
-
broken-md-links
A command-line tool and library to detect broken links in Markdown files
-
extract-words
Extracts words from text without allocation
-
wchar
Procedural macros for compile time UTF-16 and UTF-32 wide strings
-
g2-unicode-jp
convert Japanese Half-width-kana[半角カナ] and Wide-alphanumeric[全角英数] into normal ones
-
hh-highligh
hh: The text highlighter
-
dnd-character
A Dungeons and Dragons character generator
-
bcdown
Bilibili漫画下载器,written in Rust,支持epub pdf zip格式
-
surt-rs
Sort-friendly URI Reordering Transform (SURT)
-
plsfix
Text cleaner upper
-
fmtt
A stupid text formatter that tries to break lines on sensible punctuations
-
bge
Rust interface for BGE Small English Embedding Library
-
plagiarismbasic_lib
Basic plagiarism checker written in Rust
-
mdbook-fs-summary
Summary generator for mdbook
-
bitranslit
Bi-directional transliterator for Rust. Transliterates strings according to the rules specified in the language packs.
-
bpmf_py
A Bopomofo and Pinyin library
-
unicode-casing
Titlecase helper function on characters
-
unfuck
Python 2.7 bytecode deobfuscator
-
str-utils
some traits to extend types which implement
AsRef<[u8]>
orAsRef<str>
-
string_wizard
manipulate string like wizards
-
detect-newline-style
Determine a string's preferred newline character
-
aki-mcycle
mark up text with cycling color
-
heatseeker
A fast, robust, and portable fuzzy finder
-
dsntk-gendoc
Decision Toolkit | Documentation generator
-
gulagcleaner_rs
Ad removal tool for PDFs
-
gix-utils
gitoxide
utilities that don’t need feature toggles -
cgi2
Create CGI programmes
-
rustic_print
A versatile Rust library for enhancing console output. It offers a range of features to create a more engaging and informative command-line interface.
-
heckcheck
A heckin small test case generator
-
matchers
Regex matching on character and byte streams
-
wimbd
A CLI for inspecting and analyzing large text datasets
-
aprilasr
High-level wrapper for the april-asr C api (libaprilasr) using aprilasr-sys
-
khat
A cat clone, nothing more nothing less
-
texcore
Create LaTeX documents using native Rust types
-
wkhtmlapp
Convert html to pdf or image
-
mdbook-alerts
mdBook preprocessor to add GitHub Flavored Markdown's Alerts to your book
-
compute-file-server-cli
Uploads files to Fastly for serving directly from within Fastly Compute applications. Upload any type of file: images, text, video etc and serve directly from Fastly. It is ideal for…
-
case
A set of letter case string helpers
-
mdbook-gitbook
mdBook preprocessor to properly render GitBook specific syntax
-
mdbook-open-on-gh
mdbook preprocessor to add a open-on-github link on every page
-
meaningsearch
package that helps you find meaningful lines of any given input. Especially useful in CTFs.
-
marktask
A CLI tool for parsing and manipulating Markdown tasks
-
dptran
run DeepL translations on command line written by Rust
-
slugify
Macro for flexible slug generation
-
terraphim-markdown-parser
Terraphim Markdown Parser
-
unic-ucd-ident
UNIC — Unicode Character Database — Identifier Properties
-
textwrap-macros
procedural macros to use textwrap utilities at compile time
-
block-list
A minimalist hosts-based tool for managing block lists and ad-blocking
-
nlpo3
Thai natural language processing library, with Python and Node bindings
-
uwu-rs
uwuifying library
-
speech-synthesis
Common traits for speech synthesis
-
dispnet-hash
Hash Library used for Dispnet
-
formatjson
Formats JSON files
-
rust-texas
generate latex documents
-
mitex-spec
Specification Library for MiTeX
-
ab-radix-trie
A compressed radix trie implementation supporting matching rules
-
inflector-plus
Adds String based inflections for Rust. Snake, kebab, camel, word, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
pprint
Flexible and lightweight pretty printing library for Rust
-
anslatortray
translate from English to Pig Latin!
-
markdown-formatter
Flavored Markdown (ZH) content formatter
-
any2utf8
Convert any enncoding to utf-8
-
re_ui
Rerun GUI theme and helpers, built around egui
-
rusk
a Specification Language
-
newbase60
that implements Tantek Çelik's New Base 60
-
shortguid
Short URL-safe Base64 encoded UUIDs
-
dingtalk
Robot Util, Send text/markdown/link messages using DingTalk robot, 钉钉机器人
-
dodo-zh
manipulate the cedict dictionary and pinyin
-
slugify-rs
generate slugs from strings
-
easy-regex
Make long regular expressions like pseudocodes
-
notan_glyph
glyph's support for Notan
-
markov
A generic markov chain implementation in Rust
-
tracery
Text-expansion library
-
fancy-regex-fork-pb
A custom fork of the fancy-regex crate. You probably don't want to use this.
-
recursive-file-loader
recursively load files via references in the files
-
sedregex
Sed-like regex library
-
stringmatch
Allow the use of regular expressions or strings wherever you need string comparison
-
invisible_unicode
finding invisible unicode characters
-
aki-gsub
substitude text command, replace via regex
-
webreg
A CLI tool for testing regexes against web pages
-
quake_text
handling Quake strings and characters
-
print-positions
providing string segmentation on grapheme clusters and ANSI escape sequences for accurate length arithmetic based on visible print positions
-
runanum
Существительные с правильными окончаниями после чисел
-
slack_update
app to set Slack status, emoji and photo
-
typst-ansi-hl
highlights your Typst code using ANSI escape sequences
-
mdbook-svgbob
SvgBob mdbook preprocessor which swaps code-blocks with neat SVG
-
easy_io
Fast and dead-simple IO for competitive programming in Rust
-
rasciigraph
function to plot ascii graphs
-
wz
Count words, fast
-
imatree
command line tool to generate aesthetically pleasing (and sometimes not so pleasing) images of whatever string you want
-
repub-rs
binary for converting mhtml webpages into remarkable-style summarized epubs
-
regex-cli-test
A command line tool for testing against regex patterns
-
indentation
Formatter
-
sanitizer
A collection of methods and macros to sanitize struct fields
-
syllarust
quickly counting syllables
-
float-pretty-print
Format f64 for showing to user, not for serialisation
-
normalize-hebrew-rs
package that normalizes special symbols within Hebrew string used in the Qumran-Digital project
-
csv-groupby
execute a sql-like group-by on arbitrary text or csv files
-
rust_iso15924
ISO 15924, Codes for the representation of names of scripts, is an international standard defining codes for writing systems or scripts (a "set of graphic characters used for the written form of one or more languages")…
-
presenterm
A terminal slideshow presentation tool
-
text_manipulation_rs
generating random placeholder text in different languages
-
tibco_ems-sys
Rust bindings for the Tibco EMS C library
-
gdnative-doc
Documentation tool for gdnative
-
gaze
small utility library with the goal of making it easier to scan/lex text and collections
-
mdbook-iced
An mdBook preprocessor to turn iced code blocks into interactive examples
-
gecliht
A disparate collection of text manipulation and formatting algorithms
-
match-pinyin-with-hanzi
Checks whether the sentence in Chinese characters (汉字) matches with the sentence in pinyin (拼音). Erhua is supported.
-
morph-rs
Dictionary Morphologizer for Russian language
-
opt2doc
generate documentation from Rust source code
-
md-localizer
Localize markdown with remote links
-
kasedenv
Read environment variables by lower, upper case or case-insensitive keys
-
jp_utils
Utils for working with Japanese text
-
cellumina
configure and run cellular automata
-
hns
Human numeric sorting program — does what
sort -h
is supposed to do! -
manchu-converter
Converts transcripted Manchu text to Manchu script with Manchu alphabet
-
regex-chunker
Iterate over the data in a
Read
type in a regular-expression-delimited way -
aki-xtee
copy standard input to each files and standard output
-
redpatterns
a list of patterns for scanners 📟
-
ini_puga
manage INI files
-
gret
command line tool to search for patterns and show matches in a tree structure
-
vibrato
viterbi-based accelerated tokenizer
-
mdbook-svgbob2
Alternative mdbook preprocessor for svgbob
-
mdbook-bib
mdbook plugin allowing to load and present a bibliography in BibLaTex format in your books and cite its references
-
bureau
extracting dependency licenses, used by nertsio
-
text_maid
Text maid
-
correct_word
A No brainer 'did you mean' library for Rust
-
tectonic_io_base
Basic types for Tectonic's pluggable I/O backend system
-
ryaspeller
lib for searching typos in text, files and websites
-
precis-tools
Tools and parsers to generate PRECIS tables from the Unicode Character Database (UCD)
-
uniquewords-rs
Count the frequencies of words in text file(s)
-
pangu2
Paranoid text spacing for good readability, to automatically insert whitespace between CJK (Chinese, Japanese, Korean) and half-width characters (alphabetical letters, numerical digits and symbols)
-
bubble-bath
Small and quick HTML sanitizer
-
cesu8
Convert to and from CESU-8 encoding (similar to UTF-8)
-
minigrep_panda
just a test
-
shq
sh quoting tool and library
-
pdf_encoding
Font related encodings
-
leven-distance
Compute operational differences between two sequences using the Levenshtein algorithm
-
cfasttext-sys
fastText ffi binding
-
prefix
A customizable pretty printer for FIX messages
-
rs-wordcloud
☁ Flexible WordCloud implementation for ⚙Rust
-
text2art
translate text to ascii art
-
pandoc-ac
pandoc filter for converting acronym codes to LaTeX
-
remove-markdown-links
Turns
[example](https://example.com)
intoexample
. That’s it -
markdown-gen
generating Markdown files
-
uulm-sopra23-group03-core
Core library for implementing Battle of the Centerländ
-
read_chars
An iterator over characters read from some I/O source
-
cyrconv
A funny faux cyrillic character mapper
-
rscx-mdx
MDX support for RSCx
-
dmntk-gendoc
DMNTK | Documentation generator
-
stardict_wrapper
Rust implement stardict
-
groop
Outline-aware grep with syntax highlighting
-
harfbuzz-sys
Rust bindings to the HarfBuzz text shaping engine
-
proc-macro-regex
A proc macro regex library
-
genpdf
User-friendly PDF generator written in pure Rust
-
lindera-cc-cedict-builder
A Chinese morphological dictionary builder for CC-CEDICT
-
wit_owo
Finally, an adequate library for wit.ai in Rust
-
rsnltk
Rust-based Natural Language Toolkit
-
unidoc
Unite all Markdown
-
csi
processing a directory of text files. Particularly useful for building static websites.
-
minbpe
Port of Andrej Karpathy's minbpe to Rust
-
spoilers
High-level Rust bindings for CTranslate2
-
stylish-stringlike
API for string-like objects that have styles applied
-
mantra-lang-tracing
Contains functionality needed to collect requirement traces from code or plain text files for the
mantra
framework -
chardet
rust version of chardet
-
dmos
Djot HTML renderer with advanced features
-
soft-ascii-string
char/str/string wrappers which add a "is-ascii" soft constraint
-
unicode-canonical-combining-class
Fast lookup of the Canonical Combining Class property
-
distrs
PDF, CDF, and percent-point/quantile functions for the normal and Student’s t distributions
-
fiberplane-markdown
convert Fiberplane Notebooks to and from Markdown
-
string_manip_rust
Demo of managing projects
-
r-matrix
Rust port of cmatrix
-
path2regex
Express style path to RegExp utility
-
pulldown-cmark-mdcat
Render pulldown-cmark events to TTY
-
patterns
no_std compatible pattern scan library using simd
-
mdbook-tagger
A mdbook preprocessor to collect and build tag data in Json format
-
ansi-width
Calculate the width of a string when printed to the terminal
-
jawk
JSON AWK
-
pomsky-bin
Compile pomsky expressions, a new regular expression language
-
RustMqtt
Mqtt Encoding and decoding package
-
mdbook-chapter-list
mdbook preprocessor for listing subchapters
-
markdown-extract
Extract sections of a markdown file
-
mdbook-numthm
An mdbook preprocessor for automatically numbering theorems, lemmas, etc
-
owned_chars
Owned iterators with the same output as Chars and CharIndices
-
ufofmt
A fast, flexible UFO source file formatter based on the Norad library
-
re_space_view_text_document
space view that shows a single text box
-
commonregex
Rust port for CommonRegex. Find all times, dates, links, phone numbers, emails, ip addresses, prices, hex colors, and credit card numbers in a string. We did the hard work so you don't have to.
-
clima
A minimal Markdown reader in the terminal
-
mdxbook
Fork of mdBook, with more customizations and flexibility for programmers
-
bfom-lib
Brendan's Flavor of Markdown: I'll build my own markdown format, what could go wrong?
-
anystr
An abstraction over string encoding that supports ASCII, UTF-8, UTF-16 and UTF-32
-
lingua-english-language-model
The English language model for Lingua, an accurate natural language detection library
-
ttaw
talking to a wall, a piecemeal natural language processing library
-
flowistry_ide
Information Flow in the IDE for Rust
-
vader_sentiment
Bindings for Rust from the original Python VaderSentiment analysis tool
-
jira-clean
clean up Jira task description that is an output of jira-cli tool
-
poppler
Wrapper for the GPL-licensed Poppler PDF rendering library
-
pullup
Convert between markup formats
-
keyvalues-parser
A parser/renderer for vdf text
-
fast-str
A flexible, easy-to-use, immutable, efficient
String
replacement for Rust -
mdbook-ocirun
mdbook preprocessor to run arbitrary commands and code snippets inside containers
-
java_string
Java strings, tolerant of invalid UTF-16 encoding
-
refac
Transform some text given a generic natural language prompt
-
indent_write
Write adapters to add line indentation
-
pdf_seekers
parser and information extractor from PDF documents based on keyword search functionality (powered by Rust)
-
unescape
Unescapes strings with escape sequences written out as literal characters
-
CFDTCP
Short text clustering for determining center points and correlation judgment [one-hot,Jenks Natural Breaks]
-
simple-word-count
word count function, try to get same result with Microsoft Office Word application
-
notedown_ast
Notedown Abstract Syntax Tree
-
presciidoc
Preprocessing AsciiDoc for other tools
-
lindera-cli
A morphological analysis command line interface
-
enum-ts
TypeScript Enum pattern matcher codegen
-
hitori
Generic compile-time regular expressions
-
convert_string
A trait to convert Strings to safe non-keywords and/or convert a Strings case (snake_case, PascalCase, ...)
-
tcalc-rustyline
A fork of Rustyline for use specifically with tcalc
-
minigrepzzz
An application from the documentation
-
didaskalia
over-complicated language to store text entries and sequences across different languages
-
kl-hyphenate
Knuth-Liang hyphenation for a variety of languages
-
seshat-unicode
A Unicode Library for Rust. Unicode 15.1.0 ready. XID_Start and XID_Continue are also available.
-
tet_rs
A third-party implementation of Text Entry Throughput (ref. https://doi.org/10.1145/3290605.3300866) for Rust
-
tagsearch
Filter plaintext files based on @keyword tags
-
kas-text
Text layout and font management
-
translitrs
Transliteration utility for Serbian language
-
rusty-x
A snippet manager in Rust
-
notmecab
tokenizing text with mecab dictionaries. Not a mecab wrapper.
-
jput
puts and putc on unicode-width align for Rust
-
event_parser
Rust NLP library for parsing English natural language into icalendar events
-
punycode
Functions to decode and encode Punycode
-
mdbook-preprocessor-boilerplate
Boilerplate code for mdbook preprocessors
-
ranting
Linguistic formatting placeholder extensions for rust
-
json_to_table
pretty print JSON as a table
-
mdbook-post
A CLI for add post to mdbook
-
character_text_splitter
splitting text into chunks with overlap, designed for handling large amounts of text efficiently. Implementation is identical to langchain's CharacterTextSplitter
-
grep-reader
short text for crates.io
-
ultra
Cryptanalysis of the Engima
-
autoruby-cli
CLI to easily generate furigana for various document formats
-
markdown-toc
Markdown Table of Contents generator
-
slidedeck
Create an HTML slide deck from Markdown
-
cermic
Print .png images in your terminal as ascii art!
-
bibliofile
A TUI epub reader inspired by DOS-era programs
-
minigrep_empty
读取文件中的指定字符串信息
-
codevis
turning your code into one large image
-
unveil-rs
Unveil Rs is a tool to create presentations from markdown files
-
letter-sequence
A method to create sequence displayed as uppercase or lower letters, or digits
-
apidoc-attr
Apidoc attr
-
senax-inflector
Pluralizes and singularizes English nouns. This crate is forked from https://github.com/whatisinternet/Inflector.
-
text-transliterate
transliterate texts using the SO iconv from POSIX
-
ob
A Blog and RSS system written in Rust
-
pact_matching
Pact-Rust support library that implements request and response matching logic
-
stam-python
STAM is a library for dealing with standoff annotations on text
-
unflow
DSL to convert design to code
-
md-include
include any file in markdown files
-
group-similar
Group similar values based on Jaro-Winkler distance
-
md-designer
A CLI tool for creating design docs in Markdown
-
rsclp
A lib crate to parse proccess arguments
-
azusa
String index transformer for Rust utf8 to JavaScript utf16
-
spark_sdk
Rust sdk of the Spark
-
ruSTLa
A reStructuredText → LarST ⊂ LaTeX transpiler
-
ascii-hangman-backend
customizable Hangman game with ASCII-art rewarding for children (backend)
-
abjad
Calculate the numerical abjad value of Arabic-script text
-
linurgy
Manipulate the output of multiple newlines. Replace/Insert/Append newlines with text. Input and output from stdio/files/buffers
-
product-os-content
Product OS : Content provides a complete solution for content management for the purpose of serving content via Product OS : Server
-
campfire
A tiny static site generator, greatly inspired by Zola
-
cargo-markdown
Local crates.io readme development server with ultra-fast hot reloading goodness
-
mathematica-notebook-filter
mathematica-notebook-filter
parses Mathematica notebook files and strips them of superfluous information so that they can be committed into version control systems more easily -
char_trie
Text segmentation based on trie tree, High performance, support for custom dictionary
-
spongebob
A command to convert text to the Spongebob meme case
-
readable-regex
Regex made for humans. Wrapper to build regexes in a verbose style.
-
loe
Very fast and yet another line ending (CRLF <-> LF) converter written in Rust
-
terminal_cli
A standalone library with no-std support for command line terminal interfaces. With autocomplete support, helpers for commands and properties and a prompt implementation.
-
help_crafter
help message generator without hussle
-
mdbook-chapter-zero
A mdBook preprocessor that allows 0th (sub-)chapter
-
wool
Preview Github Markdown Offline
-
grace-cli
CLI tool for processing files and strings
-
kanjidic_types
A collection of types encompassing the variety of data about kanji available from Kanjidic
-
bstr
A string type that is not required to be valid UTF-8
-
kansuji
漢数字と数字の相互変換のためのライブラリ
-
grammalecte_client
Grammalecte HTTP client
-
aho-corasick
Fast multiple substring searching
-
c6o-obsidian-export
associated CLI program to export an Obsidian vault to regular Markdown
-
hh-highlight
hh: The text highlighter
-
textos
Texts, strings, formatting, unicode…
-
text-colorizer
Transitionary package
-
lorem-ipsum-generator
allows you to generate lorem ipsum text, simply and easily
-
bytepiece_rs
The Bytepiece Tokenizer Implemented in Rust
-
snp_sminigrep
my practice
-
hello_lib
Demonstrate Generics Function
-
udp-logger-rs
Log macro for log's kv-unstable backend and a UDP socket logger
-
tradukisto
Kinda useful natural language translation library and utility
-
delay_writer
Wraps a writer and delays its output after each newline
-
csvsc
Build processing chains for CSV files
-
mitex
TeX2Typst converter
-
ccase
Command line interface to convert strings into any case
-
bk-tree
A Rust BK-tree implementation
-
rtss
A command-line tool to annotate stdout/stderr with elapsed times
-
testcall
companinon crate to bintest, implements test facilities
-
dwrote
Lightweight binding to DirectWrite
-
wordmarkov
but flexible Markov chain library, specifically for text sentences, which handles punctuation and whitespace
-
env2toml
Convert env vars to toml text
-
tagalyzer
A CLI tool to gather statistics on collections of plaintext-adjacent files
-
cha-rs
Extract specific characters from an input
-
untanglr
Probabilistically split concatenated words using NLP based on English Wikipedia unigram frequencies
-
terminal-clipboard
a minimal cross-platform clipboard
-
mdbook-curly-quotes
mdBook preprocessor that replaces straight quotes with curlyquotes, except within code blocks or code spans
-
html-to-pulldown-cmark-events
Parse HTML to pulldown-cmark's events
-
sourceannot
render snippets of source code with annotations
-
divvunspell-bin
Spellchecker for ZHFST/BHFST spellers, with case handling and tokenization support
-
story-dl
Story web scraping
-
recase
Changes the convention case of input text
-
mdbook-aquascope
Interactive Aquascope editor for your mdBook
-
mutf8
Basic support for mutf8 strings
-
spellabet
Convert characters into spelling alphabet code words
-
ayda
Ask your Documents Anything. A tool for querying your documents with a large language model.
-
hfs_nfd
Handle Apple's unique NFD-like Unicode normalization, which is used in HFS+, in Rust
-
ragegun
Performs lexica based analysis on text (i.e. age, gender, PERMA, OCEAN personality traits, ..)
-
lexmatch
lexicon matching tool that, given a lexicon of words or phrases, identifies all matches in a given target text. Uses suffix arrays.
-
ascii-canvas
canvas for drawing lines and styled text and emitting to the terminal
-
ferrissay
cowsay
-
mdbook-external-links
Open external links inside your mdBooks in a different tab
-
cheetah
An extremely fast and flexible static site generator
-
transportations_library
Knowledge stack for transportation
-
sc_decode
Special character decoding
-
nu_plugin_ron
A Nushell plugin for parsing RON text
-
mdbook-auto-gen-summary
A preprocessor and cli tool for mdbook to auto generate summary
-
subject-classifier
classifying a commit by it's subject
-
hyperscan-sys
Hyperscan bindings for Rust with Multiple Pattern and Streaming Scan
-
ucfirst
Uppercase the first letter of a string
-
bbcode-tagger
BBCode tree parser and tagger
-
cattocol
Combine two text into one text as columns
-
tgrep
Toy grep that honors .gitignore
-
graphannis-cli
command-line interface to the new backend implementation of the ANNIS linguistic search and visualization system
-
prototty_render
Traits for defining a renderer and a view
-
braille-rs
working with braille characters and drawing with them
-
imperative
Check for imperative mood in text
-
ipa-translate
translating between IPA and ASCII text
-
unicode-jp
convert Japanese Half-width-kana[半角カナ] and Wide-alphanumeric[全角英数] into normal ones
-
show-bytes
Display bytes as printable ascii with escape sequences as needed
-
igo-rs
Pure Rust port of the Igo, a POS(Part-Of-Speech) tagger for Japanese (日本語 形態素解析)
-
sluggify
slug or clean url generator for rust. With default settings, you will get an hyphenized, lowercase, alphanumeric version of any string you please, with any diacritics removed, whitespace and dashes collapsed…
-
unicode-normalization-alignments
functions for normalization of Unicode strings, including Canonical and Compatible Decomposition and Recomposition, as described in Unicode Standard Annex #15
-
boss
Baseball Open Source Software
-
slicedisplay
Simplistic Display implementation for Vecs and slices
-
skill-tree
generate graphviz files to show roadmaps
-
utf8-command
UTF-8 encoded
std::process::Command
output -
charwise
This lightweight, dependency-free rust library provides a convenient way to read characters from different resources
-
mdbook-d2
D2 diagram generator plugin for MdBook
-
fast_symspell
Spelling correction & Fuzzy search
-
mdict
Rust implement mdict
-
matrix_display
display a matrix in command line with pretty unicode separators
-
charmap
one-to-(none/one/many) character mapping
-
refalizer
Make sure your branch names are good to drive
-
twitter-text
in Rust
-
chisel-parsers
Chisel parser front ends
-
rust_stringz
string manipulation library for Rust that you should probably don't need. It's just for learning purposes.
-
ngram
Iterator adaptors for n-grams and k-skip-n-grams
-
korrektor
work with Uzbek language text processing
-
fiat-lux
Offline terminal-accessible Bible
-
wattle
tools for rust
-
waifu-calendar
fetch your favorite characters' birthdays from AniList
-
bebop-lang
LISP based Markdown preprocessor for creating html documents!
-
reddish
A Rust utility library
-
mdbook-cmdrun
mdbook preprocessor to run arbitrary commands
-
mdbook-preprocessor-utils
writing mdBook preprocessors
-
detect-indent
Detect the indentation of code
-
syntastica-themes
A collection of themes for syntastica
-
uwuify
fastest text uwuifier in the west
-
bilingual
A cmdline tool used for markdown translation via calling Chinese translation api cloud services
-
unicode-case-mapping
Fast lowercase, uppercase, and titlecase mapping for characters
-
markov_strings
A simplistic Markov chain text generator
-
cutters
Rule based sentence segmentation library
-
no-comment
Remove rust-style line and block comments from a char iterator
-
null-terminated-str
FFI-friendly utf-8 string, enabling const null-terminated str and caching of the non-terminated string to avoid frequent allocation
-
thesauromatic
command-line thesaurus that returns related words when given a word. The output words are one per line, making it easy to process in shell pipelines.
-
lindera-tokenizer
A morphological analysis library
-
hulk
An ultra simple no hassle static site generator
-
falcom-sjis
Falcom-compatibile Shift JIS implementation
-
mdbook-infisearch
InfiSearch plugin for Mdbook
-
pi_ucd
unicode字符函数,获得字符的语言区间段;及根据文字排版的需要,判断字符是否为单字字符或字母字符
-
synox
Program synthesis of string transformations from input-output examples
-
zspell-cli
Command line interface for the ZSpell spellchecking library
-
kspconfigtool
KSP1 ConfigNode parser and block removal tool
-
alpino-tokenizer
Wrapper around the Alpino tokenizer for Dutch
-
highlight-pulldown
Process pulldown-cmark events to apply syntax highlighting to code blocks
-
guarding
guardians for code, architecture, layered. Guarding crate a architecture aguard DSL which based on ArchUnit.
-
rustplexity
bigram-based perplexity calculator, useful for filtering out boilerplate or other abnormal text
-
pascii
image => text
-
pulldown-cmark-fork
A pull parser for CommonMark
-
vape
full width aesthetics
-
sejong
Buffer is a buffer that can receive ASCII bytes different from keyboard and send out UTF-32 Hangul string. This buffer allows deletion by Jamo.
-
neo-mime
Strongly Typed Mimes
-
dr
Command-line data file processing in Rust
-
rsmorphy
Morphological analyzer / inflection engine for Russian and Ukrainian (soon) languages (WIP)
-
gimme
Pull useful data out of your clipboard
-
rupantor
A Bengali Phonetic Parser which is very flexible and supports Avro Phonetic
-
literumilo
A spell checker and morphological analyzer for Esperanto
-
spacey
a tiny yet performant whitespace vmm/interpreter
-
case_insensitive_hashmap
A HashMap that uses case-insensitive strings as keys
-
awabi
A morphological analyzer using mecab dictionary
-
mdzk
Plain text Zettelkasten based on mdBook
-
cabocha
Safe Rust wrapper for cabocha a japanese language dependency structure analyzer library
-
trie-match
Fast match macro
-
wcrs
GNU wc in Rust
-
regex-split
split_inclusive for the regex crate
-
mdbook-rust
Enhanced Rust support for MDBook
-
cli_utils_hoijui
A tiny CLI utilities library, providing functions and constants useful in many CLI tools
-
soup
Inspired by the python library BeautifulSoup, this is a layer on top of html5ever that adds a different API for querying and manipulating HTML
-
m4b/dryad
An almost-parallel, semi-functioning, dynamic linker experiment, written in Rust
-
fuzzywuzzy
A pure-Rust clone of the incredibly useful fuzzy string matching python package, FuzzyWuzzy
-
svgbobdoc
Renders ASCII diagrams in doc comments as SVG images
-
mdbook-bibfile-referencing
An mdBook preprocessor to add bibfile referencing to each page
-
streamparser
Adapt simple stream parsers for different types of streams
-
text_lines
Information about lines of text in a string
-
parattice
Recursive paraphrase lattice generator
-
kana-converter
converter for half-width/full-width Japanese language characters (katakana, hiragana, and ASCII)
-
aki-unbody
output first or last n lines, like a head and tail of linux command
-
cdragon-rst
Work with Riot translation files
-
unicode_reader
Adaptors which wrap byte-oriented readers and yield the UTF-8 data as Unicode code points or grapheme clusters
-
hxgm30-noise
Noise-generation for use in hexagram30 projects
-
hyphenation_commons
Proemial code for the
hyphenation
library -
mandown
Markdown to groff (man page) converter
-
pink-runtime
A very minimal replecement based DSL, intended for math
-
encoding-next
Character encoding support for Rust
-
lines_lossy
extension to
BufRead
with a functionlines_lossy
that works likeBufRead::lines
but with lossy UTF-8 decoding -
grammateus
facilitate working with Ancient Greek words
-
spinning_cube
Perspective projection of a spinning cube, using just ASCII characters
-
character-set
High performance
set.contains(char)
-
aki-mline
match line, regex text filter like a grep of linux command
-
mdtranslation
prepare multi-lingual Markdown documents
-
umlauts
text transformation of german umlauts
-
mupdf
Safe Rust wrapper to MuPDF
-
trigram
Trigram-based string similarity for fuzzy matching
-
bitflip
functions to generate bitflips of binary and UTF-8 strings
-
anon-csv-cli
anonymise CSV files, providing various options to substitute real data with plausable fake data
-
roxy_syntect
Roxy plugin for providing syntax highlighting via Syntect
-
mdbook-last-changed
mdbook preprocessor to add the last modification date per page
-
armnod
random string generator
-
afrim-translator
Manage the predication system of the afrim input method
-
bzzz
Bee-Speak Translator
-
unicode-joining-type
Fast lookup of the Unicode Joining Type and Joining Group properties
-
valid_rust_char
A tiny library to check if a char it's valid in a rust file
-
mdbook-open-git-repo
mdbook preprocessor to add a open-on-git-repo link on every page
-
stfu8
Sorta Text Format in UTF-8
-
rustrings
Strings manipulation for Rust
-
caser
Change text between PascalCase, camelCase, and snake_case
-
text-diff
text diffing and assertion library
-
spongemock
CLI-tool for random text capitalization
-
glifnames
Mapping of characters to glyph names according to the Adobe Glyph List Specification
-
math-text-transform
Transform greek letters, latin letters, or decimal digits into certain variants from the mathematical alphanumeric symbols Unicode block (U+1D400–U+1D7FF). For example to bold, italic, script or double-struck.
-
emoji
Every emoji, their metadata, and localized annotations
-
nfa_regex
NFA regex engine for text processing
-
text_unit
Newtypes for text offsets
-
ascii_converter
converting between different ascii representations
-
syntax-error
Syntax error report
-
strizer
minimal and fast library for text tokenization
-
inslice
A command-line utility for filtering text input by columns and rows
-
character_frequency
counting character frequencies in a string concurrently
-
dictcc
Rust API for reading and querying the dict.cc offline translation database
-
tpt
Pure Rust implementation of the Unix concatenate (cat), word-count (wc) and echo command
-
utfx
-
ellipse
Truncate and ellipse strings in a human-friendly way
-
zalgo-codec-common
converting an ASCII text string to a single unicode grapheme cluster and back
-
markdown-to-html
Markdown parser that runs at hyper speeds!
-
runiq-lib
An efficient way to filter duplicate lines from input, à la uniq
-
pulldown_mdbook
A pull parser for mdBook
-
ripgrep
line-oriented search tool that recursively searches the current directory for a regex pattern while respecting gitignore rules. ripgrep has first class support on Windows, macOS and Linux.
-
genkit
A common generator kit for static site generator
-
pulldown-cmark-escape
An escape library for HTML created in the pulldown-cmark project
-
asciidork-ast
Asciidork AST nodes
-
confusables
around Unicode confusables/homoglyphs
-
goodname
assist you with cool naming of your methods and software
-
afrim-memory
handle of sequential codes easier for an input method
-
clippers
Cross-platform clipboard management library
-
common_substrings
Finding all common strings
-
xpanda
Unix shell-like parameter expansion/variable substitution
-
faster-chars-count
counting length of chars faster than Chars::count()
-
harfbuzz-traits
Rust Traits for the HarfBuzz text shaping engine
-
snekdown
A parser for the custom snekdown markdown syntax
-
mdbook-open-gh-issue
mdbook preprocessor to add a open-on-github link on every page
-
mqtt-analyzer
Help analyzing MQTT messages
-
indentasy
Indent like a party
-
ansi-to-tui
convert ansi color coded text into ratatui::text::Text type from ratatui library
-
minigrep_vielenkz
grep using rust
-
encoding-index-singlebyte
Index tables for various single-byte character encodings
-
ctf-brute
Brute-force utilities for Rust
-
texting
string helpers
-
strcursor
string cursor type for seeking through a string whilst respecting grapheme cluster and code point boundaries
-
simdutf
Unicode validation and transcoding at billions of characters per second
-
esperanto-text
Convert Esperanto text between UTF-8, x-system and h-system transliterations
-
wordfreq
port of wordfreq for looking up the frequencies of words in many languages
-
zw
encoding and decoding text using zero-width characters
-
static_format
Format strings with no runtime overhead
-
sortuniq
Find or count unique values in an input stream
-
polyvalid
A validator for package names, namespace, username and app name on wasmer
-
kvu
The simplest command line tool to manage key-value pair lines
-
imstr
Cheaply clonable and slicable immutable strings
-
str-block
Remove the initial newline and common indentation from string literals
-
yozuk-helper-english
English NLP utilities for Yozuk
-
uiuifree-dictionary
My MecabDic Util
-
rex-regex
A pedestrian library for regular expressions: Slow, but easy to understand
-
notegraf
Core library for building a graph-oriented notebook
-
unindenter
unindent text
-
rutie
The tie between Ruby and Rust
-
filenamify
Convert a string to a valid safe filename
-
cyrla
two-way conversion between latin and cyrillic script
-
bidi
Unicode Bidirectional Algorithm (UBA)
-
rl-sys
Native bindings to libreadline
-
whitespace_text_steganography
A steganography strategy that uses whitespace to hide text in other text
-
veryfi
Module for communicating with the Veryfi OCR API
-
rcut-lib
rcut is a Rust replacement for GNU cut that supports UTF-8
-
mdbook-scientific
Enables inline equations for mdbook to set by $..$ signs and $$..$$
-
repub
convert markdown documents to epub
-
monkey-printer
infinite nr of monkeys you could write Shakespeare
-
pithy
Ultra-fast, spookily accurate text summarizer that works on any language
-
montchik_minigrep
Testdrive of the Rust program
-
zalgo-text
A command line tool for generating zalgo text
-
jpreprocess-njd
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
markdown-composer
composing markdown documents
-
text_distance
A collection of approximate string matching algorithms
-
gbx
GBX (Grundbuch-Exchange) Dateiformat
-
forbidden-bands
8-bit string handling library
-
ftrace
trace files and paths
-
html-compare
compare html files
-
make_it_braille
A CLI tool and library to manipulate unicode braille images
-
simdnbt
an unnecessarily fast nbt decoder
-
mdcat-http-reqwest
Reqwest-based HTTP resource handler for mdcat
-
ucd-raw
Uninterpreted access to the unicode UCD
-
uwl
A management stream for bytes and characters
-
swot
community-driven or crowdsourced library for verifying that domain names and email addresses are tied to a legitimate university of college
-
uwildmat
wildmat/uwildmat matching
-
unic-ucd-age
UNIC — Unicode Character Database — Age
-
enso-lazy-reader
An efficient buffered reader
-
json-peek
Amature JSON parser library designed for my specific need
-
runestr-pancjkv
rune
-based Pan-CJKV support -
koelner-phonetik
koelner_phonetik or cologne phonetics is a phonetic algorithm like soundex, but specialized for german words
-
solstice_glyph
A fast text renderer for solstice, powered by glyph_brush
-
markovish
Markov chain implementation for text generation
-
rex-regextract
extracts key value pairs out of text
-
uwubot
discord bot for uwuifying text
-
mini_paste
Fast-to-compile equivalent to
::paste
-
codes-iana-charset
This package contains an implementation of the IANA Character Set registry
-
glcp
Small library to help you set and get text from a global clipboard
-
looking-glass
reflection & type-erasure library for Rust
-
tectonic_engine_xetex
The XeTeX engine as a reusable crate
-
cowonverter
owo-ify text files. provides a simple function for this purpose.
-
naromat
Convert text to narou novel format
-
destiny_rs
An async library to interact with Bungie's destiny 2 api
-
adauguet_minigrep
First program from the Rust Book
-
rosie
Interface for the Rosie Pattern Language, for efficient and maintainable text pattern matching and search
-
mask-text
mask text with multiple masking options
-
ucd
Extends the char type to provide access to most fields of the UCD, Unicode Character Database, as of version 9.0.0. It aims to be compact, fast, and use minimal dependencies (only rust's core crate)…
-
pomsky-macro
Macro for converting pomsky expressions to regexes
-
framework
detector for different frameworks in one projects
-
jellybean
Syntax highlighting with tree-sitter. Sweet colors.
-
kirjat-rs
prices for finnish textbooks from multiple stores
-
anagrambot
find anagrams of words
-
mdbook-bash-tutorial
A mdbook preprocessor that allows embedding Bash scripts as tutorials
-
playin-cards
♠ Library for French-suited playing cards
-
quickner-core
A fast and simple NER tool
-
mdbook-twiki
twiki backend for mdbook
-
rust_readability
A package to assess the complexity of texts using a variety of readability formulas
-
smart-date
Parses relative and absolute dates in natural language text
-
spacemod
A easy to understand and powerful text search-and-replace tool
-
rst_parser
a reStructuredText parser
-
console-traits
Basic console traits, such as setting char position and writing strings
-
pinot
Fast, high-fidelity OpenType parser
-
todo-to-issue
CLI tool that converts forgotten TODO comments into actionable GitHub issues
-
pra
Print Random ASCII
-
idna-cli
Encode/decode Unicode domain names to/from IDNA ASCII
-
clippy_lints
A bunch of helpful lints to avoid common pitfalls in Rust
-
lf_lint
Linter that checks that all files in <PATH> ends with a newline
-
nibi
static site generator. (WIP, implementing)
-
str_overlap
Methods for finding the overlap between two string slices
-
alass-ffi
FFI bindings for subtitle synchronization with alass
-
ctrl-z
A composable reader to treat
0x1A
as an end-of-file marker -
render_readme
Render Markdown or reStructuredText with syntax highlighting and image filtering similar to GitHub's
-
humnum
Human numeric sorting program — does what
sort -h
is supposed to do! -
macro_colors
colorful printing macros
-
beary
augment Bear, the note-taking app
-
matchable
Check if text is matching against string or regex in one API
-
mdplayscript
An extension of Markdown for play scripts
-
august
& program for converting HTML to plain text
-
asciidork-parser
Asciidork parser
-
folia
High-performance library for handling the FoLiA XML format (Format for Linguistic Annotation)
-
mdbook-unlink
A mdBook backend that validates local links
-
lingua-french-language-model
The French language model for Lingua, an accurate natural language detection library
-
csv-sanity
Sanitize and transform large CSVs with millions of records quickly and efficiently
-
pdf-min
Very minimal crate for writing PDFs
-
text_alignment
Align your text in Rust in the CLI
-
STKLR
STKLR: pronounced 'stickler'. Is a cli tool to automatically link functions, enums, structs, traits etc in rust-doc docstrings. I couldn't find a tool like this when I needed it so... here we are.
-
maud-pulldown-cmark
An adapter between maud and pulldown-cmark
-
text_searcher
text searcher tool
-
wtf8-rs
WTF-8 encoding
-
mdbook-nix-eval
mdbook preprocessor for evaluating nix expressions
-
utf-cli
A UTF code viewer
-
small-bwt
BWT construction in small space
-
csr
encrypting and decrypting many types using caesar cipher. This shouldn't be used for any serious security but was more of an exercise in generics and creating a rust crate.
-
fmty
Composable
core::fmt
utilities -
unicode_converter
CLI tool to convert data between various Unicode encodings
-
porter-stemmer
Flexible and unicode friendly, Porter stemmer implementation
-
hyper-old-types
HTTP types from hyper 0.11.x
-
mdx
in Rust
-
flw
Process text via configurable tasks
-
csmlinterpreter
The CSML (Conversational Standard Meta Language) is a Domain-Specific Language developed for creating conversational experiences easily
-
markdown-includes
Include other documents, table of content, or rust-doc in Markdown using a simple template system
-
mdbook-superimport
Import code/text from other files into your mdbook - without the link rot
-
lexa-wildcard-matching
Correspondance de caractères génériques sur les chaînes de caractères
-
git-blamediff
A program to automatically annotate changes to a file in git(1)
-
md2gemtext
for converting Markdown into gemtext
-
igpay-atinlay
Translate text to Pig Latin
-
strmatch
Conditionally match strings in Rust using regex without much boilerplate
-
xhtmlchardet
Character set detection for XML and HTML
-
whitespace
Encode arbitrary data whitespaces and vice versa
-
caribon
A repetition detector program and library
-
cautious-octo-funicular
Test: shipping an mdbook with API docs
-
bookrafter
This repository contains code related to bookrafter rendering
-
gzip-cmp
that makes a distance measurement between binary data based on the difference of the compressed data length
-
wcounter
Give the word and count the appearance
-
text-scanner
A UTF-8 char-oriented, zero-copy, text and code scanning library
-
compact_str
A memory efficient string type that transparently stores strings on the stack, when possible
-
economic_indicator_finder
A finder for extracting economic indicators from paragraphs
-
uchr
translate char values to the characters themselves, and vice versa
-
whitespace-conf
Key-value configuration file delimited with whitespaces
-
catdream
Sleeping cat dreams your text
-
password-characters
help with the "enter the 12th, 35th, and 63rd characters from your password" situations
-
textedit-merge
sorting and merging textedits
-
mdbook-fix-cjk-spacing
mdbook preprocess that fixes CJK line breaks
-
text-utils
Text utils for unescaping and align
-
univstring
The Universal String(Rust String, CString, Wide(UTF-16/UTF-32) CString) trait
-
changecase
A trait and implementation for changing the case of Strings and &str. It currently supports uppercase, lowercase, alternating case, and inverting case. Title case is in the works.
-
write16
A UTF-16 analog of the Write trait
-
polyhorn-ios
Polyhorn for iOS
-
genex
Text-expansion library
-
charisma
Generate ASCII Art using AI 🦾🧠
-
fastchr
Faster memchr using SIMD intrinsics
-
cowsay
program that generates ASCII art pictures of a cow with a message
-
zhconv-cli
Convert Traditional/Simplified Chinese and regional words of Taiwan/Hong Kong/mainland China/Singapore based on Wikipedia and OpenCC rulesets 轉換中文簡體、繁體及兩岸、新馬地區詞,基於維基百科和…
-
modeling
tools to analysis different languages by Ctags
-
wordnet_stemmer
Wordnet stemmer
-
lindera-tantivy
Lindera Tokenizer for Tantivy
-
dumbfuzz
dumb library for fuzzy search
-
transcript
A transcriber for European scripts
-
hema_minigrep
grep clone
-
markdown-it-autolink
A markdown-it plugin for parsing GFM autolinks
-
rnltk
Natural Language Toolkit for Rust
-
libxdiff
Rust bindings for the libxdiff C library
-
learning_fromdocs_minigrep
Project made on the path toward learning Rust from its docs. Completely personal and to test how to upload to crates.io.
-
tectonic_bridge_flate
Exposing flate compression to the Tectonic C code
-
crawdad
ChaRActer-Wise Double-Array Dictionary
-
cw
Count Words, a fast wc clone
-
ropey
A fast and robust text rope for Rust
-
strip_markdown
remove markdown syntax from markdown files
-
the-book-tui
A TUI application to load, read, and search The Rust Book
-
md-ulb-pwrap
Markdown paragraph wrapper using Unicode Line Breaking Algorithm
-
istr
a fast string interning library based off of ustr
-
word_filter
A Word Filter for filtering text
-
genere
randomization of text respecting grammatical gender of sentences
-
boringascii
Strings that can only be constructed to contain non-whitespace, non-control ASCII characters
-
verba
working with Latin words
-
jg
Jeff Goldblum (jg) is a command-line JSON processor. jg searches for structural patterns in json input and prints each json object that matches the pattern.
-
mdbook-skill-tree
mdbook plugin to show roadmaps
-
mdbook-asciidoc
mdBook backend for AsciiDoc generation
-
encoded-words
Encoded Words for usage in MIME headers
-
argot
Parse documentation from codebases into Markdown for easy doc creation
-
vndb_rs
Bare-bones VNDB api
-
pillar
small tool to format lines into columns
-
text-tokenizer
Custom text tokenizer
-
tectonic_bridge_harfbuzz
Expose the Harfbuzz C/C++ APIs to Rust/Cargo
-
shoebill
A Wadler/Leijen style pretty-printer
-
strange
A static website generator
-
csv_to_table
pretty print CSV as a table
-
diffy-fork-filenames
Fork of https://docs.rs/diffy that allows specifiying filenames
-
cur
that will hunt for your regular expression
-
sastrawi
stemming and stopword removal for Bahasa Indonesia based on PHP sastrawi project by Andy Librian
-
charx
A replacement for char::is_ascii*
-
insim_core
Core types and traits for the insim crate
-
clparser
A command line tool for parsing CHANGELOG.md files that use the Keep A Changelog format. (fork marcaddeo/clparse)
-
mdbook-files
Preprocessor for mdbook which renders files from a directory as an interactive widget
-
rammer
A Baysian Text Classifier for Spam/Ham
-
fifthtry-mdbook
fork of mdbook, only for ft-cli
-
morc
Dead simple, minimal markdown generator library written in Rust
-
mdbook-najan
Preprocessor for the Najan mdBook
-
file-search
File indexing and search
-
linkcheck
extracting and validating links
-
ra_rustc_lexer
lexer used by rustc, zero stability gurantees
-
uiuifree-normalize
uiuifree text normalize
-
rreplace
designed to streamline string replacements. It can handle multiple unique replacements and iterates the string only once.
-
shutil
Shell utility helper library
-
founder
The package provides a font toolbox
-
mdbook-numeq
An mdbook preprocessor for automatically numbering centered equations
-
swrite
Infallible alternatives to write! and writeln! for Strings
-
mle
The markup link extractor (mle) extracts links from markup files (Markdown and HTML)
-
libflagup
Display a country's flag as an emoji
-
netxt
Handling TODO .txt files
-
skribo
low-level text layout
-
pretok
A string pre-tokenizer for C-like syntaxes
-
string-overlap
A helper crate for "layering" ASCII art
-
mdbook-webinclude
Preprocessor for mdBook that includes content from URLs
-
libwizard
creating a simple http web server or a (somewhat static) API
-
mdbook-multicode
Allows you to give multilanguage code examples, toggled by a spinner
-
zp
Copy the contents of the source file or the std output buffer to the clipboard, allowing users to easily paste the contents into another file or program
-
fmt-extra
Extra formaters not shipped in the standard library
-
mudders
Generating Lexicographically-Evenly-Spaced Strings, or: Mudder.js in Rust
-
arbitrator
Format text based on a set of rules and regexes
-
tabwriter
Elastic tabstops
-
unicode-intervals
Search for Unicode code points intervals by including/excluding categories, ranges, and custom characters sets
-
mdbook-docker-run
Plugin for mdbook which runs a script in docker and outputs the result
-
mdbook-footnote
mdbook preprocessor for footnotes
-
mime_4
Strongly Typed Mimes
-
ngrams
Generate n-grams from sequences
-
sf-api
API to send commands to the Shakes & Fidget servers and parse their responses into characters
-
file-size
a function formatting file sizes in 4 chars
-
tradu
command to traslate using DeepL
-
yarner-block-links
A Yarner plugin that adds to each code block a list of links to all referenced and all referencing blocks
-
crypto-invert
Unicode Upside-Down Mapping
-
icu_casemapping
Unicode case mapping algorithms
-
twitter_text_parser
Parser for twitter-text in Rust
-
moscato
Outline scaler for OpenType glyphs
-
noodler
A port of the python-ngram project that provides fuzzy search using N-gram
-
treelight
A syntax highlighter for the web using tree-sitter
-
haoxue-dict
Chinese dictionary and word segmenter
-
e_book_sync_library
Synchonize e-book with your local e-library
-
basic-text-internals
Basic Text string literal implementation details
-
resrap
A parser library
-
LatexImgGen-lib
generating Latex images of equations with an edition number to avoid cache flushing and timestamp attributes
-
tectonic_status_base
Basic types for reporting status messages to a user
-
whisperer
将文本编码为简短的中文字符, 防和谐
-
lindera-decompress
A morphological analysis library
-
darts
A double array trie, A Forward Maximum Matching Searcher
-
timfmt
A small utility for formatting code as Tim likes it
-
twitter_text_conformance
twitter-text in Rust
-
chinese2digits
The Best Tool of Chinese Number to Digits. A useful tool in NLP and robot project.
-
textract
extract text from various types of files
-
scoundrel
An engine for working with ASCII roguelikes
-
kincaid
A word statistics library in Rust
-
lsp-document
Helpers to convert between LSP documents and Rust strings
-
text-to-json
Convert text to json in rust
-
yeslogic-ucd-generate
A program for generating packed representations of the Unicode character database that can be efficiently searched with support for additional tables
-
mdbook-morsels
Morsels plugin for Mdbook
-
ascii-hangman-webapp
customizable Hangman game with ASCII-art rewarding for children (webapp version)
-
zummi
fun lib that produces spoonerisms
-
pulldown_typst
A pull parser for Typst markup
-
crustword
Crusty Crosswords
-
subscript-compiler
A modern LaTeX rendition
-
mdbook-obsidian
mdBook preprocessor to render Obsidian specific syntax
-
glyph-names
Mapping of characters to glyph names according to the Adobe Glyph List Specification
-
spandex-hyphenation
Knuth-Liang hyphenation for a variety of languages
-
rmw-utf8
Short text compression algorithm for utf-8 (optimized for Chinese , developed based on rust programming language). 面向utf-8的短文本压缩算法(为中文压缩优化,基于rust编程语言开发)。
-
untex
Understand and manipulate TeX files with ease
-
markdown_to_html_parser
parses Markdown syntax into HTML
-
ogrep
searching in indentation-structured texts
-
pangu
Paranoid text spacing for good readability, to automatically insert whitespace between CJK (Chinese, Japanese, Korean) and half-width characters (alphabetical letters, numerical digits and symbols)
-
unicode-vo
Unicode vertical orientation detection
-
boxy
Declarative builder for Unicode box-drawing characters
-
kbremap
Custom keyboard layouts for windows
-
corpus-count
Util to count words and character ngrams in a corpus
-
encoding_rs_transcode
Transcode text within writers using encoding_rs
-
unic-segment
UNIC — Unicode Text Segmentation Algorithms
-
old_icelandic_zoega
Old Icelandic dictionary for Rust. From "A Concise Dictionary of Old Icelandic" by Geir Zoëga
-
unidok
A powerful, readable, easy-to-learn markup language
-
fax
Decoder and Encoder for CCITT Group 3 and 4 bi-level image encodings used by fax machines TIFF and PDF
-
fancy_print
Easily print animated ASCII text to the terminal!
-
spdkit-surface
short text for crates.io
-
collapse
Trim and collapse consecutive whitespace to a single space
-
string-box
Create Rust string from UTF-8 string, byte string or wide string
-
qm-role-build
Role builder from markdown tables
-
sola
A 16-bit stack machine and forth language
-
text_styled
Trait to style String and &str
-
const-utf16
Utf8 to utf16 conversion functions for use in const contexts
-
mdbookshelf
Create epubs from a list of mdbook repositories
-
trim
trim whitespaces from files
-
quill_delta_pdf
Convert Quill Delta to PDF
-
casaubon
A feature-poor static site generator
-
gen3-charset
Pokemon Generation 3 Character Set Support (GBA)
-
fgr-rs
A Find & Grep command line utility with SQL-like syntax
-
ansi-cut
cutting a string while preserving colors
-
zawk
An efficient Awk-like language implementation by Rust with stdlib
-
markdown-linkify
Markdown preprocessor for substiting link shorthands to valid links according to configurable regexes and custom substitution implementations
-
bottomify
Fantastic (maybe) CLI for translating between bottom and human-readable text
-
marko
Programmtically format text with Markdown syntax
-
wfst4str
Python library based on rustfst for manipulatig strings with wFSTs
-
indoc
Indented document literals
-
pathmut
Command line utility for extracting, removing, and replacing components from path strings
-
is_utf8
functions to determine if a sequence of bytes is valid utf-8
-
tre-regex-sys
Rust bindgen bindings to the TRE regex module
-
kakasi
Romanize hiragana, katakana and kanji (Japanese text)
-
synctex_sys
Rust Bindings to libsynctex's C API
-
struckdown
A structured markdown / commonmark library for Rust
-
md_converter
My markdown to html converter
-
rustex
auto-generated LaTeX files in Rust
-
irssi-sys
Automatically generated bindings to irssi
-
codes-iso-15924
This package contains an implementation of the ISO 15924 Codes for the representation of names of scripts specification
-
regex-automata
Automata construction and matching using regular expressions
-
ezemoji
Catigoryized Emoji's
-
mdbook-image-size
A mdbook preprocessor which support image size syntax
-
glob-match
An extremely fast glob matcher
-
simple_csv
CSV parsing implementation
-
emoji-printer
Replace emoji shortcodes in string with emoji unicode (":sushi:" -> 🍣)
-
arabic-script
An expressive API for the characters of the Arabic script
-
qpdf-sys
Rust bindings to QPDF C++ library via FFI and bindgen
-
pdfutil
PDF document manipulation
-
cli-animate
Enrich your Rust CLIs with stunning, easy-to-implement animations
-
text-sanitizer
convert text to plain ASCII text
-
sixbit
Small packed strings
-
finl-charsub
Character substitution module for in-line replacement of fixed strings
-
mdbook-wordcount
Word count for mdbook, inspired by the mdbook tutorial
-
moenarchbook
Creates a book from markdown files
-
diff_match_patch
Diff Match Patch is a high-performance library in Rust-lang that manipulates plain text
-
textblocks
working with text blocks
-
mdbook-translation
prepare multi-lingual mdBook books
-
aprilasr-sys
Low-level FFI bindings for the april-asr C api (libaprilasr)
-
braille_pics
producing text-art pictures using Braille characters
-
scripter
A screenplay compiler
-
jpreprocess-jpcommon
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
basic_lexer
Basic lexical analyzer for parsing and compiling
-
minigrep-carlossgv
grep implementation in Rust
-
top-english-words
retrieve top words from the English language
-
iconv-compat-win-sys
iconv bindings for Rust
-
minigrepdemothorndal
A mini grep exercise from rust book
-
veloci_levenshtein_automata
Creates Levenshtein Automata in an efficient manner
-
spellcheck_toy
a basic spellchecking library based on edit distance
-
mdbook-hide
A preprocessor for mdbook that adds support for hidden chapters
-
ocr_b_checksum
Generates OCR B Checksums
-
wordfreq-model
Model loaders for wordfreq-rs
-
hina
:]
-
aki-txpr-macro
the more easy to use libaki-*
-
token-read
reading whitespace delimited files intended for competitive programming
-
mdtransform
program that renders Markdown as HTML for websites
-
mintmark
Markdown renderer for Epson TM-U220B receipt printers
-
decline-word
Choose word form based on given number
-
asciidork-backend
Asciidork backend
-
rsonpath-test-codegen
Blazing fast JSONPath query engine powered by SIMD. TOML-based test codegen for rsonpath-lib.
-
strings
String utilities, including an unbalanced Rope
-
translation-api-cn
Some useful structs for calling Chinese translation api cloud services. A helper tool for
bilingual
cmdline tool. -
mdbook-playscript
Preprocessor for mdBook, which styles stage play scripts
-
analisar
A Lua parser for Rust
-
minigrep5
grep implementation in Rust
-
simple-localization
localization library for text translation
-
textalyzer
Analyze key metrics like number of words, readability, and complexity of any kind of text
-
outerspace
Methods for prefixing and suffixing the non-whitespace characters in a string
-
mdbook-quiz-validate
Input validation for quizzes used in mdbook-quiz
-
ssml-parser
parsing speech sythnesis markup language
-
romulus
a stream editor like sed
-
comment-strip
Remove comments out of text files
-
lindera-ipadic-neologd-builder
A Japanese morphological dictionary builder for IPADIC NEologd
-
bionic-ebooks
Takes an EPUB file and generate a copy with bionic like font applied
-
trim_lines
An extremely simple and tiny library which provides an iterator over the lines of a string, trimmed of whitespace. It is a simple wrapper around the Lines iterator in std::str which trims the whitespace from each line.
-
mocker
transforming text with uppercase and lowercase letters, resembling the Spongebob Mock Meme
-
varcon-core
Varcon-relevant data structures
-
hunspell-rs
Rust bindings to the Hunspell library
-
furigana
Map furigana to a word given its reading
-
hunspell-sys
Bindings to the hunspell C API
-
node-emoji
Convert
:emoji:
to Unicode using GitHub’s and EmojiDB’s emoji names -
rust_lemmatizer
A lemmatizing package for use with a .csv dictionary of lemmas and their corresponding words
-
rew
A text processing CLI tool that rewrites FS paths according to a pattern
-
readwise
wrapper for the Readwise API
-
wildflower
Wildcard matching against strings
-
utf
UTF-8
-
libpcre-sys
Native bindings to libpcre
-
chinese-ner
A CRF based Chinese Named-entity Recognition Library written in Rust
-
allsorts_no_std
Font parser, shaping engine, and subsetter for OpenType, WOFF, and WOFF2
-
omgwtf8
Optimized-Matching-Generalized Wobbly Transformation Format — 8-bit
-
darn
Extras for the Book on Data Analysis with Rust Notebooks
-
kryptor
command-line cryptography tool
-
fsays
flavored replacement for the classic cowsay
-
ed_join
Implemtation of Ed-Join Algorithm for string similarity join
-
markdown-table
Creating markdown tables with Rust!
-
llmvm-core-lib
llmvm core application
-
lingua-german-language-model
The German language model for Lingua, an accurate natural language detection library
-
shallow
long text
-
intname
Full English name for any integer of any primitive integer type
-
gfm-autolinks
Parse GitHub Flavored Markdown autolinks
-
diacritics
Remove diacritics from letters, for example when standardizing input for a search
-
every-range
Iterator for interleaving included and excluded ranges
-
yozuk-model
NLP model generator for Yozuk
-
chars_data
Build-dependency for chars, the unicode character information CLI
-
tfidf-summarizer
Basic tf-idf compute for documents
-
catmark
Console printer for CommonMark
-
mediawiki_parser
A strict parser for MediaWiki markdown
-
ellie_tokenizer
Tokenizer for ellie language
-
gqlog
👾 filter your json logs with graphql 👾
-
gen-epub-book
Generate an ePub book from a simple plaintext descriptor
-
topfew
CLI to find high frequency occurrences in structured text files
-
md-tui
A terminal markdown viewer
-
mdbook-compress
Compress an mdBook project into a single PDF file
-
yeslogic-unicode-script
Fast lookup of the Unicode Script property
-
code-span
Add additional infomation to code character
-
mdbook-latex
An mdbook backend for generating LaTeX and PDF documents
-
jmdict
The free Japanese dictionary database JMdict, pre-digested for use in Rust libs and apps
-
ewin-com
editor for Window(GUI) users.No need to remember commands
-
mul
Bengali stemmer
-
termwrap
Wrap Unicode text with ANSI color codes
-
pygmentize
wrapper for syntax highlighting
-
ucd-util
A small utility library for working with the Unicode character database
-
case-macro
Macros for changing the casing of things
-
pdf-create
low-level, strongly-typed PDF creation library
-
hashtag-regex
regex matching hashtags accoding to the unicode spec: http://unicode.org/reports/tr31/#hashtag_identifiers
-
yozuk-sdk
Types used in the Yozuk ecosystem
-
fontconfig-rs
Safe, higher-level wrapper around the fontconfig library
-
llmvm-outsource-lib
outsource backend for llmvm
-
esl01-drawdag
Parse an ASCII DAG into parent relations
-
moguls
Let the words of financial moguls inspire and guide you in your quest for financial excellence and understanding
-
s_minigrep
project from rust book
-
difference
text diffing and assertion library
-
rodarte-minigrep
CLI for simple plain-text searching
-
stringutils
A collection of various and (hopefully) useful String utility functions
-
mdlint-cli
tbd
-
encoding-index-tradchinese
Index tables for traditional Chinese character encodings
-
leven
Make your own blog!
-
snarky-parrot
markov chain text generator
-
ascii_tree
generates ascii trees
-
recode_rs
Command-line tool for converting between the character encodings defined in the Encoding Standard
-
caseless
Unicode caseless matching
-
sauron-markdown
parsing markdown into sauron node
-
roxy_markdown_parser
Roxy plugin for parsing Markdown
-
justify
plaintext while handling Unicode gracefully
-
encoding-index-korean
Index tables for Korean character encodings
-
aki-stats
output the statistics of text, like a wc of linux command
-
autoruby
Easily generate furigana for various document formats
-
literate
programming tool that extracts code written in your Markdown files
-
owoify
Text owoification library
-
character-stream
Helper data structures for reading UTF-8 characters from a stream
-
fancy_mocker
transforming text into alternating uppercase and lowercase letters, resembling the Spongebob Mock Meme
-
saurus
Markdown to LaTeX Transpiler
-
yozuk-core-skillset
Set of default Yozuk skills
-
ansi-to-tui-forked
convert ansi color coded text into ratatui::text::Text type from ratatui library
-
color-convert
Support RGB,RGBA,HEX,HSL,HSLA,HSV,CMYK to convert each other, write by rust
-
spacebar
An anti-plagiarism tool based on null width characters
-
lines
Utililities for iterating readers efficiently line-by-line
-
grep-rezza
An app for search text in file
-
lithe
A Slim template engine by using Pest
-
pcre2
High level wrapper library for PCRE2
-
byte_string
Wrapper types for outputting byte strings (b"Hello") using the Debug ({:?}) format
-
dhoni
converting Bengali text into their phonetic counterpart
-
summertail
Summarize your tail -f output, by reporting how many lines match regexes
-
ammonia
HTML Sanitization
-
halfcaps
tRaNslAtE aNy TeXt To ThIs
-
ghimlink
Generates a Github-Flavoured-Markdown formatted image link to an image in the same repository
-
encoding-next-index-tradchinese
Index tables for traditional Chinese character encodings
-
mdbook-embed
A preprocessor that simplifies embedded URL
-
hsk
Return HSK Level for Simplified Chinese Characters
-
english-language-parser
parser of English sentences created for KMA Rust course
-
informa
font compiler for digital typography
-
rigrep
grep from Rust Book
-
eliza
natural language processing program developed by Joseph Weizenbaum in 1966
-
pencil_case
but effective library for writing GUI in bevy
-
demoji
Remove all emojis from a string
-
tzgrep
grep tar.gz
-
milligrep
Custom simplified implementation of grep
-
wkhtmltopdf
High-level bindings to wkhtmltopdf
-
charname
Incredibly simple library that just gives you the Unicode name for a character
-
syllable
counter for use with reading level calculations
-
mdbook-checklist
An mdBook preprocessor for generating checklists and indexes
-
wtf8
WTF-8 encoding. https://simonsapin.github.io/wtf-8/
-
rustextile
Textile markup language parser for Rust
-
latin1str
Windows-1252 string types
-
slideck
Create an HTML slide deck from Markdown
-
mdbook-all-the-markdowns
Render all markdown files in a given folder structure
-
morse-nostd
A nostd version of the morse crate
-
mmgrep
search in files
-
utf8_slice
Lightweight UTF8 Slice Utilities
-
csvre
replacing data in CSV columns with regular expressions
-
single_source
Generate code files from snippets in md tutorial files
-
sparklet
small flashcards library
-
enc-check
inspect utf-8 and utf-16 character encodings
-
terminal-supports-emoji
Check whether the current terminal supports emoji
-
chinese_segmenter
Tokenize Chinese sentences using a dictionary-driven largest first matching approach
-
portmanteau
create portmanteaux
-
cistring
A string type that preserve case, but compares insensitiveley
-
assert-text
the testing macro tools
-
am-parse
converting asciimath to other formats
-
find_unicode
Find Unicode characters, the easy way!
-
m_lexer
extensible regular expressions based lexer
-
mdbook-mathpunc
An mdbook preprocessor that prevents line breaks between inline math blocks and punctuation marks when using katex
-
blockcounter
Counts the blocks in a stream
-
case-conv
Faster case conversion crate
-
hashmoji
hash functions that returns emojis
-
anagram
A collection of anagram utility functions
-
ultron
A web base text-editor with strong focus on displaying multi-width characters correctly
-
str_to_bytes
parse a string representations of numbers or ascii text and return a Vec<u8>
-
detone
Decompose Vietnamese tone marks
-
gestalt_ratio
Calculate the gestalt pattern matching ratio between two strings
-
seq2xypic
Turn a text sequence diagram into a LaTeX xypic diagram
-
termbook
behind the
termbook-cli
-
kindleclip
CLI tool to export a file per book in kindle 'My Clippings.txt'
-
yarner-lib
build plugins for the Literate Programming tool Yarner
-
kaomoji-ru
The full http://kaomoji.ru database, encoded and escaped as byte literals
-
toml_document
Document-style TOML parser. Allows editing TOML document in a way which preserves whitespace and comments.
-
uniwhat
Display the unicode characters text
-
wordbreaker
A Unicode-aware no_std crate (requires alloc) that rapidly finds all sequences of dictionary words that concatenate to a given string
-
ron_to_table
pretty print RON as a table
-
trevordmiller
Personal CLI
-
strainer
Command-line tool for finding duplicate lines in text files
-
mdbook-snips
Markers for hidden lines in rust blocks within an mdbook
-
doccy
brace based markup language
-
latex_snippet
Convert even erroneous LaTeX snippets into HTML
-
ttf_word_wrap
Wraps text based on character width
-
const_format_proc_macros
detail of the
const_format
crate -
ptero-cli
A text steganography CLI tool for Social Media
-
geml
Generator-orientated ML parser
-
rescue-blanket
Escape values while they are being formatted
-
scie
research about how to build simple code identify engine for different languages
-
txtframe
Creates a frame for text
-
spongedown
Converts markdown to html with svgbob support
-
tweak
when/then clauses to run
-
basic-text
Basic Text strings and I/O streams
-
h_hangul
Korean Characters
-
dynfractal
A program to render any equation (in terms of z and c) into an ASCII fractal
-
unic-bidi
UNIC — Unicode Bidirectional Algorithm
-
p4d-mdproof
Markdown to PDF converter
-
mdtohtml
markdown to html renderer (with a couple of missing features)
-
ferris-finder
Unicode character finder with fuzzy search
-
textcat
detect text categories. It can be used to detect the language of a given text
-
demoji-rs
demoji implementation for Rust
-
ced
Dead easy csv editor
-
wordcut-engine
Word segmentation/breaking library
-
lyt
A static site generator written in Rust
-
tectonic_xetex_layout
XeTeX's font loading and layout interface encapsulation, as a crate
-
fast2s
A fast Traditional Chinese to Simplified Chinese conversion library. Built with FST, faster than most of other libraries.
-
fwuffgrep
Basic implementation of a grep command written in rust
-
html2runes
An HTML to Text converter
-
case_convert
Converts the first letter of a Rust String to uppercase
-
image_ascii
transform images into ASCII art. Both into text and images consisting of letters.
-
askama-filters
Extra template filters for Askama
-
corg
A cog-like tool
-
moenster
mønster (n) - pattern. simple glob-style pattern matching for strings
-
pikchr-cli
PIC-like diagramming language to SVG converter
-
nb2fr
Convert integer number to literal French text
-
tb_normalization
normalization utf8 string, loc dau vietnamese and some language
-
rust-cedar
efficiently-updatable double-array trie in Rust (ported from cedar)
-
lean-sys
Bindings to Lean 4's C API
-
rustscii
ASCII game framework
-
conllu-utils
working with the CoNLL-U dependency format
-
bocu1
BOCU-1 compressed unicode encoding
-
filecount
A modern high-performance open source file analysis library for automating localization tasks
-
typos-dict
Source Code Spelling Correction
-
platina
parameterized golden testing library
-
noneifempty
Convert T to None if empty
-
minigrep_macano953
A fun grep tool written as my first incursion into Rust. Not intended to be inherited and used in production code
-
promkit
A toolkit for building your own interactive command-line tools
-
terminal-emoji
safely displaying emoji inside of terminals
-
bible-reference-rs
Extract Bible references from plain text
-
rckive-genpdf
User-friendly PDF generator written in pure Rust
-
markdown-table-formatter
Markdown table formatter fully compliant with Unicode 15.1.0
-
dialogi
A dialog parser
-
rexpaint
functionality for reading and writing .xp files of the Grid Sage Games REXPaint ASCII art editor
-
quoted-string-parser
Quoted string parser for grammar defined in RFC3261
-
nipah_tokenizer
A powerful yet simple text tokenizer for your everyday needs!
-
pdf_form
programatically filling out pdf forms
-
names-changer
Convert a names of sql schemes from camelcase to snake case
-
aklat
create books from markdown files (like Gitbook)
-
bibutils-sys
Rust bindings for bibutils, a program for bibliography format interconversion
-
bookkeeping
Minimal, safe and flexible in-memory bookkeeping API
-
pdf_forms
programatically filling out pdf forms
-
bullshit-rs
A BullshitGenerator implementation in Rust
-
bitfont
Takes an ASCII string and generates a vector containing a bitmap font, for easy overlay into images
-
highlights
Convert kindle highlights into different formats
-
xim-ctext
compound text en/decoder
-
local-encoding
encoding/decoding string with local charset. It usefull for work with ANSI strings on Windows.
-
mojimoji-rs
fast converter between Japanese hankaku and zenkaku characters, mojimoji
-
snakecase
general purpose snakecase implementation supporting both ascii and unicode
-
jp-location-relation
隣接する市区町村の一覧を取得
-
suffix
arrays
-
lindera-ko-dic
A Japanese morphological dictionary for ko-dic
-
retest
Command-line regular expression tester
-
guarding_parser
Guarding is a guardians for code, architecture, layered. Guarding crate a architecture aguard DSL which based on ArchUnit.
-
lingua-spanish-language-model
The Spanish language model for Lingua, an accurate natural language detection library
-
sudachiclone
sudachiclone-rs is a Rust version of Sudachi, a Japanese morphological analyzer
-
hemtt-tokens
A token library for hemtt
-
prettify-markdown
Format Markdown at the speed of Rust
-
mdbook-tectonic
An mdbook backend for generating LaTeX and PDF documents
-
react-html
react html components for frender
-
typed-dialogflow
typed Google Dialogflow client
-
web-grep
A Grep Tool for HTML or XML
-
encoding-next-index-korean
Index tables for Korean character encodings
-
mdrend-this-is-only-a-beta-demo
Renders markdown to HTML
-
txt_processor
A little library for text processing
-
github-slugger
A slugger for GitHub headings
-
regex-lite
A lightweight regex engine that optimizes for binary size and compilation time
-
cologne_phonetics
generate phonetic cologne codes for utf8 strings
-
asciimath-text-renderer
Render asciimath in terminal
-
bos_books_codes
that handles 3-character Bible Books Codes
-
hex_d_hex
HexDHex is a Rust Crate that encodes and decodes byte data to and from its hexidecimal representation. For instance, one may wish, on ocasion that is, to translate a utf8 or ASCII string…
-
kanabake
Turn Romaji strings into Hiragana or Katakana
-
math_test
Generates math questions for elementary school kids
-
japhonex
Japanese phone number checker for Rust
-
hoedown
bindings for the Hoedown markdown processor
-
txtpic
Generate Unicode art from images
-
mdbook-wikilink
Support for wikilinks on mdBook
-
encoding_c_mem
C API for encoding_rs::mem
-
jpreprocess-naist-jdic
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
datex
package to automatically process your data and make it look pretty in LaTeX
-
hline
a grep-like tool that highlights lines in files
-
latex-to-html
Latex to html converter
-
vextractor
vextractor-cli
-
trexter
Text progression tracking library
-
tectonic_bridge_icu
Expose a subset of the ICU Unicode APIs to Rust/Cargo
-
codam-mdbook
Creates a book from markdown files
-
interpol
String interpolation macros
-
route-pattern
A parser and matcher for route patterns
-
x1b
State tracker for 0x1b terminal escape codes. Also gives random characters names.
-
re2
Wrapper for the re2 C++ regex library
-
minigrep-danidim13
A test crate based on Rust book examples
-
simple-text-pattern
Matching simple text patterns
-
case_style
Converts from and to different case styles
-
aki-json-pick
The json pick out command
-
psa
PSA(Project structure analysis) is a analyzer for analysis project struct
-
html_to_epub
A command line converts .html file to .epub file
-
fingers
a finger client library
-
ugrip
A i/o command that searches for word in a file
-
tablers
create tables and export them into CSV or PDF. Without external deps
-
rslnp
A configurable parser for scopes list notation (SLN)
-
minigrep_v1
A lib from rust book, search word in contents
-
jellybean-pack-1
Sweet syntax highlighting with tree-sitter
-
rsplitter
Splitter is a tool to split words from text that contains no spaces
-
mystem
Wrapper around Yandex Mystem for Rust
-
minigrep-bahadir
A fun project to learn the great language Rust
-
lexer-generator
Lexer derived from Regex patterns with user customizeable tokens
-
basen
Convert binary data to ASCII with a variety of supported bases
-
lindera-filter
Character and token filters for Lindera
-
mepple
English dictionary as a library
-
mdtranslation-cli
Command-line tools for using mdTranslation, which can be used to prepare multi-lingual Markdown documents
-
naming_clt
Extract and convert the naming format(case|notation) of identifiers from files or stdin. Use this tool to prepare identifier name strings for further operations (matching,replacing...) on relative files
-
stardict_lib
Rust implement stardict
-
briefcase
Convert to any case!
-
mdbook-newday
Command line utility to add a new line for today to the top of an mdbook SUMMARY.md book
-
kma-rustlang-vadym-polishchuk-english-parser
parser of English sentences created for KMA Rust course
-
readable-readability
Really fast readability
-
scrambler
command line tool to scramble letters
-
adobe-cmap-parser
parse Adobe CMap files
-
mdbook-yml-header
mdBook preprocessor for removing yml header
-
virgil
A Static Site Generator
-
indentation_flattener
From indented input, generate plain output with indentation PUSH and POP codes
-
is-vowel
Heuristically test whether a character is a vowel letter
-
utf8reader
wrapper around Reader that returns a stream of UTF-8 characters
-
pest_ascii_tree
Helper crates converting the parsing result of any pest grammar into an ascii tree
-
rosie-sys
build or link to librosie to access the Rosie Pattern Language
-
encoding8
various 8-bit encodings
-
seven_seg
Seven-segment digital display for terminal
-
esl01-renderdag
Render a graph into ASCII or Unicode text
-
mojibake
Encode/Decode bytes as emoji base2048
-
emojito
Find all the Emoji in a string. Supports composed emoji.
-
texc-latex
Contains LaTeX templates for TeXCreate
-
dismej
convert text to discord emojis
-
indented
Format data with indentation
-
font-index
Static index for system fonts
-
company_hr
My approach to the third problem from chapter 8.3 (Common Collections) of 'The Book'
-
lindera-ipadic
A Japanese morphological dictionary for IPADIC
-
tex
The νTeX typesetting engine
-
kytea-tokenizer
Wrapper of tokenization by KyTea
-
shingles
in rust
-
bgrep
grep tailored to handle binary patterns and files
-
xsv
A high performance CSV command line toolkit
-
munemo-rs
Turn an integer into a more rememberable word, or vice-versa
-
vl-convert-pdf
convert SVG to PDF with embedded text
-
cp437_constants
Defined constants covering the entirety of ASCII code page 437 (https://en.wikipedia.org/wiki/Code_page_437)
-
twemoji-rs
A word-cloud image generation crate
-
lindera-compress
A morphological analysis library
-
spyglass
Search engine for documents, inspired by bioinformatics
-
wordninja
port of the Word Ninja English word splitting library
-
markx
markdown parser
-
rut
A small UTF-8 parsing library for applications that need to parse individual
char
s -
corpus-preproc
A preprocessor for text and HTML corpora
-
vaporetto_rules
Rule-base filters for Vaporetto
-
kradical_static
Ready-to-use EDRDG radical decompositions
-
unicode_categories_zheyuye
Query Unicode category membership for chars
-
hex_table
Format a byte slice into a hex table
-
dictionary-1024
a mnemonic dictionary with 1024 words, 10 bits per word
-
grepy
search files
-
reap-grep
wgrep/write-grep CLI
-
export-kindle
Export Kindle book list to JSON or YAML
-
forgiving-htmlescape
HTML entity encoding and decoding, with support for leaving malformed entities intact
-
kase
Convert from one case to another. Input case is determined on a best-guess basis, but can be overridden with the --from flag.
-
nib
static site generator
-
ucd-generate
A program for generating packed representations of the Unicode character database that can be efficiently searched
-
stylometry
A package to perform stylometry operations
-
unidades
playing with physical units
-
mercurius
Extended pulldown-cmark
-
lexicmp
comparing and sorting strings lexicographically and naturally
-
encoding-index-japanese
Index tables for Japanese character encodings
-
atm_parser_helper_common_syntax
Parsing utilities for the common syntax specification
-
newline-converter
Newline byte converter library
-
iterlower
Final-sigma-correct lowercasing iterator adapter with option for Turkish/Azeri I behavior
-
goya
morphological analyzer for Rust and WebAssembly
-
deepfrog
A deep learning NLP suite (PoS,lemmatiser,NER) with FoLiA XML support
-
simplecc
Chinese Convert library (partially) compatible with OpenCC's dictionaries
-
text-tables
A terminal/text table prettifier with no dependencies
-
mdbook-to-github-wiki
Turns an mdbook book into a github wiki
-
overlap
shows overlap text in files
-
sesdiff
Generates a shortest edit script (Myers' diff algorithm) to indicate how to get from the strings in column A to the strings in column B. Also provides the edit distance (levenshtein).
-
deface
Lightweight markup to HTML converter
-
rusty_code_code_for_book
my book_rusty code
-
daveparr_minigrep
command line tool to search for a string in a file based on 'The Rust Programming Language' book
-
stopwords
from popular text processing frameworks
-
conveyance
A stop-gap CLI for conveyancing
-
poetry-book
Create a poetry book in latex, starting from plain text
-
lingua-latvian-language-model
The Latvian language model for Lingua, an accurate natural language detection library
-
code-tour
Enhanced example-based learning, i.e. awesome examples user experience
-
fnew
A Unicode-aware line-oriented drop-in replacement for coreutils' fold
-
hashlogs
Command-line utility that hashes the part before a space on each line from stdin with blake2b keyed with an ephemeral randomly-generated key and writes to stdout
-
notedown-rs
Notedown Text parser and formatter
-
kaomji-ru
The full http://kaomoji.ru database, encoded and escaped as byte literals
-
mdbook-trace
A traceable document preprocessor for mdbook
-
table_to_html
interface to convert a
tabled::Table
into a HTML table (<table>
) -
ra_ap_ide_ssr
Structural search and replace of Rust code
-
morsels_lang_ascii
Basic ascii tokenizer for morsels
-
unic-idna-mapping
UNIC — IDNA — IDNA Mapping Table
-
tblcompare
A tabular data comparison utility
-
glyphana
Quickly find, inspect & collect unicode glyps
-
char_reader
Safely read wild streams as chars or lines
-
charclass
define and modify unicode character classes
-
fum
fum finds fuzzy matches to a literal search pattern, searching recursively through all the files in the current directory and respecting gitignore rules
-
kth-lines
Command line tool for filtering stdin lines that just work
-
cjieba-sys
unsafe ffi to cppjieba
-
strng
Owned UTF-8 string
-
chanoma
Characters Normalization library. 文字列正規化処理用のライブラリです。
-
texrender
Thin wrapper around running
latexmk
to render LaTeX documents. Also supports generating Tex documents. -
encoding
Character encoding support for Rust
-
indexrs
inefficient multi-language search index
-
base_emoji
Convert everything to Emojis (and back)! - 🐫🔦🔥🎲🐬🍀🍟🔋🐬🎲🐬🍀🎁🍟💧👂🔥🚪🔋🍟🔦🔋🚗👣🔦🌍👂🍤🐗
-
jpreprocess-dictionary
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
exit-with-code
A CLI which returns the exit code of your choice
-
unicode-line-stacker
Tiny library for stacking Unicode line-drawing characters on top of each other
-
aqua_parse
A parser library
-
slicer
that slices string slices into smaller string slices
-
ascii-to-hex
A small, simple library to converting an ASCII text string into its hexadecimal equivalent
-
contractions
expand contractions in English
-
spellcheck
A spell-checker based on the statistical algorithm described by Peter Norvig in http://norvig.com/spell-correct.html
-
pdf-merger-rs
app for merge PDF files
-
pseudolocalize
Pseudolocalization crate for Rust
-
rdftk_iri
IRI and URI specifications
-
vaporetto_tantivy
Vaporetto Tokenizer for Tantivy
-
stfu
Shut The Ferris Up - profanity filtering for Rust
-
infisearch_lang_ascii
Basic ascii tokenizer for InfiSearch
-
tectonic_bridge_graphite2
Expose the graphite2 library to Rust/Cargo
-
pdftotext
High-level library that binds to Poppler to extract text from a PDF
-
nutrimatic
Tools for reading Nutrimatic (https://nutrimatic.org) index files
-
latexify
Shared definition for turn a rust object into latex code
-
esc
Escape characters in strings
-
betaconvert
CLI for Betacode conversion
-
unicode_names
Map characters to and from their name given in the Unicode standard. This goes to great lengths to be as efficient as possible in both time and space, with the full bidirectional tables weighing barely 500 KB…
-
unicode_clusters
variable width unicode characters as single items, allowing for array like indexing etc
-
mdbook-inline-highlighting
mdBook preprocessor that enables support for inline highlighting
-
panduck-latex
Use panduck to generate XeLaTeX
-
saku
efficient rule-based Japanese Sentence Tokenizer
-
mdtable-cli
that makes creating tables in markdown much easier!
-
scenic
Presentation tool that utilizes LaTex Beamer package
-
mdoc
Modern PDF creation through Markdown and LaTeX
-
lindera-analyzer
A morphological analysis library
-
ucd-parse
parsing data files in the Unicode character database
-
wz-conf
Configuration options for wz
-
jellybean-pack-0
Sweet syntax highlighting with tree-sitter
-
mdbook-readme-summary
A mdbook Preprocessor that generates table for content for each subdirectory README files
-
unicode_types
A mapping of all the unicode characters into convenience types (one enum per block of characters with one variant per character)
-
unicode-utf8
that converts utf-8 bytes to a unicode scalar value, and vice versa
-
scannedpdf
A fast, low memory footprint, simple images to PDF crate
-
conv-rs
CLI converter for multi extensions
-
jpreprocess-dictionary-builder
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
lyn
A scanner useful for creating recursive descent parsers
-
minigrep_maybe
Just another minigrep from Rust book
-
random_access_unicode
Provide random access to the unicode characters in a file
-
encoding-next-index-japanese
Index tables for Japanese character encodings
-
encoding-index-simpchinese
Index tables for simplified Chinese character encodings
-
alphabet-encoder
A quick and dirty way to deal with escape characters
-
unicode-character-database
Unicode character database tables (Unicode Standard Annex #44) generated using
ucd-generate
-
simplearrayhash
-
translation_api
translation API
-
nib-cli
A cli for a yet another static site generator Nib
-
japanese-ruby-filter
Japanese ruby notation parser
-
rep-cli
Replace text file in bulk
-
equt-md-ext
Extend event iterator
-
mdbook-to-example
Turns an mdbook book into a Rust example
-
toc
generate toc from markdown or html
-
static_table
creates pretty tables at compiler time
-
asciifolding
ascii folding library
-
mdbook-treesitter
mdBook preprocessor for html adding tree-sitter highlighting support
-
unidecode
pure ASCII transliterations of Unicode strings
-
text_art_cn
A text art generator can include any kind of characters depending on the font. The art is composed of Braille characters like ⠐ ⠝ ⠿ ⣫ ⣷
-
brainfuck_interpreter
brainfuck interpreter written in Rust
-
regex-cli
A command line tool for debugging, ad hoc benchmarking and generating regular expressions
-
dechode
Dechoder to ASCII
-
unic-ucd-block
UNIC — Unicode Character Database — Unicode Blocks
-
xmldecl
Extracts an encoding from an ASCII-based bogo-XML declaration in text/html in a Web-compatible way
-
luaparse
A Lua 5.3 parser
-
mdlynx
Small, fast utility to find broken file links in Markdown documents
-
tpng
A small tool that prints truecolor png renderings to the terminal using unicode block characters
-
lazy-char-iter
Lazily iterate over Unicode characters from a u8 slice in Rust
-
hashmoji_generator
Code generation tool for hashmoji
-
enchode
ASCII enchoder