Stars
- All languages
- Assembly
- Batchfile
- Bikeshed
- C
- C#
- C++
- CMake
- CSS
- Clojure
- Cython
- Dart
- Dockerfile
- Earthly
- Elixir
- Go
- HTML
- Handlebars
- Java
- JavaScript
- Jupyter Notebook
- Kotlin
- Lua
- MATLAB
- MDX
- Makefile
- Markdown
- Mermaid
- Mustache
- Objective-C
- PHP
- Pascal
- PowerShell
- Python
- Roff
- Ruby
- Rust
- SCSS
- Scala
- Shell
- TypeScript
- V
- Vala
- Vim Script
- Vue
The GoCSV package aims to provide easy CSV serialization and deserialization to the golang programming language
ClickHouse® is a real-time analytics database management system
Rust-tokenizer offers high-performance tokenizers for modern language models, including WordPiece, Byte-Pair Encoding (BPE) and Unigram (SentencePiece) models
Quickner is a new tool to quickly annotate texts for NER (Named Entity Recognition). It is written in Rust and accessible through a Python API.
Neural network transition-based dependency parser (in Rust)
Rust-based Natural Language Toolkit using Python Bindings
SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm
Spelling correction & Fuzzy search based on Symmetric Delete spelling correction algorithm.
SeekStorm - sub-millisecond full-text search library & multi-tenancy server in Rust
over 6_00_000 english words data set arranged with each words frequency
A tool for converting dictionary files aka glossaries. Mainly to help use our offline glossaries in any Open Source dictionary we like on any modern operating system / device.
Wiktionary dump file parser and multilingual data extractor
A feature-rich dictionary lookup program, supporting multiple dictionary formats (StarDict/Babylon/Lingvo/Dictd) and online dictionaries, featuring perfect article rendering with the complete marku…
Open-Source Queryable Formatted English Dictionary, in multiple formats based on The Online Plain Text English Dictionary (OPTED) dictionary
Ready-made tokenizer library for working with GPT and tiktoken
a CSV of every english word, part of speech, and definition. as well as a web scraping script that generates that data for you
💫 Models for the spaCy Natural Language Processing (NLP) library
A tool for extracting plain text from Wikipedia dumps
A list of the top 3 million+ English words in Project Gutenberg, along with their frequency.
update-golang is a script to easily fetch and install new Golang releases with minimum system intrusion
Familiar asyncio ORM for python, built with relations in mind
Prisma Client Go is an auto-generated and fully type-safe database client
Linux命令大全搜索工具,内容包含Linux命令手册、详解、学习、搜集。https://git.io/linux
Collection of handy online tools for developers, with great UX.