Lists (1)
Sort Name ascending (A-Z)
Stars
Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.
A simple zero-config tool to make locally trusted development certificates with any names you'd like.
Fast hamming-distance range searches via native GiST Indexing facility in PostgreSQL
Distribute and run LLMs with a single file.
Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured …
A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/爬虫软件,可以无代码图形化的设计和执行爬虫任务。别名:ServiceWrapper面向Web应用的智能化服务封装系统。
An alternative to original alert, confirm and prompt.
🧑🚀 The better identity infrastructure for developers and the open-source alternative to Auth0.
FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, le…
Data annotation component library --provided as NPM packages
A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.
A curated list of python scripts for automating your tasks
AiEditor is a next-generation rich text editor for AI.
Python PDF Parser (Not actively maintained). Check out pdfminer.six.
Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks
Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
360LayoutAnaylsis, a series Document Analysis Models and Datasets deleveped by 360 AI Research Institute
Convert PDF to markdown + JSON quickly with high accuracy
Easily deployable 🚀 API to convert PDF to markdown quickly with high accuracy.
text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。