OCR offline image text recognition command line windows program
The data structure for multimodal data
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML
Voice Recognition to Text Tool
Open-source, code-first Python toolkit for building, evaluating, etc.
Toloka-Kit is a Python library for working with Toloka API
AlphaFold 3 inference pipeline
Distribute and run LLMs with a single file
No-code LLM Platform to launch APIs and ETL Pipelines
Transcribe any audio to text, translate and edit subtitles 100% locall
Qwen2.5-VL is the multimodal large language model series
Conversational voice AI agents
Official MiniMax Model Context Protocol (MCP) server
Image polygonal annotation with Python
Analyze computation-communication overlap in V3/R1
A low code Machine Learning service that personalizes articles
Interaction model for connecting buyers to complete purchases
Run local LLMs like llama, deepseek, kokoro etc. inside your browser
The ChatGPT Retrieval Plugin lets you easily find personal documents
Python scraper based on AI
Neural Search
Automate browser-based workflows with LLMs and Computer Vision
Go efficient multilingual NLP and text segmentation
Semantic Search & Call Graphs for AI Agents
Async PHP client/server API for the telegram MTProto protocol