Stars
AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents
Universal File Online Preview Project based on Spring-Boot
📄 Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVINO and PaddlePaddle.
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
ModelScope-Agent: An agent framework connecting models in ModelScope with the world
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
SoftVC VITS Singing Voice Conversion
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
基于Clash Core 制作的Clash For Linux备份仓库 A Clash For Linux Backup Warehouse Based on Clash Core
Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
XVERSE-13B: A multilingual large language model developed by XVERSE Technology Inc.
LlamaIndex is a data framework for your LLM applications
Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.
A high-throughput and memory-efficient inference and serving engine for LLMs
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with …
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
A fast reverse proxy to help you expose a local server behind a NAT or firewall to the internet.
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a chain of LLMs to find the answer. The user can see the progres…
这是一个arcface-pytorch的源码,可以用于训练自己的模型。
这是一个facenet-pytorch的库,可以用于训练自己的人脸识别模型。
纯c++的全平台llm加速库,支持python调用,chatglm-6B级模型单卡可达10000+token / s,支持glm, llama, moss基座,手机端流畅运行
A Python framework for performing information retrieval experiments, building on http://terrier.org/
A fast, easy-to-use, production-ready inference server for computer vision supporting deployment of many popular model architectures and fine-tuned models.