Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.

Python 24,930 3,373 Updated Jan 18, 2025

THUDM / CogVLM2

GPT4V-level open-source multi-modal model based on Llama3-8B

Python 2,216 150 Updated Sep 3, 2024

THUDM / CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Python 6,281 426 Updated May 29, 2024

AlibabaResearch / AdvancedLiterateMachinery

A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.

C++ 1,598 184 Updated Dec 27, 2024

zjukg / KG-MM-Survey

Knowledge Graphs Meet Multi-Modal Learning: A Comprehensive Survey

369 19 Updated Dec 10, 2024

OpenBMB / MiniCPM-o

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

Python 15,828 1,142 Updated Jan 19, 2025

letta-ai / letta

Letta (formerly MemGPT) is a framework for creating LLM services with memory.

Python 14,025 1,508 Updated Jan 16, 2025

phidatahq / phidata

Build multi-modal Agents with memory, knowledge, tools and reasoning. Chat with them using a beautiful Agent UI.

Python 17,934 2,426 Updated Jan 19, 2025

1Panel-dev / MaxKB

💬 Ready-to-use, flexible RAG Chatbot. 基于大模型和 RAG 的知识库问答系统。

Python 12,680 1,659 Updated Jan 18, 2025

dataelement / bisheng

BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation,…

Python 7,528 1,268 Updated Jan 16, 2025

devv-ai / devv

An AI-powered search engine for developers.

1,439 28 Updated Jul 23, 2024

lizhe2004 / Awesome-LLM-RAG-Application

the resources about the application based on LLM with RAG pattern

1,024 59 Updated Jan 17, 2025

infiniflow / ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Python 28,613 2,723 Updated Jan 17, 2025

Alpha-VLLM / Lumina-T2X

Lumina-T2X is a unified framework for Text to Any Modality Generation

Python 2,127 88 Updated Aug 6, 2024

lllyasviel / IC-Light

More relighting!

Python 7,371 436 Updated Nov 28, 2024

yipoh / AesBench

An expert benchmark aiming to comprehensively evaluate the aesthetic perception capacities of MLLMs.

Python 221 6 Updated Aug 15, 2024

LlamaFamily / Llama-Chinese

Llama中文社区，Llama3在线体验和微调模型已开放，实时汇总最新Llama3学习资料，已将所有代码更新适配Llama3，构建最好的中文Llama大模型，完全开源可商用

Python 14,360 1,283 Updated Sep 5, 2024

developersdigest / llm-answer-engine

Build a Perplexity-Inspired Answer Engine Using Next.js, Groq, Llama-3, Langchain, OpenAI, Upstash, Brave & Serper

TypeScript 4,782 758 Updated Sep 28, 2024

AUTOMATIC1111 / stable-diffusion-webui

Stable Diffusion web UI

Python 146,038 27,377 Updated Dec 28, 2024

mbzuai-oryx / groundingLMM

[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.

Python 814 41 Updated Nov 23, 2024

philz1337x / clarity-upscaler

Clarity AI | AI Image Upscaler & Enhancer - free and open-source Magnific Alternative

Python 4,028 422 Updated Oct 11, 2024

ZHO-ZHO-ZHO / ComfyUI-BRIA_AI-RMBG

Unofficial implementation of BRIA RMBG Model for ComfyUI

Python 746 56 Updated May 22, 2024

PKU-YuanGroup / MagicTime

MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators

Python 1,249 126 Updated Jan 5, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

eLavin11

Block or report eLavin11

Stars

GAIR-NLP / anole

InternLM / InternLM-XComposer

Kwai-Kolors / Kolors

GuijiAI / duix.ai

ChunmingHe / awesome-diffusion-models-in-low-level-vision

csuhan / OneLLM

fudan-generative-vision / hallo

crewAIInc / crewAI