Skip to content
View eLavin11's full-sized avatar

Block or report eLavin11

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation

Python 709 36 Updated Aug 5, 2024

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Python 2,716 164 Updated Dec 26, 2024

Kolors Team

Python 4,114 304 Updated Nov 13, 2024
C++ 4,131 612 Updated Dec 30, 2024

[CVPR 2024] OneLLM: One Framework to Align All Modalities with Language

Python 613 33 Updated Oct 22, 2024

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

Python 8,148 1,072 Updated Sep 14, 2024

Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.

Python 24,930 3,373 Updated Jan 18, 2025

GPT4V-level open-source multi-modal model based on Llama3-8B

Python 2,216 150 Updated Sep 3, 2024

a state-of-the-art-level open visual language model | 多模态预训练模型

Python 6,281 426 Updated May 29, 2024

A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.

C++ 1,598 184 Updated Dec 27, 2024

Knowledge Graphs Meet Multi-Modal Learning: A Comprehensive Survey

369 19 Updated Dec 10, 2024

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

Python 15,828 1,142 Updated Jan 19, 2025

Letta (formerly MemGPT) is a framework for creating LLM services with memory.

Python 14,025 1,508 Updated Jan 16, 2025

Build multi-modal Agents with memory, knowledge, tools and reasoning. Chat with them using a beautiful Agent UI.

Python 17,934 2,426 Updated Jan 19, 2025

💬 Ready-to-use, flexible RAG Chatbot. 基于大模型和 RAG 的知识库问答系统。

Python 12,680 1,659 Updated Jan 18, 2025

BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation,…

Python 7,528 1,268 Updated Jan 16, 2025

An AI-powered search engine for developers.

1,439 28 Updated Jul 23, 2024

the resources about the application based on LLM with RAG pattern

1,024 59 Updated Jan 17, 2025

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Python 28,613 2,723 Updated Jan 17, 2025

Lumina-T2X is a unified framework for Text to Any Modality Generation

Python 2,127 88 Updated Aug 6, 2024

More relighting!

Python 7,371 436 Updated Nov 28, 2024

An expert benchmark aiming to comprehensively evaluate the aesthetic perception capacities of MLLMs.

Python 221 6 Updated Aug 15, 2024

Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用

Python 14,360 1,283 Updated Sep 5, 2024

Build a Perplexity-Inspired Answer Engine Using Next.js, Groq, Llama-3, Langchain, OpenAI, Upstash, Brave & Serper

TypeScript 4,782 758 Updated Sep 28, 2024

Stable Diffusion web UI

Python 146,038 27,377 Updated Dec 28, 2024

[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.

Python 814 41 Updated Nov 23, 2024

Clarity AI | AI Image Upscaler & Enhancer - free and open-source Magnific Alternative

Python 4,028 422 Updated Oct 11, 2024

Unofficial implementation of BRIA RMBG Model for ComfyUI

Python 746 56 Updated May 22, 2024

MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators

Python 1,249 126 Updated Jan 5, 2025
Next