Generate a transcript for your favourite Manga: Detect manga characters, text blocks and panels. Order panels. Cluster characters. Match texts to their speakers. Perform OCR.

299 12 Updated Sep 2, 2024

black-forest-labs / flux

Official inference repo for FLUX.1 models

Python 18,370 1,300 Updated Nov 21, 2024

baaivision / DIVA

Diffusion Feedback Helps CLIP See Better

Python 224 12 Updated Aug 24, 2024

opendatalab / PDF-Extract-Kit

A Comprehensive Toolkit for High-Quality PDF Content Extraction

Python 6,078 393 Updated Dec 10, 2024

All-Hands-AI / OpenHands

🙌 OpenHands: Code Less, Make More

Python 38,478 4,345 Updated Dec 14, 2024

baaivision / EVE

[NeurIPS'24 Spotlight] EVE: Encoder-Free Vision-Language Models

Python 245 4 Updated Oct 2, 2024

datawhalechina / leedl-tutorial

《李宏毅深度学习教程》（李宏毅老师推荐👍，苹果书🍎），PDF下载地址：https://github.com/datawhalechina/leedl-tutorial/releases

Jupyter Notebook 14,033 2,926 Updated Dec 13, 2024

InternLM / InternLM-XComposer

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Python 2,593 158 Updated Dec 12, 2024

lxtGH / OMG-Seg

OMG-LLaVA and OMG-Seg codebase [CVPR-24 and NeurIPS-24]

Python 1,335 50 Updated Dec 11, 2024

iterative / dvc

🦉 Data Versioning and ML Experiments

Python 14,016 1,194 Updated Dec 10, 2024

cambrian-mllm / cambrian

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,792 117 Updated Oct 30, 2024

AiuniAI / Unique3D

[NeurIPS 2024] Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image

Python 3,133 252 Updated Sep 18, 2024

fudan-generative-vision / hallo

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

Python 9,595 1,313 Updated Sep 14, 2024

lucasjinreal / ImageTokenizer

imagetokenizer is a python package, helps you encoder visuals and generate visuals token ids from codebook, supports both image and video.

Python 30 Updated Jun 22, 2024

bytedance / 1d-tokenizer

This repo contains the code for 1D tokenizer and generator

Jupyter Notebook 591 29 Updated Nov 20, 2024

karpathy / LLM101n

LLM101n: Let's build a Storyteller

30,492 1,671 Updated Aug 1, 2024

lucidrains / self-rewarding-lm-pytorch

Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI

Python 1,347 73 Updated Apr 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LvTianlei 2793145003

Achievements

Achievements

Block or report 2793145003

Stars

jeohalves / longkey

richards199999 / Thinking-Claude

hyz317 / StdGEN

multimodal-art-projection / AutoKaggle

nuno-faria / tetris-sql

Peterande / D-FINE

BestAnHongjun / SentenceVAE

X-PLUG / mPLUG-DocOwl

InternLM / HuixiangDou

SakanaAI / AI-Scientist

MaybeShewill-CV / segment-anything-u-specify

THUDM / CogVideo

Alpha-VLLM / Lumina-mGPT

ragavsachdeva / magi