Skip to content
View zhaoshitian's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@Alpha-VLLM

Block or report zhaoshitian

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. Causal-CoG Causal-CoG Public

    [CVPR'24 Highlight] Implementation of "Causal-CoG: A Causal-Effect Look at Context Generation for Boosting Multi-modal Language Models"

    Python 11

  2. Alpha-VLLM/Lumina-mGPT Alpha-VLLM/Lumina-mGPT Public

    Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining"

    Python 510 22

  3. Likelihood-Composition-Toolkit Likelihood-Composition-Toolkit Public

    [EMNLP'24 Findings] Implementation of "Unleashing the Potentials of Likelihood Composition for Multi-modal Language Models"

    1

  4. pptx-agent pptx-agent Public

    Parse a PPTX file to JSON format and the decomposed images.

    1