Skip to content
View adarshxs's full-sized avatar
😼
😼

Highlights

  • Pro

Organizations

@ACM-VIT

Block or report adarshxs

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

Python 15,845 1,144 Updated Jan 19, 2025

[CVPR 2024] DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks

Python 358 38 Updated Sep 24, 2024

Convert PDF to HTML without losing text or format.

HTML 4,723 425 Updated Jul 4, 2024

Materials for learning SGLang

168 10 Updated Jan 8, 2025

A port of muerrilla's sd-webui-Detail-Daemon as a node for ComfyUI, to adjust sigmas that control detail.

Python 507 15 Updated Nov 4, 2024

A framework for serving and evaluating LLM routers - save LLM costs without compromising quality

Python 3,461 258 Updated Aug 10, 2024
Python 2 Updated Dec 30, 2024

An awesome & curated list of best LLMOps tools for developers

Shell 4,263 417 Updated Jan 17, 2025

An open-source RAG-based tool for chatting with your documents.

Python 20,416 1,577 Updated Jan 17, 2025

noise_step: Training in 1.58b With No Gradient Memory

TeX 214 10 Updated Dec 25, 2024

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 3,804 363 Updated Jan 16, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 7,399 713 Updated Jan 19, 2025

Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125

Python 14,491 1,659 Updated Jan 19, 2025

Open and efficient video watermarking

Jupyter Notebook 287 29 Updated Jan 16, 2025

A generative world for general-purpose robotics & embodied AI learning.

Python 22,990 1,896 Updated Jan 18, 2025

A blender addon for generating meshes with AI

Python 437 25 Updated Jan 13, 2025

An interactive multilingual learning platform powered by Sarvam AI, AI4Bharat, and OpenAI.

Python 1 Updated Nov 6, 2024

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,400 230 Updated Jan 14, 2025

Train VAE like a boss

Jupyter Notebook 252 11 Updated Oct 21, 2024

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

Jupyter Notebook 4,005 359 Updated Dec 18, 2024

[WACV 2024] Training-Free Layout Control with Cross-Attention Guidance

Python 244 15 Updated Mar 18, 2024

A holistic way of understanding how Llama and its components run in practice, with code and detailed documentation.

Go 242 11 Updated Aug 20, 2024

Efficient Triton Kernels for LLM Training

Python 4,201 244 Updated Jan 19, 2025

face-to-sticker

Python 630 65 Updated Mar 1, 2024

This repository contains a paper collection of the methods for document image processing, including appearance enhancement, deshadow, dewarping, deblur, and binarization.

199 12 Updated Dec 30, 2024

Following master Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish

Jupyter Notebook 167 5 Updated Jul 31, 2024

LLM101n: Let's build a Storyteller

31,044 1,698 Updated Aug 1, 2024

Official Repository of paper VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding

Python 244 15 Updated Aug 11, 2024

Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.

642 37 Updated Aug 3, 2024

A generative speech model for daily dialogue.

Python 33,750 3,661 Updated Jan 13, 2025
Next