Skip to content
View rxmao's full-sized avatar

Block or report rxmao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains

Python 4,005 362 Updated Dec 6, 2024
Python 317 40 Updated May 30, 2024

Quantized Attention that achieves speedups of 2.1-3.1x and 2.7-5.1x compared to FlashAttention2 and xformers, respectively, without lossing end-to-end metrics across various models.

Cuda 685 34 Updated Dec 7, 2024

Making LLaVA Tiny via MoE-Knowledge Distillation

Python 70 4 Updated Oct 24, 2024

Are gradient information useful for pruning of LLMs?

Python 40 8 Updated Apr 22, 2024

Training-free Post-training Efficient Sub-quadratic Complexity Attention. Implemented with OpenAI Triton.

Python 25 3 Updated Dec 5, 2024
Python 29 4 Updated Dec 10, 2024

Efficient LLM Inference over Long Sequences

Python 304 12 Updated Dec 6, 2024

Focus on prompting and generating

Python 41,964 6,026 Updated Aug 21, 2024

Mixture-of-Experts for Large Vision-Language Models

Python 2,022 129 Updated Dec 3, 2024

[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support Llama-3/3.1, Llama-2, LLaMA, BLOOM, Vicuna, Baichuan, TinyLlama, etc.

Python 899 105 Updated Oct 7, 2024

Official Repo for SparseLLM: Global Pruning of LLMs (NeurIPS 2024)

Python 45 6 Updated Nov 17, 2024

Pruning the VLLMs

Python 64 2 Updated Dec 9, 2024

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 2,179 120 Updated Dec 13, 2024

Easy-to-use image segmentation library with awesome pre-trained model zoo, supporting wide-range of practical tasks in Semantic Segmentation, Interactive Segmentation, Panoptic Segmentation, Image …

Python 8,769 1,682 Updated Dec 9, 2024

[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-sim…

Python 6,159 418 Updated Dec 6, 2024

Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.

Python 19,842 2,020 Updated Nov 23, 2024

Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models

Python 108 4 Updated Nov 28, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 31,860 4,844 Updated Dec 14, 2024

VIINA: Violent Incident Information from News Articles on the 2022 Russian Invasion of Ukraine

269 21 Updated Dec 13, 2024

AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation

Python 2,944 229 Updated Dec 13, 2024

An open-source RAG-based tool for chatting with your documents.

Python 17,970 1,397 Updated Dec 11, 2024

An NVIDIA AI Workbench example project for Retrieval Augmented Generation (RAG)

Python 275 728 Updated Dec 2, 2024

An Open-Source Package for Information Retrieval

Python 156 20 Updated Oct 8, 2024

RSL-SQL: Robust Schema Linking in Text-to-SQL Generation

Python 27 3 Updated Nov 29, 2024

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 5,166 400 Updated Dec 13, 2024
Python 56 8 Updated May 21, 2024

Build resilient language agents as graphs.

Python 7,211 1,154 Updated Dec 14, 2024

A MULTI-GENERATOR ENSEMBLE FRAMEWORK FOR NATURAL LANGUAGE TO SQL

178 3 Updated Dec 13, 2024

More relighting!

Python 6,951 409 Updated Nov 28, 2024
Next