koustuvsinha

Koustuv Sinha koustuvsinha

Research Scientist @ Meta AI (FAIR). PhD from McGill University / MILA

370 followers · 25 following

Meta AI
New York, USA
https://koustuvsinha.com
@koustuvsinha

Achievements

x3 x2

Achievements

x3 x2

Highlights

Organizations

Stars

soniajoseph / ViT-Prisma

ViT Prisma is a mechanistic interpretability library for Vision Transformers (ViTs).

Jupyter Notebook 184 19 Updated Dec 13, 2024

rese1f / aurora

🔥 Aurora Series: A more efficient multimodal large language model series for video.

Python 57 4 Updated Nov 16, 2024

LLaVA-VL / LLaVA-NeXT

Python 3,089 265 Updated Oct 16, 2024

Vision-CAIR / LongVU

Python 323 22 Updated Nov 5, 2024

tris203 / precognition.nvim

💭👀precognition.nvim - Precognition uses virtual text and gutter signs to show available motions.

Lua 929 11 Updated Dec 5, 2024

EvolvingLMMs-Lab / lmms-eval

Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.

Python 2,164 168 Updated Dec 14, 2024

huggingface / fineVideo

Python 53 3 Updated Sep 19, 2024

TRI-ML / prismatic-vlms

A flexible and efficient codebase for training visually-conditioned language models (VLMs)

Python 494 254 Updated Jul 4, 2024

orrzohar / Video-STaR

Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision

Python 48 4 Updated Jul 10, 2024

stas00 / ml-engineering

Machine Learning Engineering Open Book

Python 11,980 727 Updated Dec 4, 2024

cambrian-mllm / cambrian

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,792 117 Updated Oct 30, 2024

j-min / HiREST

Hierarchical Video-Moment Retrieval and Step-Captioning (CVPR 2023)

Python 94 9 Updated Oct 21, 2023

pytorch / torchtune

PyTorch native finetuning library

Python 4,468 457 Updated Dec 13, 2024

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 27,494 3,132 Updated Aug 12, 2024

kovidgoyal / kitty

Cross-platform, fast, feature-rich, GPU based terminal

Python 25,009 999 Updated Dec 12, 2024

benlubas / molten-nvim

A neovim plugin for interactively running code with the jupyter kernel. Fork of magma-nvim with improvements in image rendering, performance, and more

Python 634 34 Updated Nov 14, 2024

facebookresearch / MetaCLIP

ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Experts via Clustering

Python 1,283 55 Updated Dec 10, 2024