Starred repositories
A comprehensive library for computational molecular biology
A trainable PyTorch reproduction of AlphaFold 3.
Discovering Interpretable Features in Protein Language Models via Sparse Autoencoders
A terminal workspace with batteries included
Ergonomic and modular web framework built with Tokio, Tower, and Hyper
ETL, Analytics, Versioning for Unstructured Data
A specification that python filesystems should adhere to.
A Rust HTTP server for Python applications
A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.
An example Flight SQL Server implementation - with DuckDB and SQLite back-ends.
ClickHouse® is a real-time analytics DBMS
chDB is an in-process OLAP SQL Engine 🚀 powered by ClickHouse
Train transformer language models with reinforcement learning.
Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
Official implementation for "iTransformer: Inverted Transformers Are Effective for Time Series Forecasting" (ICLR 2024 Spotlight), https://openreview.net/forum?id=JePfAI8fah
Burn is a new comprehensive dynamic Deep Learning Framework built using Rust with extreme flexibility, compute efficiency and portability as its primary goals.
Optimal transport tools implemented with the JAX framework, to get differentiable, parallel and jit-able computations.
jax-triton contains integrations between JAX and OpenAI Triton
A Python tool to enforce dependencies, using modular architecture 🌎 Open source 🐍 Installable via pip 🔧 Able to be adopted incrementally - ⚡ Implemented with no runtime impact ♾️ Interoperable with…
DuckDB-powered Postgres for high performance apps & analytics.
Efficient Triton Kernels for LLM Training