Stars
Examples and guides for using the OpenAI API
Azure Databricks MLOps sample for Python based source code using MLflow without using MLflow Project.
Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.
Open, Multi-modal Catalog for Data & AI
One-click deploy of a Knowledge Graph powered RAG (GraphRAG) in Azure
DuckDB is an analytical in-process SQL database management system
Python implementation of the classic Mastermind game using Pygame with a computer vision feature.
Hydra is a framework for elegantly configuring complex applications
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
Spark: The Definitive Guide's Code Repository
Example code from Learning Spark book
Open source platform for the machine learning lifecycle
Apache Spark - A unified analytics engine for large-scale data processing
Deep Learning Pipelines for Apache Spark
Always know what to expect from your data.
Fixes mojibake and other glitches in Unicode text, after the fact.
A game theoretic approach to explain the output of any machine learning model.
This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/google-cloud-storage-transfer
Google BigQuery connector for pandas
Google Cloud Client Library for Python
A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.
An Open Source Machine Learning Framework for Everyone