Skip to content
View rohimsh's full-sized avatar
🔜
🔜

Block or report rohimsh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Techniques and numbers for estimating system's performance from first-principles

Rust 4,032 159 Updated Sep 15, 2024

A curated collection of free Machine Learning related eBooks

99 33 Updated May 9, 2018

This repo is meant to serve as a guide for Machine Learning/AI technical interviews.

Jupyter Notebook 5,414 935 Updated Mar 5, 2024
JavaScript 276 61 Updated Nov 6, 2023

ScyllaDB cluster setup guide using podman

Shell 2 1 Updated May 25, 2023

Grafana panel to integrate with any kind of HTTP/REST API

TypeScript 52 36 Updated Dec 15, 2023

OpenMLDB is an open-source machine learning database that provides a feature platform computing consistent features for training and inference.

C++ 1,605 321 Updated Jan 29, 2025

QuestDB is a high performance, open-source, time-series database

Java 14,871 1,207 Updated Feb 9, 2025

🔥LeetCode solutions in any programming language | 多种编程语言实现 LeetCode、《剑指 Offer(第 2 版)》、《程序员面试金典(第 6 版)》题解

Java 32,873 8,327 Updated Feb 8, 2025

Awesome LeetCode resources to learn Data Structures and Algorithms and prepare for Coding Interviews.

Java 7,715 1,974 Updated Jan 26, 2025

Examples on how to use the command line tools in Avro Tools to read and write Avro files

154 57 Updated May 1, 2024

𝗗𝗮𝘁𝗮, 𝗔𝗻𝗮𝗹𝘆𝘁𝗶𝗰𝘀 & 𝗔𝗜. Modern alternative to Snowflake. Cost-effective and simple for massive-scale analytics. https://databend.com

Rust 8,158 762 Updated Feb 8, 2025

Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML validation needs, enabling to thoroughly test your data and mo…

Python 3,701 262 Updated Dec 15, 2024

Add Try It Out option on Redoc

TypeScript 30 11 Updated Jul 24, 2024

A distributed block-based data storage and compute engine

C++ 156 19 Updated Jan 27, 2025

What's in your data? Extract schema, statistics and entities from datasets

Python 1,457 167 Updated Feb 6, 2025

1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

Python 12,694 1,689 Updated Feb 6, 2025

An orchestration platform for the development, production, and observation of data assets.

Python 12,476 1,579 Updated Feb 9, 2025

Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.

Scala 3,362 545 Updated Feb 5, 2025

LZ4 compression for Java

Java 1,120 252 Updated Sep 19, 2024

A collection of Kotlin Multiplatform cryptographic hashing functions.

Kotlin 96 4 Updated Feb 5, 2025

Extremely fast non-cryptographic hash algorithm

C 9,478 799 Updated Feb 5, 2025

An open-source data logging library for machine learning models and data pipelines. 📚 Provides visibility into data quality & model performance over time. 🛡️ Supports privacy-preserving data collec…

Jupyter Notebook 2,689 121 Updated Jan 10, 2025

VictoriaMetrics: fast, cost-effective monitoring solution and time series database

Go 13,140 1,278 Updated Feb 9, 2025

The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

Python 17,173 4,298 Updated Feb 9, 2025

⚡ Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io

Python 2,005 223 Updated Feb 7, 2025

Visual analysis and diagnostic tools to facilitate machine learning model selection.

Python 4,313 562 Updated Sep 27, 2024

A light-weight, flexible, and expressive statistical data testing library

Python 3,607 320 Updated Feb 9, 2025

Visualizer for neural network, deep learning and machine learning models

JavaScript 29,306 2,842 Updated Feb 8, 2025

A curated list of awesome open source tools and commercial products for monitoring data quality, monitoring model performance, and profiling data 🚀

71 6 Updated May 7, 2024
Next