#data-science

  1. lance

    A columnar data format that is 100x faster than Parquet for random access

    v2.0.1 140K #apache-arrow #data-analytics #data-science #machine-learning
  2. rgwml

    ONLY 🤯 RUST-dominant AI, Data Science & Machine Learning RUST Library designed to minimize developer cognitive load, and replicate the Python Pandas Library with OpenAI, XGBoost…

    v1.3.81 18K #artificial-intelligence #csv #google-big-query #machine-learning #mysql #data-science #sql-server #openai #mssql-server #xgboost
  3. lance-bitpacking

    Vendored copy of https://github.com/spiraldb/fastlanes for use in Lance

    v2.0.1 128K #apache-arrow #data-analytics #machine-learning #data-science #data-format
  4. trueno-viz

    SIMD/GPU/WASM-accelerated visualization library for data science and ML

    v0.1.23 7.4K #data-science #gpu #graphics #wasm #visualization
  5. lace

    A probabilistic cross-categorization engine

    v0.9.0 #oracle #simulation #categorical #animal #column #uncertainty #cross-categorization #codebook #predict #data-science
  6. fluxor

    versatile Rust web framework designed for data science and computing science applications

    v1.1.2 #web-apps #data-science #web-framework #async
  7. fsst

    FSST string compression for Lance

    v2.0.1 137K #apache-arrow #data-analytics #data-science #machine-learning
  8. quantrs2-ml

    Quantum Machine Learning module for QuantRS2

    v0.1.2 #quantum-computing #quantum-machine-learning #machine-learning #data-science #artificial-intelligence
  9. lance-datagen

    A columnar data format that is 100x faster than Parquet for random access

    v2.0.1 134K #apache-arrow #data-analytics #machine-learning #data-science
  10. fluxor_cli

    Fluxor CLI: a command-line tool that allows developers to quickly and efficiently create project starters for the Fluxor web framework

    v1.1.2 #web-framework #data-science #cli #web
  11. lance-datafusion

    Internal utilities used by other lance modules to simplify working with datafusion

    v2.0.1 140K #apache-arrow #data-analytics #data-science #machine-learning
  12. wbi-rs

    + CLI to fetch, store, visualize, and summarize World Bank indicator data

    v0.1.11 #plot #data-science #worldbank #science
  13. r4pm

    Process Mining CLI for working with (object-centric) event data

    v0.4.4 #process-mining #events #cli #data-science #data-mining #web-ui
  14. concision

    toolkit for designing machine-learning models in Rust

    v0.3.1 #machine-learning #data-science
  15. kerblam

    A project management tool for data science and bioinformatics

    v1.2.1 450 #data-science #container #execution #virtualization
  16. datahugger

    fetching data and metadata from DOI or URL

    v0.2.0 #data-science #research #science
  17. lance-file

    Lance file format

    v2.0.1 139K #apache-arrow #data-analytics #data-science #machine-learning
  18. graphina

    A graph data science library for Rust

    v0.3.0-alpha.4 #graph-algorithms #data-science #graph-theory #graph-analytics #graph-data
  19. lance-namespace

    Lance Namespace Core APIs

    v2.0.1 108K #apache-arrow #data-science #machine-learning #data-analytics
  20. lance-namespace-impls

    Lance Namespace Implementations

    v2.0.1 18K #apache-arrow #data-science #machine-learning #data-analytics
  21. lance-encoding

    Encoders and decoders for the Lance file format

    v2.0.1 137K #apache-arrow #data-analytics #machine-learning #data-science
  22. lace_cc

    Core of the Lace cross-categorization engine library

    v0.7.0 700 #lace #engine #data-science #cross-categorization #data-model #sparse-data #categorical #machine-learning #logp #posterior
  23. lance-table

    Lance table format

    v2.0.1 139K #apache-arrow #machine-learning #data-analytics #data-science
  24. lance-io

    I/O utilities for Lance

    v2.0.1 139K #apache-arrow #data-analytics #machine-learning #data-science
  25. lance-index

    Lance indices implementation

    v2.0.1 140K #apache-arrow #machine-learning #data-analytics #data-science
  26. rusty-logging

    Logging for OpsML

    v0.6.0 1.1K #artificial-intelligence #quality-control #model-deployment #monitoring #logging #machine-learning #governance #data-science #generative-ai #opsml
  27. lance-linalg

    A columnar data format that is 100x faster than Parquet for random access

    v2.0.1 140K #apache-arrow #data-analytics #machine-learning #data-science
  28. lance-jni

    JNI bindings for Lance Columnar format

    v0.31.0 1.8K #apache-arrow #machine-learning #data-analytics #data-science
  29. lance-arrow

    Arrow Extension for Lance

    v2.0.1 143K #apache-arrow #machine-learning #data-analytics #data-science
  30. ndtensor

    An n-dimensional tensor

    v0.1.1 170 #tensor #data-science
  31. lance-encoding-datafusion

    Encoders and decoders for the Lance file format that rely on datafusion

    v0.30.0 2.5K #apache-arrow #data-analytics #machine-learning #data-science
  32. snowflake-connector

    Connect to Snowflake

    v0.4.0 260 #data-science #database #snowflake
  33. lance-geo

    Lance's geospatial extension providing geospatial UDFs

    v2.0.1 27K #apache-arrow #data-analytics #machine-learning #data-science #data-format
  34. json2csv

    convert JSON to CSV

    v0.2.0 #convert-json #json-csv #data-science #record #input #json-key #array-value
  35. find_peaks

    Find peaks that match criteria in 1D data

    v0.1.5 204K #data-science #spectrum #prominence #signal
  36. sparsers

    sparerse-rs: sparse matrix computation written in rust

    v0.1.0 #scientific-computing #data-science #linear-algebra
  37. lance-testing

    A columnar data format that is 100x faster than Parquet for random access

    v2.0.1 24K #apache-arrow #data-analytics #machine-learning #data-science
  38. newslookout

    A web scraping platform built for news scanning, using LLMs for text processing, powered by Rust

    v0.4.9 1.1K #data-science #machine-learning #model-deployment #analytics
  39. amadeus

    Harmonious distributed data processing & analysis in Rust. parquet postgres aws s3 cloudfront elb json csv logs hadoop hdfs arrow common crawl

    v0.4.3 #data-science #constellation #distributed
  40. lance-examples

    Lance examples in Rust

    v2.0.1 #apache-arrow #data-analytics #machine-learning #data-science #data-format
  41. axion-data

    A high-performance data processing library written in Rust, providing DataFrame and Series functionality similar to pandas

    v0.1.1 #dataframe #pandas #data-science #statistics #analytics
  42. concision-utils

    Concision is a toolkit for designing machine-learning models in Rust

    v0.2.8 #machine-learning #data-science #scsys #toolkit
  43. concision-data

    additional tools for working with datasets

    v0.3.1 #machine-learning #data-science #toolkit
  44. xpttools

    XPT read library in rust and cli tool to converst to CSV

    v0.2.2 #csv #data-analysis #xpt #data-science #clinical-data
  45. concision-ext

    implements additional models using the concision framework

    v0.3.1 #machine-learning #data-science #toolkit
  46. RustFrames

    A blazing-fast, memory-safe alternative to NumPy + Pandas, written in Rust

    v1.0.0 #rustframes #dataframe #numpy #data-science #pandas #linear-algebra
  47. concision-traits

    implements the core modules for the concision framework

    v0.3.1 #machine-learning #data-science
  48. cox-hazards

    Cox proportional hazards regression with elastic net regularization

    v0.2.1 100 #machine-learning #statistics #data-science #survival-analysis #cox-regression
  49. scidataflow

    A command-line tool to manage scientific research project data

    v0.8.11 290 #bioinformatics #data-science #reproducibility
  50. kaggle

    Unofficial rust implementation of the kaggle api

    v2.0.0 370 #dataset #data-science
  51. concision-transformer

    implements the transformer model using the concision framework

    v0.2.8 #machine-learning #data-science #scsys #toolkit
  52. concision-kan

    implements the kan model using the concision framework

    v0.2.8 #machine-learning #data-science #scsys #toolkit
  53. otters-rs

    High-performance vector search with metadata filtering

    v0.1.0-alpha3 #vector-search #semantic-search #data-science #similarity-search #embedding
  54. concision-init

    various random distribution and initialization routines for the concision framework

    v0.3.1 #machine-learning #data-science #toolkit
  55. concision-neural

    implements various abstractions for designing neural networks

    v0.2.8 #machine-learning #data-science #scsys #toolkit
  56. sklears

    A comprehensive machine learning library in Rust, inspired by scikit-learn

    v0.1.0-beta.1 #machine-learning #scikit-learn #data-science #rust #machine-learning-ml
  57. lace_data

    Data definitions and data container definitions for Lace

    v0.3.0 #lace #data-science #data-source #machine-learning-data #container #categorical #ml #user-guide #dataframe #weather
  58. concision-math

    Concision is a toolkit for designing machine-learning models in Rust

    v0.1.21 160 #machine-learning #data-science #scsys #toolkit
  59. rotoml

    A native Rust AutoML pipeline toolkit

    v0.1.2 #machine-learning #data-science #ai-agent #automl
  60. concision-params

    implements the core modules for the concision framework

    v0.3.1 #machine-learning #data-science #toolkit
  61. live-iron

    A performant, extensible cellular and genetic automata library for Rust

    v0.1.2 110 #cellular-automata #data-science #machine-learning
  62. sklears-simd

    High-performance SIMD acceleration primitives for the Sklears machine learning ecosystem

    v0.1.0-beta.1 #data-science #machine-learning #scikit-learn #rust #machine-learning-ml
  63. lance-tools

    Tools for interacting with Lance files and tables

    v2.0.1 #apache-arrow #data-analytics #machine-learning #data-science
  64. SparseDOKs

    Sparse-matrix DOK implementations

    v0.1.0 #sparsedoks #sparse-matrix #data-science #dok #mm-multiplication
  65. light-snowflake-connector

    Lightweight wrapper around Snowflake's REST API

    v0.1.1 #rest #snowflake #data-science
  66. concision-s4

    implements the s4 model using the concision framework

    v0.2.8 #machine-learning #data-science #scsys #toolkit
  67. concision-models

    implements additional models using the concision framework

    v0.2.8 #machine-learning #data-science #scsys #toolkit
  68. confusion_matrix

    Confusion matrix implementation for storing results from a classification experiment and providing statistical information

    v1.1.0 #machine-learning #data-science #analysis
  69. moose

    Encrypted learning and data processing framework

    v0.2.2 #machine-learning #secure-computation #data-science #distributed #cryptography
  70. specds

    A spec-driven data science pipeline generator using LLMs

    v0.1.0 #data-science #codegen #data-engineering
  71. lace_consts

    Default constants for Lace

    v0.2.1 #lace #constant #data-science #survey #machine-learning #dataframe #user-guide #weather #variance #uncertainty
  72. evolution-slicer

    Data slicing components for evolution

    v1.3.0 #fixed-length #evolution #iceberg #component #old #evolve #data-analytics #apache-arrow #delta-lake #data-science
  73. fluent_data

    A low footprint streaming data modelization library and service

    v1.2.4 #data-streaming #algorithm #data-science
  74. concision-snn

    Synaptic Neural Networks for the Concision Machine Learning Framework

    v0.2.8 #machine-learning #data-science #scsys #toolkit
  75. ppca

    Probabilistic Principal Component Analysis model

    v0.5.0 200 #machine-learning #data-science #missing-values #dimension-reduction
  76. evolution-parser

    Data parsing functionality for evolution

    v1.3.0 #fixed-length #evolution #file-format #iceberg #old #evolve #data-analytics #delta-lake #apache-arrow #data-science
  77. lance-core

    Lance Columnar Format -- Core Library

    v2.0.1 140K #apache-arrow #data-analytics #data-science #machine-learning
  78. rusty_science

    An easy to learn and use ML toolkit for rust

    v0.1.1 110 #machine-learning #data-science #machine-learning-ml #ml
  79. jiro_nn

    Neural Networks framework with model building & data preprocessing features

    v0.8.1 #machine-learning #neural-network #gradient-descent #data-analysis #data-science
  80. wisard

    nets implementation in Rust

    v0.0.3 #machine-learning #neural-network #data-science #weightless
  81. evolution-mocker

    Mocking components of evolution

    v1.3.0 #fixed-length #mocking #data-science #multi-threading #evolution #iceberg #evolve #schema-file #data-analytics #delta-lake
  82. evolution-target

    Output targets for evolution

    v1.3.0 #target #fixed-length #data-analytics #data-science #evolution #iceberg #evolve #delta-lake #apache-arrow
  83. automat

    Data wrangling from the command line

    v0.0.8 #data-science #data-analysis #command-line #tabular-data #command-line-data #data-manipulation #wrangling
  84. ravencol

    Tabular data manipulation

    v0.1.4 #csv #dataframe #data-science #data-manipulation
  85. neural_networks_rust

    Neural Networks framework with model specification & data preprocessing features

    v0.5.0 #neural-network #machine-learning #data-analysis #gradient-descent #data-science
  86. jyafn

    Computational graphs for Data Science that compile to machine code

    v0.3.1 110 #onnx #data-science #ml-ops #graph-data
  87. ff_k_center

    A linear-time k-center algorithm with fairness conditions and worst-case guarantees that is very fast in practice. Includes python bindings.

    v1.2.2 #k-center #data-science #fairness #algorithm
  88. aorist

    Cdylib for aorist project. Can be accessed from Python.

    v0.0.14 #python #machine-learning #data-science #codegen #repetitive #dag #data-replication #cdylib
  89. evolution-builder

    Builder implementations for evolution

    v1.3.0 #fixed-length #builder #data-science #file-format #evolution #iceberg #evolve #parquet #indra-db #data-analytics
  90. evolution-common

    Common util components of evolution

    v1.3.0 #fixed-length #evolution #iceberg #file-format #component #evolve #data-analytics #delta-lake #apache-arrow #data-science
  91. fast-neural-network

    A heavily parallelized neural network library designed for speed and flexability

    v0.7.0 460 #neural-network #data-science #machine-learning #parallel
  92. ream

    Data language for building maintainable social science datasets

    v0.4.2 #data-science #dataset #csv #social #maintainable #template-engine #data-points
  93. rusty_learn

    A small, educational, scikit-learn-like machine learning library written in Rust

    v0.1.1 #machine-learning #data-science #machine-learning-ml #ml
  94. evolution-schema

    Schema implementations for evolution

    v1.3.0 #schema-evolution #fixed-length #data-analytics #data-science #schema-file #iceberg #evolve #delta-lake #apache-arrow
  95. snowflake-deserializer

    Connect to Snowflake, used with snowflake-connector crate

    v0.4.0 300 #snowflake-connector #data-science #snowflake
  96. amadeus-types

    Harmonious distributed data analysis in Rust

    v0.4.3 #amadeus #data-science #distributed #distributed-data
  97. telraam-rs

    Teraam API CLI and library for collecting data from the IOT devices

    v0.1.0 #iot #data-analysis #iot-data #traffic #data-science #street #traffic-analysis #open-data
  98. concision-core

    implements the core modules for the concision framework

    v0.3.1 #machine-learning #data-science
  99. evolution-writer

    Output target writers for evolution

    v1.3.0 #fixed-length #target #writer #data-science #evolution #iceberg #evolve #parquet #schema-file #data-analytics
  100. feature-factory

    A high-performance feature engineering library for Rust powered by Apache DataFusion

    v0.1.1-alpha #data-science #feature-engineering #feature-selection #feature-extraction
  101. concision-linear

    Concision is a complete data-science toolkit written in Rust

    v0.1.14 140 #data-science #toolkit #scsys
  102. aorist_primitives

    Primitive macros for the aorist project

    v0.0.14 #aorist #data-science #macro #python #ml-ops #universe #repetitive #machine-learning #machine-learning-data
  103. aorist_ast

    AST (Abstract Syntax Tree) building blocks for the aorist project

    v0.0.14 #ast #aorist #machine-learning #building-block #ml-ops #data-science #repetitive
  104. ssam

    short for split sampler, splits one or more text-based input files into multiple sets using random sampling. This is useful for splitting data into a training, test and development sets, or whatever sets you desire.

    v0.2.0 #random #data-science #nlp #linguistics
  105. aorist_attributes

    Definitions for various kinds of data attributes in the aorist project

    v0.0.14 #aorist #attributes #data-science #define #ml-ops #machine-learning #repetitive #py #data-replication
  106. Try searching with DuckDuckGo.

  107. aorist_constraint

    Example constraint crate for the aorist project

    v0.0.14 #aorist #constraints #machine-learning #data-science #ml-ops #repetitive #machine-learning-ml #py
  108. presto-cli

    Presto accelerates preprocessing with precision

    v0.1.0 #tui #data-analysis #data-science
  109. mars

    A data science notebook

    v0.0.2 #notebook #data-science #context
  110. DeepIron

    machine learning and deep learning

    v0.1.4 240 #deepiron #deep-learning #machine-learning #data-science #rust
  111. deep_rust

    Machine learning crate in Rust (under dev)

    v0.1.1 #deep-learning #machine-learning #data-science #analytics
  112. lance-test-macros

    A columnar data format that is 100x faster than Parquet for random access

    v2.0.1 #apache-arrow #data-analytics #machine-learning #data-science
  113. sci_rust

    A scientific Rust library

    v0.0.2 #scientific-computing #data-science #machine-learning #linear-algebra #breaking-change #numerical-computation #statistics
  114. kornia-core

    Lightweight tensor library in Rust for computer vision

    v0.1.7 280 #computer-vision #tensor #low-level #thread-safe #3d #artificial-intelligence #image-resizing #convert-images #python-bindings #data-science
  115. parsnip

    Data science metrics (presently categorical only) for Rust

    v0.3.0 #data-science #categorical #metrics #gini #presently #impurity
  116. cogset

    Generic implementations of clustering algorithms. Includes k-means, DBSCAN and OPTICS.

    v0.2.0 1.1K #cluster-analysis #data-science #dbscan
  117. rusty_kan

    Kolmogorov-Arnold Networks in Rust

    v0.1.1 #deep-learning #data-science #machine-learning #rust
  118. concision-gnn

    Concision is a complete data-science toolkit written in Rust

    v0.1.14 140 #data-science #scsys #toolkit
  119. lrtc

    Compression-based low-resource text classification as introduced in Jiang et al (2023)

    v0.1.4 #text-classification #machine-learning #data-science
  120. egui_heatmap

    Navigatable heatmap for use together with egui

    v0.4.5 #data-science #gui #image
  121. pachyderm

    The official Pachyderm Rust library

    v0.4.1 #data-science #big-data #kubernetes #big-data-analytics #analytics
  122. wandb

    Weights & Biases Rust SDK

    v0.18.7-alpha.1 #biases #weights #artificial-intelligence #rust-sdk #model #data-science #machine-learning #ml-ops #reinforcement-learning #hyperparameter-optimization
  123. overdose

    Fast, Row Oriented, Kotlin, Scala-like dataframe

    v0.1.0 #data-science #concurrency
  124. concision-macros

    custom macros for the concision framework

    v0.3.1 #machine-learning #data-science #toolkit
  125. amadeus-core

    Harmonious distributed data analysis in Rust

    v0.4.3 #data-science #distributed #distributed-data
  126. pmrs

    Rust support to process mining functions. Includes a library and a small cli-interface.

    v0.0.2 #data-mining #data-science #machine-learning #process-mining #performance
  127. datasaurust

    Blazingly fast implementation of the Datasaurus paper

    v0.1.0 #plot #statistics #data #data-science #science
  128. quickmath

    A quick command-line math evaluator

    v0.2.3 #computer-science #expression-evaluator #command-line-calculator #math-expression #expression-evaluation #data-science
  129. kddbscan

    A k -Deviation Density Based Clustering Algorithm (kDDBSCAN)

    v0.1.0 #density #deviation #data-science #dynamic
  130. oner_induction

    1R rule induction algorithm

    v0.2.1 #machine-learning #rules #data-science
  131. exotic

    very rare deep learning framework for rust😏. (See also exotic_macro)

    v0.1.3 #deep-learning #machine-learning #framework #rare #data-science
  132. oner_quantize

    1R numeric quantization algorithm

    v0.1.0 #machine-learning #data-science #rules
  133. rds

    Rust Data Science

    v0.0.3 #data-science #array #blas #im #file-format #matrix-multiplication #reimplement #plot #real-world
  134. jmspack-rust

    functions that James finds useful

    v0.1.0 #matrix #data-science #data-manipulation #machine-learning #array
  135. geocode

    Find location information by using Google Maps or DataScienceToolkit API

    v0.1.1 #google-maps #find-location #information #data-science #google-api
  136. concision-derive

    custom derive macros for the concision framework

    v0.3.1 #machine-learning #data-science #toolkit