Skip to content
Change the repository type filter

All

    Repositories list

    • SRI Group Website
      HTML
      MIT License
      8910Updated Mar 21, 2025Mar 21, 2025
    • Automated Classification of Model Errors on ImageNet (NeurIPS 2023)
      Jupyter Notebook
      Apache License 2.0
      1600Updated Mar 18, 2025Mar 18, 2025
    • Human-Guided Fair Classification for NLP (ICLR 2023, Spotlight)
      Python
      Creative Commons Zero v1.0 Universal
      0500Updated Mar 18, 2025Mar 18, 2025
    • JavaScript
      0000Updated Mar 18, 2025Mar 18, 2025
    • cuts

      Public
      Python
      MIT License
      0610Updated Mar 17, 2025Mar 17, 2025
    • psi

      Public
      Exact Inference Engine for Probabilistic Programs
      JetBrains MPS
      Boost Software License 1.0
      1813230Updated Mar 13, 2025Mar 13, 2025
    • Python
      0000Updated Mar 13, 2025Mar 13, 2025
    • ToolFuzz

      Public
      ToolFuzz is a fuzzing framework designed to test your LLM Agent tools.
      Python
      MIT License
      01300Updated Mar 12, 2025Mar 12, 2025
    • Black-Box Detection of Language Model Watermarks (ICLR 2025)
      Python
      0000Updated Mar 10, 2025Mar 10, 2025
    • matharena

      Public
      Evaluation of LLMs on latest math competitions
      Python
      MIT License
      03730Updated Mar 10, 2025Mar 10, 2025
    • ward

      Public
      Ward: Provable RAG Dataset Inference via LLM Watermarks (ICLR 2025)
      Python
      MIT License
      0100Updated Feb 26, 2025Feb 26, 2025
    • Python
      MIT License
      55400Updated Feb 16, 2025Feb 16, 2025
    • Python
      MIT License
      0500Updated Feb 16, 2025Feb 16, 2025
    • Python
      Apache License 2.0
      0400Updated Feb 14, 2025Feb 14, 2025
    • MathConstruct: Challenging LLM Reasoning with Constructive Proofs
      Python
      MIT License
      1300Updated Feb 14, 2025Feb 14, 2025
    • Implementation of Polyrating: A Cost-Effective and Bias-Aware Rating System for LLM Evaluation
      Python
      Apache License 2.0
      0300Updated Feb 11, 2025Feb 11, 2025
    • JavaScript
      MIT License
      0100Updated Feb 10, 2025Feb 10, 2025
    • Python
      MIT License
      01000Updated Feb 3, 2025Feb 3, 2025
    • CTBench

      Public
      Python
      1200Updated Jan 20, 2025Jan 20, 2025
    • Python
      MIT License
      21800Updated Jan 9, 2025Jan 9, 2025
    • SynthPAI

      Public
      A Synthetic Dataset for Personal Attribute Inference (NeurIPS'24 D&B)
      HTML
      MIT License
      53600Updated Nov 28, 2024Nov 28, 2024
    • JavaScript
      0100Updated Nov 7, 2024Nov 7, 2024
    • ChromeER

      Public
      C++
      BSD 3-Clause "New" or "Revised" License
      3073017Updated Nov 4, 2024Nov 4, 2024
    • synthetiq

      Public
      OpenQASM
      MIT License
      1300Updated Oct 12, 2024Oct 12, 2024
    • Python
      41620Updated Oct 2, 2024Oct 2, 2024
    • The website for "Watermark Stealing in Large Language Models".
      HTML
      459100Updated Sep 27, 2024Sep 27, 2024
    • Controlled Text Generation via Language Model Arithmetic
      Python
      MIT License
      1521621Updated Sep 15, 2024Sep 15, 2024
    • ConStat

      Public
      A statistical test for contamination detection in language models.
      Python
      Apache License 2.0
      1500Updated Jul 29, 2024Jul 29, 2024
    • dl2

      Public
      DL2 is a framework that allows training neural networks with logical constraints over numerical values in the network (e.g. inputs, outputs, weights) and to query networks for inputs fulfilling a logical formula.
      Python
      MIT License
      158655Updated Jul 25, 2024Jul 25, 2024
    • diffai

      Public
      A certifiable defense against adversarial examples by training neural networks to be provably robust
      Python
      MIT License
      2621911Updated Jul 25, 2024Jul 25, 2024