-
ER-Evaluation (Python package on PyPI): Toolkit for the (statistical) evaluation of entity resolution, record linkage, and micro-clustering models.
-
StreamlitSurvey (Python package on PyPI): Streamlit apps can be a bit tricky to get right. Especially if you want multi-pages forms/surveys that keep track of progress and that can be restored from a saved state. This Streamlit component makes it easy to incorporate more complex forms on in your apps.
-
StringCompare (Python/C++ package): Most implementations of the Levenshtein distance are pretty bad. Here's how the basic algorithm can be implemented with minimal time and memory complexity.
-
CSVMeta (Python package on PyPI): You know how excel has been repeatedly blamed for high-profile disasters? Well, CSV files are part of the problem. If you're going to use CSV files, you need to specify your CSV dialect in a metadata file. This package reads/write CSV files with attached dialect information.
-
assert (R package on CRAN): Informative assertions in 16 lines of code with some R introspection magic (no dependencies!)
-
cache (R package on CRAN): On-disk caching for data science workflows in 15 lines of code (no dependencies!)
-
dgaFast/dga (R package on CRAN): I needed the dga R package (that I know maintain) to go fast. I re-implemented bottlenecks in C++ and did some linear algebra magic for a 100x speedup.
-
Fractals (Javascript App): More than 10 years ago, I deep dived into browser multithreading to create a Javascript Mendelbrot set visualization app.
-
Welcome to the Moon App (Javascript App): I love board games! I created this app as a companion to the wonderful Welcome to the Moon board game.
-
FingermatchR (R & C package): Ever needed an R wrapper to fingerprint matching C libraries? No? Well, a Duke postdoc did and sweet-talked me into building this... It may not be the most useful thing, but it was interesting!
🧑💻
Data Scientist @ American Institutes for Research
Duke Statistical Science PhD
-
Duke University
- Durham, NC
- https://olivierbinette.ca/
- in/olivier-binette
Pinned Loading
-
Awesome-Entity-Resolution
Awesome-Entity-Resolution PublicList of entity resolution software and resources.
-
er-evaluation
er-evaluation PublicAn End-to-End Evaluation Framework for Entity Resolution Systems
-
-
StringCompare
StringCompare PublicEfficient String Comparison Functions and Fuzzy String Matching
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.