-
Georgia Institute of Technology
- Atlanta, GA
- https://www.youtube.com/c/MichaelGalarnyk
- @GalarnykMichael
Stars
A toolkit and dataset for multimodal IPO filing analysis.
A demonstration on how to create a virtual environment bridge to handle package dependencies
[EMNLP 2025 System Demonstrations] ConfReady is an easy-to-use Llama or GPT powered web interface which can be used to empower authors to reflect on their work and assist authors with conference ch…
[KDD'25] This is the official code repo for our KDD'25 paper "Calibrating Pre-trained Language Classifier on LLM-generated Noisy Labels vis Iterative Refinement"
Codebase for VideoConviction, accepted at KDD 2025 (D&B Track)
This is the official repository for the paper "Words That Unite The World: A Unified Framework for Deciphering Global Central Bank Communications"
ACLReady, a retrieval-augmented language model application that can be used to empower authors to reflect on their work and assist authors with the ACL checklist.
Collection of Summer 2026 tech internships!
The study explores the connection between Reddit sentiment and Bitcoin market dynamics. Through graphical analysis and deep learning models, it examines the correlation between Reddit sentiment and…
A curated list of awesome datasets with human label variation (un-aggregated labels) in Natural Language Processing and Computer Vision, accompanying The 'Problem' of Human Label Variation: On Grou…
Python tutorials in both Jupyter Notebook and youtube format.
Distributed Machine Learning Patterns from Manning Publications by Yuan Tang https://bit.ly/2RKv8Zo
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Easily generate information-rich, publication-quality tables from R
Shingho is a PySpark based statistical library designed for Big Data applications.
Execute Python code on the fly and display results in Tableau visualizations:
Interactive computing for complex data processing, modeling and analysis in Python 3
Modern databases can contain massive volumes of data. Within this data lies important information that can only be effectively analyzed using data mining. Data mining tools and techniques can be us…
Using the Python PyQt4 library to create a GUI layout with several functionalities.
Map-reduce, streaming analysis, and external memory algorithms and their implementation using the Hadoop and its eco-system: HBase, Hive, Pig and Spark. The class will include assignment of analyzi…
Python tutorials and puzzles to share with the world!
Probability and Statistics Using Python Data Science Masters Course at UCSD (DSE 210)



