Skip to content

mdozmorov/MachineLearning_notes

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 

Repository files navigation

Machine- and Deep Learning resources

License: MIT PR's Welcome

Machine and deep learning and data analysis resources. Please, contribute and get in touch! See MDmisc notes for other programming and genomics-related notes.

Table of content

Cheatsheets

Awesome Deep Learning

Keras, Tensorflow

PyTorch

JAX

JAX is a combination of Automatic Differentiation and XLA (Accelerated Linear ALgebra). XLA is a compiler developed by Google to work on TPU units. Jax has Numpy as its higher layer of abstraction, and works the same way on CPU, GPU, and TPU (much faster).

  • awesome-jax - JAX - A curated list of resources

  • JAX - Jupyter (Colab) notebooks introducing JAX basic (jit, vmap, pmap, grad, and other) and advanced concepts, by @yvrjsharma

Graph Neural Networks

Transformers

DL Books

DL Courses & Tutorials

DL Videos

DL Papers

DL Papers Genomics

  • genomicsnotebook - Genomics Data Analysis with Jupyter Notebooks on Azure.

  • Machine Learning for Genomics - ML4GLand is a community for that develops and maintains tools (primarily in Python) for genomics sequence based machine learning.

  • SEQUOIA - a linearized transformer model for gene expression prediction from pathology slides. Uses UNI (foundational model for slides), compared with ResNet50. Compared with tRNAsformer, HE2RNA. Trained on 7584 tumor samples across 16 cancer types (TCGA), validated on independent cohorts (CPTAC, Tempus). BRCA shows best performance, the model predicts the risk of breast cancer recurrence. About 15K out of 20K genes can be predicted, well-known signature genes are predicted best. Detected 272 genes significantly associated with recurrence. Predicts spatially-specific gene expression. Python, GitHub.

    Paper Pizurica, Marija. “Digital Profiling of Gene Expression from Histology Images with Linearized Attention.” Nature Communications, 2024.

DL Tools

Auto ML

DL models

DL projects

Language models

ChatGPT, Gemini, NotebookLM, Claude, OpenRouter, Groq, Storm

Music, voice, audio

  • buzz - Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.

  • ebook2audiobook - Convert ebooks to audiobooks with chapters and metadata using dynamic AI models and voice cloning. Supports 1,107+ languages!

  • Jukebox - music generation neural network. Hierarchical Vector Quantised-Variational AutoEncoder (VQ-VAE) architecture, three separate temporal resolutions. Able to generate singing from lyrics, extend music examples. Dhariwal et al., “Jukebox: A Generative Model for Music.”, Blog post with examples of generated music

  • june - Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit

  • Magenta - Music and Art Generation with Machine Intelligence

  • OpenVoice - voice cloning tool, transfer voice tones to pronounce different words, even in different language.

  • Real-Time-Voice-Cloning - Clone a voice in 5 seconds to generate arbitrary speech in real-time. Learn voice characteristics from a short audio clip and perform text-to-speech conversion using this voice.

  • Project DeepSpeech - A TensorFlow implementation of Baidu's DeepSpeech architecture. Transcribe audio data, English model available. Documentation

  • SpeechBrain - A PyTorch-based Speech Toolkit for speech/speaker recognition, speech enhancement, processing, and more. GitHub repo

  • vampnet - music generation with masked transformers. arXiv paper, supplementary page

Image, vision

DL Misc

  • geospy.ai - location identification from photos

  • app.wombo.art - deep generative model dreaming awesome images from text, Android and iOS apps available. Tweet describing the VQGAN+CLIP technology behind it

  • CSrankings - A web app for ranking computer science departments according to their research output in selective venues, and for finding active faculty across a wide range of areas. Website

  • ColossalAI - A Unified Deep Learning System for Big Model Era. Scaling deep learning models using data, pipeline, tensor, and sequence parallelism. 1D, 2D, 2.5D, 3D distributed operators. Examples of each. Written in PyTorch, needs a configuration file defining parallelism. Benchmarked against DeepSpeed, Megatron-LM.

    Paper Li, Shenggui, Jiarui Fang, Zhengda Bian, Hongxin Liu, Yuliang Liu, Haichen Huang, Boxiang Wang, and Yang You. “Colossal-AI: A Unified Deep Learning System For Large-Scale Parallel Training,” n.d.

Awesome Machine learning

ML Books

ML Courses & Tutorials

ML Videos

ML Papers

ML Tools

  • mlr3 - Machine learning in R R package, the unified interface to classification, regression, survival analysis, and other machine learning tasks. GitHub repo, mlr3gallery - Examples of problems and code solutions, mlr3 Manual - mlr3 bookdown. More on the mlr3 package site, including videos

ML Misc

Material in Chinese

  • Autopilot-Notes - Autonomous driving notes summarizing the basics, hardware, perception, position, planning, control, product, tools, and manufacturing plan topics.

Material in Russian

  • Scientific_graphics_in_python - matplotlib for scientific graphics. 3 parts, 13 chapters. By Pavel Shabanov

  • ml-course-hse - machine learning course at the Computer Sciences Department, High Schoool of Economy. Multiple years, videos

  • mlcourse_open - OpenDataScience Machine Learning course (Both in English and Russian). Python-based ML course, with video lectures. Video

  • DL_CSHSE_spring2018 - Deep learning, Anton Osokin, Higher School of Economics, Computer Sciences Department (Russian), course material, and video lectures

  • Ordinary Differential Equations - Обыкновенные дифференциальные уравнения, Интерактивный учебник, Илья Щуров (НИУ ВШЭ)

  • Calculus - Математический анализ, Записки лекций, Илья Щуров (НИУ ВШЭ). Tweet

  • mathprofi.ru - Высшая математика – просто и доступно. Mirror

About

Machine learning and deep learning resources

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published