0% found this document useful (0 votes)

15 views

ml_cheatsheet

Uploaded by

Lenara

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views

ml_cheatsheet

Uploaded by

Lenara

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Machine Learning Algorithms Cheat Sheet

Table of Contents

1. Supervised Learning
Linear Regression
Logistic Regression
Decision Trees
Random Forests
Support Vector Machines (SVM)
k-Nearest Neighbors (k-NN)
Naive Bayes
Gradient Boosting Machines (GBM)
Neural Networks
2. Unsupervised Learning
k-Means Clustering
Hierarchical Clustering
Principal Component Analysis (PCA)
Independent Component Analysis (ICA)
Association Rules
Autoencoders
3. Reinforcement Learning
Q-Learning
Deep Q-Networks (DQN)
Policy Gradients
Actor-Critic Methods
4. Semi-Supervised and Self-Supervised Learning
Self-Training
Co-Training
5. Ensemble Methods
Bagging
Boosting
Stacking

Supervised Learning

1. Linear Regression

Purpose: Predict continuous target variables.

Key Concept: Models the relationship between input features and output as a linear combination.
Equation: ( y = \beta_0 + \beta_1x_1 + \beta_2x_2 + \dots + \beta_nx_n + \epsilon )

2. Logistic Regression

Purpose: Binary classification.

Key Concept: Uses the logistic function to model the probability of a class.
Equation: ( P(Y=1) = \frac{1}{1 + e^{-(\beta_0 + \beta_1x_1 + \dots + \beta_nx_n)}} )

3. Decision Trees

Purpose: Classification and regression.

Key Concept: Splits data into subsets based on feature values.
Advantages: Easy to interpret, handles both numerical and categorical data.
Disadvantages: Prone to overfitting.

4. Random Forests

Purpose: Classification and regression.

Key Concept: Ensemble of decision trees using bagging.
Advantages: Reduces overfitting, handles large datasets well.
Key Parameters: Number of trees, max depth.

5. Support Vector Machines (SVM)

Purpose: Classification and regression.

Key Concept: Finds the hyperplane that best separates classes with maximum margin.
Kernel Trick: Enables handling non-linear relationships.
Common Kernels: Linear, Polynomial, RBF.

6. k-Nearest Neighbors (k-NN)

Purpose: Classification and regression.

Key Concept: Assigns the output based on the majority label of the k closest training examples.
Advantages: Simple, no training phase.
Disadvantages: Computationally intensive during prediction, sensitive to irrelevant features.

7. Naive Bayes

Purpose: Classification.
Key Concept: Based on Bayes' Theorem with the assumption of feature independence.
Variants: Gaussian, Multinomial, Bernoulli.

8. Gradient Boosting Machines (GBM)

Purpose: Classification and regression.

Key Concept: Builds models sequentially, each new model correcting errors of the previous ones.
Popular Implementations: XGBoost, LightGBM, CatBoost.
Advantages: High predictive performance, handles missing data.

9. Neural Networks

Purpose: Various tasks including classification, regression, and more.

Key Concept: Composed of layers of interconnected nodes (neurons) that can capture complex patterns.
Types: Feedforward Neural Networks, Convolutional Neural Networks (CNN), Recurrent Neural Networks (RNN).

Unsupervised Learning

1. k-Means Clustering

Purpose: Partition data into k distinct clusters.

Key Concept: Minimizes within-cluster variance.
Parameters: Number of clusters (k), distance metric.

2. Hierarchical Clustering

Purpose: Create a hierarchy of clusters.

Key Concept: Either agglomerative (bottom-up) or divisive (top-down).
Linkage Criteria: Single, complete, average, ward.

3. Principal Component Analysis (PCA)

Purpose: Dimensionality reduction.

Key Concept: Transforms data to a new coordinate system with orthogonal principal components.
Uses: Feature reduction, visualization.

4. Independent Component Analysis (ICA)

Purpose: Separate a multivariate signal into additive, independent components.

Key Concept: Maximizes statistical independence.

5. Association Rules

Purpose: Discover interesting relations between variables in large databases.

Key Concepts: Support, Confidence, Lift.
Algorithms: Apriori, Eclat.

6. Autoencoders

Purpose: Learn efficient codings of input data.

Key Concept: Neural network architecture with encoder and decoder parts.
Uses: Dimensionality reduction, anomaly detection.

Reinforcement Learning

1. Q-Learning

Purpose: Learn the value of actions in states to derive an optimal policy.

Key Concept: Off-policy temporal difference learning.
Equation: ( Q(s, a) \leftarrow Q(s, a) + \alpha [r + \gamma \max_{a'} Q(s', a') - Q(s, a)] )

2. Deep Q-Networks (DQN)

Purpose: Combine Q-Learning with deep neural networks.

Key Concept: Uses neural networks to approximate Q-values.
Features: Experience replay, target networks.

3. Policy Gradients

Purpose: Optimize the policy directly.

Key Concept: Uses gradient ascent on expected rewards.
Algorithms: REINFORCE, Proximal Policy Optimization (PPO).

4. Actor-Critic Methods

Purpose: Combine value-based and policy-based methods.

Key Concept: Actor updates policy, Critic evaluates it.
Examples: A3C, DDPG.

Semi-Supervised and Self-Supervised Learning

1. Self-Training

Purpose: Utilize unlabeled data to improve model performance.

Key Concept: Iteratively label unlabeled data using the current model.

2. Co-Training

Purpose: Use multiple views of data to train models.

Key Concept: Each model trains on different feature sets and labels unlabeled data for each other.

Ensemble Methods

1. Bagging (Bootstrap Aggregating)

Purpose: Reduce variance and prevent overfitting.

Key Concept: Train multiple models on different bootstrap samples and aggregate predictions.
Example: Random Forest.

2. Boosting

Purpose: Reduce bias and build strong predictive models.

Key Concept: Sequentially train models, each focusing on errors of the previous ones.
Examples: AdaBoost, Gradient Boosting, XGBoost.

3. Stacking

Purpose: Combine multiple models to improve performance.

Key Concept: Use a meta-model to aggregate predictions from base models.

Additional Algorithms and Techniques

Support Vector Regression (SVR)

Purpose: Regression using SVM principles.

Key Concept: Fits the best line within a predefined margin.

Elastic Net

Purpose: Regularized regression combining L1 and L2 penalties.

Key Concept: Balances between feature selection and coefficient shrinkage.

Gaussian Mixture Models (GMM)

Purpose: Probabilistic clustering.

Key Concept: Assumes data is generated from a mixture of several Gaussian distributions.

t-Distributed Stochastic Neighbor Embedding (t-SNE)

Purpose: Data visualization.
Key Concept: Reduces dimensions while preserving local structure.

Hidden Markov Models (HMM)

Purpose: Model sequential data.

Key Concept: States are hidden and emit observable events.

Key Concepts and Terms

Overfitting: Model performs well on training data but poorly on unseen data.
Underfitting: Model is too simple to capture underlying patterns.
Bias-Variance Tradeoff: Balance between model complexity and generalization.
Cross-Validation: Technique to assess model performance by partitioning data.
Regularization: Techniques to prevent overfitting (e.g., L1, L2).
Feature Scaling: Standardizing features to improve model performance.

Resources and Libraries

Python Libraries:

Scikit-learn: Comprehensive ML algorithms.

TensorFlow & Keras: Deep learning frameworks.
PyTorch: Flexible deep learning library.
XGBoost, LightGBM, CatBoost: Gradient boosting implementations.

Books:

"Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow" by Aurélien Géron
"Pattern Recognition and Machine Learning" by Christopher M. Bishop
"The Elements of Statistical Learning" by Trevor Hastie, Robert Tibshirani, and Jerome Friedman

Online Courses:

Coursera's Machine Learning by Andrew Ng

edX's MicroMasters in Statistics and Data Science
Udacity's Machine Learning Nanodegree

Career - Assessment Strong Interest Inventory
100% (1)
Career - Assessment Strong Interest Inventory
12 pages
Presentation SG9a - Developing Thinking Skills - How Children Learn Math
100% (2)
Presentation SG9a - Developing Thinking Skills - How Children Learn Math
25 pages
A Critical Review of Knowledge Management Models 09696479910270416
No ratings yet
A Critical Review of Knowledge Management Models 09696479910270416
14 pages
Lecture Notes on Machine Learning Concepts.docx
No ratings yet
Lecture Notes on Machine Learning Concepts.docx
5 pages
Supervised Learning Final With Diagrams Cleaned
No ratings yet
Supervised Learning Final With Diagrams Cleaned
7 pages
ML_7th_Sem_AIML_ITE_Notes_Complete_LONG[1]-10-33
No ratings yet
ML_7th_Sem_AIML_ITE_Notes_Complete_LONG[1]-10-33
24 pages
Notes Unit 1
No ratings yet
Notes Unit 1
13 pages
Machine Learning
No ratings yet
Machine Learning
3 pages
CS480 Lecture November 14th
No ratings yet
CS480 Lecture November 14th
72 pages
ML
No ratings yet
ML
5 pages
ML Notes
No ratings yet
ML Notes
52 pages
21CS743 Model Question Paper Solution
No ratings yet
21CS743 Model Question Paper Solution
32 pages
UCS_401_Unit-LV_ Trends in Machine Learning_Model and Symbols- Bagging and Boosting, Multitask
No ratings yet
UCS_401_Unit-LV_ Trends in Machine Learning_Model and Symbols- Bagging and Boosting, Multitask
44 pages
1machine Learning
No ratings yet
1machine Learning
26 pages
ML Notes-1
No ratings yet
ML Notes-1
59 pages
data science notes c
No ratings yet
data science notes c
4 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
64 pages
CP Presentation Affan, Hammad, Arman, Shayan
No ratings yet
CP Presentation Affan, Hammad, Arman, Shayan
18 pages
What Are The Common Algorithms in Machine Learning
No ratings yet
What Are The Common Algorithms in Machine Learning
3 pages
Lecture 8
No ratings yet
Lecture 8
11 pages
Module 3
No ratings yet
Module 3
11 pages
Machine Learning Overview
No ratings yet
Machine Learning Overview
7 pages
Machine Learning Presentation
No ratings yet
Machine Learning Presentation
12 pages
Kenny-230718-The Ultimate Machine Learning Cheat Sheet
No ratings yet
Kenny-230718-The Ultimate Machine Learning Cheat Sheet
20 pages
Introduction To Machine Learning PPT Main
No ratings yet
Introduction To Machine Learning PPT Main
15 pages
Machine Learning (R20a0518)
No ratings yet
Machine Learning (R20a0518)
87 pages
3.popular Machine Learning Algorithm
No ratings yet
3.popular Machine Learning Algorithm
11 pages
data science notes b
No ratings yet
data science notes b
5 pages
book of 843_AI_Student_HandbookXI-104-127
No ratings yet
book of 843_AI_Student_HandbookXI-104-127
24 pages
Chapter 01 machine learning
No ratings yet
Chapter 01 machine learning
22 pages
ML Unit 1
No ratings yet
ML Unit 1
21 pages
ML
No ratings yet
ML
8 pages
ml1
No ratings yet
ml1
17 pages
AIML MODEL
No ratings yet
AIML MODEL
13 pages
AIYA SESSION 4
No ratings yet
AIYA SESSION 4
42 pages
Top 10 Machine Learning Algo PDF
No ratings yet
Top 10 Machine Learning Algo PDF
15 pages
Machine Learning: A Comprehensive Overview
No ratings yet
Machine Learning: A Comprehensive Overview
3 pages
ITA6016 - Machine Learning Introduction
No ratings yet
ITA6016 - Machine Learning Introduction
13 pages
Handout - BITS-F464 - Machine - Learning - August 2019
No ratings yet
Handout - BITS-F464 - Machine - Learning - August 2019
4 pages
Machine Learning
No ratings yet
Machine Learning
55 pages
21cs743 Model Question Paper Solution
No ratings yet
21cs743 Model Question Paper Solution
33 pages
All Machine Learning Algorithms Explained in One Line
No ratings yet
All Machine Learning Algorithms Explained in One Line
12 pages
Module 1 & 2
No ratings yet
Module 1 & 2
21 pages
ML assignment
No ratings yet
ML assignment
13 pages
MLSC Final Notes
No ratings yet
MLSC Final Notes
24 pages
CSC413 Lecture Note
No ratings yet
CSC413 Lecture Note
32 pages
Kavin
No ratings yet
Kavin
15 pages
21cs743 Solutions
No ratings yet
21cs743 Solutions
19 pages
AI unit 1
No ratings yet
AI unit 1
36 pages
ML CheatSheet
No ratings yet
ML CheatSheet
14 pages
ChatGPT - Machine Learning Overview
No ratings yet
ChatGPT - Machine Learning Overview
34 pages
01 - Introduction
No ratings yet
01 - Introduction
35 pages
UNIT-1,2,3
No ratings yet
UNIT-1,2,3
30 pages
Lecture 01 Introducing ML 13102022 031101pm
No ratings yet
Lecture 01 Introducing ML 13102022 031101pm
36 pages
DL UNIT 1
No ratings yet
DL UNIT 1
21 pages
Machine Learning.
No ratings yet
Machine Learning.
50 pages
ClassNote One
No ratings yet
ClassNote One
2 pages
Machine Learning Concept1
No ratings yet
Machine Learning Concept1
16 pages
computer network ppt file
No ratings yet
computer network ppt file
10 pages
Machine Learning
No ratings yet
Machine Learning
54 pages
Python Machine Learning
From Everand
Python Machine Learning
Sebastian Raschka
4/5 (18)
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
César Pérez López
No ratings yet
AI for Everyone: An Intermediate Guide to Artificial Intelligence
From Everand
AI for Everyone: An Intermediate Guide to Artificial Intelligence
Nova Clarke
No ratings yet
sevcik-scales-and-chord-studies
No ratings yet
sevcik-scales-and-chord-studies
2 pages
Sprint Backlog
No ratings yet
Sprint Backlog
2 pages
IMSLP890186-SIBLEY1802 20835 642e-39087012336113pieces
No ratings yet
IMSLP890186-SIBLEY1802 20835 642e-39087012336113pieces
50 pages
Untitled
No ratings yet
Untitled
25 pages
Achieving Lean Data Science Agility Via Data Driven Scrum
No ratings yet
Achieving Lean Data Science Agility Via Data Driven Scrum
10 pages
Core Foundations
No ratings yet
Core Foundations
14 pages
Agile Data Science With R
No ratings yet
Agile Data Science With R
65 pages
Nijet Special Edition No 12019
No ratings yet
Nijet Special Edition No 12019
98 pages
WHLP For EAPP LAS 1 6
No ratings yet
WHLP For EAPP LAS 1 6
2 pages
SCHOLARSHIPFORM2025-2026
No ratings yet
SCHOLARSHIPFORM2025-2026
4 pages
Using Excel For Classroom Activities
No ratings yet
Using Excel For Classroom Activities
5 pages
Exam Schedule UGRC150 Main - 2024
No ratings yet
Exam Schedule UGRC150 Main - 2024
4 pages
8 бжб
No ratings yet
8 бжб
3 pages
TH Essay
No ratings yet
TH Essay
4 pages
Now Discover Your Strengths
100% (7)
Now Discover Your Strengths
6 pages
A For and Against Essay
No ratings yet
A For and Against Essay
10 pages
Module Assessment
No ratings yet
Module Assessment
14 pages
RRL
No ratings yet
RRL
69 pages
Contemporary Learning Theories/Psychologists: Their Impact To Epp Teaching
100% (1)
Contemporary Learning Theories/Psychologists: Their Impact To Epp Teaching
8 pages
Business Plan Presentation Rubric
No ratings yet
Business Plan Presentation Rubric
4 pages
Lesson Plan Itech Land Water and Air Pollution
No ratings yet
Lesson Plan Itech Land Water and Air Pollution
3 pages
Guidance Philosophy
No ratings yet
Guidance Philosophy
4 pages
Ect 3300
No ratings yet
Ect 3300
4 pages
TTL2 (Instrumentation and Technology in Mathematics)
100% (1)
TTL2 (Instrumentation and Technology in Mathematics)
9 pages
Manoj Resume
No ratings yet
Manoj Resume
2 pages
A Development of Brake Shoe and Brake Pads Trainer Mockup For Sedan Cars System: Evaluation of Learners Psychomotor Skills Remarks and Performance Rating
No ratings yet
A Development of Brake Shoe and Brake Pads Trainer Mockup For Sedan Cars System: Evaluation of Learners Psychomotor Skills Remarks and Performance Rating
13 pages
LCC November 2023 - January 2024 Weekends
No ratings yet
LCC November 2023 - January 2024 Weekends
4 pages
Lev Vygotsky Sociocultural Theory
100% (1)
Lev Vygotsky Sociocultural Theory
7 pages
Westward Expansion Lesson Plan
No ratings yet
Westward Expansion Lesson Plan
2 pages
Introduction To Kmeans
No ratings yet
Introduction To Kmeans
4 pages
February 2010 - Issue 6 - Vol
No ratings yet
February 2010 - Issue 6 - Vol
16 pages
Edvard S Son 2003
No ratings yet
Edvard S Son 2003
16 pages
Lac Session RD Elc - Rneb - Awards
100% (1)
Lac Session RD Elc - Rneb - Awards
4 pages
SanGabriel Camille Wk1-4
No ratings yet
SanGabriel Camille Wk1-4
6 pages