Bagging, Boosting, Decision Trees, Random Forest

Uploaded by

soumya ranjan bhanja

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views19 pages

Bagging, Boosting, Decision Trees, Random Forest

Uploaded by

soumya ranjan bhanja

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

Introduction to Decision

Trees
Before Starting the Course
● Upon completion of this course, you will have acquired general knowledge of introductory
artificial intelligence algorithms and data analysis.
● Since the specific topics are difficult to understand and it will not be easy for the person to
settle the logic during the lesson, a lot of individual practice is required.
● Slides will be explained with visuals without drowning in texts. That is why it is extremely
important to take notes during the lesson.
● Since the titles are sufficient for basic level algorithms, the titles should be researched,
and research should be done on sites that contain plenty of practice and theoretical
information.
Decision Trees
A decision tree uses a tree structure
to represent a set of possible
decision paths and an outcome for
each path.

The decision tree is one of the most

commonly used classification
techniques.

It is very easy to understand and

interpret and the process of arriving
at an estimate is completely
transparent.
Structure of Decision Trees
Root: The first cells of the decision trees
are called the root (root or root node).
Each observation is classified as “Yes” or
“No” according to the root condition.

Node: Root cells contain nodes (interval

nodes or nodes). Each observation is
classified with the help of nodes. The
complexity of the model increases as the
number of nodes increases.

Leaf: At the bottom of the decision tree

are leaves (leaf nodes or leaves). The
leaves give us the result.
Decision Trees Application
Decision Trees Application
Let's examine the following two-dimensional data, which has four class labels
Decision Trees Application
A simple decision tree built on this data will iteratively split the data along one or the other
axis according to some quantitative criteria and, at each level, assign the label of the new
region by a majority vote of the scores in it.

Note that after the first split, every point in the parent branch remains unchanged, so there
is no need to further subdivide this branch. At each level, each region is split again along
one of the two features, except for nodes containing an entire color.
How to Calculate
Decision Trees
Gini Index
Pure, means that all data in a selected dataset instance belongs to the same class.

Impure, means that the data is a mix of different classes.

Gini Impurity, It is a measure of the probability that a new sample of a random

variable will be misclassified if a new sample is randomly classified based on the
distribution of class labels from the dataset.

If our dataset is Pure, the probability of misclassification is 0. If our sample is a mix of

different classes, the probability of misclassification will be high.
Entropy
To construct a decision tree, we need to decide
which questions to ask and in what order. At each
stage of the tree there are some possibilities that
we eliminate and some that we do not.

Example: After we learned that an animal has no more

than five legs, we eliminated the possibility of being a
grasshopper. We didn't rule out the possibility of it being a
duck. Each possible question segments the remaining
probabilities according to their answers.

Entropy is a measure of the uncertainty of a

random variable. The higher the entropy, the
more information obtained.
Decision Trees Advantages
● It can work with both continuous and
discrete data
● It needs less data preprocessing. Does
not require outlier detection or scaling
● Decision trees are easily visualized and
classification rules are clearly visible so
they are easy to understand and
interpret
● It can be used for multiple output
Decision Trees Visualization
Ensemble Learning
More than one classification is called the event that
the estimation algorithm gives more successful
results with the logic of working together.

Example: Random forest is the drawing of more than one

decision tree algorithm for the same problem several times
and using them together to solve the problem.

● Max Voting
● Averaging
● Weighted Averaging
● Stacking
● Blending
Bootstrapping
Bagging
Bootstrap Aggregation or Bagging, It is
a widely used ensemble learning method to
reduce the variance in a noisy dataset.

After generating random data subsets from

the main dataset, multiple weak models are
trained independently.

The random forest algorithm is an extension of the

bagging method that uses both bagging and feature
randomness to generate a random forest from
unrelated decision trees.
Random Forest
This concept operates under an ensemble
method called bagging, in which multiple fit
estimators can be combined to reduce the
effect of overfitting.

Bagging uses a collection of parallel

estimators, each of which fits the data well
and averages the results to find a better
classification.

A collection of random decision trees is

known as a random forest.
Boosting
Boosting is an ensemble learning method that
combines a set of weak learners into a strong
learner to minimize training errors.

In reinforcement, a random sample of data is

selected and then sequentially trained – that is,
each model tries to compensate for the weaknesses
of the previous one.

At each iteration, the weak rules from each

classifier are combined to form a single strong
prediction rule.
Boosting App
Source: [Link]
edac1174e971
7/3 = 11/3 =
2.33 3.66

Unit 1 ML (DT)
No ratings yet
Unit 1 ML (DT)
24 pages
Unit 3 Notes
No ratings yet
Unit 3 Notes
25 pages
Unit 1 ML (NN& ML Techniques)
No ratings yet
Unit 1 ML (NN& ML Techniques)
40 pages
1.decision Trees Concepts
No ratings yet
1.decision Trees Concepts
70 pages
Decision Tree & Random ForestNotes
No ratings yet
Decision Tree & Random ForestNotes
11 pages
Lecture Note #5 - PEC-CS701E
No ratings yet
Lecture Note #5 - PEC-CS701E
16 pages
Decision Tree and Random Forest
No ratings yet
Decision Tree and Random Forest
41 pages
Random Forests Simplified
No ratings yet
Random Forests Simplified
39 pages
Decision Tree
0% (1)
Decision Tree
16 pages
Decision Tree Is An Upside
No ratings yet
Decision Tree Is An Upside
17 pages
Ml-Unit Iii-1
No ratings yet
Ml-Unit Iii-1
46 pages
Ch5 Data Science
No ratings yet
Ch5 Data Science
60 pages
08 Decision - Tree
No ratings yet
08 Decision - Tree
9 pages
Decision Tree and Random Forest Overview
No ratings yet
Decision Tree and Random Forest Overview
15 pages
Tree Based Algorithms in Machine Learning
No ratings yet
Tree Based Algorithms in Machine Learning
8 pages
Unit 3.2 Decision Tree Algorithm Wit Examples
No ratings yet
Unit 3.2 Decision Tree Algorithm Wit Examples
85 pages
Unit 4
No ratings yet
Unit 4
33 pages
Training Day 22
No ratings yet
Training Day 22
48 pages
ML Unit 3
No ratings yet
ML Unit 3
15 pages
Unit-4 (1) .Docx ML
No ratings yet
Unit-4 (1) .Docx ML
42 pages
Dmi Unit 4
No ratings yet
Dmi Unit 4
34 pages
NOTES
No ratings yet
NOTES
18 pages
Naive Bayes and Decision Tree Classification
No ratings yet
Naive Bayes and Decision Tree Classification
21 pages
What Is Decision Tree
No ratings yet
What Is Decision Tree
35 pages
DS Unit - 4
No ratings yet
DS Unit - 4
76 pages
CH2-Decision Trees and Random Forest
No ratings yet
CH2-Decision Trees and Random Forest
54 pages
An Introduction To Random Forest Algorithm For Beginners
No ratings yet
An Introduction To Random Forest Algorithm For Beginners
16 pages
Chapter 03
No ratings yet
Chapter 03
30 pages
Decision Tree
No ratings yet
Decision Tree
12 pages
445 Lecture 6
No ratings yet
445 Lecture 6
74 pages
Lecture Notes 3
No ratings yet
Lecture Notes 3
11 pages
Decision Tree Structure and Algorithms
No ratings yet
Decision Tree Structure and Algorithms
5 pages
Chapter 4classification and Prediction
No ratings yet
Chapter 4classification and Prediction
19 pages
Chapter 03
No ratings yet
Chapter 03
30 pages
Decision Tree Learning (8 Hours)
No ratings yet
Decision Tree Learning (8 Hours)
141 pages
Decision Tree
No ratings yet
Decision Tree
21 pages
Machine Learning Chapter 4
No ratings yet
Machine Learning Chapter 4
9 pages
Decision Trees for Beginners
No ratings yet
Decision Trees for Beginners
45 pages
Unit 3
No ratings yet
Unit 3
25 pages
Decisiontrees
No ratings yet
Decisiontrees
28 pages
Machine - Learning - Lecture - 08 - Decision Tree Learning
No ratings yet
Machine - Learning - Lecture - 08 - Decision Tree Learning
67 pages
DS Tech M 3 1
No ratings yet
DS Tech M 3 1
13 pages
Unit 3 - ML (NEW)
No ratings yet
Unit 3 - ML (NEW)
68 pages
Understanding Decision Trees in Data Science
No ratings yet
Understanding Decision Trees in Data Science
13 pages
DM Unit 4
No ratings yet
DM Unit 4
24 pages
Decision Tree Learning
No ratings yet
Decision Tree Learning
11 pages
Decision Trees for Data Enthusiasts
No ratings yet
Decision Trees for Data Enthusiasts
52 pages
Decision Trees
No ratings yet
Decision Trees
8 pages
2.unit 2
No ratings yet
2.unit 2
23 pages
Decision Tree Classification Algorithm
No ratings yet
Decision Tree Classification Algorithm
30 pages
Decision Trees
67% (3)
Decision Trees
14 pages
ML Unit 3 Notes-1
No ratings yet
ML Unit 3 Notes-1
118 pages
Understanding Decision Trees in ML
No ratings yet
Understanding Decision Trees in ML
11 pages
Unit 3 by GPT
No ratings yet
Unit 3 by GPT
10 pages
Decision Trees - A Complete Introduction With Examples - by Shubham Koli - Medium
No ratings yet
Decision Trees - A Complete Introduction With Examples - by Shubham Koli - Medium
22 pages
Decision Trees: Example
No ratings yet
Decision Trees: Example
14 pages
ML Unit@4
No ratings yet
ML Unit@4
70 pages
Decision Tree Learning
No ratings yet
Decision Tree Learning
15 pages
Adobe Scan 16 May 2023
No ratings yet
Adobe Scan 16 May 2023
12 pages
Ai in Accounting
No ratings yet
Ai in Accounting
12 pages
REPORT ON DATA ANALYTICS - Docx NANMA
No ratings yet
REPORT ON DATA ANALYTICS - Docx NANMA
52 pages
Google Translate in EFL Freshmen's Writing Assignments
No ratings yet
Google Translate in EFL Freshmen's Writing Assignments
20 pages
Test Bank For Understanding Nursing Research, 6th Edition, Susan K. Grove, Jennifer R. Gray Nancy Burns Download
100% (20)
Test Bank For Understanding Nursing Research, 6th Edition, Susan K. Grove, Jennifer R. Gray Nancy Burns Download
38 pages
Document For Sharing"reliability Centered Maintenance"
100% (1)
Document For Sharing"reliability Centered Maintenance"
5 pages
Holiday Homework for Class XII Commerce
No ratings yet
Holiday Homework for Class XII Commerce
10 pages
UKZN Policy On Ethical Integration of AI
100% (1)
UKZN Policy On Ethical Integration of AI
25 pages
Forex Indicator Setup Guide
No ratings yet
Forex Indicator Setup Guide
8 pages
Environmental Key Performance Indicators
100% (6)
Environmental Key Performance Indicators
68 pages
Fogarty 11 MPhil1
No ratings yet
Fogarty 11 MPhil1
96 pages
Inbound 7067753922267691965
No ratings yet
Inbound 7067753922267691965
6 pages
Action Research Proposal 1
No ratings yet
Action Research Proposal 1
22 pages
B.Tech Project Research Guide
No ratings yet
B.Tech Project Research Guide
12 pages
B-Dc204 Practical Studio 2B: //design Thinking and Practice
No ratings yet
B-Dc204 Practical Studio 2B: //design Thinking and Practice
16 pages
TCS Recruitment Practices Study
No ratings yet
TCS Recruitment Practices Study
68 pages
Comprehensive TM Search Report for Brands
No ratings yet
Comprehensive TM Search Report for Brands
16 pages
The Impact of Job Insecurity and Contract Type On Attitudes, Well-Being and Behavioural Reports: A Psychological..
No ratings yet
The Impact of Job Insecurity and Contract Type On Attitudes, Well-Being and Behavioural Reports: A Psychological..
16 pages
Summative Test in Practical Research 1 2022 - 2023
No ratings yet
Summative Test in Practical Research 1 2022 - 2023
1 page
Sharynn M. Tomlin, Ph.D. 6546 FM HWY 380 San Angelo, TX 76905 (325) 655-9707
0% (1)
Sharynn M. Tomlin, Ph.D. 6546 FM HWY 380 San Angelo, TX 76905 (325) 655-9707
12 pages
Risk Management in Bounce Fitness
No ratings yet
Risk Management in Bounce Fitness
11 pages
Business Model Innovation Challenges
No ratings yet
Business Model Innovation Challenges
4 pages
Criterion-Related Validity of Sit-And-Reach Test For Estimating Hamstring and Lumber Extensibility
No ratings yet
Criterion-Related Validity of Sit-And-Reach Test For Estimating Hamstring and Lumber Extensibility
14 pages
Topics To Be Covered:: Group Assignment# 1
No ratings yet
Topics To Be Covered:: Group Assignment# 1
9 pages
Ijerph 18 02555 v2
No ratings yet
Ijerph 18 02555 v2
31 pages
Lindley (2001) The Philosophy of Statistics
No ratings yet
Lindley (2001) The Philosophy of Statistics
45 pages
Assumptions of The Study Thesis Example
100% (3)
Assumptions of The Study Thesis Example
8 pages
Shila
No ratings yet
Shila
16 pages
Econ Bulluy
No ratings yet
Econ Bulluy
39 pages
The Entrepreneurial Mind: Christian P. Tan
No ratings yet
The Entrepreneurial Mind: Christian P. Tan
55 pages
Linguistic Survey of Milagros, Masbate-Edited Version
100% (1)
Linguistic Survey of Milagros, Masbate-Edited Version
60 pages