0% found this document useful (0 votes)

41 views5 pages

RandomForest Project Report

Uploaded by

navshu35

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

41 views5 pages

RandomForest Project Report

Uploaded by

navshu35

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Random Forest Classification on Social Network Ads Dataset

Project Title: Random Forest Classification on Social Network Ads

Name: Naveen Kumar

Date: August 1, 2025

Abstract

This project uses the Random Forest Classification algorithm to predict whether a user on a social network

will purchase a product based on their age and estimated salary. The dataset is preprocessed using standard

scaling, and results are evaluated using accuracy and a confusion matrix. Visualization of the decision

boundary shows clear class separation. The model achieves high performance on the test set and

demonstrates the effectiveness of ensemble learning.

Table of Contents

1. Introduction

2. Literature Review

3. Problem Statement

4. Data Collection and Preprocessing

5. Methodology

6. Implementation

7. Results

8. Discussion

9. Conclusion

10. References

11. Appendices

12. Acknowledgments
Random Forest Classification on Social Network Ads Dataset

Introduction

This project explores a supervised machine learning approach to predict user behavior in online

advertisements. By using Random Forest, a powerful ensemble method, we aim to improve classification

accuracy over traditional single-tree models. The goal is to accurately predict if a person will buy a product

based on simple demographic inputs like age and salary.

Literature Review

Ensemble methods like Random Forest are widely known for reducing overfitting and increasing prediction

accuracy. Earlier studies show that decision trees are prone to high variance, which Random Forest

overcomes by averaging many trees. Applications in marketing and user behavior prediction have

demonstrated significant gains through such methods.

Problem Statement

Predict whether a user will purchase a product based on age and estimated salary.

Assumptions:

- Only two features are used.

- Binary classification (0 = No, 1 = Yes).

Limitations:

- Does not account for other possible influences (e.g., device used, browsing time, gender).

Data Collection and Preprocessing

Dataset: Social_Network_Ads.csv

Features: Age, Estimated Salary

Random Forest Classification on Social Network Ads Dataset

Target: Purchased (0 or 1)

Data was split into training and testing sets (75% train, 25% test). Feature scaling was applied using

StandardScaler.

Methodology

We used Random Forest Classification with:

- 10 decision trees (n_estimators=10)

- Entropy criterion for split decisions

Rationale:

- Random Forest reduces overfitting

- Handles non-linearly separable data better than linear models

Implementation

Language: Python

Libraries: scikit-learn, pandas, matplotlib, numpy

Code Example:

from sklearn.ensemble import RandomForestClassifier

classifier = RandomForestClassifier(n_estimators=10, criterion='entropy', random_state=0)

classifier.fit(x_train, y_train)

Results

Model Predictions:
Random Forest Classification on Social Network Ads Dataset

classifier.predict([[30, 87000]]) -> [1]

classifier.predict([[40, 0]]) -> [0]

Confusion Matrix:

[[64 4]

[ 3 29]]

Accuracy: 93%

Decision boundary shows clear class separation.

Discussion

The model performed well with 93% accuracy. False positives and negatives were minimal. Some

misclassifications may be due to the limited feature space or overlapping classes in the dataset.

Conclusion

The Random Forest model provided strong performance for this binary classification task. This approach

could be expanded by incorporating additional features for even better predictive power. This work

demonstrates the real-world applicability of ensemble models in marketing and recommendation systems.

References

- Scikit-learn documentation

- Breiman, L. (2001). 'Random Forests'. Machine Learning.

- Dataset: Social_Network_Ads.csv
Random Forest Classification on Social Network Ads Dataset

Appendices

How to Reproduce:

1. Install dependencies:

pip install numpy pandas matplotlib scikit-learn

2. Run the script: python social_ads_rf.py

Acknowledgments

Thanks to the open-source community and contributors of scikit-learn and matplotlib.

Random Forest Classification
No ratings yet
Random Forest Classification
8 pages
07 - Model Selection & Building
No ratings yet
07 - Model Selection & Building
17 pages
Practical No4 - 5 ML
No ratings yet
Practical No4 - 5 ML
11 pages
Customer Churn Prediction in Telecom
No ratings yet
Customer Churn Prediction in Telecom
4 pages
5.random Forest
No ratings yet
5.random Forest
12 pages
Present
No ratings yet
Present
20 pages
Random Forest in ML
No ratings yet
Random Forest in ML
13 pages
Randon Forest
No ratings yet
Randon Forest
34 pages
6x3 Tech Star Summit 2025 Template 2 Final
No ratings yet
6x3 Tech Star Summit 2025 Template 2 Final
1 page
Machine Learning Random Forest Algorithm - Javatpoint
100% (1)
Machine Learning Random Forest Algorithm - Javatpoint
14 pages
Random - Forest - Classification - Ipynb - Colab
No ratings yet
Random - Forest - Classification - Ipynb - Colab
3 pages
Machine Learning With Random Forests - by Knoldus Inc. - Knoldus - Technical Insights - Medium
No ratings yet
Machine Learning With Random Forests - by Knoldus Inc. - Knoldus - Technical Insights - Medium
12 pages
ML Asst.-01
No ratings yet
ML Asst.-01
21 pages
ChatGPT Randomforest
No ratings yet
ChatGPT Randomforest
4 pages
Random Forest Algorithm Unit 3
No ratings yet
Random Forest Algorithm Unit 3
2 pages
Random Forest Model Assumptions
No ratings yet
Random Forest Model Assumptions
33 pages
Random Forest Algorithm
No ratings yet
Random Forest Algorithm
9 pages
Random Forest
No ratings yet
Random Forest
25 pages
A Brief Survey On Random Forest Ensembles in Classification Model
No ratings yet
A Brief Survey On Random Forest Ensembles in Classification Model
8 pages
AttiqAhmadAfsar Lab 13
No ratings yet
AttiqAhmadAfsar Lab 13
5 pages
Random Forest Algorithm in Machine Learning Random Forest Random Forests or Random Decision Trees Decision Trees
No ratings yet
Random Forest Algorithm in Machine Learning Random Forest Random Forests or Random Decision Trees Decision Trees
6 pages
Random Forest for ML Enthusiasts
No ratings yet
Random Forest for ML Enthusiasts
4 pages
Random Forest Classifier
No ratings yet
Random Forest Classifier
9 pages
Python Implementation of Random Forest Algorithm
No ratings yet
Python Implementation of Random Forest Algorithm
10 pages
Random Forest
No ratings yet
Random Forest
20 pages
Random Forest
No ratings yet
Random Forest
13 pages
Decision Trees vs. Random Forests
No ratings yet
Decision Trees vs. Random Forests
1 page
Random Forest Algorithm 1
100% (2)
Random Forest Algorithm 1
14 pages
Lab 10 - Random Forest Classifier
100% (1)
Lab 10 - Random Forest Classifier
3 pages
03 - Random Forest
No ratings yet
03 - Random Forest
24 pages
Machine Learning With Random Forests and Decision Trees PDF
No ratings yet
Machine Learning With Random Forests and Decision Trees PDF
171 pages
Decision Tree, Random Forest
No ratings yet
Decision Tree, Random Forest
37 pages
2023AIB1008 Lab08
No ratings yet
2023AIB1008 Lab08
8 pages
DA PRA WEEK 13 (Random Forest) - 054551
No ratings yet
DA PRA WEEK 13 (Random Forest) - 054551
12 pages
ML: Decision Trees & Random Forests
No ratings yet
ML: Decision Trees & Random Forests
25 pages
Random Forest - Basics
100% (1)
Random Forest - Basics
9 pages
Bank Marketing Campaign Analysis
100% (2)
Bank Marketing Campaign Analysis
14 pages
CTR Presentation
No ratings yet
CTR Presentation
9 pages
Random Forest Algorithm Updated
No ratings yet
Random Forest Algorithm Updated
11 pages
Lecture-12 Machine Learning With Python
No ratings yet
Lecture-12 Machine Learning With Python
18 pages
Da MS
No ratings yet
Da MS
24 pages
Overcoming Random Forest Thesis Challenges
100% (3)
Overcoming Random Forest Thesis Challenges
4 pages
Machine Learning - Random Forest
No ratings yet
Machine Learning - Random Forest
6 pages
FAI Lecture - 4-10-2023 PDF
No ratings yet
FAI Lecture - 4-10-2023 PDF
27 pages
015 - Random Forest
No ratings yet
015 - Random Forest
15 pages
Random Forest
No ratings yet
Random Forest
29 pages
Classification
No ratings yet
Classification
6 pages
Classification Research 1
No ratings yet
Classification Research 1
4 pages
Random Forest
100% (1)
Random Forest
11 pages
Ensemble Learning and Random Forests Guide
No ratings yet
Ensemble Learning and Random Forests Guide
33 pages
Dac Phase 2
No ratings yet
Dac Phase 2
5 pages
Module 5 Machine Learning
No ratings yet
Module 5 Machine Learning
36 pages
Deep Learning and Neural Networks
No ratings yet
Deep Learning and Neural Networks
21 pages
Random Forest
No ratings yet
Random Forest
6 pages
Lecture 19 Different Classification Models
No ratings yet
Lecture 19 Different Classification Models
22 pages
Mobile Campus Navigation System Design
No ratings yet
Mobile Campus Navigation System Design
33 pages
Web 8000 - Faq - 16 0005
No ratings yet
Web 8000 - Faq - 16 0005
6 pages
Last Mile Mobile App for Logistics
No ratings yet
Last Mile Mobile App for Logistics
12 pages
Functions and Evaluation of Microprocessors
No ratings yet
Functions and Evaluation of Microprocessors
60 pages
ISO27001 CheatSheet Explanation
No ratings yet
ISO27001 CheatSheet Explanation
2 pages
Arcpy and Arcgis Geospatial Analysis With Python Silas Toms Digital Version 2025
No ratings yet
Arcpy and Arcgis Geospatial Analysis With Python Silas Toms Digital Version 2025
98 pages
SACS 5.3 Enhancements
100% (1)
SACS 5.3 Enhancements
32 pages
Data Dictionary For Proposed Systems SDM
No ratings yet
Data Dictionary For Proposed Systems SDM
8 pages
Postgre UserReactive Log 100225
No ratings yet
Postgre UserReactive Log 100225
23 pages
Lexium 28 Servo Drives and BCH2 Servo Motors
No ratings yet
Lexium 28 Servo Drives and BCH2 Servo Motors
26 pages
MySQL Interview Questions Guide
No ratings yet
MySQL Interview Questions Guide
88 pages
CSAT Pro User Guide v1 11 0
No ratings yet
CSAT Pro User Guide v1 11 0
27 pages
Cisco 7600 Series Supervisor Engine Guide
No ratings yet
Cisco 7600 Series Supervisor Engine Guide
47 pages
The Future of Cloud Computing 2024 To 2030
No ratings yet
The Future of Cloud Computing 2024 To 2030
9 pages
Exercise 3 - Wireframe Geometry Creation and Editing - Rev A
No ratings yet
Exercise 3 - Wireframe Geometry Creation and Editing - Rev A
33 pages
Solving The Black Box Problem Carlos Zednik
No ratings yet
Solving The Black Box Problem Carlos Zednik
24 pages
Superpave Gyratory Compactor
100% (1)
Superpave Gyratory Compactor
4 pages
Extensive Testing Version1.1
No ratings yet
Extensive Testing Version1.1
1 page
Effective Metrics For Software Process
No ratings yet
Effective Metrics For Software Process
13 pages
Civil Engineering Laboratories Overview
No ratings yet
Civil Engineering Laboratories Overview
4 pages
RMA Evolution - Local Authorization Consistency Check (2022)
No ratings yet
RMA Evolution - Local Authorization Consistency Check (2022)
3 pages
Christian CHIKOMOLA RUZUBA B.SC.: Professional Summary
No ratings yet
Christian CHIKOMOLA RUZUBA B.SC.: Professional Summary
6 pages
New Driving License Application Slip
No ratings yet
New Driving License Application Slip
2 pages
Missing Child Identification System
No ratings yet
Missing Child Identification System
88 pages
Resume Latika Bhatia
No ratings yet
Resume Latika Bhatia
2 pages
Plan Preparation Manual PDF
No ratings yet
Plan Preparation Manual PDF
81 pages
EU Declaration for MiR200 Autonomous Vehicle
No ratings yet
EU Declaration for MiR200 Autonomous Vehicle
1 page
Sketch Plan: LOT 5048 and 5455 and LOT 1, (LRC) PCS-5626 CAD 220 Tacloban Cadastre
No ratings yet
Sketch Plan: LOT 5048 and 5455 and LOT 1, (LRC) PCS-5626 CAD 220 Tacloban Cadastre
1 page
Affordable Rustproof Grease Test Chamber
No ratings yet
Affordable Rustproof Grease Test Chamber
4 pages
LSMW BOM Upload Guide
No ratings yet
LSMW BOM Upload Guide
10 pages

RandomForest Project Report

Uploaded by

RandomForest Project Report

Uploaded by

Random Forest Classification on Social Network Ads Dataset

Project Title: Random Forest Classification on Social Network Ads

Date: August 1, 2025

demonstrates the effectiveness of ensemble learning.

4. Data Collection and Preprocessing

based on simple demographic inputs like age and salary.

demonstrated significant gains through such methods.

- Only two features are used.

- Binary classification (0 = No, 1 = Yes).

Data Collection and Preprocessing

Features: Age, Estimated Salary

We used Random Forest Classification with:

- 10 decision trees (n_estimators=10)

- Entropy criterion for split decisions

- Random Forest reduces overfitting

- Handles non-linearly separable data better than linear models

Libraries: scikit-learn, pandas, matplotlib, numpy

from sklearn.ensemble import RandomForestClassifier

classifier = RandomForestClassifier(n_estimators=10, criterion='entropy', random_state=0)

classifier.predict([[30, 87000]]) -> [1]

classifier.predict([[40, 0]]) -> [0]

Decision boundary shows clear class separation.

- Breiman, L. (2001). 'Random Forests'. Machine Learning.

pip install numpy pandas matplotlib scikit-learn

2. Run the script: python social_ads_rf.py

Thanks to the open-source community and contributors of scikit-learn and matplotlib.

You might also like