0% found this document useful (0 votes)

464 views5 pages

Campus Placement Analyzer: Using Supervised Machine Learning Algorithms

The main aim of every academia enthusiast is placement in a reputed MNC’s and even the reputation and every year admission of Institute depends upon placement that it provides to their students. So, any system that will predict the placements of the students will be a positive impact on an institute and increase strength and decreases some workload of any institute’s training and placement office (TPO). With the help of Machine Learning techniques, the knowledge can be extracted from past placed

Uploaded by

ATS

Available Formats

Download as PDF, TXT or read online on Scribd

Download as pdf or txt

0% found this document useful (0 votes)

464 views5 pages

Campus Placement Analyzer: Using Supervised Machine Learning Algorithms

Uploaded by

ATS

Available Formats

Download as PDF, TXT or read online on Scribd

Download as pdf or txt

You are on page 1/ 5

International Journal of Computer Applications Technology and Research

Volume 8–Issue 09, 358-362, 2019, ISSN:-2319–8656

Campus Placement Analyzer: Using Supervised Machine

Learning Algorithms
Shubham Khandale Sachin Bhoite
Student, M.Sc. (Big Data Analytics) Assistant Professor
School of Computer Science, Faculty of Science School of Computer Science, Faculty of Science
MIT-WPU, Pune, Maharashtra, India MIT-WPU, Pune, Maharashtra, India

Abstract -- The main aim of every academia enthusiast is placement in a reputed MNC’s and even the reputation and every year
admission of Institute depends upon placement that it provides to their students. So, any system that will predict the placements of the
students will be a positive impact on an institute and increase strength and decreases some workload of any institute’s training and
placement office (TPO). With the help of Machine Learning techniques, the knowledge can be extracted from past placed students and
placement of upcoming students can be predicted. Data used for training is taken from the same institute for which the placement
prediction is done. Suitable data pre-processing methods are applied along with the features selections. Some Domain expertise is used
for pre-processing as well as for outliers that grab in the dataset. We have used various Machine Learning Algorithms like Logistic,
SVM, KNN, Decision Tree, Random Forest and advance techniques like Bagging, Boosting and Voting Classifier and achieved 78%
in XGBoost and 78% in AdaBoost Classifier.

Keywords: Pre-processing, Feature Selection, Domain expertise, Outliers, Bagging, Boosting, SVM, KNN, Logistics

1. INTRODUCTION
Nowadays Placement plays an important role in this world
accuracy of 71.66% with tested real-life data indicates that the
full of unemployment. Even the ranking and rating of
institutes depend upon the amount of average package and system is reliable for carrying out its major objectives, which
amount of placement they are providing. is to help teachers and placement cell[2].
So basically main objective of this model is to predict whether
the student might get placement or not. Different kinds of Ajay Kumar Pal, Saurabh Pal (2013) they are predicting the
classifiers were applied i.e. Logistic Regression, SVM, placement of student after doing MCA by the three selected
Decision Tree, Random Forest, KNN, AdaBoost, Gradient classification algorithms based on Weka. The best algorithm
Boosting and XGBoost. For this all over academics of based on the placement data is Naïve Bayes Classification
students are taken under consideration. As placements activity with an accuracy of 86.15% and the total time taken to build
take place in last year of academics so last year semesters are the model is at 0 seconds. Naïve Bayes classifier has the
not taken under consideration lowest average error at 0.28 compared to others.[3]

2. RELATED WORK Syed A0068med, Aditya Zade, Shubham Gore, Prashant

Various researches and students have published related work Gaikwad, Mangesh Kolhal (2017). Their objective is to
in national and international research papers, thesis to analyze the previous year's student's historical data and
understand the objective, types of algorithm they have used predict placement chance of the current students and the
and various techniques for pre-processing, Feature. percentage placement chance of the institution. They have
used the Decision tree C4.5 Algorithm. Decision tree C4.5
Pothuganti Manvitha, Neelam Swaroopa (2019) used Random algorithms are applied to the Company’s previous year data &
Forest and Decision Tree. The accuracy obtained after current requirement to generate the model and this model can
analysis for Decision tree is 84% and for the Random Forest be used to predict the students’ eligibility in various
is 86%. Hence, from the above-said analysis and prediction, companies. According to company eligibility criteria, they
it’s better if the Random Forest algorithm is used to predict will send the notification to those candidates who are eligible
the placement results [1]. for that campus interview and check the eligibility of
candidate on the basis of percentage & technology [4].
Senthil Kumar Thangavel, Divya Bharathi P, Abijith
Sankar(2017) used Decision Tree, Logistic Regression, Apoorva Rao r , Deeksha K C , Vishal Prajwal R , Vrushak K,
Metabagging Classifier, Naïve Bayes and obtain highest Nandini M S (2018). They have used techniques like
84.42% accuracy in Decision Tree. The objectives, which is to clustering along with that they have used classification rule
predict the placement status the students in Btech are most Naïve Bayes algorithm that will classify students in five
likely to have at the end of their final year placements. The

www.ijcat.com 358
International Journal of Computer Applications Technology and Research
Volume 8–Issue 09, 358-362, 2019, ISSN:-2319–8656

different status i.e. Dream company, Core Company, Mass we have merged 12th and diploma marks and made
recruiters, Not eligible and Not interested[5] a single column for both.
 Some of the tuples where from M.tech background
so we have dropped them and even in
3. DATASET DESCRIPTION AND “current_aggregate” column we have dropped the
SYSTEM FLOW NA values because the whole row was having NA.
 Replaced all NA values in columns
“Current_Back_Papers”,
This approach was followed in following Figure 3. “Current_Pending_Back_Papers”, all semester wise
“Sem_Back_Papers”, “Sem_Pending_Back_Papers”
with 0 because it was null only if that student have
no backlogs
Data Gathering  Using LabelEncoder from Preprocessing API in
sklearn encoded the labels of columns
“'Degree_Specializations”, “Campus”,” Gender”,
“year_down”, “educational_gap”

Pre-processing
3.2 Feature Selection
As per machine learning Feature Selection algorithms like
“Ridge”, “Lasso”, “RFE”, “plot importance”, “F1 score” and
“feature importance” we have got various outputs
Feature selection
 “Feature importance” with DT

Training different
Model

Model Selection

Prediction

Figure 3. Flow chart

Figure 3.2.1 Feature importance with DT

3.1 Data gathering and Pre-processing

The Data was collected from Training and placement
department of MIT which consist of all the students of
Bachler of Engineering (B.E) from 3 different colleges of
their campus. The Data consists of 2338 records with 31
different attribute.
 Dataset contains academic information of students.
As some students have completed their 12th and
some of them are from diploma background who
have directly taken admission to the second year so,

www.ijcat.com 359
International Journal of Computer Applications Technology and Research
Volume 8–Issue 09, 358-362, 2019, ISSN:-2319–8656

 “Feature importance” with Random Forest  “RFE”

Num Features: 5 Features support : [False False False False

False False False False True False False True False False
False True False True False False False False True False False
False False False False False] Features Ranking [25 6 4 3 24 8
13 22 1 10 11 1 23 17 2 1 19 1 5 18 7 21 1 26 16 20 15 12 14
9] selected
Features:['Sem1_Pending_Back_Papers',’Sem2_Pending_Bac
k_Papers','Sem4_Aggregate_Marks','Sem4_Pending_Back_Pa
pers', 'Sem6_Back_Papers']
Selected features index: [8, 11, 15, 17, 22]

 “Ridge”

Figure 3.2.2 Feature importance with Random Forest

 “F1 score”

Feature Names F1 score

Sem4_Aggregate_Marks 312.063809

Current_Aggregate_Marks 286.086537

Sem3_Aggregate Marks 255.771833

Sem2_Aggregate_Marks 164.183078

12th_/_Diploma_Aggre_marks 142.208129

Sem1_Aggregate_Marks 139.183936
Figure 3.2.3 Feature Selection Using Ridge
Sem6_Aggregat_ Marks 136.333959

Sem5_Aggregate_Marks 131.988165

10th_Aggregate_Marks 128.526784

Sem6_Back_Papers 128.526784

live_atkt 47.908927

Sem5_Back_Papers 45.382049

Sem4_Back_Papers 43.547352

www.ijcat.com 360
International Journal of Computer Applications Technology and Research
Volume 8–Issue 09, 358-362, 2019, ISSN:-2319–8656

 “Lasso”

Figure 4.2 Campus wise number of students who got placed

Figure 3.2.4 Feature Selection Using Lasso

But as per the domain knowledge we have selected

all the features which are importance for our model

4. EXPLORATORY DATA ANALYSIS

Figure 4.3 Gender Wise Student Placement

5. BAGGING AND BOOSTING

Figure 4.1 Total number of student placed

Bagging is nothing but bootstrap aggregating, it is an
ensemble method to improve the accuracy and stability of the
models. Random samples are taken with replacement and with
every new sample that is generated is trained and the
ensemble can make a prediction for the new instance by
simply aggregating the prediction of all predictors

Boosting is nothing but the ensemble method that can

combine different weak learner into a strong learner. Its main
aim is to train predictors sequentially. Most popular are
AdaBoost and Gradient Boosting.

www.ijcat.com 361
International Journal of Computer Applications Technology and Research
Volume 8–Issue 09, 358-362, 2019, ISSN:-2319–8656

Base classifier- AdaBoost Classifier

7. REFERENCES
Decision Tree
[1] Pothuganti Manvitha, Neelam Swaroopa “Campus
Placement Prediction Using Supervised Machine Learning
Bagging Classifier Techniques” International Journal of Applied Engineering
Research ISSN 0973-4562 Volume 14, Sept 2019

Figure 5.1 Layering of Classifiers [2] Senthil Kumar Thangavel, Divya Bharathi P, Abijith
Sankar “Student Placement Analyzer: A Recommendation
System Using Machine Learning” 2017 International
We have used Base Classifier as Decision Tree, over that we Conference on Advanced Computing and Communication
have used AdaBoost Classifier and over that we have used Systems (ICACCS -2017), Coimbatore, INDIA, Jan. 06 – 07,
Baagging Classifier because we want to tune the accuracy of 2017
the model
[3] Ajay Kumar Pal, Saurabh Pal “Classification Model of
Prediction for Placement of Students” I.J.Modern Education
and Computer Science, 2013, 11, 49-56 Published Online, 11
6. RESULT AND CONCLUSION November 2013

[4] Syed A0068med, Aditya Zade, Shubham Gore, Prashant

Algorithms Accuracy Gaikwad, Mangesh Kolhal “Smart System for Placement
Prediction using Data Mining” International Journal for
Logistic Regression 58% Research in Applied Science & Engineering Technology
(IJRASET) ISSN: 2321-9653, Dec 2017
Support Vector Machine 69%
[5] Apoorva Rao r , Deeksha K C , Vishal Prajwal R ,
KNN 63.22 % Vrushak K, Nandini M S “Student placement analyzer: a
recommendation system using machine learning” ijariie-
Decision Tree 69% issn(o)-2395-4396, Jan 2018

Random Forest 75.25%

AdaBoost(DT) 77%

Gradient Boosting 77%

Voting Classifier Soft 69.11%

Voting Classifier Hard 68.43%

XGBoost 78%

In this model, we have considered various academics records

along with all semester’s aggregate, live backlog, dead
backlog, education gap, year down. This model will help the
teachers to find whether the student will get placement or not
prior in 3rd year only so that they can pay special attention to
those students who are predicted as not getting placement.
Even the institute can take major steps to improve the
qualities of those students before their final placement.
Various algorithms were used but the final model is selected
on AdaBoost classifier along with the Bagging and Decision
Tree as Base Classifier as its accuracy is very high.

The existing dataset was only for 3 colleges further even we

can add more college’s dataset to it for prediction. In future,
we are going to implement Deep learning algorithms which
may give better accuracy then Machine Learning models

www.ijcat.com 362

Oak, Ash & Thorn Tarot
100% (21)
Oak, Ash & Thorn Tarot
82 pages
DRRR DLL 2nd Quarter - Week 2 Day 1 and 2
100% (3)
DRRR DLL 2nd Quarter - Week 2 Day 1 and 2
4 pages
Laptop Price Prediction Using Machine Learning: International Journal of Computer Science and Mobile Computing
100% (1)
Laptop Price Prediction Using Machine Learning: International Journal of Computer Science and Mobile Computing
5 pages
Bayesian Learning Unit 3 PDF
No ratings yet
Bayesian Learning Unit 3 PDF
18 pages
Report Minor Project PDF
No ratings yet
Report Minor Project PDF
37 pages
Fundamentals of Tabletop Roleplaying
100% (1)
Fundamentals of Tabletop Roleplaying
16 pages
Students Placement Prediction Using Machine Learning Algorithms
No ratings yet
Students Placement Prediction Using Machine Learning Algorithms
14 pages
Placement Prediction Using Various Machine Learning Models and Their Efficiency Comparison
No ratings yet
Placement Prediction Using Various Machine Learning Models and Their Efficiency Comparison
5 pages
Project Report Hate
100% (1)
Project Report Hate
24 pages
Lung Disease Prediction From X Ray Images
100% (1)
Lung Disease Prediction From X Ray Images
63 pages
PROJECT REPORT For Machine Learning
100% (1)
PROJECT REPORT For Machine Learning
22 pages
Age and Gender Detection Using Deep Learning: HYDERABAD - 501 510
No ratings yet
Age and Gender Detection Using Deep Learning: HYDERABAD - 501 510
11 pages
Lab Program
100% (1)
Lab Program
15 pages
MCA Project Titles
No ratings yet
MCA Project Titles
2 pages
Final Year Project Report 2
No ratings yet
Final Year Project Report 2
96 pages
Message Spam Classification Using Machine Learning Report
No ratings yet
Message Spam Classification Using Machine Learning Report
28 pages
Final Twitter - Sentiment - Analysis - Report
100% (1)
Final Twitter - Sentiment - Analysis - Report
14 pages
Face Recognition Attendance System
No ratings yet
Face Recognition Attendance System
18 pages
Major Project Documentation Final 2
No ratings yet
Major Project Documentation Final 2
62 pages
NLP Mini Project Report
No ratings yet
NLP Mini Project Report
27 pages
1NH17CS407
No ratings yet
1NH17CS407
110 pages
Weather Prediction 2
No ratings yet
Weather Prediction 2
33 pages
Stock-Price-Prediction-Using-Machine-Learning Final Project Indu Mam Project Final Project
No ratings yet
Stock-Price-Prediction-Using-Machine-Learning Final Project Indu Mam Project Final Project
47 pages
Summer Internship Report
No ratings yet
Summer Internship Report
27 pages
Unit 1 - Machine Learning
No ratings yet
Unit 1 - Machine Learning
21 pages
Nikhil MOOC Report
No ratings yet
Nikhil MOOC Report
16 pages
Project Report - Credit Card Fraud Detection
No ratings yet
Project Report - Credit Card Fraud Detection
11 pages
Heart Disease Prediction Synopsis
No ratings yet
Heart Disease Prediction Synopsis
36 pages
Internship - Report Nithin
No ratings yet
Internship - Report Nithin
25 pages
MLT Unit 3
100% (1)
MLT Unit 3
38 pages
UNIT IV (Well Posed Leaning Problems)
100% (1)
UNIT IV (Well Posed Leaning Problems)
16 pages
AI Chatbot: Green University of Bangladesh
100% (2)
AI Chatbot: Green University of Bangladesh
20 pages
For Fake or Real Disaster Tweet Analysis of Machine Learning Algorithms
No ratings yet
For Fake or Real Disaster Tweet Analysis of Machine Learning Algorithms
23 pages
Blood Group Determination Using Fingerprint
No ratings yet
Blood Group Determination Using Fingerprint
10 pages
Sign Language Recognition Synopsis
No ratings yet
Sign Language Recognition Synopsis
4 pages
Twitter Sentiment Analysis Project Report Compressed
No ratings yet
Twitter Sentiment Analysis Project Report Compressed
33 pages
A Multi Perspective Fraud Detection Method For Multi Participant E Commerce Transactions
No ratings yet
A Multi Perspective Fraud Detection Method For Multi Participant E Commerce Transactions
6 pages
Detection of Cyber Attacks Using Ai
No ratings yet
Detection of Cyber Attacks Using Ai
92 pages
LP3 - ML Mini-Project Report Format Shreeyas
No ratings yet
LP3 - ML Mini-Project Report Format Shreeyas
13 pages
Well Posed Learning Problems and Applications of ML
No ratings yet
Well Posed Learning Problems and Applications of ML
17 pages
Crime Prediction in Nigeria's Higer Institutions
No ratings yet
Crime Prediction in Nigeria's Higer Institutions
13 pages
Prediction of Autism Spectrum Disorder
No ratings yet
Prediction of Autism Spectrum Disorder
25 pages
Project Final Report
100% (1)
Project Final Report
44 pages
Virtual Mirror - A Hassle Free Approach To The Use of Trial Room
No ratings yet
Virtual Mirror - A Hassle Free Approach To The Use of Trial Room
38 pages
Project Doc Personality Prediction
No ratings yet
Project Doc Personality Prediction
6 pages
Documentation-Fake News Detection
100% (1)
Documentation-Fake News Detection
57 pages
Industrial Training Report
No ratings yet
Industrial Training Report
24 pages
Synopsis Chatbot PDF
100% (1)
Synopsis Chatbot PDF
6 pages
Age and Gender Detection
No ratings yet
Age and Gender Detection
13 pages
Supervised Learning (Classification and Regression)
No ratings yet
Supervised Learning (Classification and Regression)
14 pages
54 Batch Project Documentation-1
No ratings yet
54 Batch Project Documentation-1
82 pages
Internship Report File
No ratings yet
Internship Report File
35 pages
toc mod 5 notes
No ratings yet
toc mod 5 notes
41 pages
Object Detection - Deep Learning: Jamia Hamdard
No ratings yet
Object Detection - Deep Learning: Jamia Hamdard
26 pages
Deep Learning Based Car Damage Detection, Classification and Severity
No ratings yet
Deep Learning Based Car Damage Detection, Classification and Severity
7 pages
Helmet Detection Using Machine Learning and Automatic License Final
75% (4)
Helmet Detection Using Machine Learning and Automatic License Final
47 pages
Data Valley 21VV1A0510
No ratings yet
Data Valley 21VV1A0510
85 pages
Spammer Detect Project Document
No ratings yet
Spammer Detect Project Document
45 pages
Project Report (Amazon Review (Sentiment Analysis) )
No ratings yet
Project Report (Amazon Review (Sentiment Analysis) )
31 pages
Emotion Detection
No ratings yet
Emotion Detection
17 pages
Colour Detection
No ratings yet
Colour Detection
6 pages
Multiple Disease Prediction
No ratings yet
Multiple Disease Prediction
23 pages
The Today and Future of WSN, AI, and IoT: A Compass and Torchbearer for the Technocrats
From Everand
The Today and Future of WSN, AI, and IoT: A Compass and Torchbearer for the Technocrats
Dr.Chandrakant
No ratings yet
Effect of Wind Environment On High Voltage Transmission Lines Span
No ratings yet
Effect of Wind Environment On High Voltage Transmission Lines Span
6 pages
License Plate Detection and Recognition Using OCR Based On Morphological Operation
No ratings yet
License Plate Detection and Recognition Using OCR Based On Morphological Operation
5 pages
Electronic Transcript Management System
No ratings yet
Electronic Transcript Management System
5 pages
Modeling and Analysis of Lightning Arrester For Transmission Line Overvoltage Protection
No ratings yet
Modeling and Analysis of Lightning Arrester For Transmission Line Overvoltage Protection
5 pages
Wine Quality Prediction Using Machine Learning Algorithms
100% (1)
Wine Quality Prediction Using Machine Learning Algorithms
4 pages
Android-Based High School Management Information System
No ratings yet
Android-Based High School Management Information System
5 pages
Application of Matrices in Human's Life
No ratings yet
Application of Matrices in Human's Life
6 pages
Individual Household Electric Power Consumption Forecasting Using Machine Learning Algorithms
No ratings yet
Individual Household Electric Power Consumption Forecasting Using Machine Learning Algorithms
4 pages
Applications of Machine Learning For Prediction of Liver Disease
No ratings yet
Applications of Machine Learning For Prediction of Liver Disease
3 pages
Restaurants Rating Prediction Using Machine Learning Algorithms
No ratings yet
Restaurants Rating Prediction Using Machine Learning Algorithms
4 pages
Customer Churn Analysis and Prediction
No ratings yet
Customer Churn Analysis and Prediction
4 pages
Air Quality Prediction Using Machine Learning Algorithms
100% (1)
Air Quality Prediction Using Machine Learning Algorithms
4 pages
The "Promotion" and "Call For Service" Features in The Android-Based Motorcycle Repair Shop Marketplace
No ratings yet
The "Promotion" and "Call For Service" Features in The Android-Based Motorcycle Repair Shop Marketplace
5 pages
Analysis of Student Feedback Using Deep Learning
No ratings yet
Analysis of Student Feedback Using Deep Learning
4 pages
Use and Analysis On Cyclomatic Complexity in Software Development
No ratings yet
Use and Analysis On Cyclomatic Complexity in Software Development
4 pages
Density Based Traffic Signalling System Using Image Processing
No ratings yet
Density Based Traffic Signalling System Using Image Processing
4 pages
Student Performance Prediction
No ratings yet
Student Performance Prediction
4 pages
Designing Framework For Data Warehousing of Patient Clinical Records Using Data Visualization Technique of Nigeria Medical Records
No ratings yet
Designing Framework For Data Warehousing of Patient Clinical Records Using Data Visualization Technique of Nigeria Medical Records
14 pages
Application of Knowledge Management System Using Influence of Inukshuk and Kano Model Case Study: Palembang Private Higher Education
No ratings yet
Application of Knowledge Management System Using Influence of Inukshuk and Kano Model Case Study: Palembang Private Higher Education
6 pages
Search Engine Development To Enhance User Communication
No ratings yet
Search Engine Development To Enhance User Communication
3 pages
Research On Modulation Recognition Technology Based On Machine Learning
No ratings yet
Research On Modulation Recognition Technology Based On Machine Learning
5 pages
Limits of Confidentiality
100% (1)
Limits of Confidentiality
3 pages
Lesson Plan SCN
No ratings yet
Lesson Plan SCN
3 pages
Types of Freelance
No ratings yet
Types of Freelance
3 pages
Mission &amp Vision of Diffferent Company
100% (1)
Mission &amp Vision of Diffferent Company
8 pages
Genie Wiley Case Study
No ratings yet
Genie Wiley Case Study
2 pages
Report ON Compensation System of Fatimafert Limited
No ratings yet
Report ON Compensation System of Fatimafert Limited
10 pages
Confronting With Firmness and Compassion
No ratings yet
Confronting With Firmness and Compassion
6 pages
Entrepreneurship - The Engine of Growth
100% (5)
Entrepreneurship - The Engine of Growth
831 pages
Scientific Method
No ratings yet
Scientific Method
1 page
Sample Project Proposal
No ratings yet
Sample Project Proposal
9 pages
Lesson Plan
No ratings yet
Lesson Plan
11 pages
Assessing Intercultural Competence in Language Learning and Teaching
No ratings yet
Assessing Intercultural Competence in Language Learning and Teaching
19 pages
Flat No 69
No ratings yet
Flat No 69
237 pages
DLL Entrep
No ratings yet
DLL Entrep
4 pages
Management History
No ratings yet
Management History
17 pages
Impact of Social Media On Body Image and Self-Esteem Among Young Filipino Men and Women
100% (2)
Impact of Social Media On Body Image and Self-Esteem Among Young Filipino Men and Women
9 pages
Eapp Test1
No ratings yet
Eapp Test1
2 pages
Visual Arts Term Program and Rubric Stage 3
No ratings yet
Visual Arts Term Program and Rubric Stage 3
6 pages
Astral Predators and Vampiric Spirits
0% (1)
Astral Predators and Vampiric Spirits
4 pages
CA - Staying Out of Auto-Rejection
No ratings yet
CA - Staying Out of Auto-Rejection
10 pages
Ch6. Motivational Needs, Processes and Application
No ratings yet
Ch6. Motivational Needs, Processes and Application
21 pages
Chapter Three Research Methodology
No ratings yet
Chapter Three Research Methodology
5 pages
12 Principles of Multimedia Learning
No ratings yet
12 Principles of Multimedia Learning
6 pages
Book Review - Facial Analysis and Homeopathy
0% (1)
Book Review - Facial Analysis and Homeopathy
2 pages
Carl Rogers
100% (1)
Carl Rogers
16 pages
Stop Self Sabotage
100% (1)
Stop Self Sabotage
10 pages
The Birthday Party
100% (1)
The Birthday Party
26 pages