0% found this document useful (0 votes)

46 views28 pages

Anu Internshipreport

The internship report by S B Anusha focuses on developing a machine learning model for loan prediction to enhance decision-making in the banking sector. It details the project objectives, methodologies, and the importance of accurate loan predictions in mitigating risks and improving customer satisfaction. The report also outlines the technical requirements, data preprocessing steps, and various machine learning algorithms employed in the project.

Uploaded by

thenameisappleabhi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

46 views28 pages

Anu Internshipreport

Uploaded by

thenameisappleabhi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

THE OXFORD COLLEGE OF SCIENCE

Department of Computer Applications

INTERNSHIP REPORT
ON

“LOAN PREDICTION”
Submitted in partial fulfilment of the requirements for the award of the Degree of

Bachelor of Computer Applications

Submitted by

S B Anusha (U03MS21S0394)

Machine Learning Internship at

PRINSTON SMART ENGINEERS

Under the guidance of

Mr. AKASH V

Department of Computer Science and Applications

Bangalore University
CERTIFICATE

Certified that the Project work entitled “LOAN PREDICTION”

carried out by S B ANUSHA (U03MS21S0394) is a bona-fide
student of The Oxford College of science, 4th sector, HSR Layout,
Bengaluru in partial fulfilment of the requirement of VI semester
(Machine Learning Project) Bachelor of computer application, during
the year 2023 – 2024. It is certified that all corrections/suggestions
indicated for Internal Assessment have been incorporated in the report
deposited in the departmental library. The Internship Project report
has been approved as it satisfies the academic requirements with
respect of the Mini Project work prescribed for the said degree.
ACKNOWLEDGEMENT

While presenting this Machine Learning Project on “Loan

Prediction”, I feel it is my duty to acknowledge the help rendered to
us by various people.

I endure our humble and sincere gratitude to Dr. Susil Kumar Sahoo,
Head of Department of Computer Science and Applications, for
his great encouragement and valuable support.

I sincerely acknowledge the guidance and support of our internship

guide, Mr. Akash V, Prinston Smart Engineers who provided
valuable advice throughout the course of the project.
ABSTRACT

Loan prediction is a critical task in the banking and financial

sector, aiming to predict the likelihood of a loan applicant defaulting
on their loan. Accurate predictions can significantly enhance decision-
making processes, reduce financial risk, and increase profitability for
financial institutions. This research focuses on developing a predictive
model for loan approval using machine learning techniques.

The dataset, sourced from a leading financial institution, includes

features such as applicant income, loan amount, credit history,
employment status, and other demographic variables. Various data
preprocessing techniques are applied to handle missing values, encode
categorical variables, and normalize numerical features. The study
explores several machine learning algorithms, including Logistic
Regression, Decision Trees, Random Forests, Support Vector
Machines, and Gradient Boosting.

By leveraging advanced machine learning techniques, this

research provides a reliable and efficient approach to loan prediction,
offering valuable insights for risk management and strategic planning
in the banking industry.
DECLARATION

I, S B ANUSHA, hereby declare that this project work entitled

LOAN PREDICTION is submitted in fulfilment for the award of the
degree of BACHELOR OF COMPUTER APPLICATIONS of
Bangalore University. I further declare that I have not submitted this
project report either in part or in full to any other university for the
award of any degree.

S B ANUSHA
(U03MS21S0394)
CONTENTS

1. Introduction
1.1 Problem Statement
1.2 Objective
1.3 Future Scope
2. Requirements Specification
2.1 Hardware Requirements
2.2 Software Requirements
3. System Definition
3.1 Project Description
3.2 Importance of Loan Prediction
3.3 Working Description
3.4 Libraries Used
3.5 Dataset
3.6 Advantages
3.7 Disadvantages
4. Implementation
5. Snapshots
6. Conclusion
CHAPTER 1
INTRODUCTION

Arthur Samuel, a pioneer in the field of artificial intelligence and

computer gaming, coined the term “Machine Learning”. He defined
machine learning as – “A Field of study that gives computers the
capability to learn without being explicitly programmed.” The process
starts with feeding good quality data and then training our
machines(computers) by building machine learning models using the
data and different algorithms. The choice of algorithms depends on
what type of data we have and what kind of task we are trying to
automate.

How ML works?
 Gathering relevant data for the problem you are trying to solve.
This can come from various sources like databases, sensors,
online repositories, etc. The quality and quantity of data
significantly impact te performance of the machine learning
model.
 Data Preprocessing- Raw data often needs to be cleaned and
formatted before it can be used to train a model. It includes steps
like filling in or removing the missing data points, scaling
features to common range or distribution, converting categorical
data into numerical formats and dividing the data into training
and testing sets.
 Selecting an appropriate algorithm based on the nature of
dataset and building models on the training set.
 Evaluating the trained model using the testing set to assess its
performance. Common metrics include accuracy, precision,
recall, F1-score, mean squared error, etc.
- Linear Algebra
- Statistics and Probability
- Calculus
- Graph Theory
- Programming Skills (Languages like Python, R, MATLAB,
C++, or Octave)

How we split data in Machine Learning?

 Train-Test Split: The training set is used to train the model and
includes majority of the data. The testing set is used to evaluate
the model’s performance and includes the remaining data.
 Validation Data: This part of data is used to do frequent
evaluation of the model, fit ni the training dataset along with
improving involved hyper parameters. This data plays its role
when the model is actually training.
 Testing Data: Once the model is completely trained, testing
data provides an unbiased evaluation. The model will predict
some values after feeding some inputs of testing data. After
prediction, we evaluate the model by comparing it with actual
output present in testing data.

1.1 Problem Statement

Loans are the major requirement of the modern world. By this,
banks get a major part of the total profit. It is beneficial for
students to manage their education and living expenses, and for
people to buy any kind of luxury like houses, cars, etc. But
when it comes to deciding whether the applicant’s profile is
relevant to be granted with loan or not, banks have to look after
many aspects. This project mainly focuses on identifying the
customer segments, those who are eligible for loan aspects so
that they can specifically target these customers.

1.2 Objectives
The objectives of developing a loan prediction model using
machine learning can be outlined as follows:
 Develop a system that can automatically assess and predict
the eligibility of loan applicants, reducing the time and
resources spent on manual evaluations.
 Improve the accuracy of loan approval decisions, ensuring
that only applicants who meet certain criteria and have a
huge likelihood of repayment are approved
 Minimize the risk of default by identifying high-risk
applicants through predictive modeling.
 Ensure consistent and unbiased loan approval decisions by
removing subjective human judgements.
 Provide faster loan approval or rejection feedback to
applicants, enhancing customer satisfaction.
 Gain insights into the key factors that influence loan
approval and default rates.
 Streamline operations and reduce the cost associated with
manual processing of loan applications.
 Ensure that the loan process complies with regulatory
requirements and guidelines.
 Develop a scalable solution that can handle increasing
volumes of loan applications without compromising
performance.
 Maintain transparency and explainability in the decision-
making process to meet compliance standards.

1.3 Future Scope

1. Social Media and Online Behavior: Using data from social
media, online transactions and digital footprints to assess
credit worthiness.
2. IoT and Smart Devices: Leveraging data from smart
devices and IoT to gain insights into an applicant’s
financial behavior and stability.
3. Dynamic Scoring Models: Developing real-time credit
scoring systems that update with new data, providing more
accurate and current assessments.
4. Deep Learning: Utilizing deep neural networks to capture
complex patterns in large datasets for improved prediction
accuracy.
5. Transfer Learning: Applying knowledge gained from one
financial domain to another to enhance model
performance.
6. Fraud Detection: Integrating predictive models with fraud
detection systems to identify and mitigate fraudulent
applications in real-time.
7. Microloans and Nano Loans: Developing models that can
assess the creditworthiness for Microloans and Nano
loans, providing financial services to underserved
populations.
8. Ecosystem Partnerships: Collaborating with fintech
companies, banks, and other financial institutions to create
an interconnected ecosystem that leverages loan prediction
models.
9. Green Loans: Developing models that promote and
support green financing initiatives and sustainable
investments.
CHAPTER 2
REQUIREMENTS SPECIFICATION

2.1 SOFTWARE REQUIREMENTS

 Operating System – Windows 10/11
 Languages used in Python
 Jupyter Notebook
 Libraries like Numpy, Pandas, Matplotlib, Sci-kit Learn,
Seaborn
 Version Control
 Security

2.2 HARDWARE REQUUIREMENTS

 Processor
 Processor Speed – 1 GHz
 Memory – 2 GB RAM
 SSD with 1 TB capacity
 Mouse or any other pointing device
 Keyboard
 Display device – Color Monitor
CHAPTER 3
SYSTEM DEFINITION

3.1 Project Description

The “Loan Prediction” project aims to develop a machine
learning model that predicts the eligibility of applicants for loan
approval based on their personal and financial data. The goal is to
streamline the loan approval process, minimize default risks, and
enhance customer satisfaction through efficient, accurate, and
automated decision-making.
1. Data Collection: The project begins by obtaining the Loan
Prediction dataset which gathers historical loan application data
from a financial institution or open-source database.

2. Data Preprocessing: This process is essential to ensure data

quality and usability. It includes steps like handling missing
values, encoding categorical variables, and scaling numerical
features to prepare the dataset for analysis.

3. Feature Engineering: It creates new features or modify the

existing ones to improve model performance and selects
relevant features based on statistical analysis and domain
knowledge.

4. Model Building: Here the data is split into training, testing and
validations sets. Various machine learning models like Logistic
Regression, Decision Trees, Random Forest, etc. are trained to
predict the loan approval status of the applicants.

5. Model Evaluation: The performance of the models is evaluated

using metrics such as accuracy, precision, recall, etc. The
performance is then compared across different models to choose
the suitable one.
6. Model Interpretation: The feature importance is analyzed to
understand the key factors influencing loan approval. This will
help to ensure that the model is interpretable and explainable to
stakeholders.

7. Deployment: A user interface or API is developed and

integrated into the loan processing system. The deployment
environment should be secure, scalable and efficient.

8. Monitoring and Maintenance: The model performance is

continuously monitored and updated with new data to maintain
accuracy and relevance.

3.2 Importance of Loan Prediction

1. Risk Mitigation
 Assessing Creditworthiness: Accurate loan prediction
models help banks and financial institutions evaluate the
creditworthiness of applicants. By identifying potential
defaulters, institutions can make informed decisions about
whether to approve or reject loan applications.

2. Optimizing Lending Practices

 Personalized Loan Products: Loan prediction models
enable lenders to tailor loan products to individual
applicants based on their risk profile. This personalization
can include adjusting interest rates, loan amounts, and
repayment terms.

3. Regulatory Compliance
 Fair Lending Practices: Loan prediction models help
ensure that lending decisions are based on objective data,
reducing the risk of discrimination and bias. This is
crucial for complying with regulations that mandate fair
lending practices.

4. Operational Efficiency
 Automation of Loan Processing: Predictive models can
automate the evaluation of loan applications, significantly
reducing the time and effort required for manual
assessments. This leads to faster loan processing times and
improved customer satisfaction.

5. Enhanced Customer Experience

 Quick Decision Making: Automated loan prediction
models enable quicker decision-making, providing
applicants with rapid feedback on their loan applications.
This improves the overall customer experience and
satisfaction.

6. Continuous Improvement
 Learning from Data: Machine learning models improve
over time as they are exposed to more data. This
continuous learning process helps institutions refine their
predictive capabilities and adapt to changing market
conditions and borrower behaviours.

7. Strategic Decision Making

 Informed Decision-Making: Data-driven insights from
loan prediction models support strategic decision-making
at higher levels of management. This includes portfolio
management, risk assessment, and long-term planning.

3.3 Working Description

In this data-driven project, we will predict whether a loan
applicant will be approved or not based on their personal and financial
information.
1. Data Collection
 Gather historical loan application data, which includes
features like applicant demographics, income details, loan
amount, credit history and loan status.

2. Data Preprocessing
 Handling Missing Values
 Encoding Categorical Variables
 Feature Scaling
 Exploratory Data Analysis (EDA)

3. Feature Engineering
 Creating New Features: Combine existing features to
create new ones that may better represent the underlying
patterns (e.g., Total_Income = ApplicantIncome +
CoapplicantIncome).

 Feature Selection: Use statistical tests, correlation analysis

and domain knowledge to select the most relevant features
for the model.

4. Data Splitting
 Split the dataset into training and testing sets.

5. Model Selection and Training

 Train multiple machine learning models and select the
best performing one.

6. Model Evaluation
 Evaluate the model using the validation set to tune the
model and avoid overfitting. The different metrics include
accuracy, precision, recall etc.

7. Model Interpretation
 Understand which features are most important in the
model’s decision-making process. Tools like SHAP
(SHapley Additive exPlanation) can help with
interpretability.

8. Testing and Validation

 Test the final model on the test set to evaluate its
performance on unseen data.

9. Deployment
 Develop an API or user-interface for the model to be used
in real-time loan application processing.

10. Monitoring and Maintenance

 Continuously monitor the model’s performance and
update it with new data to maintain its accuracy and
relevance.

3.4 Libraries used

For the Loan Prediction project, several Python libraries are
used for various tasks such as data manipulation, visualization and
analysis.
1. NumPy (‘import numpy as np’): NumPy (Numerical Python)
is a fundamental library for scientific computing in Python. It
provides support for arrays, matrices, and many mathematical
functions to operate on these data structures efficiently.

2. Pandas (‘import pandas as pd’): Pandas is a powerful and

flexible open-source data analysis and data manipulation library
for Python. It provides data structures and functions needed to
work on structured data seamlessly and efficiently.

3. Matplotlib (‘import [Link] as plt’): Matplotlib is a

comprehensive library for creating static, animated, and
interactive visualizations in Python. It is widely used in data
science, data analysis, and scientific research to create a variety
of plots and charts.

4. Seaborn (‘import seaborn as sns’): Seaborn is a powerful and

user-friendly data visualization library for Python, built on top
of Matplotlib. It provides a high-level interface for drawing
attractive and informative statistical graphics, making it a
popular choice among data scientists and analysts.

5. Sci-Kit Learn (‘from sklearn…’): Scikit-learn (often

abbreviated as sklearn) is one of the most popular and powerful
machine learning libraries in Python. It provides simple and
efficient tools for data mining and data analysis, built on
NumPy, SciPy, and Matplotlib.

3.5 Dataset
The data of the individuals who applied for the loan, is used for
the analysis. This contains various details of the applicants like
marital status, education, their income, their employment details, etc.
Key features include:
1. Loan_ID: Unique identifier for each loan application.
2. Gender: Gender of the applicant (Male/Female).
3. Married: Marital status of the applicant (Yes/No).
4. Dependents: Number of dependents (0, 1, 2, 3+).
5. Education: Education level (Graduate/Not graduate).
6. Self_Employed: Self-employment status (Yes/No).
7. ApplicantIncome: Monthly income of the applicant.
8. CoapplicantIncome: Monthly income of the co-applicant.
9. LoanAmount: Loan amount in thousands.
10. Loan_Amount_Term: Term of the loan in months.
11. Credit_History: Credit history (1: Good, 0: Bad)
12. Property_Area: Property location (Urban/Semiurban/Rural)
13. Loan_Status: Target variable indicating the loan approval status
(Y: Approved, N: Not Approved).

3.6 Advantages
 Accuracy: Advanced predictive models can assess the credit
worthiness of applicants more accurately than the traditional
methods, reducing the likelihood of defaults.
 Operational Efficiency: Automating the loan approval process
reduces the need for manual intervention, cutting down on labor
costs and minimizing human errors.
 Fraud Detection: Predictive models can identify suspicious
patterns that may indicate fraudulent applications, saving the
institution from potential financial losses.
 Risk Assessment: Machine learning models can evaluate a vast
array of data points to predict the risk associated with each loan
applicant more comprehensively.

3.7 Disadvantages
While loan prediction models can significantly improve the
efficiency and accuracy of the loan approval process, they are not
without their drawbacks. Here are some potential disadvantages of
using loan prediction models:
 Incomplete or Inaccurate Data: The model's accuracy is highly
dependent on the quality of the data. Incomplete or inaccurate
data can lead to incorrect predictions.
 Over-fitting & Under-fitting: If the model is too complex, it
may over fit the training data, capturing noise instead of the
underlying pattern. This reduces its performance on new,
unseen data. Conversely, a model that is too simple may under-
fit, failing to capture the underlying trends in the data.
 Complexity: Some advanced models, such as ensemble
methods or neural networks, can be difficult to interpret and
explain to stakeholders. This lack of transparency can be
problematic in regulated industries like finance.
 Privacy: The use of personal data in loan prediction models
raises privacy concerns. Ensuring compliance with data
protection regulations (e.g., GDPR) is essential.
CHAPTER 4
IMPLEMENTATION (CODE)

Executed in Jupyter Notebook Environment

#import statements
import pandas as pd
import numpy as np
import [Link] as plt
import seaborn as sns

#loading the dataset

df = pd.read_csv("loanpred_dataset.csv")
[Link](10)

#analyzing the data

print([Link])
print([Link]())
print([Link]())

#data wrangling
print([Link]())
print([Link]().sum())
#checking if there are any null values
lpp = [Link](deep=True)
#Copying the dataset into another variable

lpp["Gender"].fillna("Others", inplace=True)
lpp["Married"].fillna("Unmarried", inplace=True)
lpp["Self_Employed"].fillna("No", inplace=True)

#train and test data

x = lpp[["LoanAmount"]]
y = lpp.Loan_Status
from sklearn.model_selection import train_test_split
x_train, x_test, y_train, y_test = train_test_split(x, y, test_size=0.2,
random_state=2)
from sklearn.linear_model import LinearRegression
slr = LinearRegression()
[Link](x_train,y_train)

#KNN Algorithm
from [Link] import KNeighborsClassifier
knn = KNeighborsClassifier(n_neighbors=3, metric= 'euclidean')
[Link](x_train, y_train)
xpred = [Link](x_test)
print(xpred)
from [Link] import classification_report, accuracy_score
print(accuracy_score(y_test, xpred))
#Decision Tree Classifier
from [Link] import DecisionTreeClassifier
clf = DecisionTreeClassifier()
[Link](x, y)

#printing confusion matrix and plot tree

from [Link] import confusion_matrix
confusion_matrix(y,ypred)
from [Link] import plot_tree
plot_tree(clf,feature_names=["LoanAmount"], class_names=["Y","N"])

#Heat map
[Link](figsize=(10, 6))
corr = [Link]()
[Link](corr, annot=True, cmap='coolwarm')
[Link]('Correlation Heatmap')
[Link]()
CHAPTER 5
SNAPSHOTS

5.1 Libraries Imported and Dataset Loaded

Figure 5.1 Libraries Imported and Dataset Loaded

5.2 Filling the null values

Figure 5.2 Filling the null values

5.3 Correlation Heat map

Figure 5.3 Correlation Heatmap

5.4 Training the Data using Linear Regression

Figure 5.4 Training the data using Linear Regression

5.5 KNN Classification

Figure 5.5 KNN Classification

5.7 Decision Tree Classifier

Figure 5.7 Decision Tree Classifier

CHAPTER 6
CONCLUSION

Loan Prediction is a critical process for financial institutions,

enabling them to assess the risk associated with loan applications and
make data-driven decisions. By these techniques, lenders can
significantly improve their risk management practices, enhance the
accuracy of their lending decisions and reduce the incidents of loan
defaults.
The implementation of loan prediction models offers significant
benefits, including risk mitigation, improved decision making,
operational efficiency, and regulatory compliance. It has also raised
awareness about the ethical considerations and responsibilities
associated with automated decision-making, especially in domains
with significant real-world consequences like finance. By leveraging
advanced data analytics and machine learning techniques, financial
institutions can develop robust models that accurately predict loan
defaults, thereby supporting sustainable and profitable lending
practices. Continuous innovation and improvement in these models
are necessary to address ongoing challenges and meet the dynamic
needs of the financial sector.

ML and Ai Synopsis
No ratings yet
ML and Ai Synopsis
8 pages
Machine Learning for Loan Approval Prediction
No ratings yet
Machine Learning for Loan Approval Prediction
60 pages
Loan Prediction Using Artificial Intelligence and Machine Learning
No ratings yet
Loan Prediction Using Artificial Intelligence and Machine Learning
24 pages
Loan Approval Prediction Model
No ratings yet
Loan Approval Prediction Model
7 pages
Wa0000.
No ratings yet
Wa0000.
58 pages
Shailesh Synopsis 1
No ratings yet
Shailesh Synopsis 1
99 pages
Loan Approval Prediction with ML Techniques
No ratings yet
Loan Approval Prediction with ML Techniques
19 pages
Loan Approval Prediction Using Supervised Learning Algorithm
No ratings yet
Loan Approval Prediction Using Supervised Learning Algorithm
11 pages
Loan Risk Prediction Using Machine Learning
No ratings yet
Loan Risk Prediction Using Machine Learning
31 pages
1822 B.E Cse Batchno 92
No ratings yet
1822 B.E Cse Batchno 92
69 pages
Loan Prediction with Data Analytics
No ratings yet
Loan Prediction with Data Analytics
31 pages
Loan Approval - PPT
No ratings yet
Loan Approval - PPT
19 pages
Loan Prediction System Overview
No ratings yet
Loan Prediction System Overview
5 pages
Loan Prediction Using Artificial Intelligence and Machine Learning
No ratings yet
Loan Prediction Using Artificial Intelligence and Machine Learning
23 pages
Arpit Pal E2 17 Report Loan-Prediction-System
No ratings yet
Arpit Pal E2 17 Report Loan-Prediction-System
34 pages
Loan Eligibility Prediction
No ratings yet
Loan Eligibility Prediction
14 pages
Phase 2 Loan Prediction
No ratings yet
Phase 2 Loan Prediction
26 pages
Wa0003.
No ratings yet
Wa0003.
6 pages
Prediction of Modernized Loan Approval System Based On Machine Learning Approach
No ratings yet
Prediction of Modernized Loan Approval System Based On Machine Learning Approach
11 pages
Loan Predictor Using ML: Internship Report
No ratings yet
Loan Predictor Using ML: Internship Report
12 pages
Loan Eligibility Prediction with ML
No ratings yet
Loan Eligibility Prediction with ML
28 pages
Bank Loan Prediction with Machine Learning
No ratings yet
Bank Loan Prediction with Machine Learning
4 pages
Loan Eligibility Prediction with ML
No ratings yet
Loan Eligibility Prediction with ML
9 pages
2022 V13i1198
No ratings yet
2022 V13i1198
12 pages
Sat - 6.Pdf - Prediction of Modernized Loan Approval System Based On Machine Learning Approach
No ratings yet
Sat - 6.Pdf - Prediction of Modernized Loan Approval System Based On Machine Learning Approach
11 pages
Paper 14014
No ratings yet
Paper 14014
9 pages
Loan Eligibility Prediction System
No ratings yet
Loan Eligibility Prediction System
8 pages
Research Paper ALAS
No ratings yet
Research Paper ALAS
4 pages
Research Paper
No ratings yet
Research Paper
14 pages
Prediction of Modernized Loan Approval System Based On Machine Learning Approach
No ratings yet
Prediction of Modernized Loan Approval System Based On Machine Learning Approach
22 pages
Bank Loan Approval via ML
No ratings yet
Bank Loan Approval via ML
13 pages
Loan Approval Prediction Project Report
No ratings yet
Loan Approval Prediction Project Report
58 pages
minipptPOWER 1pdf
No ratings yet
minipptPOWER 1pdf
16 pages
Loan Default Prediction with ML Models
No ratings yet
Loan Default Prediction with ML Models
17 pages
Loan Prediction with Machine Learning
No ratings yet
Loan Prediction with Machine Learning
4 pages
Loan Approval Prediction Using ML Techniques
No ratings yet
Loan Approval Prediction Using ML Techniques
36 pages
Loan Eligibility Prediction with ML
No ratings yet
Loan Eligibility Prediction with ML
72 pages
Loan Approval Prediction with ML Models
No ratings yet
Loan Approval Prediction with ML Models
11 pages
B.tech It Batchno 50
No ratings yet
B.tech It Batchno 50
31 pages
Loan Prediction Project Report
No ratings yet
Loan Prediction Project Report
3 pages
Loan Approval Prediction Using Machine Learning
No ratings yet
Loan Approval Prediction Using Machine Learning
14 pages
Loan Approval Prediction System Using Machina Learning
No ratings yet
Loan Approval Prediction System Using Machina Learning
4 pages
Bank Loan Prediction Using ML
No ratings yet
Bank Loan Prediction Using ML
65 pages
Loan Approval Prediction with Naive Bayes
No ratings yet
Loan Approval Prediction with Naive Bayes
9 pages
YB Corr Project Report
No ratings yet
YB Corr Project Report
43 pages
Loan Prediction System
No ratings yet
Loan Prediction System
8 pages
Decision Tree Model for Loan Approval
No ratings yet
Decision Tree Model for Loan Approval
7 pages
Literature Survey
No ratings yet
Literature Survey
3 pages
Loan Approval Prediction Project Report
No ratings yet
Loan Approval Prediction Project Report
22 pages
Wa0001.
No ratings yet
Wa0001.
8 pages
Loan Approval System Based On Machine Learning Approach
100% (1)
Loan Approval System Based On Machine Learning Approach
55 pages
AI-Based Loan Approval System Report
100% (1)
AI-Based Loan Approval System Report
20 pages
Loan Prediction with Machine Learning
No ratings yet
Loan Prediction with Machine Learning
25 pages
Loan Prediction System with AutoML
No ratings yet
Loan Prediction System with AutoML
52 pages
Loan Approval Prediction Using Machine Learning
No ratings yet
Loan Approval Prediction Using Machine Learning
16 pages
Loan Approval Prediction Based On Machine Learning Approach: Kumar Arun, Garg Ishan, Kaur Sanmeet
No ratings yet
Loan Approval Prediction Based On Machine Learning Approach: Kumar Arun, Garg Ishan, Kaur Sanmeet
4 pages
Pptloan
No ratings yet
Pptloan
8 pages
Credit Card Score Prediction Models
No ratings yet
Credit Card Score Prediction Models
8 pages
Loan Approval Prediction with ML
No ratings yet
Loan Approval Prediction with ML
10 pages
Six Sigma for Business Leaders
No ratings yet
Six Sigma for Business Leaders
373 pages
13 434 Mflow Extrusion Plastometer PI en
No ratings yet
13 434 Mflow Extrusion Plastometer PI en
11 pages
Mechasys Sales Deck - XR Projector - en - USD
No ratings yet
Mechasys Sales Deck - XR Projector - en - USD
21 pages
Understanding Epidemiology Basics
No ratings yet
Understanding Epidemiology Basics
106 pages
General Surveying Overview and Techniques
No ratings yet
General Surveying Overview and Techniques
3 pages
3-2 Internship
No ratings yet
3-2 Internship
30 pages
P1 Febr2025
No ratings yet
P1 Febr2025
17 pages
Calibration of Gas Measurement Instruments
No ratings yet
Calibration of Gas Measurement Instruments
3 pages
Mobilelab ICMR
No ratings yet
Mobilelab ICMR
9 pages
GPT-4 Translation Error Detection
No ratings yet
GPT-4 Translation Error Detection
8 pages
GD&T Form Measurement
No ratings yet
GD&T Form Measurement
19 pages
Hach Method 10242 Revision 1.2 March 2022
No ratings yet
Hach Method 10242 Revision 1.2 March 2022
12 pages
Common Instrumentation Measurement Terms
No ratings yet
Common Instrumentation Measurement Terms
3 pages
Types of Sensors and Transducers
No ratings yet
Types of Sensors and Transducers
84 pages
Machine Learning Internship Report
No ratings yet
Machine Learning Internship Report
43 pages
Smai Lecture 04 Perf Measures Classification
No ratings yet
Smai Lecture 04 Perf Measures Classification
42 pages
Understanding Quantitative Research in SHS
No ratings yet
Understanding Quantitative Research in SHS
8 pages
Q2 Practical Research 2 DLL Week 008
No ratings yet
Q2 Practical Research 2 DLL Week 008
13 pages
Headspace GC for Residual Solvents in Docetaxel
No ratings yet
Headspace GC for Residual Solvents in Docetaxel
6 pages
0580 Y20 SM 4
No ratings yet
0580 Y20 SM 4
12 pages
Analyst Specialization and Conglomerate Stock Breakups: Stuartc - Gilson, Paulm - Healy, Christopherf - Noe
No ratings yet
Analyst Specialization and Conglomerate Stock Breakups: Stuartc - Gilson, Paulm - Healy, Christopherf - Noe
18 pages
Surrey Phys 1102-1220 Manual-3
No ratings yet
Surrey Phys 1102-1220 Manual-3
125 pages
School Opening and Diagnostic Tests Guide
No ratings yet
School Opening and Diagnostic Tests Guide
157 pages
RCSI Sample Size Calculation Guide
No ratings yet
RCSI Sample Size Calculation Guide
36 pages
NLP NB
No ratings yet
NLP NB
52 pages
Analytical Method Validation: Interview Questions and Answers
No ratings yet
Analytical Method Validation: Interview Questions and Answers
9 pages
Analysis of Ethylene Glycols and Propylene Glycols: Standard Test Methods For
No ratings yet
Analysis of Ethylene Glycols and Propylene Glycols: Standard Test Methods For
14 pages
AQA GCSE Required Practicals
No ratings yet
AQA GCSE Required Practicals
5 pages
Multi-Channel CNN for Thyroid Diagnosis
No ratings yet
Multi-Channel CNN for Thyroid Diagnosis
11 pages

Anu Internshipreport

Uploaded by

Anu Internshipreport

Uploaded by

THE OXFORD COLLEGE OF SCIENCE

Department of Computer Applications

Bachelor of Computer Applications

Machine Learning Internship at

Under the guidance of

Department of Computer Science and Applications

Certified that the Project work entitled “LOAN PREDICTION”

While presenting this Machine Learning Project on “Loan

I sincerely acknowledge the guidance and support of our internship

Loan prediction is a critical task in the banking and financial

The dataset, sourced from a leading financial institution, includes

By leveraging advanced machine learning techniques, this

I, S B ANUSHA, hereby declare that this project work entitled

Arthur Samuel, a pioneer in the field of artificial intelligence and

How we split data in Machine Learning?

1.1 Problem Statement

1.3 Future Scope

2.1 SOFTWARE REQUIREMENTS

2.2 HARDWARE REQUUIREMENTS

3.1 Project Description

2. Data Preprocessing: This process is essential to ensure data

3. Feature Engineering: It creates new features or modify the

5. Model Evaluation: The performance of the models is evaluated

7. Deployment: A user interface or API is developed and

8. Monitoring and Maintenance: The model performance is

3.2 Importance of Loan Prediction

2. Optimizing Lending Practices

5. Enhanced Customer Experience

7. Strategic Decision Making

3.3 Working Description

 Feature Selection: Use statistical tests, correlation analysis

5. Model Selection and Training

8. Testing and Validation

10. Monitoring and Maintenance

3.4 Libraries used

2. Pandas (‘import pandas as pd’): Pandas is a powerful and

3. Matplotlib (‘import [Link] as plt’): Matplotlib is a

4. Seaborn (‘import seaborn as sns’): Seaborn is a powerful and

5. Sci-Kit Learn (‘from sklearn…’): Scikit-learn (often

Executed in Jupyter Notebook Environment

#loading the dataset

#analyzing the data

#train and test data

#printing confusion matrix and plot tree

5.1 Libraries Imported and Dataset Loaded

Figure 5.1 Libraries Imported and Dataset Loaded

5.2 Filling the null values

5.3 Correlation Heat map

5.4 Training the Data using Linear Regression

Figure 5.4 Training the data using Linear Regression

5.5 KNN Classification

5.7 Decision Tree Classifier

Figure 5.7 Decision Tree Classifier

Loan Prediction is a critical process for financial institutions,

You might also like