Deep Learning Based Car Damage Detection, Classification and Severity
Deep Learning Based Car Damage Detection, Classification and Severity
Volume
Ritik Gandhi, International Journal of Advanced 10,inNo.5,
Trends September
Computer - October
Science and 2021
Engineering, 10(5), September - October 2021, 2947 – 2953
International Journal of Advanced Trends in Computer Science and Engineering
Available Online at http://www.warse.org/IJATCSE/static/pdf/file/ijatcse031052021.pdf
https://doi.org/10.30534/ijatcse/2021/031052021
Received Date : August 06, 2021 Accepted Date : September 15, 2021 Published Date : October 06, 2021
of the images were also done [15]. Since the dataset is small,
ABSTRACT we used Data Augmentation Techniques to synthetically
enlarge the dataset. We consider the common car damages
In the accident insurance industry, settling the claim is a such as bumper dent, bumper scratch, door dent, door scratch,
time-consuming process since it is a manual process and there glass shattered, scratch, smash and some images of no
is a gap between the optimal and the actual settlement. Using damaged cars.
deep learning models, we are not only trying to speed up the A number of techniques were tried such as directly training a
process but also provide better customer service and increase CNN, using a pre-trained CNN model, using transfer
the profitability of insurance companies. In this paper we are learning from large CNNs and building an ensemble
using various pretrained models such as VGG 16, VGG 19, classifier. Out of all the techniques tried we observed that
Resnet50 and Densenet and based on these models, selecting transfer learning works the best. The damaged part
the best performing models. We initially check whether the classification and its localization was done using the
car is damaged or not using the Resnet50 model and if it’s a YOLOv3 model which classifies an image and draws the
damaged one we use the WPOD-net model to detect the bounding boxes around them. To add more to it, YOLOv3
license plate. To identify the damaged region, we use the framework is significantly faster than R-CNN models because
YOLO model. At last, comes the damage severity which is it has a bigger network and residual networks are added by
implemented using the Densenet model. After implementing adding various shortcut connections. To extract and read the
various models, we find out that transfer learning gives better license plate we used the WPOD-net model which detects the
results than fine-tuning. In addition to that we propose a license plate no matter how different the distortion is and
framework that integrates all of this into one application and further rectifies the license plate area to a rectangular shape so
in turn helps in the automation of the insurance industry. that the detections can be further fed to the OCR network. The
damage severity of the car was done on 3 parameters: Minor,
Key words: Deep Learning, Damage assessment (detection, Moderate, Severe.
classification and severity), Pre-trained CNN Models, YOLO Although a lot many minor factors were taken into account so
as to make the model as much best performing as possible but
1. INTRODUCTION along the way the focus was on the influence of certain
hyper-parameters and searching theoretically defined ways to
The Global Auto Insurance market is projected to reach $1.06 adapt them [2].
trillion by 2027 and still a lot of money is being wasted when Since Deep Learning is one of the best techniques when it
it comes to claims. The traditional method involves a tedious comes to image processing related tasks, a major challenge
process wherein the customer submits the claim documents to was to reduce the model training time since a traditional CNN
the agent who in turn submits the claim documents to the model can be very time-consuming to perform image
company and then an external evaluator inspects the unit and classification tasks and identify the correct weights for the
correspondingly prepares his reports. The company reviews it network by multiple forward and backward iterations.
and issues the LOA and then sends the car to the shop for
repairs. This has forced the insurance firms to look out for 2. RELATED WORK
solutions that include fair assessment and faster agreement of
claims. Whenever object detection comes into play, deep learning has
In this paper we try to employ different Convolutional Neural always shown promising results. The most popular detection
Network (CNN) models such as VGG16, VGG19, Resnet50, algorithms include the Convolutional Neural Networks
Densenet ,etc and based on the accuracy, select the models (CNN), since they perform well for many computer vision
that work best for us. We couldn’t find any publicly available tasks such as visual object recognition and detection [3][4].
dataset for the same and therefore created our own dataset by With computing resources based on transfer learning
web scraping different sites. Manual filtering and annotating solutions and extensive use of data, deep learning has been
outstanding in image classification [5][6].
2947
Ritik Gandhi, International Journal of Advanced Trends in Computer Science and Engineering, 10(5), September - October 2021, 2947 – 2953
2948
Ritik Gandhi, International Journal of Advanced Trends in Computer Science and Engineering, 10(5), September - October 2021, 2947 – 2953
2949
Ritik Gandhi, International Journal of Advanced Trends in Computer Science and Engineering, 10(5), September - October 2021, 2947 – 2953
The images for the damaged and not damaged classes are
equal leading to no class imbalance in both the training and
testing data folder.
The license plate reader was tested over a dataset of the Indian
car license plate images and the score was determined by
comparing the predicted sequences of both Pytesseract and
CNN with the actual value, using a Sequence Matcher.
The best of the two was taken as the score of that image. The
accuracy was close to 80.34%.
2950
Ritik Gandhi, International Journal of Advanced Trends in Computer Science and Engineering, 10(5), September - October 2021, 2947 – 2953
Stage 3: Damaged Part Classification and Localization Since there is no proper data, LabelImg tool was used for
using YOLO creating bounding boxes and giving classes. Certain specific
files were created for training yolo. Since we had 3 classes the
The damaged part classification is done using the YOLO YOLO was trained for 7000 epochs and the weights were
model, if the car is damaged then then this model is used to saved for multiples of 1000. The observation showed that
localize the damaged part of the car. YOLO refers to “YOU YOLO custom 5000 weights gave better results than other
LOOK ONLY ONCE” and is one of the most versatile models.
models when it comes to object detection. It classifies and YOLO divides all the input images into the SxS grid system
finds damaged part of a car in an image and draws the and each grid is responsible for object detection. The grid
bounding boxes around them. cells are actually responsible for prediction the boundary
boxes for the detected objects. For every box there are 5 main
attributes that are to be considered x and y for coordinates, w
and h for width and height of the object, and a confidence
score for the probability that the box containing the object. 9
classes are used for classification such as bumper dent,
bumper scratch, door dent, door scratch, windshield damaged
etc.
2951
Ritik Gandhi, International Journal of Advanced Trends in Computer Science and Engineering, 10(5), September - October 2021, 2947 – 2953
Fig 17. Minor damage On the original data for stage 1 that is to check whether the
car is damaged or not we can observe that Densenet trained on
all layers is performing better than other models. The
accuracy of this model is 96.3%, precision of 94.9% and
recall of 97.8%.
For stage 2 which is the Damage Localization again Densenet
outperformed the rest of the models with an accuracy of
76.5%, precision of 76.8% and recall of 74.4%. For stage 3
Resnet perfromed better than rest of the models with an
accuracy of 67.8%, precision of 68.5% and recall of 67.3%.
2952
Ritik Gandhi, International Journal of Advanced Trends in Computer Science and Engineering, 10(5), September - October 2021, 2947 – 2953
2953