0% found this document useful (0 votes)
29 views40 pages

Merged

This internship report presents a study on predicting the Remaining Useful Life (RUL) of electric vehicle (EV) batteries using deep learning models, specifically LSTM, RNN, and CNN. The research addresses the challenges of battery degradation due to various factors, including temperature, and aims to develop a robust framework for accurate life forecasting. The findings are expected to enhance battery management systems and contribute to sustainable mobility solutions.

Uploaded by

J.A.S SANTHOSH
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
29 views40 pages

Merged

This internship report presents a study on predicting the Remaining Useful Life (RUL) of electric vehicle (EV) batteries using deep learning models, specifically LSTM, RNN, and CNN. The research addresses the challenges of battery degradation due to various factors, including temperature, and aims to develop a robust framework for accurate life forecasting. The findings are expected to enhance battery management systems and contribute to sustainable mobility solutions.

Uploaded by

J.A.S SANTHOSH
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd

LIFE TIME PREDICTION OF EV BATTERY USING

MACHINE LEARNING MODELS.


Internship Report submitted in partial fulfillment of the requirements for the
degree of

BACHELOR OF ENGINEERING

IN

COMPUTER SCIENCE AND ENGINEERING

Submitted By

J.A.S SANTHOSH–22CS133

Under the guidance of

[Link] R
Professor and Head, Battery Engineering Lab
Department of Engineering Design Indian
Institute of Technology Madras, Chennai-600036

COMPUTERSCIENCEDEPARTMENT INDIAN INSTITUTE OF


ST. JOSEPH’S COLLEGE OF TECHNOLOGY MADRAS,
ENGINEERING DEPARTMENT OF
(AFFILIATEDTOANNAUNIVERSITY) ENGINEERING DESIGN,
OMR, CHENNAI – 600 119 CHENNAI – 600 036
DECLARATION

I here by declare that this entitled “LIFE TIME PREDICTION OF EV


BATTERY USING DEEP LEARNING MODELS“submitted for the award of
the degree of Bachelor of Engineering in Computer Science and Engineeringto St.
Joseph’s College Of Engineering is a record ofbonafide work carried out by J A S
SANTHOSH under the supervision of [Link] R, Professor Department
of Engineering Design,IITM,Chennai.

I further declare that the work reported in this report has not been submittedand
will not be submitted, either in part or in full, for the award of any other degree or
diploma in this institute or any other institute or university.

Place: Chennai

Date:30/06/2025 Signature of the Candidate

1
TABLE OF CONTENTS

SNO. TITLE PAGE


NO
1 INTRODUCTION 4

2 PROBLEMSTATEMENTANDSYNOPSIS 6

3 BLOCKDIAGRAM 13

4 DEVELOPMENT ENVIRONMENT DETAILS 14

5 DATA COLLECTION AND PREPROCESSING 15

6 ALGORITHM DEVELOPMENT 20

7 MODEL DEVELOPMENT AND EVALUATION 24

8 PERFORMANCE EVALUATION AND TESTING 27

9 FUTURE SCOPE 32

10 CONCLUSION 34

11 REFERENCES 36

2
ACKNOWLEDGEMENT

With immense pleasure and deep sense of gratitude, I wish to express my


sincerethanks to my supervisor [Link] R, Professor, IITM, Chennai
without their motivation and continuous encouragement, this project work would
not have been successfully completed.

I am grateful to the Chairman of St. Joseph’s College Of Engineering,


[Link], the Chairman for motivating me to carry out research in
the St. Joseph’s College of Engineering and also for providing me with
infrastructural facilities and many other resources needed for my project.

I express my sincere thanks to [Link], HOD, Department of


Computer Science Engineering, St. Joseph’s College Of Engineering for her kind
words of support and encouragement.

Place: Chennai

Date:30/06/2025 J A S SANTHOSH

3
INTRODUCTION
The global shift towards sustainable transportation has accelerated the adoption of electric
vehicles (EVs), driven by the urgent need to reduce carbon emissions, dependence on fossil
fuels, and environmental pollution. At the heart of this transition lies the lithium-ion battery,
the primary power source for EVs. Despite their advantages in energy density, efficiency, and
rechargeability, lithium-ion batteries are prone to gradual degradation over time due to a
combination of electrochemical, mechanical, and thermal factors. This degradation process
significantly impacts vehicle performance, operational safety, and maintenance schedules.
Therefore, predicting the Remaining Useful Life (RUL) of these batteries is a critical
challenge in modern EV technology and maintenance systems.

RUL refers to the estimated time or usage cycles remaining before a battery reaches the end
of its useful performance threshold. Accurate RUL estimation plays a pivotal role in
minimizing the risk of sudden failures, optimizing energy management strategies, and
planning timely battery replacements. It also supports predictive maintenance frameworks,
enhances user confidence, and improves the total cost of ownership of EVs. However, the
prediction of RUL is inherently complex due to the nonlinear, time-dependent, and
temperature-sensitive nature of battery aging mechanisms.
Traditional approaches to battery life estimation have primarily relied on physics-based
models or empirical degradation curves. While these methods provide a theoretical
understanding, they often lack adaptability to varying operational profiles and environmental
conditions. They also require extensive domain expertise, precise modeling of internal battery
chemistry, and calibration with numerous experimental constants, which limits their
scalability across diverse battery types and usage scenarios.

In contrast, data-driven methods—particularly those rooted in machine learning and deep


learning—offer a scalable and robust alternative. By learning patterns directly from historical
performance data, these models can generalize across diverse scenarios and capture complex
relationships that may not be visible through analytical modeling. Deep learning, in
particular, has emerged as a powerful tool for time-series forecasting due to its ability to
model long-term dependencies, nonlinearities, and dynamic behaviors inherent in battery
degradation processes.

4
This report explores the application of deep learning models such as Long Short-Term
Memory (LSTM), Recurrent Neural Networks (RNN), and Convolutional Neural Networks
(CNN) for the task of RUL prediction in EV batteries. These models were trained and
validated on a rich dataset consisting of various operational features—such as current,
voltage, temperature, capacity, and time—collected from battery test cycles under controlled
laboratory conditions. A unique aspect of this study is the inclusion of temperature variations
(25°C, 30°C, and 50°C) to examine their influence on battery health and model performance.
Temperature is a critical factor affecting electrochemical reactions inside the battery, and
understanding its impact enables more accurate life predictions and safer thermal
management in real-world EV systems.

The motivation for selecting LSTM, RNN, and CNN stems from their proven efficacy in
sequence modeling and signal interpretation tasks. LSTM is particularly adept at capturing
long-term temporal dependencies, making it well-suited for time-series data like battery
degradation trends. RNN provides a baseline for sequential learning, while CNN,
traditionally used for spatial data, can extract hierarchical features when applied to
transformed input data such as those obtained through Fast Fourier Transform (FFT). FFT
enhances the representation of raw signals by revealing patterns in the frequency domain that
might otherwise be hidden in the time domain.

By employing these models and comparing their performance across different thermal
environments, this project aims to determine the optimal deep learning strategy for accurate
and robust battery life forecasting. Achieving high predictive accuracy (R² > 0.8) under these
varying conditions is a key target. The findings from this work are expected to contribute
meaningfully to the fields of battery health management, electric vehicle safety, and
intelligent transportation systems, aligning with India's growing focus on electric mobility
and green technologies.

Furthermore, the application of such models is not limited to EVs but extends to all domains
where lithium-ion batteries are used—such as aerospace, defense, consumer electronics, and
renewable energy storage. Thus, this research not only addresses a vital engineering
challenge but also aligns with global priorities of energy efficiency, environmental
sustainability, and advanced AI-driven diagnostics.

5
PROBLEM STATEMENT

 The advancement of electric vehicle (EV) technology represents a pivotal shift in


modern transportation systems, aimed at reducing greenhouse gas emissions,
improving energy security, and transitioning toward sustainable mobility solutions.
Central to the performance, reliability, and commercial viability of EVs is the lithium-
ion battery, which functions as the primary energy storage system. However, lithium-
ion batteries are inherently susceptible to gradual degradation due to a combination of
electrochemical aging, calendar aging, temperature fluctuations, mechanical stress,
and charge-discharge cycling. This degradation significantly influences the battery's
state of health (SOH) and limits the total mileage, efficiency, and safety of EVs over
time.

 One of the most critical challenges in this context is the accurate prediction of the
Remaining Useful Life (RUL) of the EV battery—defined as the estimated
operational time or number of cycles left before the battery reaches its end-of-life
threshold, beyond which its performance becomes unreliable or unsafe. Inaccurate or
delayed identification of battery degradation can lead to unexpected failures,
increased maintenance costs, compromised user safety, and inefficient energy
management. On the contrary, over-conservative replacement strategies may result in
premature discarding of batteries, contributing to economic losses and environmental
hazards related to battery disposal.

 Traditional approaches to battery life prediction are largely based on empirical models
or physics-based degradation analysis, such as equivalent circuit models (ECM),
electrochemical impedance spectroscopy (EIS), and capacity fade models. While
these methods offer interpretability and are grounded in the physical chemistry of
batteries, they often require complex parameter estimation, domain-specific
knowledge, and are unable to adapt to highly dynamic and non-linear usage patterns.
Furthermore, these models do not generalize well when exposed to real-world
operational variabilities such as fluctuating loads, inconsistent charge/discharge
cycles, and diverse environmental conditions, particularly temperature.

6
 To overcome these limitations, recent advancements in Artificial Intelligence (AI) and
Machine Learning (ML), particularly Deep Learning (DL), have emerged as
promising alternatives. These models have the capability to learn complex patterns
from large volumes of high-dimensional data and can effectively model the temporal
dependencies, nonlinearities, and multi-factorial influences that characterize battery
degradation. Among these, Long Short-Term Memory (LSTM) networks,
Recurrent Neural Networks (RNN), and Convolutional Neural Networks (CNN)
are highly effective in modeling time-series data and signal variations, which makes
them suitable for the prediction of battery life.

 Despite this promise, several technical gaps and research challenges remain
unaddressed:
 How can we ensure that deep learning models are robust across different operational
temperatures, which significantly affect electrochemical reactions inside the battery?
 Can a unified model accurately predict RUL across varying thermal environments
(e.g., 25°C, 30°C, 50°C), or is temperature-specific training necessary?
 How does the performance of temporal models like LSTM compare with frequency-
domain models such as CNN when Fast Fourier Transform (FFT) is applied to input
features?
 What are the trade-offs between model complexity, training time, interpretability, and
predictive performance in a real-time, embedded EV system?
 How do we ensure the model generalizes well to unseen battery usage profiles and
maintains prediction accuracy above industrial thresholds (R² > 0.8)?
 To address these questions, this project aims to develop a comprehensive deep
learning-based framework for accurately estimating the RUL of EV batteries. The
system will leverage a multivariate dataset consisting of features such as current,
voltage, temperature, instantaneous capacity, cumulative capacity, true State of
Charge (SoC), and time, collected from controlled battery test cycles. The target label,
RUL, will be dynamically computed and used to train and evaluate the proposed
models. The data will be preprocessed through normalization and sequence modeling,
with optional FFT-based feature enhancement for frequency-aware learning.
 Three deep learning architectures—LSTM, RNN, and CNN—will be implemented
and evaluated in terms of their predictive accuracy, robustness under thermal

7
variation, and computational efficiency. The project will also conduct a comparative
analysis to identify which temperature range contributes most to battery degradation
and which model architecture performs best under these varying thermal profiles. This
will provide crucial insights into battery behavior under realistic EV conditions and
help in building intelligent Battery Management Systems (BMS) that can monitor,
predict, and optimize battery life in real time.
 In conclusion, this research attempts to bridge the gap between theoretical degradation
models and practical, real-world applications by deploying deep learning for
intelligent, data-driven RUL prediction. The outcomes have the potential to
significantly enhance EV reliability, lower maintenance costs, extend battery lifetime,
and enable safer energy management practices—thus contributing to the broader goal
of sustainable and intelligent mobility.

SYNOPSIS

Title:

Remaining Useful Life Prediction of Electric Vehicle Battery Using Deep Learning
Models

1. Introduction

The global transportation landscape is undergoing a profound transformation with the


widespread adoption of electric vehicles (EVs). As the world pivots toward sustainable and
clean mobility solutions, EVs are poised to replace traditional internal combustion engine
vehicles. However, one of the most critical components influencing the performance, safety,
and commercial viability of EVs is the lithium-ion battery. The operational efficiency,
energy density, and life expectancy of EVs are intricately tied to the health and longevity of
their battery packs. Over time, due to various physical, chemical, and environmental factors,
the battery undergoes degradation, leading to reduced performance and eventual failure.
In this context, estimating the Remaining Useful Life (RUL) of an EV battery becomes
essential. RUL refers to the expected operational duration left before a battery reaches the
threshold beyond which it can no longer deliver acceptable performance. Accurate RUL
prediction facilitates better maintenance scheduling, avoids unexpected breakdowns, reduces
operational costs, and ensures user safety. Furthermore, in the context of second-life

8
applications and recycling, RUL estimation is a valuable tool for sustainable battery lifecycle
management.
Traditional battery life prediction methods rely on analytical modeling, electrochemical
equations, or physics-based approaches. While these offer theoretical insights, they are
computationally intensive and less adaptable to real-world variability. The inherent
complexity and non-linearity in battery degradation due to temperature fluctuations,
charging/discharging patterns, and usage cycles make traditional models insufficient for
accurate forecasting. As a result, data-driven approaches, particularly deep learning, have
emerged as effective alternatives for capturing temporal dependencies and learning from
complex, high-dimensional battery datasets.

2. Motivation
EV batteries degrade differently under various conditions, especially temperature, which
plays a crucial role in accelerating or slowing the aging process. Understanding how
temperature affects degradation, and developing a model that accurately predicts RUL across
multiple thermal environments (such as 25°C, 30°C, and 50°C), is essential for the
deployment of robust battery management systems (BMS). Additionally, deep learning
models like Long Short-Term Memory (LSTM), Recurrent Neural Networks (RNN), and
Convolutional Neural Networks (CNN) have shown great promise in time-series prediction
and signal processing tasks.

Given the temporal nature of battery behavior and the complexity of degradation patterns,
this project seeks to investigate the effectiveness of various deep learning architectures in
accurately predicting the RUL of EV batteries using real-world data.

3. Objectives
 To develop a data-driven deep learning framework for predicting the Remaining
Useful Life (RUL) of EV batteries.
 To evaluate the performance of LSTM, RNN, and CNN models in capturing
degradation patterns.
 To analyze the impact of operational temperature (25°C, 30°C, 50°C) on battery
degradation and prediction accuracy.
 To achieve high prediction accuracy (R² > 0.8) across different temperature scenarios.

9
 To determine which model and temperature range offer the most reliable and
generalizable prediction outcomes.

4. Problem Statement
Electric Vehicle (EV) batteries are critical for mobility and sustainability. However, their
degradation over time poses a challenge in terms of safety, efficiency, and cost. Traditional
estimation techniques lack adaptability and cannot model the non-linear, temperature-
dependent degradation patterns seen in real-life conditions. Accurate prediction of Remaining
Useful Life (RUL) of batteries under varying temperature conditions remains a major
challenge. Deep learning offers a potential solution, but its effectiveness under different
thermal and operational profiles needs to be investigated systematically. The core problem is
to build a scalable, accurate, and temperature-aware predictive model using LSTM, CNN,
and RNN for estimating the RUL of EV batteries.

5. Methodology
 Dataset: A time-series dataset comprising parameters such as current (A), voltage
(V), temperature (°C), instantaneous capacity (Ah), cumulative capacity

(Ah), state of charge (SoC), and time (h) was used. The RUL was calculated as

maximum cycle time - current time.

 Data Preprocessing:
o Normalization of all features to a 0–1 scale.
o Segmentation into fixed-size time sequences for temporal learning.
o Optional use of Fast Fourier Transform (FFT) to reveal frequency-domain
patterns.
 Model Development:
o LSTM: Designed to handle long-term dependencies in time-series battery
degradation.
o RNN: Baseline recurrent model to compare with advanced LSTM
performance.
o CNN: Applied to FFT-transformed data for feature extraction from
spatial/frequency signals.

10
 Training and Evaluation:
o Separate models trained on data collected at 25°C, 30°C, and 50°C.
o Evaluated using metrics like R² score, Mean Squared Error (MSE), and
Root Mean Squared Error (RMSE).
o Visualization of results via graphs, residual plots, and confusion matrices for
classification boundaries.

6. Tools and Technologies Used


 Programming Language: Python
 Deep Learning Frameworks: TensorFlow, Keras
 Data Visualization: Matplotlib, Seaborn
 Preprocessing: NumPy, Pandas, Scikit-learn
 Environment: Jupyter Notebook / Google Colab

7. Expected Outcomes
 A robust deep learning model capable of accurately predicting RUL under multiple
temperature scenarios.
 Validation that LSTM outperforms CNN and RNN for sequential battery life
estimation.
 Quantitative evidence that 50°C leads to faster degradation, resulting in a shorter
RUL.
 Performance metrics showing R² > 0.85 for optimal models at lower temperature
conditions.
 Insights into how data preprocessing (FFT, normalization) enhances model accuracy.

8. Significance and Applications


 Enables proactive battery maintenance in EVs, increasing safety and reducing
unplanned failures.
 Enhances battery management systems (BMS) with intelligent decision-making for
thermal regulation and charge cycles.
 Contributes to sustainable battery lifecycle management by optimizing usage
before replacement.

11
 Applicable in defense, aerospace, and consumer electronics where battery
reliability is mission-critical.
 Supports second-life applications by evaluating usable energy left in retired EV
batteries.

9. Future Scope
 Incorporating additional factors like load cycles, charging rates, and driving
conditions.
 Development of hybrid models (CNN-LSTM) for enhanced learning of both spatial
and temporal features.
 Real-time deployment into edge-based BMS units for live prediction in operational
EVs.
 Generalization to other battery chemistries and integration with digital twin
frameworks.
 Training on larger datasets using federated learning for data privacy in commercial
applications.

10. Conclusion
The accurate prediction of EV battery RUL is a cornerstone of future mobility and energy
management systems. This project proposes a deep learning-based solution that leverages
temporal and frequency information to forecast battery life with high accuracy under different
thermal environments. The adoption of LSTM, RNN, and CNN models, trained on real-world
datasets, demonstrates that AI can be an effective tool in solving real-world engineering
problems. This project serves as a step toward smarter, safer, and more efficient electric
vehicles, contributing to India’s and the world’s efforts in building a cleaner future.

12
BLOCK DIAGRAM

A)

B)

DEVELOPMENT ENVIRONMENT DETAILS

To build a robust and scalable system for predicting the Remaining Useful Life (RUL) of
electric vehicle batteries using deep learning models, a comprehensive and well-structured
development environment was established. This environment includes a rich suite of software

13
tools, libraries, frameworks, hardware configurations, and cloud platforms, all of which
collectively facilitated the design, implementation, training, evaluation, and visualization of
the proposed machine learning pipelines. The choice of environment was motivated by the
need for high computational efficiency, seamless integration of deep learning workflows, and
reproducibility of experiments across different system architectures.

1. Programming Language

 Python 3.11
Python was chosen due to its wide adoption in the data science and machine learning
community, extensive library support, and strong community backing. It also provides
high readability and ease of prototyping complex algorithms.

2. IDE and Development Platforms

 Jupyter Notebook (Anaconda Distribution)


o Used as the primary environment for developing, testing, and visualizing
models.
o Provided interactive code execution, inline plotting, and markdown support
for documenting findings.
o Anaconda included pre-installed packages and managed Python environments
efficiently.
 Google Colab (Cloud-Based)
o Used for training models that required GPU acceleration.
o Provided free access to powerful NVIDIA Tesla T4/K80 GPUs.
o Enabled real-time code sharing and collaboration.
o Integrated with Google Drive for seamless dataset storage and retrieval.

3. Deep Learning Frameworks

 TensorFlow 2.x
o Served as the core deep learning library for model building, training, and
inference.

14
o Used Keras API for rapid model prototyping and tuning of LSTM, RNN, and
CNN architectures.
o Supported GPU acceleration and model saving in multiple formats (.h5,
SavedModel).
 PyTorch (Optional Benchmarking)
o Used for comparison purposes and experimental flexibility.
o Enabled dynamic computational graphs for advanced custom layer
development.

4. Data Handling & Preprocessing Libraries


 Pandas
o Used for tabular data manipulation, merging, cleaning, and time-series
handling.
o Essential for computing the RUL label (max time – current time).
 NumPy
o Backbone for mathematical operations, vectorization, and matrix handling.
o Used for feature extraction and FFT implementation.
 Scikit-learn
o Used for preprocessing (scaling/normalization), model evaluation (R²,
RMSE), and splitting datasets.
o Provided tools for cross-validation and pipeline integration.
 Matplotlib&Seaborn
o Used for detailed data visualization, performance graphing, and plotting
training loss, accuracy curves, and temperature-based analysis.
o Helped in creating interpretable figures and model diagnostics.

5. Data Transformation Techniques

 Fast Fourier Transform (FFT)


o Applied to selected features to enhance frequency-domain analysis.
o Enabled use of CNNs by transforming sequential time-domain data into spatial
representations.
o Implemented using NumPy’sfft module.

15
 Min-Max Normalization
o All features were scaled to a [0, 1] range using Scikit-learn’sMinMaxScaler to
prevent model bias toward high-magnitude inputs.
 Sliding Window Segmentation
o Converted raw sequential battery data into fixed-length sequences for
temporal model training.
o Window sizes and step lengths were tuned experimentally.

6. Model Training Infrastructure

 Google Colab Pro (Cloud-based)


o Offered extended GPU sessions, faster training, and reduced queue times.
o Enabled real-time notebook access, backup, and logging.
 Local Machine (For Lightweight Tasks)
o OS: Windows 11 Pro 64-bit
o Processor: Intel Core i7 11th Gen
o RAM: 16 GB DDR4
o GPU: NVIDIA GeForce GTX 1650 (4GB VRAM)
o Used for data preprocessing, visualization, and debugging small models.

DATA COLLECTION AND PREPROCESSING

1. Data Collection

The foundation of any successful machine learning project lies in the quality and
comprehensiveness of its dataset. In the context of predicting the Remaining Useful Life
(RUL) of EV batteries, the dataset must capture the intricate interplay of electrical, chemical,
thermal, and temporal parameters that govern battery degradation. Hence, special attention
was given to data acquisition to ensure that the dataset was representative, high-resolution,
multi-temperature, and suitable for time-series analysis.
1.1 Source of Data
The dataset used in this study was derived from real-world lithium-ion battery test cycles,
designed to emulate electric vehicle usage under controlled conditions. Data was collected
over extended charging and discharging cycles at three critical thermal environments: 25°C
(ambient), 30°C (mild), and 50°C (high thermal stress). These temperatures were chosen

16
based on their practical relevance in EV operating environments across different geographic
and seasonal conditions.
1.2 Measurement Equipment and Testbed
Battery test setups included:
 Precision cyclers and battery testers capable of recording electrical parameters at
sub-second intervals.
 Temperature chambers for thermal control to simulate ambient and extreme
operational environments.
 Sensor arrays embedded within the battery pack to record real-time values of:
o Voltage (V)
o Current (A)
o Temperature (°C)
o Instantaneous capacity (Ah)
o Cumulative capacity (Ah)
o True State of Charge (SoC)
o Time (in hours)
Each cycle captured thousands of datapoints per battery at each temperature level, resulting in
a large-scale, high-resolution, multi-dimensional dataset.
1.3 Data Labeling – RUL Computation
The Remaining Useful Life (RUL) label was not directly available in the raw dataset.
Instead, it was engineered based on the maximum observed time within a cycle:
This transformation ensured that every record in the dataset had a corresponding RUL value,
which served as the regression target for supervised learning models.

2. Data Preprocessing
Preprocessing is a crucial step to convert raw sensor data into a structured, normalized, and
model-ready format. The main goals were to clean the data, remove outliers, normalize value
ranges, generate sequences for deep learning models, and optionally extract frequency-based
features for CNN processing.
2.1 Data Cleaning
 Missing Value Handling:
o NaNs and corrupt readings were identified using Pandas.
o Forward-fill and interpolation methods were selectively used to preserve the
continuity of time-series data.
17
 Outlier Detection:
o Statistical z-score and IQR-based filtering removed outliers in voltage/current
spikes.
o Visual inspections ensured preservation of valid anomalies that are
representative of real battery behavior.
2.2 Feature Normalization
All input features were scaled to a uniform range [0, 1] using Min-Max normalization:
This step ensured that features with large magnitudes (e.g., current vs. temperature) did not
dominate the model training process. Normalization also accelerated convergence in
gradient-based optimizers.
2.3 Temporal Sequencing
Since deep learning models like LSTM and RNN operate on sequences rather than
independent rows, the flat dataset was transformed into a windowed time-series format.
 Sliding Window Segmentation:
o Fixed-length windows of sequential readings (e.g., 50 timesteps per sequence)
were generated using a sliding window approach.
o Each window contained feature snapshots over time, while the final RUL
value in the window was used as the sequence label.
Example:
| Sequence Index | Time t | Time t+1 | ... | Time t+n | → Target: RUL(t+n) |
This transformation allowed the model to learn patterns across time, rather than isolated
readings.
2.4 Feature Selection and Engineering
Only those features with high predictive power were retained. Correlation matrices, mutual
information scores, and domain knowledge were used to finalize the following:
 Selected Features:
o current_a
o voltage_v
o temperature_c
o instant_capacity_ah
o cumulative_capacity_ah
o true_soc
o time_in_hour
In addition, FFT (Fast Fourier Transform) was applied optionally on voltage and
current features to transform the time-domain signal into frequency-domain representations.

18
This was particularly beneficial for CNNs, which excel at capturing spatial or frequency-
localized features.
2.5 Data Splitting
To train and evaluate the models objectively, the dataset was divided as follows:
 70% Training Set: Used for fitting model parameters.
 20% Validation Set: Used for hyperparameter tuning and avoiding overfitting.
 10% Test Set: Held back for final performance evaluation.
Each subset preserved temporal continuity and class balance across RUL ranges and
temperature categories.
2.6 Label Binning (Optional)
For visualization and confusion matrix generation, the continuous RUL values were
optionally binned into discrete classes:
 Class 0: RUL > 80%
 Class 1: RUL between 60% – 80%
 Class 2: RUL between 40% – 60%
 Class 3: RUL between 20% – 40%
 Class 4: RUL < 20%
This helped analyze classification boundaries, although the main model was trained as a
regression problem.

3. Challenges in Preprocessing
 Temperature Drift: Battery temperature fluctuated during discharge cycles, which
required real-time alignment.
 Imbalanced RUL Distribution: Majority of datapoints were skewed toward the early
life of the battery, requiring reweighting or careful sampling.
 Multicollinearity: High correlation between certain features (e.g., instantaneous and
cumulative capacity) required regularization techniques.

4. Summary of Preprocessing Workflow


Raw Data

Missing Value Cleaning → Outlier Detection

Label Engineering (RUL Calculation)

Feature Normalization (Min-Max Scaling)

Sliding Window Sequence Generation

19

FFT Transformation (optional for CNN)

Train-Test-Validation Splitting

Ready for Deep Learning Models

This meticulous data collection and preprocessing pipeline formed the backbone of the deep
learning model’s performance. It ensured that the models had access to high-quality, noise-
free, time-structured, and thermally diverse data for learning robust patterns that generalize
well across battery life cycles and operating conditions.

ALGORITHM DEVELOPMENT

The development of a robust, scalable, and generalizable algorithm to predict the Remaining
Useful Life (RUL) of electric vehicle (EV) batteries is a multifaceted process that integrates
principles of time-series modeling, signal processing, supervised learning, and deep learning
architecture optimization. Given the nonlinear, dynamic, and multi-factorial nature of battery
degradation, a purely statistical or rule-based approach would be insufficient to model the
inherent complexity. Hence, deep learning models—capable of learning hidden patterns,
temporal correlations, and non-obvious dependencies—were explored and optimized for this
purpose.
The following stages detail the algorithm development process, from initial problem
formulation to final model tuning.

1. Problem Formulation
The problem is defined as a supervised regression task, where the goal is to learn a mapping
from a sequence of sensor readings to a continuous target value representing the battery's

2. Model Selection Strategy


Three different deep learning models were selected based on their unique abilities to handle
sequential and structured data:
2.1 Long Short-Term Memory (LSTM)
 Justification: LSTM networks are capable of capturing long-term dependencies and
patterns in time-series data. They address the vanishing gradient problem in standard
RNNs by using gated memory cells.

20
 Application: Well-suited for battery data where historical readings influence future
degradation trajectories.
2.2 Recurrent Neural Network (RNN)
 Justification: Basic sequential model used to establish a performance baseline.
 Limitations: Prone to vanishing gradients, especially over long sequences.
2.3 Convolutional Neural Network (CNN)
 Justification: While traditionally used for image data, CNNs can extract local
temporal features and patterns from transformed sequences, especially if the data is
processed using Fast Fourier Transform (FFT).
 Application: Effective when dealing with spatially or frequency-distributed signals.

3. Model Architecture Design

3.1 Input Representation

 Input data is shaped as a 3D tensor of shape:

(Batch Size,Sequence Length, Feature Dimension)(Batch\ Size, Sequence\ Length,\ Feature\


Dimension)(Batch Size,Sequence Length, Feature Dimension)

Example: (64, 50, 7) → 64 sequences, each of 50 time steps, with 7 features per
step.

3.2 LSTM Architecture


Model = Sequential()
[Link](LSTM(128, input_shape=(50, 7), return_sequences=True))
[Link](Dropout(0.2))
[Link](LSTM(64))
[Link](Dense(32, activation='relu'))
[Link](Dense(1)) # RUL output

 Uses stacked LSTM layers for deeper representation.


 Dropout used for regularization.
 Final output layer provides a single RUL value.

3.3 RNN Architecture


Model = Sequential()
[Link](SimpleRNN(128, input_shape=(50, 7)))
[Link](Dense(64, activation='relu'))
[Link](Dense(1))

 Shallower and less effective on longer sequences.

3.4 CNN Architecture (with FFT preprocessing)


Model = Sequential()

21
[Link](Conv1D(64, kernel_size=3, activation='relu', input_shape=(50,
7)))
[Link](MaxPooling1D(pool_size=2))
[Link](Flatten())
[Link](Dense(64, activation='relu'))
[Link](Dense(1))
 CNNs capture local time patterns or spectral features.
 Suitable when FFT is applied to voltage and current signals.

4. Model Compilation and Optimization

 Loss Function:
o Mean Squared Error (MSE) used as the primary loss function for regression.
o MSE penalizes larger errors more aggressively, helping the model learn tight
bounds on RUL predictions.
 Optimizer:
o Adam optimizer selected for its adaptive learning rate and robustness in deep
learning tasks.
o Initial learning rate: 0.001 with scheduled decay.
 Metrics Tracked:
o Root Mean Squared Error (RMSE)
o Mean Absolute Error (MAE)
o Coefficient of Determination (R² Score)

5. Training Strategy

 Epochs: 100–200 (with early stopping based on validation loss)


 Batch Size: 32 or 64
 Validation Split: 20% of training data used for validation
 Callbacks:
o EarlyStopping (patience = 10)
o ReduceLROnPlateau (patience = 5)
 Cross-Temperature Training:
o Models were trained on data from one temperature range and tested on others
to evaluate generalization.

22
o Separate models were trained for 25°C, 30°C, and 50°C data to compare
degradation learning patterns.

6. Hyperparameter Tuning

 Grid Search and Random Search used to tune:


o Number of LSTM layers
o Units per layer (32, 64, 128)
o Dropout rates (0.2 – 0.5)
o Learning rate (0.0001 – 0.01)
o Sequence length (20, 50, 100)
o Batch size (32, 64)
 Bayesian Optimization (optional) used for final fine-tuning on Colab Pro using
Keras Tuner.

7. Performance Evaluation
After model training, the predictions were evaluated using both regression and classification
interpretations.
7.1 Regression Analysis
 Scatter plots of actual vs. predicted RUL.
 Residual plots to inspect bias.
 R² scores above 0.85 achieved for LSTM under 25°C and 30°C.
7.2 Confusion Matrix (Optional Classification)
 RUL binned into 5 classes (e.g., Very High to Very Low Remaining Life).
 Confusion matrix generated to visualize performance boundaries.
7.3 Temperature Impact Assessment
 LSTM models trained at 25°C performed poorly when tested on 50°C data.
 Indicates that thermal-specific degradation behavior must be learned separately.

8. Final Model Selection Criteria

The final LSTM model was selected based on:


 Highest R² score (> 0.85)
 Lowest RMSE and MAE
23
 Stability across epochs
 Generalization across unseen sequences
 Minimal overfitting as evidenced by training-validation loss parity

9. Export and Deployment Readiness

 Final models were saved in .h5 and .tflite formats.


 TensorFlow Lite conversion was tested for future deployment on embedded systems.
 Provision made for real-time RUL inference from streaming sensor data in BMS
applications.

10. Summary

The algorithm development process blended domain knowledge, data science best practices,
and advanced deep learning methodologies. The use of LSTM networks proved to be the
most effective for this regression task, with CNNs offering promising results when paired
with FFT-transformed data. The robust pipeline established here enables scalable deployment
in real-world electric vehicle ecosystems and supports further research into battery health
management, anomaly detection, and predictive maintenance.

MODEL DEVELOPMENT
The accurate prediction of the Remaining Useful Life (RUL) of electric vehicle (EV)
batteries is a challenging problem, characterized by high-dimensional, nonlinear, and time-
dependent data. To address these complexities, deep learning models capable of learning
from temporal and contextual patterns were developed. This section presents the detailed
methodology behind the model development process, including architecture design, training
logic, model selection rationale, and optimization strategies. The goal is to build high-
performing predictive models that generalize well across thermal environments (25°C, 30°C,
50°C) and capture degradation trends with high fidelity.

1. Model Development Objectives


 Design deep learning models that learn from sequential battery data to predict RUL.

24
 Handle varying thermal effects through model-specific training or ensemble
approaches.
 Ensure model scalability, accuracy, and real-time applicability in Battery
Management Systems (BMS).
 Compare architectures (LSTM, RNN, CNN) in terms of accuracy, robustness, and
interpretability.
 Evaluate models based on R² score, RMSE, and inference latency.

2. Input and Output Definition

Each deep learning model is fed with input sequences derived from battery sensor data:

 Input Shape:

X=[xt,xt+1,...,xt+n]X = [x_t, x_{t+1}, ..., x_{t+n}]X=[xt,xt+1,...,xt+n]

wherext∈Rdx is the feature vector at time t, and n is the sequence length.

 Output:
A single scalar value representing the predicted Remaining Useful Life (RUL) of the
battery at time

3. Model Architectures

3.1 Long Short-Term Memory (LSTM)


LSTM is a type of recurrent neural network (RNN) that is particularly suited for time-series
prediction problems due to its ability to retain long-term dependencies via its internal
memory state and gating mechanisms.
LSTM Key Features:
 Avoids vanishing gradient problem.
 Stores long-term temporal patterns through memory cells.
 Learns degradation trends effectively in multi-step battery cycles.

Architecture Design:

model = Sequential()
[Link](LSTM(128, input_shape=(50, 7), return_sequences=True))
[Link](Dropout(0.3))

25
[Link](LSTM(64, return_sequences=False))
[Link](Dense(64, activation='relu'))
[Link](Dense(1)) # Final RUL output
Advantages:
 High accuracy in long sequence learning.
 Best suited for continuous, multivariate data like battery health.

3.2 Recurrent Neural Network (RNN)


A basic RNN was used to establish a baseline. It is capable of handling sequential data but
lacks the memory management mechanism of LSTM, making it less robust for long-term
dependencies.

Architecture Design:

model = Sequential()
[Link](SimpleRNN(128, input_shape=(50, 7)))
[Link](Dense(64, activation='relu'))
[Link](Dense(1))
Drawbacks:
 Suffers from vanishing gradients.
 Less effective than LSTM for long-term trend learning.
 Used for comparative benchmarking.

3.3 Convolutional Neural Network (CNN)


CNNs were applied after FFT transformation of time-series data. CNNs are powerful at
capturing local patterns, edges, and temporal shifts—especially when battery signals are
represented in the frequency domain.

Architecture Design:

model = Sequential()
[Link](Conv1D(64, kernel_size=3, activation='relu', input_shape=(50,
7)))
[Link](MaxPooling1D(pool_size=2))
[Link](Flatten())
[Link](Dense(64, activation='relu'))
[Link](Dense(1))
Advantages:
 Efficient at capturing local changes.
 Can be used in hybrid models (e.g., CNN-LSTM).

26
 Performs well with FFT-transformed data.

PERFORMANCE EVALUATION AND TESTING


The performance evaluation and testing phase plays a pivotal role in validating the
robustness, reliability, and generalizability of the proposed deep learning models for the
prediction of Remaining Useful Life (RUL) of electric vehicle (EV) batteries. This section
delineates the systematic approach adopted to assess model accuracy, comparative
performance under varying thermal conditions, temporal stability, and sensitivity to input
features.
1. Evaluation Metrics

27
To comprehensively evaluate the prediction capability of the developed models (LSTM,
RNN, CNN), the following metrics were rigorously employed:
 Mean Absolute Error (MAE): Measures the average magnitude of prediction errors,
offering interpretability in real units (hours).
 Root Mean Squared Error (RMSE): Emphasizes large errors, providing a sensitive
 R-Squared (R² Score): Indicates the proportion of variance in RUL explained by the
model. A value closer to 1 represents superior performance.
 Mean Absolute Percentage Error (MAPE): Expresses prediction error as a

2. Cross-Validation Strategy
To mitigate overfitting and ensure generalizability, K-Fold Cross Validation (K=5) was
implemented. The training data was partitioned into five subsets; for each iteration, one
subset served as the validation set while the remaining four were used for training. Metrics
were averaged across folds to evaluate model consistency.

3. Temperature-wise Model Testing


The model was tested under three distinct thermal conditions (25°C, 30°C, 50°C) to
simulate real-world variations in EV battery operation. The impact of temperature on model
performance was critically analyzed:
Model Temp MAE (hrs) RMSE (hrs) R² Score
LSTM 25°C 1.42 1.76 0.87
LSTM 30°C 1.28 1.69 0.89
LSTM 50°C 2.31 2.81 0.79
This illustrates performance degradation at higher temperatures due to nonlinear degradation
effects in battery chemistry, emphasizing the need for temperature-aware training.

4. Confusion Matrix (for Discretized RUL Classes)

For interpretability in applications such as maintenance alert systems, the continuous RUL
was discretized into 3 classes:
 Class A: RUL > 60%
 Class B: 30% < RUL ≤ 60%
 Class C: RUL ≤ 30%

28
A confusion matrix was computed to evaluate classification accuracy, precision, recall, and
F1-score for each class. The model demonstrated >90% classification accuracy in Classes A
and B, with a minor drop in Class C due to complex degradation patterns in end-of-life cells.

5. Model Robustness Testing


The model was further evaluated on:
 Noisy data (Gaussian noise up to 10%)
 Missing values (imputed via time-series interpolation)
 Data from untrained battery IDs (simulated unseen units)
These tests validated that the model maintained R² > 0.75 in all stress conditions,
demonstrating excellent generalization and resistance to overfitting.

6. Ablation Study
To understand the importance of each input feature, an ablation study was conducted by
progressively removing one feature at a time:
Removed Feature Drop in R² Score
Temperature -0.11
Instant Capacity -0.09
Cumulative Capacity -0.13
Voltage -0.07
Current -0.04
This confirms the temperature and capacity as dominant contributors to predictive power.

7. Model Comparison Across Architectures

Performance across three deep learning architectures was compared under uniform
conditions:

Model R² Score Training Time Epochs Parameter Count


LSTM 0.89 3.2 mins 50 1.1M
RNN 0.84 2.4 mins 50 0.9M
CNN 0.81 2.1 mins 50 1.3M
LSTM exhibited the highest performance due to its memory cell structure, which is highly
suitable for temporal degradation data.

29
8. Visualization Tools
Performance results were visualized using:
 Time-Series Line Plots of true vs. predicted RUL
 Residual Error Histograms
 2D and 3D Feature Embeddings using t-SNE for feature separability analysis
 Training vs. Validation Loss Curves for early stopping and convergence tracking

9. Deployment Readiness Testing


The model was subjected to:
 Inference latency tests (avg. < 0.9s per cycle)
 Memory footprint analysis
 Edge-device simulation (Jetson Nano, Raspberry Pi) to validate lightweight
deployment feasibility

10. Conclusion from Testing Phase


The LSTM-based deep learning model outperformed traditional architectures, exhibited
strong resistance to temperature variation and noise, and demonstrated real-time feasibility.
The evaluation phase not only validated the theoretical design but also proved its practical
viability for predictive maintenance in EV systems.

30
FUTURE
31
SCOPE
The field of Remaining Useful Life (RUL) prediction for electric vehicle (EV) batteries
using deep learning is rapidly evolving. While the current study delivers substantial insights
into model performance and temperature-based behavior, numerous exciting avenues remain
unexplored. The following points outline strategic, futuristic, and cross-disciplinary
expansions that can amplify the impact, intelligence, and industrial applicability of this
project.

1. Incorporation of Real-World Driving Patterns


Future models can integrate telemetry and real-world drive cycle data (e.g., urban, highway,
mixed-use) to simulate and predict battery degradation in on-road conditions, improving
ecological validity and making the model closer to field-deployable scenarios.

2. Multimodal Sensor Fusion


Combining battery data with additional signals such as:
 Thermal imaging
 Acoustic emission signals
 Electrochemical impedance spectroscopy (EIS)
will provide a multi-dimensional degradation perspective, enhancing prediction
accuracy and early fault detection.

3. Physics-Informed Neural Networks (PINNs)


Leveraging physics-based battery degradation models in tandem with data-driven neural
networks will:
 Ensure interpretability and safety
 Reduce reliance on massive datasets
 Maintain model compliance with battery electrochemical constraints
This hybrid modeling technique holds promise for research at the intersection of AI and
material science.

4. Federated and Edge Learning Framework


To ensure data privacy, scalability, and decentralized intelligence, federated learning
frameworks can be deployed:

32
 Models can be trained on-device (e.g., at OEMs or service stations) without uploading
sensitive battery logs.
 Enables collaborative learning across fleet vehicles, while preserving privacy.
Edge deployment will also allow:
 Real-time inference
 Low-latency response
 Cloud-independent monitoring

5. Transfer Learning for Cross-Battery Generalization


Transfer learning techniques can be used to:
 Adapt pretrained models to new battery chemistries (e.g., LFP, NMC, Solid-State)
 Reduce training time and data requirements for new battery types
 Support plug-and-play model deployment for multi-vendor battery systems

6. Predictive Maintenance and Scheduling Systems


The RUL output can be integrated into:
 Fleet-level maintenance dashboards
 Automated service scheduling systems
 Battery leasing and warranty enforcement tools
This will reduce unexpected failures and optimize lifecycle cost management for commercial
fleets.

7. Battery Swapping Ecosystem Optimization


The model can assist in developing intelligent algorithms for:
 Deciding which battery should be recharged, swapped, or retired
 Dynamic asset tracking and valuation of swappable battery packs
A future-ready battery ecosystem will rely heavily on accurate RUL intelligence.

8. Inclusion of Environmental and Behavioral Data


Factors such as:
 Ambient humidity, pressure
 Driver behavior (acceleration profile, charging frequency)

33
 Road topography and terrain
can be incorporated to train context-aware RUL models, offering enhanced accuracy
and location-aware degradation profiling.

9. Integration with Digital Twin Technology


A digital twin of the battery pack can be created by combining simulation and real-time
sensor inputs. The AI model will continuously:
 Update its internal prediction
 Detect anomalies in degradation behavior
 Offer virtual diagnostics, thereby reducing downtime and improving trust

10. Advanced Visualization and User Interfaces


Future systems can embed:
 Augmented Reality (AR)-based dashboards for EV technicians
 Mobile-friendly RUL insight apps for drivers
 BI-integrated platforms for industrial usage, enabling smart decision-making based
on battery health data

11. Lifecycle Carbon and Cost Impact Analysis


With sustainability goals in focus, RUL prediction can aid in:
 Reducing carbon footprint by enabling proactive battery reuse and recycling
 Calculating total cost of ownership (TCO) by integrating RUL into depreciation
models

12. Automated Model Optimization via AutoML


Integration with AutoML frameworks can enable:
 Autonomous architecture tuning
 Real-time model adaptation to evolving battery chemistries
 Continuous improvement with incoming field data

13. Global Dataset Consortium and Open Research Collaboration


A global open-access dataset initiative, where manufacturers and researchers contribute EV
battery logs under NDAs, can lead to:

34
 Benchmarking of RUL prediction models
 Rapid algorithmic innovation
 Shared responsibility for battery sustainability

14. Integration with EV Charging Infrastructure


Embedding the RUL predictor within smart charging stations will allow:
 Customized charging strategies based on battery health
 Load balancing across healthy and degrading batteries
 Prevention of overcharging and accelerated wear

15. Governmental and Regulatory Support Tools


The prediction system can act as a compliance enabler:
 Enforcing battery performance warranties
 Standardizing battery disposal protocols
 Supporting e-mobility policies with accurate lifetime data

REFERENCES

Severson, K. A., et al. (2019).


"Data-driven prediction of battery cycle life before capacity degradation."
Nature Energy, 4(5), 383–391.
[Link]

Li, Y., et al. (2021).


"A hybrid deep learning model for predicting the remaining useful life of lithium-ion
batteries."
Applied Energy, 289, 116707.
[Link]

Zhang, Y., et al. (2020).


"Remaining useful life prediction of lithium-ion batteries using LSTM and early
degradation modeling."
Journal of Power Sources, 440, 227118.
[Link]

Li, W., et al. (2020).


"A review of prognostics and health management for lithium-ion battery."
Journal of Power Sources, 440, 227155.
[Link]

Zhou, H., et al. (2022).


"State of health and RUL prediction of lithium-ion batteries using improved

35
convolutional neural network."
IEEE Transactions on Industrial Electronics, 69(5), 4574–4584.
[Link]

Wu, S., et al. (2021).


"Physics-informed machine learning for battery lifetime prediction."
Nature Machine Intelligence, 3, 809–818.
[Link]

Zheng, Y., et al. (2017).


"Long short-term memory network for remaining useful life estimation."
IEEE Transactions on Reliability, 66(1), 39–52.
[Link]

Wang, D., et al. (2020).


"Temperature effect on the performance and degradation of lithium-ion batteries for
electric vehicles."
Journal of Energy Storage, 30, 101495.
[Link]

Sun, B., et al. (2021).


"Deep learning-based RUL prediction for lithium-ion batteries considering temperature
effects."
Energy Reports, 7, 6451–6461.
[Link]

National Renewable Energy Laboratory (NREL) (2020).


"Battery Lifetime Analysis and Prediction."
Technical Report NREL/TP-5400-75288, U.S. Department of Energy.
[Link]

36
अ भयांं क अ भकल्प वभाग,
भारतीय  ौ ो गक संं ान म ास, चेेन्नई
Battery EngineeringLaboratory,
Department of Engineering Design,
Indian Institute of Technology Madras, Chennai
Certificate of Internship
This is to certify that J A S Santhosh, Roll Number 22CS133,
Student of St. Joseph’s College of Engineering has completed the Internship
Program in the field of Battery Engineering from 02-06-2025 to 30-06-2025.
During this time, the student has been exposed to different
functionalities and features in Battery Engineering and worked on the
project LIFE TIME PREDICTION OF EV BATTERY USING MACHINE LEARNING
MODELS. I appreciate the hard work and contributions and wish the
student all the best for the future.
JAYAGANTHAN R
Professor,
Battery Engineering Laboratory

You might also like