Diabetes Detection Using Deep Learning Algorithms 2018
Diabetes Detection Using Deep Learning Algorithms 2018
com
ScienceDirect
ICT Express 4 (2018) 243–246
www.elsevier.com/locate/icte
Abstract
Diabetes is a metabolic disease affecting a multitude of people worldwide. Its incidence rates are increasing alarmingly every year. If untreated,
diabetes-related complications in many vital organs of the body may turn fatal. Early detection of diabetes is very important for timely treatment
which can stop the disease progressing to such complications. RR-interval signals known as heart rate variability (HRV) signals (derived from
electrocardiogram (ECG) signals) can be effectively used for the non-invasive detection of diabetes. This research paper presents a methodology for
classification of diabetic and normal HRV signals using deep learning architectures. We employ long short-term memory (LSTM), convolutional
neural network (CNN) and its combinations for extracting complex temporal dynamic features of the input HRV data. These features are passed
into support vector machine (SVM) for classification. We have obtained the performance improvement of 0.03% and 0.06% in CNN and CNN-
LSTM architecture respectively compared to our earlier work without using SVM. The classification system proposed can help the clinicians to
diagnose diabetes using ECG signals with a very high accuracy of 95.7%.
⃝c 2018 The Korean Institute of Communications and Information Sciences (KICS). Publishing Services by Elsevier B.V. This is an open access
article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
Keywords: Deep learning; Diabetes; Heart rate variability; ECG; CNN; LSTM
Fig. 1. (A) and (B) Sample heart rate signal for normal and diabetic subject.
4. Description of dataset
The Electrocardiograms (ECG) of 20 people each from
the diabetes and normal group were collected for 10 min
with people lying down in a relaxed supine position. The
heart rate time series data is derived from ECG signals using
Pan and Tompkins algorithm. This real-time algorithm can
effectively detect QRS complexes in an ECG signal based on
its morphological features like slope, amplitude and width. It
involves processes like digital bandpass filtering (to reduce
false detections due to noise) and thresholding operations (to
increase detection sensitivity). The ECG signal is sampled at
500 Hz. 71 datasets (same number for diabetic and normal
group) each were extracted from the recorded data. Each dataset
contains 1000 number of samples. The input data (Fig. 1) is Fig. 2. Overview of proposed architecture.
passed to deep learning algorithms without any further pre-
processing.
6. Experiments and results
5. Proposed architecture All experiments are run on GPU enabled TensorFlow [10]
An overview of proposed architecture is shown in Fig. 2. with Keras [11] framework. In this work, we use the same
This is composed of 3 main sections. In input layer, the heart configuration that we had used in our early paper [3]. In
rate variability of raw ECG signal are given as input to deep this work, we extract features in deep learning network, com-
learning architecture. This contains 5 CNN layers and each prised of CNN-LSTM architecture and pass into SVM for
layer follows maxpooling. First two CNN layers contain 64 and classification. LSTM has the capability to handle long-term
128 filters with filter length 3 and maxpooling with pooling dependencies in a data sequence. To decide the kernel function,
length 2. Next two CNN layers contain 256 and 512 filters we run two trail of experiment for SVM with linear and RBF
with filter length 3 and maxpooling with pooling length 4. A kernel. SVM with RBF kernel performed better. These SVM
model are implemented using Scikit-learn. The detailed 5-fold
last CNN layer contains 1024 filters with filter length 3 and
cross-validation accuracy is reported in Table 2. In almost
maxpooling with pooling length 6. This feature map is passed
all the network structures, SVM has performed better in 5-
into LSTM layer. LSTM contains 70 memory blocks which
fold cross-validation with accuracy which is comparable to the
learns the time domain features. This follows dropout 0.1.
fully connected linear with nonlinear activation function for
This randomly removes the neurons along with its connections. classification. Thus, we claim that the combination of SVM in
Finally, the features are passed into SVM for classification. The penultimate layer for classification with deep learning layers for
SVM used RBF kernel. This is defined as follows with samples feature extraction can achieve the best performance.
s and s1
( )
∥s − s1∥2 7. Conclusion and future work
K (s, s1) = exp (1)
2σ 2 Considerable part of human population is under the grip of
diabetes which is incurable. If not managed well, diabetes can
246 Swapna G., Vinayakumar R., Soman K.P. / ICT Express 4 (2018) 243–246