0% found this document useful (0 votes)

54 views13 pages

Ensemble Deep Learning For Cervix Image Selection

This document summarizes a research study that developed an ensemble deep learning method to classify cervical images as either containing a cervix or not containing a cervix. The method combined three deep learning models - RetinaNet for object detection, Deep SVDD for one-class classification, and a custom CNN for binary classification. When evaluated on a separate test dataset of over 30,000 images, the ensemble method achieved an average accuracy of 91.6% and F1-score of 0.890 at identifying cervix versus non-cervix images. The goal was to develop an automatic pre-processing step to ensure only high-quality cervix images are used for cervical cancer screening algorithms.

Uploaded by

Fatihani Nurqolbiah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

54 views13 pages

Ensemble Deep Learning For Cervix Image Selection

Uploaded by

Fatihani Nurqolbiah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Article

Ensemble Deep Learning for Cervix Image Selection

toward Improving Reliability in Automated Cervical
Precancer Screening
Peng Guo 1,*, Zhiyun Xue 1, Zac Mtema 2, Karen Yeates 3,4,5, Ophira Ginsburg 6, Maria Demarco 7,
L. Rodney Long 1, Mark Schiffman 7 and Sameer Antani 1
1 Communications Engineering Branch, Lister Hill National Center for Biomedical Communications,
U.S. National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA;
[Link]@[Link] (Z.X.); rlong@[Link] (L.R.L.); santani@[Link] (S.A.)
2 Ifakara Health Institute, P.O. Box 53 Ifakara, Tanzania; zacmtema@[Link]

3 Department of Medicine, Queen’s University, Kingston, ON K7L3N6, Canada; yeatesk@[Link]

4 School of Global Public Health, New York University, New York, NY 10012, USA

5 Pamoja Tunaweza Research Centre, Moshi, Tanzania

6 Perlmutter Cancer Center, NYU Langone Health, New York, NY 10016, USA;

[Link]@[Link]
7 Division of Cancer Epidemiology and Genetics (DCEG), National Cancer Institutes,

Bethesda, MD 20892, USA; [Link]@[Link] (M.D.); schiffmm@[Link] (M.S.)

* Correspondence: [Link]@[Link]; Tel.: +1-301-827-4171

Received: 22 May 2020; Accepted: 24 June 2020; Published: 3 July 2020

Abstract: Automated Visual Examination (AVE) is a deep learning algorithm that aims to improve
the effectiveness of cervical precancer screening, particularly in low- and medium-resource regions.
It was trained on data from a large longitudinal study conducted by the National Cancer Institute
(NCI) and has been shown to accurately identify cervices with early stages of cervical neoplasia for
clinical evaluation and treatment. The algorithm processes images of the uterine cervix taken with
a digital camera and alerts the user if the woman is a candidate for further evaluation. This requires
that the algorithm be presented with images of the cervix, which is the object of interest, of
acceptable quality, i.e., in sharp focus, with good illumination, without shadows or other occlusions,
and showing the entire squamo-columnar transformation zone. Our prior work has addressed some
of these constraints to help discard images that do not meet these criteria. In this work, we present
a novel algorithm that determines that the image contains the cervix to a sufficient extent. Non-
cervix or other inadequate images could lead to suboptimal or wrong results. Manual removal of
such images is labor intensive and time-consuming, particularly in working with large retrospective
collections acquired with inadequate quality control. In this work, we present a novel ensemble
deep learning method to identify cervix images and non-cervix images in a smartphone-acquired
cervical image dataset. The ensemble method combined the assessment of three deep learning
architectures, RetinaNet, Deep SVDD, and a customized CNN (Convolutional Neural Network),
each using a different strategy to arrive at its decision, i.e., object detection, one-class classification,
and binary classification. We examined the performance of each individual architecture and an
ensemble of all three architectures. An average accuracy and F-1 score of 91.6% and 0.890,
respectively, were achieved on a separate test dataset consisting of more than 30,000 smartphone-
captured images.

Keywords: deep learning; cervical cancer; cervix/non-cervix; ensemble; one-class classification

Diagnostics 2020, 10, 451; doi:10.3390/diagnostics10070451 [Link]/journal/diagnostics

Diagnostics 2020, 10, 451 2 of 13

1. Introduction
There were nearly 570,000 new cases of cervical cancer diagnosed in 2018 according to the World
Health Organization (WHO) [1]. Visual Inspection of the cervix post application of dilute (3–5%)
Acetic acid (VIA) is considered to be a simpler and less expensive test compared to the other cervical
cancer screening modalities. Cervical tissue whitening after the acetic acid application (called
acetowhitening or AW) is an indicator of Human Papillomavirus (HPV) infection. However, not all
AW lesions are indicative of cervical intraepithelial neoplasia (CIN), and not all CIN turns white with
acetic acid application. This results in relatively poor quality assessments using VIA, where the
typical sensitivity and specificity of VIA are in the range from ~70–80% as reported in [2–4]; but, high
subjectivity has also been reported in [5] with only a 56.8% complete decision agreement rate among
20 gynecologists over 919 cases. A breakthrough deep learning method, called Automated Visual
Evaluation (AVE) [6], was shown to successfully classify digitized cervix outperforming human
experts on a test dataset especially for a key age group in women. The method was proven on a large
high quality collection of over 60,000 digitized images taken with a specialized SLR camera in a 7
year longitudinal study of over 9000 women [7,8]. We have recognized that future images that are
likely to be acquired with low cost handheld devices, such as a smartphone, could impact AVE
performance if the cervix images are of low quality. In response, we have embarked on a multi-
national multi-organization collaborative effort [9] to develop and validate a smartphone-based AVE
app with enhanced training on a wide variety of images. We envisage the app to include pre-
classification image quality assessment to ensure reliable results.
Among various tasks necessary for reliable AVE performance, two have drawn our attention:
ensuring that the images used for processing are (i) of adequate quality; and, (ii) the images have
adequate coverage of necessary anatomical landmarks, such as the os, the squamo-columnar junction,
and the transformation zone. Acceptable image quality can be defined in many ways, namely focus,
illumination, occlusion, and content coverage. So far, we have conducted the following peripheral
studies in order to support AVE: (i) focus determination: we developed a deep learning algorithm to
detect sharp cervical regions in cervix images captured using mobile devices [10]; and, (ii) anatomical
landmark detection: we developed a deep learning algorithm to automatically locate and segment
the os [11]. However, in these two peripheral studies [10,11], the methods assume that the input
images are images of the cervix. However, we have observed in our visual analysis of cervical image
datasets, particularly those that are retrospectively collected from patient records receiving routine
gynecological care, that they often contain images with insufficient cervical information and/or non-
cervix images. Possible reasons could be accidental camera trigger presses, or using the device for
training (in which case precise image acquisition is not crucial) among others. Several examples of
non-cervix images from a dataset are shown in Figure 1. It is necessary to remove these non-cervix
images since they distract model training and could adversely impact classifier performance. Manual
inspection during and after the capture of images can be done, but it is labor intensive, distracting
during clinical procedures, and prone to human error.
Our goal is to develop an automatic approach which labels images as cervix or non-cervix. This
is work toward developing a pre-processing quality control module that would ensure that such
images are not associated with study subject data or patient data.

Figure 1. Non-cervix image samples in one collected dataset.

One of the challenges in automatically labeling these images is large visual variety in image
content due to the cervices being morphologically different with respect to each individual’s age,
Diagnostics 2020, 10, 451 3 of 13

parity, and related changes in her anatomy. Other factors creating variability include image focus,
illumination, specular reflection from the moist tissue, presence of clinical instruments (such as the
speculum and swab), and variable zoom. Secondly, our datasets are highly imbalanced with respect
to the numbers of cervix images and non-cervix images with many more cervix images than non-
cervix images.
To the best of our knowledge, there is no previously reported study regarding classifying an
image to be cervix or non-cervix. We propose using an ensemble of three methods, aiming to take
advantage of the strengths of each. They are one-class classification, binary classification, and object
detection, namely Deep SVDD [12], a customized CNN (Convolutional Neural Network), and
RetinaNet [13].
The rest of the manuscript is organized as follows: Section 2 describes the details of datasets used
in this study; Section 3 describes the methods; Section 4 presents the experiments and a discussion of
the results, with Section 5 offering conclusions.

2. Image Data
Four datasets are used in this study: MobileODT, Kaggle, SEVIA, and COCO2017 [14]. Of these,
MobileODT, Kaggle, and SEVIA are cervix image datasets, while the COCO dataset is of stock
photography images. We use the MobileODT, Kaggle, and COCO datasets for training and
validation, and we use the SEVIA dataset for testing. All datasets are de-identified and exempted
from IRB review. Note that training, validation and testing sets have no overlap on any image. All
these datasets have large intra- and inter-dataset variety with respect to object illumination, lighting
condition, object color, object position, size, and other related visual factors.

2.1. MobileODT Dataset

The images in this dataset were provided by MobileODT ([Link]
originally collected mainly for the purpose of documenting the colposcopy procedure using their
hand-held colposcope (EVA), which incorporates a mobile device for image capture. The dataset
contains 9000 images labeled by 6 experts (at Hospital Universitario de Caracas), and verified for
completeness by staff at MobileODT and the National Cancer Institute (NCI). The dataset exhibits
excellent natural variety in the appearance of women’s cervices due to age, parity, medical history,
and ethnicity. We use this image dataset for training deep learning algorithms in the cervix class. We
apply a cervix detector used in [10] to detect the cervix regions; we record these regions using a
bounding box format as part of the annotation for training the RetinaNet framework used in our
study. There are 7984 cervix images in this dataset. All of them are used to train the Deep SVDD and
the customized CNN. Of the total dataset, 3118 images have bounding box annotations and are used
for training the RetinaNet.

2.2. Kaggle Dataset

The dataset on Kaggle [15] provided for the “Intel & MobileODT Cervical Cancer Screening
Competition” was originally released for classifying the images into three cervix types based on
changes in their visual appearance; these three types are related to the changing, age-related,
relationships between squamous and glandular epithelium of the cervix. Cervix region contours for
the images in this set are also available [16]. For RetinaNet training, a tight fitting bounding box
around these contours is computed and stored. In each image, we obtain the cervix bounding box by
drawing a rectangle tightly enclosing the given contour. We observe that the image quality in this set
varies widely. We note presence of motion blur, speculum interference, and specular reflection, poor
lighting, poor camera positioning, among others. We use the 1481 training images from [15] in our
training, and the 512 test images from [15] in validation.
Diagnostics 2020, 10, 451 4 of 13

2.3. COCO Dataset

COCO [14] is a large-scale dataset collected for the general purpose tasks of object detection,
segmentation, captioning, etc. It contains hundreds of thousands of stock photography images
categorized into as many as 80 classes, viz., airplane, motorcycle, dog, cat, etc. (example shown as the
3rd row in Figure 2), thereby providing us with a variety of samples with characteristics that are
different from cervix images, fulfilling our need of a good resource for non-cervical features in
training. We use COCO2017 [14] as the supplement for our non-cervix image dataset in training and
validation. Since there is no presence of a cervix in these images, we annotate each image with (i) a
sequence containing no coordinate, indicating “absence of RoI” (for RetinaNet); and (ii) the non-
cervix class label. The quantitative details of images used in this dataset for training, validation of
each method is shown in Table 1.

2.4. SEVIA Dataset

This dataset was collected across 4 regions of Tanzania by the “Smartphone-Enhanced program
of Visual Inspection with Acetic acid (SEVIA)” for cervical cancer screening in low-resource regions
[17]. The program makes the use of smartphone camera as a training tool to allow health providers
to maintain and strengthen their skills in VIA. This dataset is suitable for our study in three aspects:
(i) it has images in a large quantity (31,967); (ii) it contains non-cervix images. While small percentage-
wise, the number of non-cervix images (1816) is still substantial; (iii) large image quality variation
can be observed among the cervix images, where the most common degradation is “motion blur”;
and; (iv) many of the non-cervix images in this dataset are visually similar to actual cervix images,
such as hand-drawn illustration images of cervix (4th row in Figure 2), or have similar visual features
with cervix, such as shape, edges, color, etc. Those images were included in the dataset probably for
the purpose of medical training. The SEVIA dataset may be considered to be a very challenging image
collection source for classifying cervix images/non-cervix images. We use it as the test dataset.

Figure 2. Examples of cervix images used in this study, showing quality variability. 1st row:
MobileODT dataset; 2nd row: Kaggle dataset; 3rd row: COCO dataset; 4th row: SEVIA dataset (only
Diagnostics 2020, 10, 451 5 of 13

the first two images are cervix images, the rest images in this row are non-cervix image samples in
this dataset).

3. Methods
In this study, we address the problem of cervix/non-cervix classification from two aspects: (1)
building a balanced dataset containing both cervix and non-cervix images, and (2) utilizing multiple
deep learning architectures. Regarding the data imbalance issue in Kaggle and MobileODT datasets,
we use images in COCO2017 as our non-cervix images in training and validation. The COCO images
are randomly selected from 80 categories in general domain, and are labeled as non-cervix images
regardless of their original labels in COCO dataset. Together with cervix images from MobileODT
and Kaggle datasets we build balanced training and validation datasets. As for the testing dataset,
we use the SEVIA dataset which is not seen by any of the models in training. All these datasets have
large intra- and inter-dataset variety with respect to object illumination, lighting condition, object
color, object position, size, and other related visual factors. The detailed dataset description is given
in Table 1.
We propose an ensemble method which consists of three deep learning architectures: RetinaNet,
Deep SVDD and a customized CNN. We use RetinaNet since it has been reported to achieve state-of-
the-art performance for multi-class object detection on public datasets such as ImageNet and COCO.
In addition, we use Deep SVDD which was proposed for anomaly detection [12] known as a “one-
class classification” problem [18]. The core of the method is to train a neural network to extract and
encode the common factors of training data (mainly consisting of “normal” data) variation into a
hypersphere of minimum volume, so that the targeting class data (normal) can be separated from the
counter-examples (anomalous) by the hypersphere. The concept suits the scenario in our study where
the “anomalous” samples could be of unlimited representations. By obtaining a good “normality
description”, we aim to accurately distinguish the “normal” samples, “cervix images”, from the
“anomalous” samples, “non-cervix” images. In addition to RetinaNet and Deep SVDD, we propose
a binary classifier which is a sequential CNN with a simple architecture. It consists of four
sequentially connected convolutional blocks followed by two fully-connected layers. The three
methods, RetinaNet, Deep SVDD, and the customized CNN, are different in several aspects such as
the algorithm type, the network architecture, and how the class labels are predicted, and are
complementary to each other. We fuse their outputs via a voting scheme, making an ensemble
method. The ensemble method as well as each individual model is tested on a separate image set that
is not used in training and validation. The performance of the ensemble method is compared with
that of each model. Limitations of the methods are also discussed through analyzing the error cases
in both validation and testing. Below is a brief introduction to each architecture.

3.1. RetinaNet
We use an object detection network, RetinaNet [9], in this study. It has reportedly been achieving
state-of-the-art performance for multi-class object detection on public datasets such as ImageNet [19]
and COCO [13]. It has a feature pyramid network [20] where features are computed separately in
multiple scales and then merged through convolutional operations (top-down pathways) (Figure 3).
Furthermore, with the proposed “focal loss” function, RetinaNet is reported to have better
performance handling class imbalance and focusing on hard, misclassified examples. To help us
achieve a faster convergence on training, we initialize the RetinaNet with pre-trained weights
obtained in one of our related studies [10] which also used a subset of the cervix images captured
using a mobile device by MobileODT.
As introduced in Section 2.2 and 2.3, to prepare annotation for training this supervised
algorithm, we (i) annotate cervix images with a class label (“cervix”) and cervix bounding box
coordinates; (ii) annotate non-cervix images with a class label (“non-cervix”) and an empty
coordinate sequence. Based on the learned information from the annotated region of interest (RoI),
the network outputs a score associated with each candidate bounding box. The score indicates the
probability (0–1) that the bounding-box-enclosed region is the desired target class. Non-maximum
Diagnostics 2020, 10, 451 6 of 13

suppression is then applied to merge similar bounding boxes that are with high IoU (Intersection
over Union) values. Lastly, a class label is generated by thresholding the score of each bounding box.
If there is no bounding box predicted with a score higher than the threshold, that image is considered
as a non-cervix image. The threshold is selected based on the performance of the validation dataset;
more details are presented in Section 4.

Figure 3. RetinaNet workflow. The convolutional layers (blue block on the left) extract features in
different scales (blue, yellow and green layers in the middle) followed by NMS (Non-maximum
Suppression) and thresholding.

3.2. Deep SVDD

The second classifier we use is Deep SVDD [12] which is a one-class classification method built
with a deep learning architecture. It learns a deep neural network transformation φ (· ; W) (with
weights W) that attempts to map most of the target data representations into a “hypersphere” of
minimum volume, so that new samples belonging to the target category would be mapped within
this hypersphere whereas the samples belonging to the other category would be mapped outside of
it (Figure 4). The role of the deep architecture is to extract features from the input images and learn
common factors to represent the target category, in our case, the cervix images. This one-class
classification concept is a good fit for our study scenario, where the “anomalous” samples could have
unlimited representations.
We modify the original Deep SVDD structure in [12] lightly to make it suitable for encoding
larger images. The architecture proposed in [12] used LeNet [21]-type CNNs for the datasets of
MNIST and CIFAR-10. Most images in these datasets have a size of 32 × 32, which is much smaller
than the size of our cellphone images. To make the structure more suitable for encoding images as
large as 128 × 128, we modify the network structure by increasing the number of units in the final
dense layer. The architecture has three convolutional modules with 32 × 5 × 5 filters, 64 × 5 × 5 filters,
and 128 × 5 × 5 filters, and a final dense layer of 256 units.

Figure 4. Deep SVDD [21] workflow from input data distribution to output data distribution. The
“hypersphere” (the circle in the right subfigure) is found by the algorithm as the smallest hypersphere
with center c, and radius R, that includes all (or the majority) of the target samples in the feature space.

3.3. Customized CNN

Compared with the above two deep learning architectures, viz., Deep SVDD and RetinaNet, our
Customized CNN is conceptually and architecturally simpler. Following similar architectures of
Diagnostics 2020, 10, 451 7 of 13

classic CNNs such as VGG and LeNet, we design the CNN as a sequential architecture (Figure 5) that
consists of 4 convolutional blocks and two fully-connected layers. In each of the four convolutional
blocks, ReLU (Rectified Linear Unit) activation is used for every convolutional layer and a max
pooling layer is connected to the last convolutional layer in each block. The first two blocks (“Conv
block 1” in Figure 5) consist of 2 convolutional layers each, and the last two blocks (“Conv block 2”
in Figure 5) consist of 4 convolutional layers each. Two fully-connected layers consist of 64 nodes and
2 nodes, respectively. Softmax activation is used for the last fully-connected layer. The loss function
is categorical entropy loss and the optimization is performed via Stochastic Gradient Decent (SGD).

Figure 5. Customized CNN (Convolutional Neural Network) architecture.

3.4. Ensemble Method

The output scores of each individual network are obtained through different mechanisms.
RetinaNet uses sigmoid activation at the output, our customized CNN uses the softmax function, and
both limit the output scores to be less than 1. However, Deep SVDD uses CNN architecture(s) to
encode the input and find the “hypersphere (radius)” for distinguishing samples belonging to
different categories. The output of Deep SVDD represents the “encoded distance” of the given sample
to the center of the hypersphere which is not limited. To combine outputs from the above three
methods, we take advantage of a voting method using the predicted labels (a cervix image or non-
cervix image label predicted by each method) instead of using the score values. The class label which
is “voted” at least twice is assigned to the input image as the final decision of the ensemble method.

4. Experiments

4.1. Data Preparation and Implementation Details

Table 1 summarizes the data splits used in this work. Note that the number of validation images
in MobileODT or Kaggle datasets are the same for all three methods. The total number of validation
images are adjusted by the amount of COCO images used, with respect to the ratio of the number of
cervix images over the number of non-cervix images in training.

Table 1. Summary of data split and quantitative details of images used from each dataset.

Training Validation Testing

Dataset
RetinaNet Deep SVDD Custom CNN All Methods All Methods
MobileODT 3118 3118 7170 814 0
Kaggle 1481 1481 1481 512 0
8561
COCO 4599 0 (contains the 4599 images 1326 0
used for training RetinaNet)
30,151 + 1816
SEVIA 0 0 0 0 (in 10 folds, last fold has
3016 cervix images)

The test dataset contains 31,967 (30,151 cervix images + 1816 non-cervix images) images, which
has significant imbalance between the two categories. Due to this imbalance, we split the cervix
images and generate a “10-fold” testing scenario. In each fold, we have: (i) all the non-cervix images
(1816 images); and (ii) 3015 cervix images that are randomly selected (the last fold has 3016 images).
Note that there is no image that is selected more than one time; in other words, for the cervix images,
there is no overlap between each test fold. The purpose of splitting the whole set of cervix images
Diagnostics 2020, 10, 451 8 of 13

into many folds is to maintain relatively balanced test sets, so that in each test, the performance
measurement parameters we use, such as accuracy, sensitivity (Sens.), specificity (Spec.) and F-1
scores are calculated without bias brought by dataset imbalance [22]. We then use the average value
of those measures to evaluate the performance of each individual algorithm as well as the ensemble
model. Each image is resized to three sizes and the input image size for each network architecture is
listed in Table 2.

Table 2. Input image dimensions for all the three architectures used.

Methods Resized Dimension (w × h)

Deep SVDD 128 × 128
RetinaNet 480 × 640
Customized CNN 256 × 256

The training is undertaken with one Nvidia Tesla V100 graphical processing unit (GPU). The
hyper-parameters for each model are:
• RetinaNet: The batch size is 4, and a learning rate starting at 0.001. The weight decay is set as
0.0001 and the momentum is 0.9. The model uses ResNet50 [23] as the backbone and is initialized
with pre-trained RetinaNet weights that were obtained in [10]. Augmentations of rotation, shearing,
shifting and x/y flipping are used. Python 3.6 and Keras 2.2.4 are used.
• Deep SVDD: The batch size is 8 and the weight decay factor of λ = 0.5 × 10−5. Base learning rate
is set as 0.001. Python version is 3.7 and PyTorch version is 0.5. Our Deep SVDD network is modified
based on the PyTorch implementation in [24].
• Customized CNN: The batch size is 16. Learning rate is set as 10−4 and momentum is set as 0.9.
“Categorical cross entropy” loss is used and updated via Stochastic Gradient Descent (SGD).
Augmentations of scaling, horizontal/vertical shift and rotation are used. Python 3.6 and Keras 2.2.4
are used.

4.2. Results and Disccussion

The performance of each individual method is evaluated first followed by the performance of
the ensemble model.

4.2.1. Results of RetinaNet

The performance of the RetinaNet model on the test sets is shown in Table 3a,b. It achieves an
average accuracy of 91.1%, an average F-1 score of 0.877 and AUC of 0.94 (Figure 6a). From the error
examples shown in Figure 6b, we can observe that the error cases (mainly false negatives) are hand-
drawn sketches of the cervix. These misclassifications are due to their visual features that are similar
with cervical features, e.g., edges, colors, etc.

Table 3. Performance measurement (a) and the confusion matrix of one test fold (b) for RetinaNet
approach on test images. Positive: non-cervix image. (Sens. = Sensitivity, Spec. = Specificity, Std. =
Standard deviation, Avg. = Average).

(a)
Accuracy Sens. Spec. F-1 Score
Methods
(Avg./Std.) (Avg.) (Avg./Std.) (Avg.)
RetinaNet 0.911/0.010 0.918 0.908/0.011 0.877

(b)
True Positive True Negative
Predicted Positive 1667 279
Predicted Negative 149 2736
Diagnostics 2020, 10, 451 9 of 13

(a) (b)
Figure 6. RetinaNet Results: (a) ROC from one of the test folds, and (b) and false negative error
examples.

4.2.2. Results of Deep SVDD

As shown in Table 4a,b, our modified Deep SVDD achieves an average F-1 score of 0.863 across
the folds, and the “most normal” samples are shown in Figure 7.

Table 4. Performance measurement (a) and the confusion matrix of one test fold (b) for Deep SVDD
approach on test images. Positive: non-cervix image. (Sens. = Sensitivity, Spec. = Specificity, Std. =
Standard deviation, Avg. = Average).

(a)
Accuracy Sens. Spec. F-1 Score
Methods
(Avg./Std.) (Avg.) (Avg./Std.) (Avg.)
Deep SVDD 0.889/0.010 0.922 0.876/0.010 0.863

(b)
True Positive True Negative
Predicted Positive 1674 394
Predicted Negative 142 2621

We observe that there are some green-filtered cervix images that are from the Kaggle dataset
[15] in the training/validation sets, as shown in Figure 7. There are no such green filtered images in
the SEVIA dataset. It is worthwhile to point out, however, that they are predicted with large
“anomaly scores” [12] in the Kaggle validation set. This indicates that there is a significant visual
difference from the majority of “cervix image” samples used to train Deep SVDD. On further
examination, we find that the number of training images with a green filter is less than 0.1% of all
training cervix images, which may be insufficient for the model to learn as an acceptable appearance
for cervix images.
Diagnostics 2020, 10, 451 10 of 13

Figure 7. Validation examples of Deep SVDD approach. Left images are predicted as cervix in the
validation set. Right images predicted as non-cervix images, though all of the images shown are cervix
image samples with “green filter” applied.

4.2.3. Results of customized CNN

As shown in Table 5a,b, the customized CNN achieves an average accuracy of 83.0% on the test
folds. The overall performance for classifying the cervix image class is lower than that of the previous
two methods, viz., RetinaNet and Deep SVDD. However, it obtains an accuracy over 93% on the non-
cervix images in the validation set, with a threshold value of 0.75. We use this parameter setting in
the test phase.

Table 5. Performance measurement (a) and the confusion matrix of one test fold (b) for customized
CNN approach on test images. Positive: non-cervix image. (Sens. = Sensitivity, Spec. = Specificity, Std.
= Standard deviation, Avg. = Average).

(a)
Accuracy Sens. Spec. F-1 Score
Methods
(Avg./Std.) (Avg.) (Avg./Std.) (Avg.)
Customized CNN 0.830/0.010 0.924 0.780/0.012 0.810

(b)
True Positive True Negative
Predicted Positive 1678 667
Predicted Negative 138 2348

4.2.4. Results of Ensemble method

We obtain the hyper parameters for each of above three methods from their best validation
performance and use the parameters in the ensemble methods in the testing stage. Given an input
testing image, a label of cervix or non-cervix is assigned if that specific label is voted for at least twice.
As shown in Table 6a,b, we have achieved an average testing accuracy of 91.6% with F-1 score of
0.890 for the ensemble method.

Table 6. Performance measurement (a) and the confusion matrix of one test fold (b) for ensemble
approach on test images. Positive: non-cervix image. (Sens. = Sensitivity, Spec. = Specificity, Std. =
Standard deviation, Avg. = Average).

(a)
Accuracy Sens. Spec. F-1 Score
Method
(Avg./Std.) (Avg.) (Avg./Std.) (Avg.)
Ensemble 0.916/0.004 0.935 0.900/0.008 0.890

(b)
True Positive True Negative
Predicted Positive 1698 301
Predicted Negative 118 2715

The specificity (0.900) achieved by the ensemble method is very close to that of RetinaNet which
yields the best specificity (0.908) among the three individual methods. A higher sensitivity (0.935) is
achieved by the ensemble method indicating an improvement in classifying non-cervix images over
the individual methods. Compared with each individual method, the ensemble method detects more
non-cervix images at the cost of “losing” a small amount of cervix images. Meanwhile, we observe
that many error cases here show similar visual characteristics to those found in each individual
model. Among the cervix images that are misclassified as non-cervix images (false positive), there
Diagnostics 2020, 10, 451 11 of 13

are often factors such as motion blur, low lighting, presence of distractors such as pubic hair, that
reduce the visual clarity of the cervix as well as the sharpness of the entire image (shown in Figure
8). While, in reality, these are cervix images, they would not be typically used in visual analysis by
human experts due to poor quality. While our results are “wrong”, the ability of our ensemble
method to “reject” these images is advantageous for quality control of cervix image acquisition and
use in machine learning algorithms.

Figure 8. Samples of cervix images that are “misclassified” as non-cervix images.

5. Conclusions
Toward our goal of achieving reliable AVE performance, it is critical that the algorithm be
presented with images (1) that contain the cervix; and (2) are of adequate quality. In this paper, we
have presented an ensemble deep learning method which detects non-cervix images so that they can
be eliminated from AVE processing. The ensemble structure consists of three deep learning methods,
viz., RetinaNet, Deep SVDD, and a customized CNN, that are selected to take advantage of their
complementary features, viz., object detection, one-class classification, and binary classification. The
decisions of these three models are combined via a voting scheme.
The individual methods as well as the ensemble decisions are evaluated against a large
smartphone-acquired cervix image dataset that has many non-cervix images. The results from our
testing show that the ensemble method outperforms individual deep learning methods and achieves
an accuracy and F-1 score of 91.6% and 0.890, respectively. While this performance is respectable, we
analyzed the error cases to better understand why the ensemble was wrong. We observe that low
image quality factors, such as motion blur, poor illumination, and incomplete/occluded cervix view
(shown in Figure 8) among others, are responsible for these errors. As a result, a more comprehensive
image quality and cervix/non-cervix classifier is necessary to ensure that only the most appropriate
images are presented to the AVE. There are other errors more difficult to resolve. Note the sketches
of the cervix that were possibly used for training (shown in Figure 6b) caused false negatives, i.e.,
non-cervix images classified as a cervix. This could be addressed by adding such examples to the
training dataset.
Our approach can help the cervix image acquisition in two usage scenarios. The method can be
used either as a standalone module to process all the images that have been uploaded to a data server
or an embedded module in an image capturing device to process each image shortly after it is
captured. The cervix images identified by the classifier can be sent back to human experts for further
review, or to another automatic algorithm.
We believe that such data quality algorithms are extremely important for not only cleaning “big
data” datasets used in developing robust disease detection algorithms, but also for quality assurance
for machine learning algorithms in routine clinical use.
Author Contributions: Conceptualization, Z.X. and S.A.; software, P.G.; validation, Z.X., L.R.L and S.A.; formal
analysis, P.G.; investigation, Z.X., L.R.L. and S.A.; data curation, P.G., Z.X., Z.M. and K.Y.; writing—original
draft preparation, P.G.; writing—review and editing, Z.M, K.Y., O.G., M.D., M.S., Z.X., L.R.L. and S.A.;
visualization, P.G.; supervision, Z.X. and S.A.; project administration, S.A.; funding acquisition, S.A. All authors
have read and agreed to the published version of the manuscript.

Funding: This work was supported by the Intramural Research Program of the Lister Hill National Center for
Biomedical Communications (LHNCBC), the National Library of Medicine (NLM), and the U.S. National
Institutes of Health (NIH).
Diagnostics 2020, 10, 451 12 of 13

Acknowledgments: In addition to the above mentioned organizations, we thank Mark Schiffman of the National
Cancer Institute for his kind review of the manuscript. We also are grateful to MobileODT and The Kilimanjaro
Cervical Screening Project (SEVIA) for providing us the image dataset.

Conflicts of Interest: The authors declare no conflict of interest.

References
1. Human Papillomavirus (HPV) and Cervical Cancer. Available online: [Link]
room/fact-sheets/detail/human-papillomavirus-(hpv)-and-cervical-cancer (accessed on 2 October 2019).
2. Bhattacharyya, A.; Nath, J.; Deka, H. Comparative study between pap smear and visual inspection with
acetic acid (via) in screening of CIN and early cervical cancer. J. Mid Life Health 2015, 6, 53–58.
3. Egede, J.; Ajah, L.; Ibekwe, P.; Agwu, U.; Nwizu, E.; Iyare, F. Comparison of the accuracy of papanicolaou
test cytology, Visual Inspection with Acetic acid, and Visual Inspection with Lugol Iodine in screening for
cervical neoplasia in southeast Nigeria. J. Glob. Oncol. 2018, 4, 1–9.
4. Sritipsukho, P.; Thaweekul, Y. Accuracy of visual inspection with acetic acid (VIA) for cervical cancer
screening: A systematic review. J. Med. Assoc. Thai. 2010, 93, 254–261.
5. Jeronimo, J.; Massad, L.S.; Castle, P.E.; Wacholder, S.; Schiffman, M.; National Institutes of Health (NIH)-
American Society for Colposcopy and Cervical Pathology (ASCCP) Research Group. Interobserver
agreement in the evaluation of digitized cervical images. Obstet. Gynecol. 2007, 110, 833–840.
6. Hu, L.; Bell, D.; Antani, S.; Xue, Z.; Yu, K.; Horning, M.P.; Gachuhi, N.; Wilson, B.; Jaiswal, M.S.; Befano, B.;
et al. An observational study of deep learning and automated evaluation of cervical images for cancer
screening. J. Natl. Cancer Inst. 2019, 111, 923–932.
7. Bratti, M.C.; Rodriguez, A.C.; Schiffman, M.; Hildesheim, A.; Morales, J.; Alfaro, M.; Guillén, D.;
Hutchinson, M.; Sherman, M.E.; Eklund, C.; et al. Description of a seven-year prospective study of human
papillomavirus infection and cervical neoplasia among 10000 women in Guanacaste, Costa Rica. Rev.
Panam. Salud Publica 2004, 152, 75–89.
8. Herrero, R.; Schiffman, M.; Bratti, C.; Hildesheim, A.; Balmaceda, I.; Sherman, M.E.; Greenberg, M.;
Cárdenas, F.; Gómez, V.; Helgesen, K.; et al. Design and methods of a population-based natural history
study of cervical neoplasia in a rural province of Costa Rica: The Guanacaste Project. Rev. Panam. Salud
Publica 1997, 15, 362–375.
9. AI Approach Outperformed Human Experts in Identifying Cervical Precancer. Available online:
[Link]
(accessed on 15 May 2020).
10. Guo, P.; Singh, S.; Xue, Z.; Long, L.R.; Antani, S. Deep learning for assessing image focus for automated
cervical cancer screening. In Proceedings of the 2019 IEEE EMBS International Conference on Biomedical
& Health Informatics (BHI), Chicago, IL, USA, 19–22 May 2019; pp. 1–4, doi:10.1109/BHI.2019.8834495.
11. Guo, P.; Xue, Z.; Long, L.R.; Antani, S. Deep learning cervix anatomical landmark segmentation and
evaluation. SPIE Med. Imaging 2020, doi:10.1117/12.2549267.
12. Ruff, L.; Vandermeulen, R.A.; Geornitz, N.; Deecke, L.; Siddiqui, S.A.; Binder, A.; Müller, E.; Kloft, M. Deep
one-class classification. In Proceedings of the 35th International Conference on Machine Learning, PMLR,
Stockholm Sweden, 10–15 July 2018; Volume 80, pp. 4393–4402.
13. Lin, T.Y.; Goyal, P.; Girshick, R.; He, K.; Dollar, P. Focal loss for dense object detection. In ICCV; IEEE:
Piscataway, NJ, USA, 2017.
14. Common Objects in Context (COCO) Dataset. Available online: [Link] (accessed on
20 May 2020).
15. Intel & MobileODT Cervical Cancer Screening Competition, March 2017. Available online:
[Link] (accessed on 13 May 2020).
16. Fernandes, K.; Cardoso, J.S. Ordinal image segmentation using deep neural networks. In Proceedings of
the International Joint Conference on Neural Networks, Rio de Janeiro, Brazil, 8–13 July 2018; pp. 1–7.
17. Yeates, K.; Sleeth, J.; Hopman, W.; Ginsburg, O.; Heus, K.; Andrews, L.; Giattas, M.R.; Yuma, S.; Macheku,
G.; Msuya, A.; et al. Evaluation of a Smartphone-Based Training Strategy among Health Care Workers
Screening for Cervical Cancer in Northern Tanzania: The Kilimanjaro Method. J. Glob. Oncol. 2016, 2, 356–
364.
Diagnostics 2020, 10, 451 13 of 13

18. Moya, M.M.; Koch, M.W.; Hostetler, L.D. One-class classifier networks for target recognition applications.
In Proceedings World Congress on Neural Networks; International Neural Network Society: Portland, OR,
USA, 1993; pp. 797–801.
19. Deng, J.; Dong, W.; Socher, R.; Li, L.J.; Li, K.; Fei-Fei, L. ImageNet: A large-scale hierarchical image database.
In Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition, Miami, FL, USA,
20–25 June 2019.
20. Lin, T.Y.; Dollar, P.; Girshick, R.; He, K.; Hariharan, B.; Belongie, S. Feature pyramid networks for object
detection. In Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition
(CVPR), Honolulu, HI, USA, 21–26 July 2017.
21. Lecun, Y.; Bottou, L.; Bengio, Y.; Haffner, P. Gradient-based learning applied to document recognition.
Proc. IEEE 1998, 86, 2278–2324.
22. Johnson, J.M.; Khoshgoftaar, T.M. Survey on deep learning with class imbalance. J. Big Data 2019, 6, 27.
23. He, K.; Zhang, X.; Ren, S.; Sun, J. Deep residual learning for image recognition. In CVPR; IEEE: Las Vegas,
NV, USA, 27–30 June 2016.
24. A PyTorch Implementation of the Deep SVDD Anomaly Detection Method. Available online:
[Link] (accessed on 23 April 2020).

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access
article distributed under the terms and conditions of the Creative Commons Attribution
(CC BY) license ([Link]

Classification of Cervical Cancer Using Deep Learn 2
No ratings yet
Classification of Cervical Cancer Using Deep Learn 2
26 pages
Artigo Sobre Cancer
No ratings yet
Artigo Sobre Cancer
10 pages
1 s2.0 S0010482521006843 Main
No ratings yet
1 s2.0 S0010482521006843 Main
12 pages
Segmentation of Squamous Columnar Junction On VIA Images Using U-Net Architecture
No ratings yet
Segmentation of Squamous Columnar Junction On VIA Images Using U-Net Architecture
11 pages
Completed Work
No ratings yet
Completed Work
14 pages
Deep Ensemble Learning With Uncertainty Aware Prediction Ranking For Cervical Cancer Detection Using Pap Smear Images
No ratings yet
Deep Ensemble Learning With Uncertainty Aware Prediction Ranking For Cervical Cancer Detection Using Pap Smear Images
11 pages
1 s2.0 S235291482300206X Main
No ratings yet
1 s2.0 S235291482300206X Main
9 pages
Machine Learning Algorithm Detects Cervical Cancer
No ratings yet
Machine Learning Algorithm Detects Cervical Cancer
2 pages
RGB Channel Superposition Algorithm With Acetowhite Mask Images in A Cervical Cancer Classification Deep Learning Model
No ratings yet
RGB Channel Superposition Algorithm With Acetowhite Mask Images in A Cervical Cancer Classification Deep Learning Model
10 pages
CNNs for Cervical Cancer Detection
No ratings yet
CNNs for Cervical Cancer Detection
12 pages
Enhancing Cervical Cancer Detection and Robust Classification Through A Fusion of Deep Learning Models
No ratings yet
Enhancing Cervical Cancer Detection and Robust Classification Through A Fusion of Deep Learning Models
14 pages
Abstract and Intro
No ratings yet
Abstract and Intro
2 pages
Automatic Classification of Cervical Cells Using D
No ratings yet
Automatic Classification of Cervical Cells Using D
11 pages
Meng 2021
No ratings yet
Meng 2021
11 pages
Deep Feature Extraction of Pap Smear Images Based On Convolutional Neural Network and Vision Transformer For Cervical Cancer Classification
No ratings yet
Deep Feature Extraction of Pap Smear Images Based On Convolutional Neural Network and Vision Transformer For Cervical Cancer Classification
7 pages
Journal Pre-Proof: Expert Systems With Applications
No ratings yet
Journal Pre-Proof: Expert Systems With Applications
42 pages
AI-Powered Cervical Cancer Detection
No ratings yet
AI-Powered Cervical Cancer Detection
10 pages
Cervical Net An Effective Convolution Neural Network For Five-Class Classification of Cervical Cells
No ratings yet
Cervical Net An Effective Convolution Neural Network For Five-Class Classification of Cervical Cells
5 pages
Deep Learning for Cervical Cancer Detection
No ratings yet
Deep Learning for Cervical Cancer Detection
17 pages
Sompawong 2019
No ratings yet
Sompawong 2019
5 pages
Cervical Cancer Diagnostics Healthcare System Using Hybrid Object Detection Adversarial N
No ratings yet
Cervical Cancer Diagnostics Healthcare System Using Hybrid Object Detection Adversarial N
5 pages
Cervix Type and Cervical Cancer Classification System Using Deep Learning Techniques
No ratings yet
Cervix Type and Cervical Cancer Classification System Using Deep Learning Techniques
14 pages
Transfer Learning Based Classification of Cervical Cancer Immunohistochemistry Images
No ratings yet
Transfer Learning Based Classification of Cervical Cancer Immunohistochemistry Images
5 pages
Artificial Intelligence in Early Detection of Cervical Intraepithelial Neoplasia
No ratings yet
Artificial Intelligence in Early Detection of Cervical Intraepithelial Neoplasia
6 pages
Multiscale Optical Imaging Fusion For Cervical Precancer Diagnosis: Integrating Widefield Colposcopy and High-Resolution Endomicros
No ratings yet
Multiscale Optical Imaging Fusion For Cervical Precancer Diagnosis: Integrating Widefield Colposcopy and High-Resolution Endomicros
10 pages
IJISS+Vol.14+No.1+January March+2024,+Pp.17 28
No ratings yet
IJISS+Vol.14+No.1+January March+2024,+Pp.17 28
12 pages
Research Paper 2025
No ratings yet
Research Paper 2025
8 pages
VFDSG
No ratings yet
VFDSG
6 pages
Classification of Cervical Cancer Using Convolutional Neural Networks
No ratings yet
Classification of Cervical Cancer Using Convolutional Neural Networks
5 pages
Accuracy and Efficiency of Deep-Learning-Based Automation of Dual Stain Cytology in Cervical Cancer Screening
No ratings yet
Accuracy and Efficiency of Deep-Learning-Based Automation of Dual Stain Cytology in Cervical Cancer Screening
8 pages
IJCS 48 3 37 Segmentação
No ratings yet
IJCS 48 3 37 Segmentação
10 pages
Authomated VIA Testing
No ratings yet
Authomated VIA Testing
13 pages
864 Full
No ratings yet
864 Full
16 pages
Applsci 12 04661
No ratings yet
Applsci 12 04661
14 pages
2 CBLR System To Support The Diagnos... 2015
No ratings yet
2 CBLR System To Support The Diagnos... 2015
5 pages
Diagnosisof Medical Images Usingan Expert System
No ratings yet
Diagnosisof Medical Images Usingan Expert System
10 pages
Explainable Screening and Classification of Cervical Cancer Cells With Enhanced ResNet-50 and LIME
No ratings yet
Explainable Screening and Classification of Cervical Cancer Cells With Enhanced ResNet-50 and LIME
7 pages
Triple Up-Sampling Segmentation Network With Distribution Consistency Loss For Pathological Diagnosis of Cervical Precancerous Lesions
No ratings yet
Triple Up-Sampling Segmentation Network With Distribution Consistency Loss For Pathological Diagnosis of Cervical Precancerous Lesions
13 pages
Cervical Cancer Awareness Web Portal
No ratings yet
Cervical Cancer Awareness Web Portal
4 pages
Conventional Neural Network Based Automated Cervical Cancer Detection Technique
No ratings yet
Conventional Neural Network Based Automated Cervical Cancer Detection Technique
6 pages
Transfer Learning Supported Accurate Assessment of Multiclass Cervix Type Images (科研通-Ablesci.com)
No ratings yet
Transfer Learning Supported Accurate Assessment of Multiclass Cervix Type Images (科研通-Ablesci.com)
17 pages
Artificial Intelligence and Colposcopy: Automatic Identification of Vaginal Squamous Cell Carcinoma Precursors
No ratings yet
Artificial Intelligence and Colposcopy: Automatic Identification of Vaginal Squamous Cell Carcinoma Precursors
9 pages
IEEE Paper2
No ratings yet
IEEE Paper2
12 pages
BasePaper Cervic
No ratings yet
BasePaper Cervic
18 pages
Diagnostics 12 02477
No ratings yet
Diagnostics 12 02477
16 pages
The Effect of Features Combination On Coloscopy Images of Cervical Cancer Using The Support Vector Machine Method
No ratings yet
The Effect of Features Combination On Coloscopy Images of Cervical Cancer Using The Support Vector Machine Method
9 pages
Herlev Pap Smear Dataset Insights
No ratings yet
Herlev Pap Smear Dataset Insights
3 pages
Hierarchical Pathology Screening For Cervical Abnormality
No ratings yet
Hierarchical Pathology Screening For Cervical Abnormality
8 pages
Cervical Smear Analyzer (CSA) Expert System For Identification of Cervical Cells in Papanicolaou Smear Test
No ratings yet
Cervical Smear Analyzer (CSA) Expert System For Identification of Cervical Cells in Papanicolaou Smear Test
4 pages
130 - Cervical Cancer Prediction Through Different Screening Methods Using Data Mining
No ratings yet
130 - Cervical Cancer Prediction Through Different Screening Methods Using Data Mining
9 pages
Multimodal Cervical Cancer Diagnosis IEEE
No ratings yet
Multimodal Cervical Cancer Diagnosis IEEE
3 pages
Cervical Cancer Diagnostics Healthcare System Using Hybrid Object Detection Adversarial Networks
No ratings yet
Cervical Cancer Diagnostics Healthcare System Using Hybrid Object Detection Adversarial Networks
8 pages
MODT Product Info 032019
No ratings yet
MODT Product Info 032019
33 pages
SVM-Based Cervical Cancer Diagnosis
No ratings yet
SVM-Based Cervical Cancer Diagnosis
7 pages
A Review of Computational Methods For Cervical Cel
No ratings yet
A Review of Computational Methods For Cervical Cel
38 pages
Fuzzy Logic for Cervical Image Analysis
No ratings yet
Fuzzy Logic for Cervical Image Analysis
6 pages
Survey of Cervical Cancer Prediction Using Machine Learning: A Comparative Approach
No ratings yet
Survey of Cervical Cancer Prediction Using Machine Learning: A Comparative Approach
6 pages
Paper1 PDF
No ratings yet
Paper1 PDF
11 pages
CS 231N Final Project Report: Cervical Cancer Screening
No ratings yet
CS 231N Final Project Report: Cervical Cancer Screening
9 pages
AI for Mandibular Molar Root Analysis
No ratings yet
AI for Mandibular Molar Root Analysis
7 pages
Research Article: DL-IDS: Extracting Features Using CNN-LSTM Hybrid Network For Intrusion Detection System
No ratings yet
Research Article: DL-IDS: Extracting Features Using CNN-LSTM Hybrid Network For Intrusion Detection System
11 pages
Challenges in Galactomannan Testing
No ratings yet
Challenges in Galactomannan Testing
5 pages
UNIT II Machine Learning
No ratings yet
UNIT II Machine Learning
43 pages
Cl-Molec Cap Assay Check List
0% (1)
Cl-Molec Cap Assay Check List
74 pages
A Comprehensive Survey On Machine Learning For Networking
No ratings yet
A Comprehensive Survey On Machine Learning For Networking
99 pages
DeltaShift PhaseV AI Accelerometer Specs
No ratings yet
DeltaShift PhaseV AI Accelerometer Specs
2 pages
Deepfake Pornography: Impacts and Solutions
No ratings yet
Deepfake Pornography: Impacts and Solutions
55 pages
HCHE 511 Assignment
No ratings yet
HCHE 511 Assignment
3 pages
Clinical Biochemistry of Domestic Animals 6th Edition ISBN 012370491X, 9780123704917 Secure Download
No ratings yet
Clinical Biochemistry of Domestic Animals 6th Edition ISBN 012370491X, 9780123704917 Secure Download
16 pages
Machine Learning Models and Bankruptcy Prediction
No ratings yet
Machine Learning Models and Bankruptcy Prediction
31 pages
The Strengths and Difficulties Questionnaire: A Research Note
No ratings yet
The Strengths and Difficulties Questionnaire: A Research Note
7 pages
Predicting Hernia Closure Success
No ratings yet
Predicting Hernia Closure Success
6 pages
ROM Plus: Accurate Point-Of-Care Detection of Ruptured Fetal Membranes
No ratings yet
ROM Plus: Accurate Point-Of-Care Detection of Ruptured Fetal Membranes
7 pages
The Accuracy of The Edinburgh Red Eye Diagnostic Algorithm
No ratings yet
The Accuracy of The Edinburgh Red Eye Diagnostic Algorithm
6 pages
Ultrason Imaging-2016-Mittal-0161734616654933
No ratings yet
Ultrason Imaging-2016-Mittal-0161734616654933
12 pages
Medical Statistics Made Easy, 3rd Edition No-Wait Download
100% (12)
Medical Statistics Made Easy, 3rd Edition No-Wait Download
16 pages
Validity of Atopic Dermatitis Criteria
No ratings yet
Validity of Atopic Dermatitis Criteria
12 pages
TYCS Data Science Manual
No ratings yet
TYCS Data Science Manual
44 pages
HW3 Sol
No ratings yet
HW3 Sol
7 pages
DSand ML
No ratings yet
DSand ML
76 pages
Lightning Prediction with EFMs and Neural Networks
No ratings yet
Lightning Prediction with EFMs and Neural Networks
7 pages
Waaler Rose Test for Rheumatoid Factors
No ratings yet
Waaler Rose Test for Rheumatoid Factors
2 pages
Level Relays for Liquid Control
No ratings yet
Level Relays for Liquid Control
8 pages
Lembar Jawaban Skill Lab Evidence Based Medicine Tanggal 23 Mei 2017
No ratings yet
Lembar Jawaban Skill Lab Evidence Based Medicine Tanggal 23 Mei 2017
21 pages
YOLOv8 Enhances Landmine Detection
No ratings yet
YOLOv8 Enhances Landmine Detection
11 pages
AIIMS Screening Test Prep
No ratings yet
AIIMS Screening Test Prep
8 pages
Fisiopatología de La Cirrosis Hepática
No ratings yet
Fisiopatología de La Cirrosis Hepática
12 pages
Precision Animal Nutrition
No ratings yet
Precision Animal Nutrition
9 pages
Confusion Matrix in Regression Analysis
No ratings yet
Confusion Matrix in Regression Analysis
23 pages

Ensemble Deep Learning For Cervix Image Selection

Uploaded by

Ensemble Deep Learning For Cervix Image Selection

Uploaded by

Article

Ensemble Deep Learning for Cervix Image Selection

3 Department of Medicine, Queen’s University, Kingston, ON K7L3N6, Canada; yeatesk@[Link]

5 Pamoja Tunaweza Research Centre, Moshi, Tanzania

Bethesda, MD 20892, USA; [Link]@[Link] (M.D.); schiffmm@[Link] (M.S.)

Received: 22 May 2020; Accepted: 24 June 2020; Published: 3 July 2020

Keywords: deep learning; cervical cancer; cervix/non-cervix; ensemble; one-class classification

Diagnostics 2020, 10, 451; doi:10.3390/diagnostics10070451 [Link]/journal/diagnostics

Figure 1. Non-cervix image samples in one collected dataset.

2.1. MobileODT Dataset

2.2. Kaggle Dataset

2.3. COCO Dataset

2.4. SEVIA Dataset

3.2. Deep SVDD

3.3. Customized CNN

Figure 5. Customized CNN (Convolutional Neural Network) architecture.

3.4. Ensemble Method

4.1. Data Preparation and Implementation Details

Training Validation Testing

Methods Resized Dimension (w × h)

4.2. Results and Disccussion

4.2.1. Results of RetinaNet

4.2.2. Results of Deep SVDD

4.2.3. Results of customized CNN

4.2.4. Results of Ensemble method

Figure 8. Samples of cervix images that are “misclassified” as non-cervix images.

Conflicts of Interest: The authors declare no conflict of interest.

You might also like