0% found this document useful (0 votes)

59 views

Paper 19-Deep Learning Based Neck Models For Object Detection

1) The document reviews and benchmarks neck models, which are neural networks used for feature fusion in object detection models. 2) Neck models aggregate high-level and low-level feature maps from different stages of the backbone network to allow interaction between feature levels. Popular neck models include FPN and PAN. 3) The study aims to provide researchers with a timely comparison of neck models to guide their work on object detection. It discusses related works, describes neck model architectures, presents a benchmark comparison, and identifies results and future directions.

Uploaded by

Michael Cabanillas

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

59 views

Paper 19-Deep Learning Based Neck Models For Object Detection

Uploaded by

Michael Cabanillas

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

(IJACSA) International Journal of Advanced Computer Science and Applications,

Vol. 12, No. 11, 2021

Deep Learning based Neck Models for Object

Detection: A Review and a Benchmarking Study
Sara Bouraya, Abdessamad Belangour
Laboratory of Information Technology and Modeling
Hassan II University, Faculty of Sciences Ben M'sik
Casablanca, Morocco

Abstract—Artificial intelligence is the science of enabling Learning approaches are a set of models of Deep Learning,
computers to act without being further programmed. starting from input, then a backbone for feature extraction
Particularly, computer vision is one of its innovative fields that model, then neck model for feature fusion, and finally a head
manages how computers acquire comprehension from videos and model class/box network.
images. In the previous decades, computer vision has been
involved in many fields such as self-driving cars, efficient The neck of the object detector refers to the additional
information retrieval, effective surveillance, and a better layers existing between the backbone [1] and the head. Their
understanding of human behaviour. Based on deep neural role is to collect feature maps from different stages. The neck
networks, object detection is actively growing for pushing the models are composed of several top-down paths and several
limits of detection accuracy and speed. Object Detection aims to bottom-up paths. The idea behind this feature aggregation
locate each object instance and assign a class to it in an image or existing in this model is to allow low-level features to interact
a video sequence. Object detectors are usually provided with a more directly with high-level features, by mixing information
backbone network designed for feature extractors, a neck model from this high-level feature with the low-level feature. They
for feature aggregation, and finally a head for prediction. Neck reach aggregation and feature interaction across many layers,
models, which are the purpose of study in this paper, are neural since the distance between the two feature maps is large.
networks used to make a fusion between high-level features and Several methods can reach be implemented in this part, for
low-level features and are known by their efficiency in object example, PAN [2] or FPN [3] (see Fig. 1).
detection. The aim of this study to present a review of neck
models together before making a benchmarking that would help Head is the last model of object detection, predicts
researchers and scientists use it as a guideline for their works. bounding boxes and classes of objects and could be a sparse
prediction that belongs to One-stage detectors such as YOLO
Keywords—Object detection; deep learning; computer vision; [4] , SDD [5], CenterNet [6], or a Dense prediction that
neck models; feature aggregation; feature fusion belongs to Two-stage detectors, such as Fast R-CNN [7],
I. INTRODUCTION Faster R-CNN [8], Mask R-CNN [9] (see Fig. 1). On the one
hand, One Stage detectors have high inference speeds, these
Object detection is often called image detection, object models predict bounding boxes in a one or single step without
identification, and object recognition; and all these concepts using region proposals. On the other hand, two stage detectors
are synonymous. It is a computer vision method for locating have high localization and recognition accuracy. Firstly, they
instances of objects in an image or video sequence. Object use a Region Proposal Network to generate regions of
detection algorithms, therefore, typically benefit from machine interests; secondly, they send the region proposals for object
learning techniques or deep learning techniques to gain classification and bounding-box regression.
meaningful results. When humans look at images or videos,
they could locate and recognize objects of interest easily. The We aim that our benchmarking study can provide a timely
goal of object detection is to mimic this intelligence using a comparison of neck models of object detection for
computer. With recent advancements in Deep Learning-based practitioners and researchers to further master research on
computer vision models, Object Detection use cases are object detection models. The rest of our study is organized as
spreading more than ever before. A wide range of applications follows: In Section 2, we are going to discuss the different
is implemented, for instance, self-driving cars, object tracking, existing related works about feature aggregation. In Section 3,
anomaly detection, and video surveillance. we list the neck neural networks about object detection used
for feature fusion, their architecture is discussed also in their
Object Detection could be divided into two main categories. In Section 4, our comparative study is presented. In
categories Deep Learning-based techniques and Machine Section 5, we highlight the different recognizable results and
Learning based techniques. Deep Learning based techniques Section 6 covers the discussion. Finally, in Section 7, we
could be separated into two approaches one stage detectors conclude and discuss future directions.
and two-stage detectors. Object Detection based Deep

161 | P a g e
www.ijacsa.thesai.org
(IJACSA) International Journal of Advanced Computer Science and Applications,
Vol. 12, No. 11, 2021

Fig. 1. Models’ Taxonomy of Object Detectors in each Part Backbone, Head, and Neck.

II. RELATED WORK III. BACKGROUND

Several scientific works and researches have been Since Feature Pyramid Networks appearance, the focus of
implemented to develop and evolve Object Detection this work is the object detector neck, the existing part between
applications and systems and depend on enormous the backbone and the head. These techniques are useful for
methodologies of the deep learning era, machine learning era many reasons.
and other eras. Several researchers and scientists are
expanding their implementation and research to develop and 1) Aggregation network models (FPN): FPN [3] is a top-
apply enormous methodologies. Such us the case of feature down architecture with lateral connections, it is
aggregation methods that are used to make a connection implemented in building high-level semantic feature maps at
between low and high feature for better object recognition in all scales (see Fig. 2).
video sequence and images. Feature aggregation is used
widely in action recognition [10], [11], [12], [13], [14] and
video description [15],[16]. Most of these methods use
recurrent neural network (RNNs) in order to aggregate
features from consecutive frames on the one hand. Exhaustive
temporal-spatial convolution is used to extract temporal-
spatial features, on the other hand. U-Net [17] was proposes to
concatenate features from low level to high-level for medical
image segmentation, and it achieved great success in that
field. In order to gain an outstanding feature for object
detection, the FPN stands for Feature Pyramid Networks
aggregated both the transformed feature from the bottom-up
weighted pyramid and the top-down lateral convolutions
through a simple sum operation. Relied on Feature Pyramid
Networks, several extensive works [18], [19], [20], [2] define
new option on connectivity between scales. Attention based
models also prove their efficiency in several applications of
deep learning era [21], [22], [23], [24], [25], [26]. Self-
attention models by measuring and applying a context relied
encoding summarized from a dimension of feature. All these
works cited propose to aggregate and fuse features via
Fig. 2. FPN Architecture.
element-wise concatenation or summation.

162 | P a g e
www.ijacsa.thesai.org
(IJACSA) International Journal of Advanced Computer Science and Applications,
Vol. 12, No. 11, 2021

2) Neural architecture search FPN (NAS-FPN): NAS-

FPN [19] consists of a combination of top-down and bottom-
up connections to fuse features across scales (see Fig. 3).

Fig. 5. BiFPN Architecture.

Based on the architecture above PANET is more

performant then FPN et NAS-FPN, but the computation cost is
higher.
Fig. 3. PANet Architecture. 5) Fully-connected FPN: Fully-connected, the calculation
is the most complex all scales use the most complete
3) Neural architecture search FPN (NAS-FPN): NAS-
connection (see Fig. 6).
FPN [19] consists of a combination of top-down and bottom-
up connections to fuse features across scales (see Fig. 4).

Fig. 6. Fully-Connected FPN Architecture.

6) Simplified PANet: Simplified PANet, this method

simplifies and removes only one input node (see Fig. 7).
Fig. 4. NAS-FPN Architecture.

4) Bi-directional feature pyramid network (BiFPN):

BiFPN [27] is a type of feature pyramid network that allows
fast and easy multi-scale feature fusion. BiFPN incorporates
the other feature fusion models. It enables information to flow
in the top-down and bottom-up directions, while using
efficient and regular connections. This network improves the
connections by removing some nodes and treats each
bidirectional path as a feature network layer (Fig. 5).

Fig. 7. Simplified FPN Architecture.

163 | P a g e
www.ijacsa.thesai.org
(IJACSA) International Journal of Advanced Computer Science and Applications,
Vol. 12, No. 11, 2021

IV. COMPARISON boxes. The Backbone determines the backbone used for
Table I below illustrates the models that we are going to feature extraction the number associated refers to the number
compare based on different comparison metrics. The measures of layers, and finally, the neck illustrates the feature
are gathered carefully to cover several methods. aggregation network used.

This table illustrates the deep learning models used for the Table I contains the model’s name, Reference, Journal
object detection task of the COCO dataset. It defines the used year, Year, Backbone, Neck, AP, AP50, AP75, APS, APM,
models for the prediction for classification and bounding APL (see Table I).

TABLE I. DETAILED COMPARISONS ON MULTIPLE POPULAR BASELINE OBJECT DETECTORS ON THE COCO DATASET

Model
Journal Model Backbone Neck AP AP50 AP75 APS APM APL
Ref

Libra R-CNN ResNet-50 FPN 38.7 59.9 42.0 22.5 41.1 48.7
[18] CVPR 2019
Libra R-CNN ResNet-101 FPN 40.3 61.3 43.9 22.9 43.1 51.0
Libra R-CNN ResNeXt-101 FPN 43.0 64 47 25.3 45.6 54.6
Faster R-CNN ResNet-50 FPN 37.8 58.7 40.6 21.3 41.0 49.5
Faster R-CNN ResNet-50 AdaFPN 39.0 58.8 41.8 22.6 42.3 50.0
Faster R-CNN ResNet-50 AugFPN 38.8 61.5 42.0 23.3 42.1 47.7
[8] Faster R-CNN ResNet-101 AugFPN 41.5 63.9 45.1 23.8 44.7 52.8
Faster R-CNN ResNext-101- 32x4d AugFPN 41.9 64.4 45.6 25.2 45.4 52.6
Faster R-CNN ResNext-101-64x4d AugFPN 43.0 65.6 46.9 26.2 46.5 53.9
Faster R-CNN MobileNet-v2 AugFPN 34.2 56.6 36.2 19.6 36.4 43.1
FCOS ResNet-50 AugFPN 37.9 58.0 40.4 21.2 40.5 47.9
ICCV 2019 FCOS ResNet-50 FPN 39.1 57.9 42.1 23.3 43.0 50.2
[28]
FCOS ResNet-50 AdaFPN 40.1 58.6 43.2 24.1 43.6 50.6
FCOS ResNeXt-101 FPN 42.7 62.2 46.1 26.0 45.6 52.6
Mask R-CNN ResNet-101 FPN 38.2 60.3 41.7 20.1 41.1 50.2
Mask R-CNN ResNeXt-101 FPN 39.8 62.3 43.4 22.1 43.2 51.2
Mask R-CNN ResNet-50 AugFPN 39.5 61.8 42.9 23.4 42.7 49.1
[9] ICCV 2017
Mask R-CNN ResNet-101 AugFPN 42.4 64.4 46.3 24.6 45.7 54.0
Mask R-CNN ResNet-50 A2 -FPN 36.6 59.3 39.1 19.8 39.3 48.0
2
Mask R-CNN ResNet-101 A -FPN 37.9 60.8 40.5 20.6 41.8 50.1
CascadeR-CNN ResNet-50 FPN 36.5 59 39.2 20.3 38.8 46.4
[29] CVPR 2018 CascadeR-CNN ResNet-101 FPN 38.8 61.1 41.9 21.3 41.8 49.8
CascadeR-CNN ResNet-101 AC-FPN 45.0 64.4 49.0 26.9 47.7 56.6
RetinaNet ResNet-101 FPN 39.1 59.1 42.3 21.8 42.7 50.2
ICCV 2017 RetinaNet ResNeXt-101 FPN 40.8 61.1 44.1 24.1 44.2 51.2
[30]
RetinaNet ResNet-50 AugFPN 37.5 58.4 40.1 21.3 40.5 47.3
RetinaNet MobileNet-v2 AugFPN 34.0 54.0 36.0 18.6 36.0 44.0
[31] arXiv 2019 RetinaMask ResNet-50 FPN 39.4 58.6 42.3 21.9 42.0 51.0
[32] CVPR 2019 Grid R-CNN ResNeXt-101 FPN 43.2 63.0 46.6 25.1 46.5 55.2
HTC ResNeXt-101 FPN 47.1 63.9 44.7 22.8 43.9 54.6
HTC ResNet-50 FPN 38.4 60.0 41.5 20.4 40.7 51.2
HTC ResNet-101 FPN 39.7 61.8 43.1 21.0 42.2 53.5
[33] CVPR 2019
HTC ResNet-50 A2 -FPN 39.8 62.3 43.0 21.6 42.4 52.8
HTC ResNet-101 A2 -FPN 40.8 63.6 44.1 22.3 43.5 54.4
HTC ResNeXt -101 A2 -FPN 42.1 65.3 45.7 23.6 44.8 56.0
[34] CVPR 2020 DetectRS ResNeXt-101-DCN RFP 53.3 71.6 58.5 33.9 56.5 66.9
[35] arXiv 2021 CenterNet2 Res2Net-101-DCN BiFPN 56.4 74.0 61.6 38.7 59.7 68.6

164 | P a g e
www.ijacsa.thesai.org
(IJACSA) International Journal of Advanced Computer Science and Applications,
Vol. 12, No. 11, 2021

Average Precision (AP)

AP % AP at IoU=.50:.05:.95
APIoU=.50 % AP at IoU=.50
APIoU=.75 % AP at IoU=.75
AP Across Scales:
APsmall % AP for small objects: area < 322
APmedium AP for medium objects: 322 < area < 962
APlarge AP for large objects: area >962 Fig. 9. Faster R-CNN Comparison based Different Feature Aggregation
Models.
V. RESULT
In this part, we are going to discuss the performance of 3) FCOS: The highest performance was obtained by
different methods cited in Table I Libra R-CNN, Faster R- FCOS [28] on the head, ResNext-101 as a backbone, and
CNN, FCOS, Mask R-CNN, Cascade R-CNN, RetinaNet, FPN as a feature aggregator model. By changing feature
RetinaMask, Grid R-CNN, HTC, DetectRS, CenterNet2 aggregation models FPN, AdaFPN, and AugFPN, moreover
methods based on different feature aggregation networks and fixing ResNet-50 the AdaFPN gains the best performance in
different backbone networks. In each model, we tried to fix
this category, after that FPN and finally AugFPN (see
either a backbone or a neck and see how the performance
behave. These results show us the importance of both feature Fig. 10).
aggregation networks and feature extraction networks and
how they impact the object detection models accuracy.
1) Libra R-CNN: We have compared Libra R-CNN [18]
with different backbones. This comparison reveals that the act
of changing backbones with a solid feature aggregation model
changes the performance. Regarding, Libra R-CNN with
ResNeXt-101 as a backbone on top of the quality range. The
two last models based on ResNet-50 and ResNet-101 as
backbones, Libra R-CNN based ResNet-101 gain the highest
performance (see Fig. 8).
Fig. 10. FCOS Comparison based Different Feature Aggregation Models.

4) Mask R-CNN: Regarding Mask R-CNN [9] models

based on a diversity of backbones and necks relied on our
category, ResNet-101 and FPN combination leads the
performance then, ResNeXt-101 and FPN. By fixing ResNet-
101, mutating feature aggregation models the highest
performance was gained by AugFPN, then FPN, and finally
A2FPN. Concerning ResNet-50 as a backbone and A2 FPN or
AugFPN as feature aggregation models, AugFPN attain the
greatest performance (see Fig. 11).
Fig. 8. Libra R-CNN Comparison based Different Feature Aggregation
Models.

2) Faster R-CNN: Faster R-CNN [8] relying on ResNext-

101-64x4d as a backbone and AugFPN as a feature
aggregation model are leading the performance in this
category. By fixing ResNet-50 as a backbone with changing
different feature aggregation, the model based on AdaFPN
gains the highest performance. Moreover, by fixing AugFPN
and changing ResNext-101 the best performance was gained
by ResNext-101-64x4d (see Fig. 9).

Fig. 11. Mask R-CNN Comparison based Different Feature Aggregation

Models.

165 | P a g e
www.ijacsa.thesai.org
(IJACSA) International Journal of Advanced Computer Science and Applications,
Vol. 12, No. 11, 2021

5) HTC: Related to HTC [33] model, ResNeXt-101 and have preferred to compare the methods that gain the top
A2FPN are leading in performance, the second performant average precision. On the other hand, in terms of performance
fusion is ResNeXt-101 and FPN. Regarding the models based and based on our spider, centerNet2 achieves the best
on ResNet as a backbone, ResNet-50 with A2FPN works performance. The best method is based on Res2Net101-DCN
better than ResNet-50 with FPN in terms of performance (see as a backbone and BiFPN as a feature aggregation model. The
Fig. 12). second rank is for DetectRs based on ResNeXt-101-DCN as a
backbone and RFP as feature extraction (see Fig. 15).

Fig. 12. HTC Comparison based Different Feature Aggregation Models.

6) Cascade R-CNN: Cascade R-CNN [29] performance Fig. 15. Multicriteria Comparison based Different Feature Aggregation
was led by merging ResNet-101 and AC-FPN. The Models.
combination of ResNet-101 as a backbone and FPN neck has
VI. DISCUSSION
gained less performance (see Fig. 13).
In this paper, we have systematically depicted the
importance of object detection components, covering the deep
learning methodologies used in object detection, including,
Two Stage detectors and one stage detectors.
Firstly, we have started by presenting object detection
methodologies that have been categorized on traditional
methods and based deep learning methodologies. Secondly,
we have talked about the main arrangement of object detection
based on deep learning that includes a backbone usually
pretrained used to extract feature then feature aggregation
Fig. 13. Cascade R-CNN Comparison based Different Feature Aggregation model for merging high and low features called neck and
Models.
finally, the head used for prediction.
7) RetinaNet: Regarding RetinaNet,[30] firstly, ResNeXt- Relied on our comparative study, we notice that the
101 as a backbone and FPN as a feature aggregation model CenterNet2 with Res2Net-101-DCN as a backbone and
compared to the other fusions, it has gained the highest BiFPN as a feature fusion model leads the performance and
performance; secondly, by merging ResNet-101 and FPN; and gains widespread dominance because of its supremacy
thirdly, ResNet-50 with AugFPN gains the performance, and regarding all criteria.
finally, MobileNet-V2 with AugFPN (see Fig. 14). DetectRS with ResNeXt-101-DCNas a backbone and RFP
as a feature fusion model is reaching the second score. HTC is
gaining the third position with its high performance based on
ResNeXt-101 as a backbone and FPN. We notice also that
there is no intersection between all the compared algorithms,
each algorithm gains its performance regarding all criteria that
the underlying algorithm.
This comparison has also been made based on a set of
criteria. The scores for each method evaluated were calculated
using the Weight Score Model. Various scores or results have
not only helped us determine an overall ranking, but they have
also shown their internal strengths and weaknesses concerning
Fig. 14. RetinaNet Comparison based Different Feature Aggregation Models. each criterion.
This comparison has also revealed the importance of
8) Six Top average precision: On the one hand, after making a benchmark in order to have a global straightforward
extracting the 6 best models in terms of average precision, we view of building efficient models with high performance.

166 | P a g e
www.ijacsa.thesai.org
(IJACSA) International Journal of Advanced Computer Science and Applications,
Vol. 12, No. 11, 2021

One the one hand, we hold in mind that from this review [12] Z. Li, K. Gavrilyuk, E. Gavves, M. Jain, and C. G. M. Snoek,
and comparison study that object detection based deep ―VideoLSTM convolves, attends and flows for action recognition,‖
Comput. Vis. Image Underst., vol. 166, pp. 41–50, 2018, doi:
learning models, backbone, neck and head, impacting highly 10.1016/j.cviu.2017.10.011.
the performance. On the other hand, generally, more used [13] N. Ballas, L. Yao, C. Pal, A. Courville, and R. Convolution, ―D
layers give high performance. ELVING D EEPER INTO C ONVOLUTIONAL N ETWORKS,‖ pp.
1–11, 2016.
VII. CONCLUSION [14] A. Karpathy and T. Leung, ―Large-scale Video Classification with
From the study handed, it has been noticed that several Convolutional Neural Networks.‖
scientists and researchers from a diversity of ethnicities are [15] J. Donahue, ―Long-term Recurrent Convolutional Networks for Visual
Recognition and Description,‖ 2014.
working day after day on the object detection field, due to its
utmost importance. Several models are appearing every month [16] N. Ballas, H. Larochelle, and A. Courville, ―Describing Videos by
Exploiting Temporal Structure,‖ pp. 4507–4515, 2015.
with the growth of deep learning.
[17] O. Ronneberger, P. Fischer, and T. Brox, ―U-Net: Convolutional
This comparison could be used as a support, by handing Networks for Biomedical Image Segmentation,‖ pp. 1–8.
researchers a scientific comparison of different object [18] J. Pang, K. Chen, J. Shi, H. Feng, W. Ouyang, and D. Lin, ―Libra R-
detection methodologies and their main models, in order to CNN: Towards balanced learning for object detection,‖ Proc. IEEE
Comput. Soc. Conf. Comput. Vis. Pattern Recognit., vol. 2019-June, no.
build performant models. 2, pp. 821–830, 2019, doi: 10.1109/CVPR.2019.00091.
A comparison of neck used for feature aggregation [19] G. Ghiasi, T. Y. Lin, and Q. V. Le, ―NAS-FPN: Learning scalable
between high and low features has been presented. We have feature pyramid architecture for object detection,‖ Proc. IEEE Comput.
Soc. Conf. Comput. Vis. Pattern Recognit., vol. 2019-June, pp. 7029–
been interested in giving you different necks and analyse the 7038, 2019, doi: 10.1109/CVPR.2019.00720.
performance of their global models. [20] N. Wang et al., ―NAS-FCOS: Fast Neural Architecture Search for
Future work will be focusing on the implementation of Object Detection,‖ Proc. IEEE Comput. Soc. Conf. Comput. Vis. Pattern
Recognit., pp. 11940–11948, 2020, doi:
some of the different models of object detection-based deep 10.1109/CVPR42600.2020.01196.
learning. We aim to implement, test, and analyze the results. [21] F. Chollet, ―Xception: Deep Learning with Depthwise Separable
REFERENCES Convolutions,‖ CVPR, vol. 7, no. 3, pp. 1251–1258, 2014, doi:
10.4271/2014-01-0975.
[1] S. Bouraya and A. Belangour, ―Object Detectors‖ Convolutional Neural
Networks backbones : a review and a comparative study,‖ vol. 9, no. 11, [22] A. Vaswani et al., ―Attention is all you need,‖ Adv. Neural Inf. Process.
pp. 1379–1386, 2021. Syst., vol. 2017-Decem, no. Nips, pp. 5999–6009, 2017.
[2] S. Liu, L. Qi, H. Qin, J. Shi, and J. Jia, ―Path Aggregation Network for [23] X. Wang and R. Girshick, ―Non-local Neural Networks.‖
Instance Segmentation,‖ Proc. IEEE Comput. Soc. Conf. Comput. Vis. [24] Y. Chen, ―A 2 -Nets : Double Attention Networks,‖ no. NeurIPS, 2018.
Pattern Recognit., pp. 8759–8768, 2018, doi: [25] H. L. Fu Jun, Jing Liu, Haijie Tian, Yong Li, Yongjun Bao, Zhiwei
10.1109/CVPR.2018.00913. Fang, ―Dual Attention Network for Scene Segmentation.‖
[3] T. Y. Lin, P. Dollár, R. Girshick, K. He, B. Hariharan, and S. Belongie, [26] Y. Chen, M. Rohrbach, Z. Yan, S. Yan, J. Feng, and Y. Kalantidis,
―Feature pyramid networks for object detection,‖ Proc. - 30th IEEE ―Graph-Based Global Reasoning Networks,‖ vol. 1.
Conf. Comput. Vis. Pattern Recognition, CVPR 2017, vol. 2017-Janua,
pp. 936–944, 2017, doi: 10.1109/CVPR.2017.106. [27] M. Tan, R. Pang, and Q. V Le, ―EfficientDet : Scalable and Efficient
Object Detection,‖ pp. 10781–10790.
[4] J. Redmon, S. Divvala, R. Girshick, and A. Farhadi, ―You only look
once: Unified, real-time object detection,‖ Proc. IEEE Comput. Soc. [28] Z. Tian, C. Shen, H. Chen, and T. He, ―FCOS: Fully convolutional one-
Conf. Comput. Vis. Pattern Recognit., vol. 2016-Decem, pp. 779–788, stage object detection,‖ Proc. IEEE Int. Conf. Comput. Vis., vol. 2019-
2016, doi: 10.1109/CVPR.2016.91. Octob, pp. 9626–9635, 2019, doi: 10.1109/ICCV.2019.00972.
[5] W. Liu et al., ―SSD: Single shot multibox detector,‖ Lect. Notes [29] Z. Cai and N. Vasconcelos, ―Cascade R-CNN: Delving into High
Comput. Sci. (including Subser. Lect. Notes Artif. Intell. Lect. Notes Quality Object Detection,‖ Proc. IEEE Comput. Soc. Conf. Comput.
Bioinformatics), vol. 9905 LNCS, pp. 21–37, 2016, doi: 10.1007/978-3- Vis. Pattern Recognit., pp. 6154–6162, 2018, doi:
319-46448-0_2. 10.1109/CVPR.2018.00644.
[6] K. Duan, S. Bai, L. Xie, H. Qi, Q. Huang, and Q. Tian, ―CenterNet: [30] T. Y. Lin, P. Goyal, R. Girshick, K. He, and P. Dollar, ―Focal Loss for
Keypoint triplets for object detection,‖ Proc. IEEE Int. Conf. Comput. Dense Object Detection,‖ IEEE Trans. Pattern Anal. Mach. Intell., vol.
Vis., vol. 2019-Octob, pp. 6568–6577, 2019, doi: 42, no. 2, pp. 318–327, 2020, doi: 10.1109/TPAMI.2018.2858826.
10.1109/ICCV.2019.00667. [31] C. F. Mykhailo and S. Alexander, ―RetinaMask: Learning to predict
[7] R. Girshick, ―Fast R-CNN,‖ Proc. IEEE Int. Conf. Comput. Vis., vol. masks improves state-of-the-art single-shot detection for free.‖
2015 Inter, pp. 1440–1448, 2015, doi: 10.1109/ICCV.2015.169. [32] X. Lu, B. Li, Y. Yue, Q. Li, and J. Yan, ―Grid R-CNN,‖ Proc. IEEE
[8] S. Ren, K. He, and R. Girshick, ―Faster R-CNN : Towards Real-Time Comput. Soc. Conf. Comput. Vis. Pattern Recognit., vol. 2019-June, pp.
Object Detection with Region Proposal Networks,‖ pp. 1–9. 7355–7364, 2019, doi: 10.1109/CVPR.2019.00754.
[9] K. He, G. Gkioxari, P. Dollár, and R. Girshick, ―Mask R-CNN,‖ IEEE [33] K. Chen et al., ―Hybrid task cascade for instance segmentation,‖ Proc.
Trans. Pattern Anal. Mach. Intell., vol. 42, no. 2, pp. 386–397, 2020, IEEE Comput. Soc. Conf. Comput. Vis. Pattern Recognit., vol. 2019-
doi: 10.1109/TPAMI.2018.2844175. June, pp. 4969–4978, 2019, doi: 10.1109/CVPR.2019.00511.
[10] S. Sharma, R. Kiros, and R. Salakhutdinov, ―Action Recognition using [34] S. Qiao, L.-C. Chen, and A. Yuille, ―DetectoRS: Detecting Objects with
Visual Attention,‖ pp. 1–11, 2015, [Online]. Available: Recursive Feature Pyramid and Switchable Atrous Convolution,‖ 2020,
http://arxiv.org/abs/1511.04119. [Online]. Available: http://arxiv.org/abs/2006.02334.
[11] A. Kar, N. Rai, K. Sikka, and G. Sharma, ―AdaScan: Adaptive scan [35] X. Zhou, V. Koltun, and P. Krähenbühl, ―Probabilistic two-stage
pooling in deep convolutional neural networks for human action detection,‖ 2021, [Online]. Available: http://arxiv.org/abs/2103.07461.
recognition in videos,‖ Proc. - 30th IEEE Conf. Comput. Vis. Pattern
Recognition, CVPR 2017, vol. 2017-January, pp. 5699–5708, 2017, doi:
10.1109/CVPR.2017.604.

167 | P a g e
www.ijacsa.thesai.org

Silver Pyke Peterson Title Page
No ratings yet
Silver Pyke Peterson Title Page
2 pages
Computer Vision Application
No ratings yet
Computer Vision Application
2 pages
I Jeter 039112021
No ratings yet
I Jeter 039112021
8 pages
Object Detection With Deep Learning_ A Review Summary
No ratings yet
Object Detection With Deep Learning_ A Review Summary
11 pages
Object Detection With Deep Learning: A Review
No ratings yet
Object Detection With Deep Learning: A Review
21 pages
Project Detecto!: A Real-Time Object Detection Model
No ratings yet
Project Detecto!: A Real-Time Object Detection Model
3 pages
An_Investigation_of_Deep_Neural_Network_based_Techniques_for_Object_Detection_an
No ratings yet
An_Investigation_of_Deep_Neural_Network_based_Techniques_for_Object_Detection_an
6 pages
Tensor Flow
No ratings yet
Tensor Flow
5 pages
2802 8020 1 PB
No ratings yet
2802 8020 1 PB
3 pages
Backbone Search For Object Detection For Applications in Intrusion Warning Systems
No ratings yet
Backbone Search For Object Detection For Applications in Intrusion Warning Systems
10 pages
Manuscript Template 2
No ratings yet
Manuscript Template 2
13 pages
5-IJLEMR-77839
No ratings yet
5-IJLEMR-77839
5 pages
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Object and Face Detection Based On Center-Net 1
No ratings yet
Object and Face Detection Based On Center-Net 1
7 pages
SR22804211151
No ratings yet
SR22804211151
8 pages
Application of Deep Learning For Object Detection
No ratings yet
Application of Deep Learning For Object Detection
12 pages
Object Detection: Advances, Applications, and Algorithms
From Everand
Object Detection: Advances, Applications, and Algorithms
Fouad Sabry
No ratings yet
Objectdetection
No ratings yet
Objectdetection
7 pages
A Brief Survey and An Application of Sem
No ratings yet
A Brief Survey and An Application of Sem
38 pages
Object Detection With Deep Learning: A Review
No ratings yet
Object Detection With Deep Learning: A Review
21 pages
A novel model to detect and categorize objects from images by using a hybrid machine learning model
No ratings yet
A novel model to detect and categorize objects from images by using a hybrid machine learning model
13 pages
Applsci 12 07825
No ratings yet
Applsci 12 07825
23 pages
Object Detection using ELAN
No ratings yet
Object Detection using ELAN
6 pages
1525_context_augmentation_and_featu
No ratings yet
1525_context_augmentation_and_featu
11 pages
Realtime Visual Recognition in Deep Convolutional Neural Networks
No ratings yet
Realtime Visual Recognition in Deep Convolutional Neural Networks
13 pages
Real Time Object Detection Using SSD and MobileNet
No ratings yet
Real Time Object Detection Using SSD and MobileNet
6 pages
Image Sorting Using Object Detection and Face Recognition
No ratings yet
Image Sorting Using Object Detection and Face Recognition
6 pages
Real-Time Object Detection Using Deep Learning and Open CV
No ratings yet
Real-Time Object Detection Using Deep Learning and Open CV
4 pages
Transfer Learning For Object Detection Using State-of-the-Art Deep Neural Networks
No ratings yet
Transfer Learning For Object Detection Using State-of-the-Art Deep Neural Networks
7 pages
Fin Irjmets1654850281
No ratings yet
Fin Irjmets1654850281
10 pages
Recent Advances in Deep Learning For Object Detection
No ratings yet
Recent Advances in Deep Learning For Object Detection
26 pages
Second Progress Report UID - 17BCS2127
No ratings yet
Second Progress Report UID - 17BCS2127
13 pages
Facemask Detection Using MMdetection Toolbox
No ratings yet
Facemask Detection Using MMdetection Toolbox
6 pages
An Efficient Object Detection Algorithm Based On Compressed Networks
No ratings yet
An Efficient Object Detection Algorithm Based On Compressed Networks
13 pages
2022 V13i3059
No ratings yet
2022 V13i3059
11 pages
Survey On Object Detection Framework Evolution of Algorithms
No ratings yet
Survey On Object Detection Framework Evolution of Algorithms
5 pages
Base
No ratings yet
Base
17 pages
1-realtimeobjectdetection
No ratings yet
1-realtimeobjectdetection
6 pages
Region-Based Convolutional Networks For Accurate Object Detection and Segmentation
No ratings yet
Region-Based Convolutional Networks For Accurate Object Detection and Segmentation
21 pages
FD Report
No ratings yet
FD Report
3 pages
Finalreport
No ratings yet
Finalreport
56 pages
Real-Time Object Detection Using Deep Learning: Journal of Advances in Mathematics and Computer Science June 2023
No ratings yet
Real-Time Object Detection Using Deep Learning: Journal of Advances in Mathematics and Computer Science June 2023
10 pages
Centralized Feature Pyramid for Object Detection
No ratings yet
Centralized Feature Pyramid for Object Detection
14 pages
Unit 3-Non CNN approaches to object recognition
No ratings yet
Unit 3-Non CNN approaches to object recognition
26 pages
Fin Irjmets1684232858
No ratings yet
Fin Irjmets1684232858
9 pages
ObjectDetectionwithConvolutionalNeuralNetworks
No ratings yet
ObjectDetectionwithConvolutionalNeuralNetworks
12 pages
Real Time Object Detection in Surveillance Cameras With 2xjeq74wam
No ratings yet
Real Time Object Detection in Surveillance Cameras With 2xjeq74wam
8 pages
Ding 2018 IOP Conf. Ser. Mater. Sci. Eng. 322 062024
No ratings yet
Ding 2018 IOP Conf. Ser. Mater. Sci. Eng. 322 062024
6 pages
Object Tracking in Crowd Environment Using Deep Learning
No ratings yet
Object Tracking in Crowd Environment Using Deep Learning
8 pages
Applied Sciences: Lightweight Attention Pyramid Network For Object Detection and Instance Segmentation
No ratings yet
Applied Sciences: Lightweight Attention Pyramid Network For Object Detection and Instance Segmentation
16 pages
From classical techniques to convolution-based models: A review of object detection algorithms
No ratings yet
From classical techniques to convolution-based models: A review of object detection algorithms
6 pages
NNDL Unit 5
No ratings yet
NNDL Unit 5
21 pages
Introduction To Computer Vision
No ratings yet
Introduction To Computer Vision
45 pages
Project Detecto!: A Real-Time Object Detection Model
No ratings yet
Project Detecto!: A Real-Time Object Detection Model
3 pages
Real Time Object Detection Using Deep Learning
No ratings yet
Real Time Object Detection Using Deep Learning
6 pages
Younis 2020
No ratings yet
Younis 2020
5 pages
Visual Sensor Network: Exploring the Power of Visual Sensor Networks in Computer Vision
From Everand
Visual Sensor Network: Exploring the Power of Visual Sensor Networks in Computer Vision
Fouad Sabry
No ratings yet
IMINT Target Acquisition Using Deep Learning
No ratings yet
IMINT Target Acquisition Using Deep Learning
5 pages
A Survey of Deep Learning-Based Object Detection
No ratings yet
A Survey of Deep Learning-Based Object Detection
30 pages
Overview_of_object_detection_based_on_deep_learnin
No ratings yet
Overview_of_object_detection_based_on_deep_learnin
7 pages
Computer Vision: Exploring the Depths of Computer Vision
From Everand
Computer Vision: Exploring the Depths of Computer Vision
Fouad Sabry
No ratings yet
Paper 4-Modeling and Simulation of A Blockchain Consensus
No ratings yet
Paper 4-Modeling and Simulation of A Blockchain Consensus
10 pages
Sai Paper Format
No ratings yet
Sai Paper Format
3 pages
Paper 5-Microcontrollers Programming Framework
No ratings yet
Paper 5-Microcontrollers Programming Framework
8 pages
5 Paper - 23-A - Meta - Analytic - Review - of - Intelligent - Intrusion - Detection
No ratings yet
5 Paper - 23-A - Meta - Analytic - Review - of - Intelligent - Intrusion - Detection
12 pages
3 Paper - 17-A - Review - of - Feature - Selection - Algorithms
No ratings yet
3 Paper - 17-A - Review - of - Feature - Selection - Algorithms
7 pages
Data Lake
No ratings yet
Data Lake
2 pages
Heart Disease Prediction System: User'S Manual
No ratings yet
Heart Disease Prediction System: User'S Manual
15 pages
Topics Asked For Technical Interview.:-1. Programing 2. Data Structures 3. Academic Project Questions: 1. 2
No ratings yet
Topics Asked For Technical Interview.:-1. Programing 2. Data Structures 3. Academic Project Questions: 1. 2
3 pages
Dbmcli 73eng
No ratings yet
Dbmcli 73eng
184 pages
Castor Marine - Starlink Maritime EUR - 0324
No ratings yet
Castor Marine - Starlink Maritime EUR - 0324
1 page
WinDbg CheatSheet
No ratings yet
WinDbg CheatSheet
1 page
Ing. Martin Lauf: Areas of Expertise
No ratings yet
Ing. Martin Lauf: Areas of Expertise
3 pages
Top Artists DNA Graphic From Spotify Data N-Gen Art
No ratings yet
Top Artists DNA Graphic From Spotify Data N-Gen Art
1 page
Freedom DICOM Print Server
100% (1)
Freedom DICOM Print Server
4 pages
Tips Goodpassword
No ratings yet
Tips Goodpassword
2 pages
Pcconsoletoc FR Esn
No ratings yet
Pcconsoletoc FR Esn
72 pages
DB Related: 'Db3' 'E:/Sql - Dbs/Db3.Mdf'
No ratings yet
DB Related: 'Db3' 'E:/Sql - Dbs/Db3.Mdf'
8 pages
pse_prismacloud_p_studyguide
No ratings yet
pse_prismacloud_p_studyguide
129 pages
TC Electronic System 6000 Integrator Manual English
No ratings yet
TC Electronic System 6000 Integrator Manual English
28 pages
Python Next Steps l1 The Basics
No ratings yet
Python Next Steps l1 The Basics
18 pages
Analog Communication by U.a.bakshi A.P. Godse PDF
No ratings yet
Analog Communication by U.a.bakshi A.P. Godse PDF
3 pages
RAU Reject When LTE UE Moving To 3G Network
No ratings yet
RAU Reject When LTE UE Moving To 3G Network
4 pages
Cisy112 Bbit326 Assign
No ratings yet
Cisy112 Bbit326 Assign
3 pages
Zerto Cyber Attack Survival Kit
No ratings yet
Zerto Cyber Attack Survival Kit
7 pages
The Ultimate C - E - HANAAW - 17 - SAP Certified Development Specialist - ABAP For SAP HANA 2.0
No ratings yet
The Ultimate C - E - HANAAW - 17 - SAP Certified Development Specialist - ABAP For SAP HANA 2.0
2 pages
Altium Designer Resource Reference: Editors Preferences Objects Panels Processes & Commands Dialogs Wizards
No ratings yet
Altium Designer Resource Reference: Editors Preferences Objects Panels Processes & Commands Dialogs Wizards
4 pages
The Forrester Wave™: API Management Solutions, Q4 2018: Key Takeaways Why Read This Report
No ratings yet
The Forrester Wave™: API Management Solutions, Q4 2018: Key Takeaways Why Read This Report
24 pages
How-To Troubleshoot Authentication Errors Using View AUTHENTICATION_ERROR_DETAILS From HANA 2.0 SPS08
No ratings yet
How-To Troubleshoot Authentication Errors Using View AUTHENTICATION_ERROR_DETAILS From HANA 2.0 SPS08
2 pages
CP1 Modbus/TCP Adapter
No ratings yet
CP1 Modbus/TCP Adapter
2 pages
Aboutkrita2.8 SM
No ratings yet
Aboutkrita2.8 SM
24 pages
21h4e1b4 Annunciator Keyboard
No ratings yet
21h4e1b4 Annunciator Keyboard
12 pages
Term 2 CSPractical File
No ratings yet
Term 2 CSPractical File
24 pages
Design and Implementation of Efficient APRIORI Algorithm
No ratings yet
Design and Implementation of Efficient APRIORI Algorithm
4 pages
WP Lab Manual
No ratings yet
WP Lab Manual
71 pages