Real Time Object Detection Can be Embedded on Low Powered Devices

Authors

  • Pal JB Dept. Of Computer Science, St. Xavier’s College, Park Street, Kolkata, India
  • Agarwal S Dept. Of Computer Science, St. Xavier’s College, Park Street, Kolkata, India

DOI:

https://doi.org/10.26438/ijcse/v7i2.10051009

Keywords:

TensorFlow, MobileNet, MS COCO, Real-time, and Object detection

Abstract

It has been a real challenge for computers with low computing power and memory to detect objects in real time. After the invention of Convolution Neural Networks (CNN) it is easy for computers to detect images and recognize them. There are several technologies and models which can detect objects in real time, but most of them require high end technologies in terms of GPUs and TPUs. Though, recently many new algorithms and models have been proposed, which runs on low resources. In this paper we studied MobileNets to detect objects using webcam to successfully build a real time object detection system. We observed the pre trained model of the famous MS COCO dataset to achieve our purpose. Moreover, we applied Google’s open source TensorFlow as our back end. This real time object detection system may help in future to solve various complex vision problems.

References

[1] S. Tripathi, G. Dane, B. Kang, V. Bhaskaran, and T. Nguyen, “LCDet: Low-Complexity Fully-Convolutional Neural Networks for Object Detection in Embedded Systems”, work done in part during an internship at Qualcomm, arXiv:1705.05922 [cs.CV], https://arxiv.org/abs/1705.05922 accessed on 27.09.2018, May 2017.

[2] Y. Li, J. Li, W. Lin, and J. Li, “Tiny-DSOD: Lightweight Object Detection for Resource-Restricted Usages”, Shanghai Jiao Tong University and Intel Labs, arXiv: 1807.11013 [cs.CV], https://arxiv.org/abs/1807.11013 accessed on 27.09.2018, July 2018.

[3] J. Redmon, S. Divvala, R. Girshick, and A. Farhadi, “You Only Look Once: Unified, Real-Time Object Detection”, arXiv: 1506.02640 [cs.CV], https://arxiv.org/abs/1506.02640 accessed on 27.09.2018, June 2015.

[4] X. Zhang, X. Zhou, M. Lin, and J. Sun, “ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices”, MegviiInc (Face++), arXiv:1707.01083 [cs.CV], https://arxiv.org/abs/1707.01083 accessed on 27.09.2018, Dec 2017.

[5] R. J. Wang, X. Li, S. Ao, and C. X. Ling, “Pelee: A Real-Time Object Detection System on Mobile Devices”, University of Western Ontario, arXiv: 1804.06882v1 [cs.CV], https://arxiv.org/abs/1804.06882 accessed on 27.09.2018, April 2018.

[6] S. Y. Nikouei, Y. Chen, S. Song, and T. R. Faughnan, “Kerman: A Hybrid Lightweight Tracking Algorithm to Enable Smart Surveillance as an Edge Service”, Binghamton University, arXiv: 1808.02134 [cs.DC], https://arxiv.org/abs/1808.02134 accessed on 27.09.2018, August 2018.

[7] T. Liu, M. Elmikaty, and T. Stathaki, “SAM-RCNN: Scale-Aware Multi-Resolution Multi-Channel Pedestrian Detection”, Electrical and Electronic Engineering Imperial College London and Jaguar Land Rover Research Coventy, arXiv: 1808.02246 [cs.CV], https://arxiv.org/abs/1808.02246 accessed on 27.09.2018, August 2018.

[8] T. Y. Lin, M. Maire, S. Belongie, L. Bourdev, R. Girshick, J. Hays, P. Perona, D. Ramanan, C. L. Zitnick, and P. Dollar, “Microsoft COCO: Common Objects in Context”, arXiv: 1405.0312 [cs.CV], https://arxiv.org/abs/1405.0312 accessed on 27.09.2018, May 2014.

[9] A. G. Howard, M. Zhu, B. Chen, D. Kalenichenko, W. Wang, T. Weyand, M. Andreetto, and H. Adam, “MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications”, arXiv: 1704.04861 [cs.CV], https://arxiv.org/abs/ 1704.04861 accessed on 27.09.2018, April 2017.

[10] M. Abadi, A. Agarwal, P. Barham, E. Brevdo, Z. Chen, C. Citro, G. S. Corrado, A. Davis, J. Dean, M. Devin, S. Ghemawat, I. Goodfellow, A. Harp, G. Irving, M. Isard, Y. Jia, R. Jozefowicz, L. Kaiser, M. Kudlur, J. Levenberg, D. Mane, R. Monga, S. Moore, D. Murray, C. Olah, M. Schuster, J. Shlens, B. Steiner, I. Sutskever, K. Talwar, P. Tucker, V. Vanhoucke, V. Vasudevan, F. Viegas, O. Vinyals, P. Warden, M. Wattenberg, M. Wicke, Yuan Yu, X. Zheng ,“TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems”, Google Research, available on http://download.tensorflow.org/paper/whitepaper2015.pdf accessed on 27.09.2018, November 2015.

[11] J. E. Espinosa, S. A. Velastin, and J. W. Branch, “Motorcycle detection and classification in urban Scenarios using a model based on Faster R-CNN”, University Carlos 3 – Madrid Spain, arXiv: 1808.02299 [cs.CV], https://arxiv.org/abs/1808.02299 accessed on 27.09.2018, August 2018.

[12] M. Rahman, M. Islam, J. Calhoun, and M. Chowdhury,“Real-time Pedestrian Detection Approach with an Efficient Data Communication Bandwidth Strategy”, Clemson University, arXiv: 1808.09023 [cs.CV], https://arxiv.org/abs/1808.09023 accessed on 27.09.2018, August 2018.

Downloads

Published

2019-02-28
CITATION
DOI: 10.26438/ijcse/v7i2.10051009
Published: 2019-02-28

How to Cite

[1]
J. B. Pal and S. Agarwal, “Real Time Object Detection Can be Embedded on Low Powered Devices”, Int. J. Comp. Sci. Eng., vol. 7, no. 2, pp. 1005–1009, Feb. 2019.

Issue

Section

Research Article