University of Oulu

Liu, L., Ouyang, W., Wang, X. et al. Deep Learning for Generic Object Detection: A Survey. Int J Comput Vis 128, 261–318 (2020).

Deep learning for generic object detection : a survey

Saved in:
Author: Liu, Li1,2; Ouyang, Wanli3; Wang, Xiaogang4;
Organizations: 1National University of Defense Technology, Changsha, China
2University of Oulu, Oulu, Finland
3University of Sydney, Camperdown, Australia
4Chinese University of Hong Kong, Sha Tin, China
5University of Waterloo, Waterloo, Canada
Format: article
Version: published version
Access: open
Online Access: PDF Full Text (PDF, 7.5 MB)
Persistent link:
Language: English
Published: Springer Nature, 2020
Publish Date: 2020-01-13


Object detection, one of the most fundamental and challenging problems in computer vision, seeks to locate object instances from a large number of predefined categories in natural images. Deep learning techniques have emerged as a powerful strategy for learning feature representations directly from data and have led to remarkable breakthroughs in the field of generic object detection. Given this period of rapid evolution, the goal of this paper is to provide a comprehensive survey of the recent achievements in this field brought about by deep learning techniques. More than 300 research contributions are included in this survey, covering many aspects of generic object detection: detection frameworks, object feature representation, object proposal generation, context modeling, training strategies, and evaluation metrics. We finish the survey by identifying promising directions for future research.

see all

Series: International journal of computer vision
ISSN: 0920-5691
ISSN-E: 1573-1405
ISSN-L: 0920-5691
Volume: 128
Pages: 261 - 318
DOI: 10.1007/s11263-019-01247-4
Type of Publication: A1 Journal article – refereed
Field of Science: 113 Computer and information sciences
Funding: Open access funding provided by University of Oulu including Oulu University Hospital. The authors would like to thank the pioneering researchers in generic object detection and other related fields. The authors would also like to express their sincere appreciation to Professor Jiří Matas, the associate editor and the anonymous reviewers for their comments and suggestions. This work has been supported by the Center for Machine Vision and Signal Analysis at the University of Oulu (Finland) and the National Natural Science Foundation of China under Grant 61872379.
Copyright information: © The Author(s) 2019. This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.