Open library

This portal has been archived. Explore the next generation of this technology.

What is Holding Back Convnets for Detection?

lib:d482fbd3deb53095 (v1.0.0)

Authors: Bojan Pepik,Rodrigo Benenson,Tobias Ritschel,Bernt Schiele
ArXiv: 1508.02844
Document: PDF DOI

Abstract URL: http://arxiv.org/abs/1508.02844v2

Convolutional neural networks have recently shown excellent results in general object detection and many other tasks. Albeit very effective, they involve many user-defined design choices. In this paper we want to better understand these choices by inspecting two key aspects "what did the network learn?", and "what can the network learn?". We exploit new annotations (Pascal3D+), to enable a new empirical analysis of the R-CNN detector. Despite common belief, our results indicate that existing state-of-the-art convnet architectures are not invariant to various appearance factors. In fact, all considered networks have similar weak points which cannot be mitigated by simply increasing the training data (architectural changes are needed). We show that overall performance can improve when using image renderings for data augmentation. We report the best known results on the Pascal3D+ detection and view-point estimation tasks.

Relevant initiatives

Related knowledge about this paper

Search on this portal

Reproduced results (crowd-benchmarking and competitions)

Artifact and reproducibility checklists

Common formats for research projects and shared artifacts

Collective Knowledge (organizing research projects based on FAIR principles)

Reproducibility initiatives

Comments

Please log in to add your comments!

If you notice any inapropriate content that should not be here, please report us as soon as possible and we will try to remove it within 48 hours!

What is Holding Back Convnets for Detection?

Relevant initiatives Hide

Comments Hide

Relevant initiatives

Comments