University of Oulu

M. Otani, Y. Nakashima, E. Rahtu and J. Heikkilä, "Rethinking the Evaluation of Video Summaries," 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA, 2019, pp. 7588-7596. doi: 10.1109/CVPR.2019.00778

Rethinking the evaluation of video summaries

Saved in:
Author: Otani, Mayu1; Nakashima, Yuta2; Rahtu, Esa3;
Organizations: 1CyberAgent, Inc.
2Osaka University
3Tampere University
4University of Oulu
Format: article
Version: accepted version
Access: open
Online Access: PDF Full Text (PDF, 1 MB)
Persistent link: http://urn.fi/urn:nbn:fi-fe202003238864
Language: English
Published: Institute of Electrical and Electronics Engineers, 2019
Publish Date: 2020-03-23
Description:

Abstract

Video summarization is a technique to create a short skim of the original video while preserving the main stories/content. There exists a substantial interest in automatizing this process due to the rapid growth of the available material. The recent progress has been facilitated by public benchmark datasets, which enable easy and fair comparison of methods. Currently the established evaluation protocol is to compare the generated summary with respect to a set of reference summaries provided by the dataset. In this paper, we will provide in-depth assessment of this pipeline using two popular benchmark datasets. Surprisingly, we observe that randomly generated summaries achieve comparable or better performance to the state-of-the-art. In some cases, the random summaries outperform even the human generated summaries in leave-one-out experiments. Moreover, it turns out that the video segmentation, which is often considered as a fixed pre-processing method, has the most significant impact on the performance measure. Based on our observations, we propose alternative approaches for assessing the importance scores as well as an intuitive visualization of correlation between the estimated scoring and human annotations.

see all

ISBN: 978-1-7281-3293-8
ISBN Print: 978-1-7281-3294-5
Pages: 7588 - 7596
DOI: 10.1109/CVPR.2019.00778
OADOI: https://oadoi.org/10.1109/CVPR.2019.00778
Host publication: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 15-20 June 2019, Long Beach, USA
Conference: IEEE/CVF Conference on Computer Vision and Pattern Recognition
Type of Publication: A4 Article in conference proceedings
Field of Science: 113 Computer and information sciences
Subjects:
Funding: This work was partly supported by JSPS KAKENHI Grant Nos. 16K16086 and 18H03264.
Copyright information: © 2019 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.