Pixel difference networks for efficient edge detection

Su, Zhuo; Liu, Wenzhe; Yu, Zitong; Hu, Dewen; Liao, Qing; Tian, Qi; Pietikäinen, Matti; Liu, Li

Pixel difference networks for efficient edge detection

Su, Zhuo; Liu, Wenzhe; Yu, Zitong; Hu, Dewen; Liao, Qing; Tian, Qi; Pietikäinen, Matti; Liu, Li (2022-02-28)

Avaa tiedosto

nbnfi-fe2023032333036.pdf (2.446Mt)

nbnfi-fe2023032333036_meta.xml (47.62Kt)

nbnfi-fe2023032333036_solr.xml (38.88Kt)

Lataukset:

URL:

https://doi.org/10.1109/iccv48922.2021.00507

Su, Zhuo

Liu, Wenzhe

Yu, Zitong

Hu, Dewen

Liao, Qing

Tian, Qi

Pietikäinen, Matti

Liu, Li

IEEE Computer Society

28.02.2022

Z. Su et al., "Pixel Difference Networks for Efficient Edge Detection," 2021 IEEE/CVF International Conference on Computer Vision (ICCV), Montreal, QC, Canada, 2021, pp. 5097-5107, doi: 10.1109/ICCV48922.2021.00507.

https://rightsstatements.org/vocab/InC/1.0/
© 2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
https://rightsstatements.org/vocab/InC/1.0/

doi:https://doi.org/10.1109/iccv48922.2021.00507

Näytä kaikki kuvailutiedot

Julkaisun pysyvä osoite on
https://urn.fi/URN:NBN:fi-fe2023032333036

Tiivistelmä

Abstract

Recently, deep Convolutional Neural Networks (CNNs) can achieve human-level performance in edge detection with the rich and abstract edge representation capacities. However, the high performance of CNN based edge detection is achieved with a large pretrained CNN backbone, which is memory and energy consuming. In addition, it is surprising that the previous wisdom from the traditional edge detectors, such as Canny, Sobel, and LBP are rarely investigated in the rapid-developing deep learning era. To address these issues, we propose a simple, lightweight yet effective architecture named Pixel Difference Network (PiDiNet) for efficient edge detection. PiDiNet adopts novel pixel difference convolutions that integrate the traditional edge detection operators into the popular convolutional operations in modern CNNs for enhanced performance on the task, which enjoys the best of both worlds. Extensive experiments on BSDS500, NYUD, and Multicue are provided to demonstrate its effectiveness, and its high training and inference efficiency. Surprisingly, when training from scratch with only the BSDS500 and VOC datasets, PiDiNet can surpass the recorded result of human perception (0.807 vs. 0.803 in ODS F-measure) on the BSDS500 dataset with 100 FPS and less than 1M parameters. A faster version of PiDiNet with less than 0.1M parameters can still achieve comparable performance among state of the arts with 200 FPS. Results on the NYUD and Multicue datasets show similar observations. The codes are available at https://github.com/zhuoinoulu/pidinet.

Kokoelmat

Avoin saatavuus [31936]